C5: Information Density Aware Text-to-Speech Synthesis

Project C5 investigates how text-to-speech (TTS) synthesis techniques can be enhanced to take knowledge about information and encoding density into account. The project explores methods to connect and align the processing of high-level information with its encoding into low-level phonetic parameters in TTS synthesis. The approach is to encode information density in two stages: first, directly as high-level parameters during TTS voice building (offline) and, second, during runtime synthesis (online). Quantification of information density can also be used to develop a model of listeners’ susceptibility to synthesis artifacts, in order to automatically predict and pre-emptively improve the perceived output quality by selecting a sequence of acoustic units that forms the desired variation and density of encoding given a defined degree of information density.

Publications

2018

Raveh, Eran ; Steiner, Ingmar ; Gessinger, Iona ; Möbius, Bernd

Studying Mutual Phonetic Influence With a Web-Based Spoken Dialogue System Inproceedings Forthcoming

20th International Conference on Speech and Computer (SPECOM), Leipzig, Germany, Forthcoming.

BibTeX

Gessinger, Iona ; Raveh, Eran ; Möbius, Bernd ; Steiner, Ingmar

Phonetic Accommodation in HCI: Introducing a Wizard-of-Oz Experiment Inproceedings Forthcoming

Phonetik & Phonologie 14, Vienna, Austria, Forthcoming.

BibTeX

Gessinger, Iona ; Schweitzer, Antje ; Andreeva, Bistra ; Raveh, Eran ; Möbius, Bernd ; Steiner, Ingmar

Convergence of Pitch Accents in a Shadowing Task Inproceedings

9th International Conference on Speech Prosody, pp. 225-229, Poznán, Poland, 2018.

Links | BibTeX

Steiner, Ingmar ; Le Maguer, Sébastien

Creating New Language and Voice Components for the Updated MaryTTS Text-to-Speech Synthesis Platform Inproceedings

11th Language Resources and Evaluation Conference (LREC), pp. 3171-3175, Miyazaki, Japan, 2018.

Links | BibTeX

2017

Steiner, Ingmar ; Le Maguer, Sébastien ; Hewer, Alexander

Synthesis of Tongue Motion and Acoustics from Text using a Multimodal Articulatory Database Journal Article

IEEE/ACM Transactions on Audio, Speech, and Language Processing, 25 (12), pp. 2351-2361, 2017.

Links | BibTeX

Le Maguer, Sébastien ; Steiner, Ingmar

The "Uprooted" MaryTTS Entry for the Blizzard Challenge 2017 Inproceedings

Blizzard Challenge, Stockholm, Sweden, 2017.

Links | BibTeX

Gessinger, Iona ; Raveh, Eran ; Le Maguer, Sébastien ; Möbius, Bernd ; Steiner, Ingmar

Shadowing Synthesized Speech -- Segmental Analysis of Phonetic Convergence Inproceedings

Interspeech, pp. 3797-3801, Stockholm, Sweden, 2017.

Links | BibTeX

Le Maguer}, Sébastien ; Steiner, Ingmar ; Hewer, Alexander

An HMM/DNN comparison for synchronized text-to-speech and tongue motion synthesis Inproceedings

Interspeech, pp. 239-243, Stockholm, Sweden, 2017.

Links | BibTeX

Steiner, Ingmar ; Le Maguer, Sébastien ; Manzoni, Judith ; Gilles, Peter ; Trouvain, Jürgen

Developing new language tools for MaryTTS: the case of Luxembourgish Inproceedings

Trouvain, Jürgen ; Steiner, Ingmar ; Möbius, Bernd (Ed.): 28th Conference on Electronic Speech Signal Processing (ESSV), pp. 186-192, Saarbrücken, Germany, 2017.

Links | BibTeX

Raveh, Eran ; Gessinger, Iona ; Le Maguer, Sébastien ; Möbius, Bernd ; Steiner, Ingmar

Investigating Phonetic Convergence in a Shadowing Experiment with Synthetic Stimuli Inproceedings

Trouvain, Jürgen ; Steiner, Ingmar ; Möbius, Bernd (Ed.): 28th Conference on Electronic Speech Signal Processing (ESSV), pp. 254-261, Saarbrücken, Germany, 2017.

Links | BibTeX

Le Maguer, Sébastien ; Steiner, Ingmar

Uprooting MaryTTS: Agile Processing and Voicebuilding Inproceedings

Trouvain, Jürgen ; Steiner, Ingmar ; Möbius, Bernd (Ed.): 28th Conference on Electronic Speech Signal Processing (ESSV), pp. 152-159, Saarbrücken, Germany, 2017.

Links | BibTeX

2016

Le Maguer, Sébastien ; Steiner, Ingmar

The MaryTTS entry for the Blizzard Challenge 2016 Inproceedings

Blizzard Challenge, Cupertino, CA, USA, 2016.

Links | BibTeX

Le Maguer, Sébastien ; Möbius, Bernd; Steiner, Ingmar; Lolive, Damien

De l'utilisation de descripteurs issus de la linguistique computationnelle dans le cadre de la synthèse par HMM Inproceedings

Proc. Journées d'Etudes sur la Parole, Paris, 2016.

Links | BibTeX

Le Maguer, Sébastien ; Möbius, Bernd; Steiner, Ingmar

Toward the use of information density based descriptive features in HMM based speech synthesis Inproceedings

8th International Conference on Speech Prosody, pp. 1029–1033, Boston, MA, USA, 2016.

Links | BibTeX

2015

Le Maguer, Sébastien ; Steiner, Ingmar; Möbius, Bernd

Toward a Speech Synthesis Guided by the Modeling of Unexpected Events Inproceedings

Schweitzer, Antje ; Dogil, Grzegorz (Ed.): Workshop on Modeling Variability in Speech, Stuttgart, Germany, 2015.

Links | BibTeX

Ingmar Steiner

PI

Mail
Website

Sébastien LeMaguer

Postdoc

Mail
Website