The paper assesses the capability of an HMM-based TTS system to produce German speech. The results are discussed in qualitative terms, and compared over three different choices of context features. In addition, the system is adapted to a small set of football announcements, in an exploratory attempt to synthesise expressive speech. We conclude that the HMMs are able to produce highly intelligible neutral German speech, with a stable quality, and that the expressivity is partially captured in spite of the small size of the football dataset.
Cite as: Krstulović, S., Hunecke, A., Schröder, M. (2007) An HMM-based speech synthesis system applied to German and its adaptation to a limited set of expressive football announcements. Proc. Interspeech 2007, 1897-1900, doi: 10.21437/Interspeech.2007-527
@inproceedings{krstulovic07_interspeech, author={Sacha Krstulović and Anna Hunecke and Marc Schröder}, title={{An HMM-based speech synthesis system applied to German and its adaptation to a limited set of expressive football announcements}}, year=2007, booktitle={Proc. Interspeech 2007}, pages={1897--1900}, doi={10.21437/Interspeech.2007-527} }