This paper introduces the IBM Expressive Speech Synthesis system. We describe recent work in improving the quality of our baseline text-to-speech system as well as extending our capabilities to generate expressive synthetic speech. We present results showing improved base quality, especially for sentences drawn from a limited domain. We also demonstrate our ability to convey good news and bad news, produce contrastive emphasis, and ask a question appropriately. In order to facilitate access to the expressive capabilities, we use some of our proposed extensions to the Speech Synthesis Markup Language (SSML).
Cite as: Hamza, W., Eide, E., Bakis, R., Picheny, M., Pitrelli, J. (2004) The IBM expressive speech synthesis system. Proc. Interspeech 2004, 2577-2580, doi: 10.21437/Interspeech.2004-485
@inproceedings{hamza04b_interspeech, author={Wael Hamza and Ellen Eide and Raimo Bakis and Michael Picheny and John Pitrelli}, title={{The IBM expressive speech synthesis system}}, year=2004, booktitle={Proc. Interspeech 2004}, pages={2577--2580}, doi={10.21437/Interspeech.2004-485} }