ISCA Archive Eurospeech 1997
ISCA Archive Eurospeech 1997

Prosodic modelling in text-to-speech synthesis

Jan P. H. van Santen

This paper discusses three broad obstacles that must be overcome to improve prosodic quality in text-to-speech systems. First, direct and indirect limits set by the signal processing ("synthesis") components. Second, combinatorial and statistical constraints inherent in generalizing from training corpora to unrestricted domains, and that require the integration of contentspecific knowledge and detailed mathematical modeling. Third, the nature of many empirical research issues that must be solved for prosodic modeling to improve: they are often too focused and model-dependent for academe, and too long-term for development organizations.


doi: 10.21437/Eurospeech.1997-3

Cite as: Santen, J.P.H.v. (1997) Prosodic modelling in text-to-speech synthesis. Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997), kn19-kn28, doi: 10.21437/Eurospeech.1997-3

@inproceedings{santen97_eurospeech,
  author={Jan P. H. van Santen},
  title={{Prosodic modelling in text-to-speech synthesis}},
  year=1997,
  booktitle={Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997)},
  pages={kn19-kn28},
  doi={10.21437/Eurospeech.1997-3}
}