This paper describes an extension of maximum likelihood linear regression (MLLR) to hidden semi-Markov model (HSMM) and presents an adaptation technique of phoneme/state duration for an HMM-based speech synthesis system using HSMMs. The HSMMbased MLLR technique can realize the simultaneous adaptation of output distributions and state duration distributions. We focus on describing mathematical aspect of the technique and derive an algorithm of MLLR adaptation for HSMMs.
Cite as: Yamagishi, J., Masuko, T., Kobayashi, T. (2004) MLLR adaptation for hidden semi-Markov model based speech synthesis. Proc. Interspeech 2004, 1213-1216, doi: 10.21437/Interspeech.2004-449
@inproceedings{yamagishi04_interspeech, author={Junichi Yamagishi and Takashi Masuko and Takao Kobayashi}, title={{MLLR adaptation for hidden semi-Markov model based speech synthesis}}, year=2004, booktitle={Proc. Interspeech 2004}, pages={1213--1216}, doi={10.21437/Interspeech.2004-449} }