MLLR adaptation for hidden semi-Markov model based speech synthesis

Yamagishi, Junichi; Masuko, Takashi; Kobayashi, Takao

doi:10.21437/Interspeech.2004-449

MLLR adaptation for hidden semi-Markov model based speech synthesis

Junichi Yamagishi, Takashi Masuko, Takao Kobayashi

This paper describes an extension of maximum likelihood linear regression (MLLR) to hidden semi-Markov model (HSMM) and presents an adaptation technique of phoneme/state duration for an HMM-based speech synthesis system using HSMMs. The HSMMbased MLLR technique can realize the simultaneous adaptation of output distributions and state duration distributions. We focus on describing mathematical aspect of the technique and derive an algorithm of MLLR adaptation for HSMMs.

doi: 10.21437/Interspeech.2004-449

Cite as: Yamagishi, J., Masuko, T., Kobayashi, T. (2004) MLLR adaptation for hidden semi-Markov model based speech synthesis. Proc. Interspeech 2004, 1213-1216, doi: 10.21437/Interspeech.2004-449

@inproceedings{yamagishi04_interspeech,
  author={Junichi Yamagishi and Takashi Masuko and Takao Kobayashi},
  title={{MLLR adaptation for hidden semi-Markov model based speech synthesis}},
  year=2004,
  booktitle={Proc. Interspeech 2004},
  pages={1213--1216},
  doi={10.21437/Interspeech.2004-449}
}