ISCA Archive ICSLP 2002
ISCA Archive ICSLP 2002

Eigenvoices for HMM-based speech synthesis

Kengo Shichiri, Atsushi Sawabe, Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura

This paper describes an eigenvoice technique for an HMM-based speech synthesis system which can synthesize speech with various voice qualities. In the eigenvoice technique, which has successfully been applied to fast speaker adaptation in an HMM based speech recognition, a large number of speaker dependent HMM sets are represented by a few parameters through a dimensionality reduction technique, e.g., PCA. In this paper, we propose an eigenvoice technique for speech synthesis, and apply it to an HMM-based speech synthesis system in which spectrum and F0 are modeled by HMMs, and synthetic speech generated fromHMMs themselves. The generated spectrum and F0 pattern are shown, and the relation between weights for eigenvoices and voice quality is discussed.


doi: 10.21437/ICSLP.2002-390

Cite as: Shichiri, K., Sawabe, A., Yoshimura, T., Tokuda, K., Masuko, T., Kobayashi, T., Kitamura, T. (2002) Eigenvoices for HMM-based speech synthesis. Proc. 7th International Conference on Spoken Language Processing (ICSLP 2002), 1269-1272, doi: 10.21437/ICSLP.2002-390

@inproceedings{shichiri02_icslp,
  author={Kengo Shichiri and Atsushi Sawabe and Takayoshi Yoshimura and Keiichi Tokuda and Takashi Masuko and Takao Kobayashi and Tadashi Kitamura},
  title={{Eigenvoices for HMM-based speech synthesis}},
  year=2002,
  booktitle={Proc. 7th International Conference on Spoken Language Processing (ICSLP 2002)},
  pages={1269--1272},
  doi={10.21437/ICSLP.2002-390}
}