Synthesizing dialogue speech of Japanese based on the quantitative analysis of prosodic features

Hirose, Keikichi; Sakata, Mayumi; Kawanami, Hiromichi

doi:10.21437/ICSLP.1996-60

Synthesizing dialogue speech of Japanese based on the quantitative analysis of prosodic features

Keikichi Hirose, Mayumi Sakata, Hiromichi Kawanami

Through the analyses of fundamental frequency contours and speech rates of dialogue speech and also of read speech, prosodic rules were derived for the synthesis of spoken dialogue. As for the fundamental frequency contours, they were first decomposed into phrase and accent components based on the superpositional model, and then their command magnitudes/amplitudes were analyzed by the method of multiple regression analysis. As for the speech rate, the reduction rate of mora duration from reading-style to dialogue-style was calculated. After normalizing the sentence length, the mean reduction rate was calculated as an average over utterances without complicated syntactic structure. Results of the above analyses were incorporated in the prosodic rules for dialog speech synthesis. Using a formerly developed formant speech synthesizer, synthesis was conducted using both the former rules of read speech and the newly developed rules. A hearing test showed that the new rules can produce better prosody as dialogue speech.

doi: 10.21437/ICSLP.1996-60

Cite as: Hirose, K., Sakata, M., Kawanami, H. (1996) Synthesizing dialogue speech of Japanese based on the quantitative analysis of prosodic features. Proc. 4th International Conference on Spoken Language Processing (ICSLP 1996), 378-381, doi: 10.21437/ICSLP.1996-60

@inproceedings{hirose96_icslp,
  author={Keikichi Hirose and Mayumi Sakata and Hiromichi Kawanami},
  title={{Synthesizing dialogue speech of Japanese based on the quantitative analysis of prosodic features}},
  year=1996,
  booktitle={Proc. 4th International Conference on Spoken Language Processing (ICSLP 1996)},
  pages={378--381},
  doi={10.21437/ICSLP.1996-60}
}