ISCA Archive Eurospeech 1997
ISCA Archive Eurospeech 1997

On representation of fundamental frequency of speech for prosody analysis using reliability function

Mitsuru Nakai, Hiroshi Shimodaira

This paper highlights on a method that provides a new prosodic feature called 'F0 reliability field' based on a reliability function of the fundamental frequency (F0 ). The proposed method does not employ any correction process for F0 estimation errors that occur during automatic F0 extraction. By applying this feature as a score function for prosodic analyses like prosodic structure estimation or superpositional modeling of prosodic commands, these prosodic information could be acquired with higher accuracy. The feature has been applied to 'F0 template matching method', which detects accent phrase boundaries in Japanese continuous speech. The experimental results show that compared to the conventional F0 contour, the proposed feature overcomes the harmful influence caused by F0 errors.


doi: 10.21437/Eurospeech.1997-88

Cite as: Nakai, M., Shimodaira, H. (1997) On representation of fundamental frequency of speech for prosody analysis using reliability function. Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997), 243-246, doi: 10.21437/Eurospeech.1997-88

@inproceedings{nakai97_eurospeech,
  author={Mitsuru Nakai and Hiroshi Shimodaira},
  title={{On representation of fundamental frequency of speech for prosody analysis using reliability function}},
  year=1997,
  booktitle={Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997)},
  pages={243--246},
  doi={10.21437/Eurospeech.1997-88}
}