ISCA Archive Interspeech 2007
ISCA Archive Interspeech 2007

Voice source and vocal tract variations as cues to emotional states perceived from expressive conversational speech

Hiroki Mori, Hideki Kasuya

Speech parameters originating from voice source and vocal tract were analyzed to find acoustic correlates of dimensional descriptions of emotional states. To achieve this goal best, we adopted the Utsunomiya University Spoken Dialogue Database, which was designed for studies on paralinguistic information in expressive conversational speech. Analyses for four female and two male speakers showed: (i) Prosodic parameters were highly correlated especially with the activation dimension, (ii) The aperiodicity-related voice source parameter showed that breathy phonation was mainly used in unpleasant utterances for three females, (iii) Due to smiling facial expression, formant frequencies were higher in pleasant utterances for a female.


doi: 10.21437/Interspeech.2007-49

Cite as: Mori, H., Kasuya, H. (2007) Voice source and vocal tract variations as cues to emotional states perceived from expressive conversational speech. Proc. Interspeech 2007, 102-105, doi: 10.21437/Interspeech.2007-49

@inproceedings{mori07_interspeech,
  author={Hiroki Mori and Hideki Kasuya},
  title={{Voice source and vocal tract variations as cues to emotional states perceived from expressive conversational speech}},
  year=2007,
  booktitle={Proc. Interspeech 2007},
  pages={102--105},
  doi={10.21437/Interspeech.2007-49}
}