ISCA Archive Eurospeech 2003
ISCA Archive Eurospeech 2003

Frequency distribution based weighted sub-band approach for classification of emotional/stressful content in speech

Mandar A. Rahurkar, John H.L. Hansen

In this paper we explore the use of nonlinear Teager Energy Operator based features derived from multi-resolution sub-band analysis for classification of emotional/stressful speech. We propose a novel scheme for automatic sub-band weighting in an effort towards developing a generic algorithm for understanding emotion or stress in speech. We evaluate the proposed algorithm using a corpus of audio material from a military stressful Soldier of the Quarter Board evaluation panel. We establish classification performance of emotional/stressful speech using an open speaker set with open test tokens. With the new frequency distribution based scheme, we obtain a relative detection error reduction of series 81.3% in stress speech, and a series 75.4% relative detection rate reduction in neutral speech detection error rate. The results suggest a important step forward in establishing an effective processing scheme for developing generic models of neutral and emotional speech.


doi: 10.21437/Eurospeech.2003-305

Cite as: Rahurkar, M.A., Hansen, J.H.L. (2003) Frequency distribution based weighted sub-band approach for classification of emotional/stressful content in speech. Proc. 8th European Conference on Speech Communication and Technology (Eurospeech 2003), 721-724, doi: 10.21437/Eurospeech.2003-305

@inproceedings{rahurkar03_eurospeech,
  author={Mandar A. Rahurkar and John H.L. Hansen},
  title={{Frequency distribution based weighted sub-band approach for classification of emotional/stressful content in speech}},
  year=2003,
  booktitle={Proc. 8th European Conference on Speech Communication and Technology (Eurospeech 2003)},
  pages={721--724},
  doi={10.21437/Eurospeech.2003-305}
}