An acoustic subword unit approach to non-linguistic speech feature identification

Afify, Mohamed; Gong, Yifan; Haton, Jean-Paul

doi:10.21437/Eurospeech.1997-603

An acoustic subword unit approach to non-linguistic speech feature identification

Mohamed Afify, Yifan Gong, Jean-Paul Haton

Automatic identification of non-linguistic speech features (e.g. the speaker or the language of an utterance) are currently of practical interest. In this paper, we first impose a set of requirements that we think a statistical model used in non-linguistic feature identification should satisfy. Namely, these requirements are capturing both short and long term correlations in addition to maintaining a certain acoustic resolution. A model satisfying these requirements, and in the same time having the attractive feature of requiring no transcribed speech material during training is proposed. Experimental evaluation of the approach in speaker recognition on the TIMIT database is presented, where recognition rates up to 99.2 % are achieved.

doi: 10.21437/Eurospeech.1997-603

Cite as: Afify, M., Gong, Y., Haton, J.-P. (1997) An acoustic subword unit approach to non-linguistic speech feature identification. Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997), 2291-2294, doi: 10.21437/Eurospeech.1997-603

@inproceedings{afify97b_eurospeech,
  author={Mohamed Afify and Yifan Gong and Jean-Paul Haton},
  title={{An acoustic subword unit approach to non-linguistic speech feature identification}},
  year=1997,
  booktitle={Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997)},
  pages={2291--2294},
  doi={10.21437/Eurospeech.1997-603}
}