ISCA Archive Interspeech 2011
ISCA Archive Interspeech 2011

Estimation of perceptual spaces for speaker identities based on the cross-lingual discrimination task

Minoru Tsuzaki, Keiichi Tokuda, Hisashi Kawai, Jinfu Ni

This paper reconfirms that talker identity can be transmitted across languages. Talker discrimination was examined in the ABX paradigm, where the stimuli A and B were utterances by different talkers in the same language and the stimulus X was an utterance by either of A or B in the different language. The average hit rate of this discrimination task was as high as 0.89. The mutual distance matrices were generated using the discrimination index, d. By applying the multidimensional scaling, three-dimensional perceptual spaces were estimated. The features related with loudness and spectral centroid had high contribution to the perceptual dimensions.


doi: 10.21437/Interspeech.2011-71

Cite as: Tsuzaki, M., Tokuda, K., Kawai, H., Ni, J. (2011) Estimation of perceptual spaces for speaker identities based on the cross-lingual discrimination task. Proc. Interspeech 2011, 157-160, doi: 10.21437/Interspeech.2011-71

@inproceedings{tsuzaki11_interspeech,
  author={Minoru Tsuzaki and Keiichi Tokuda and Hisashi Kawai and Jinfu Ni},
  title={{Estimation of perceptual spaces for speaker identities based on the cross-lingual discrimination task}},
  year=2011,
  booktitle={Proc. Interspeech 2011},
  pages={157--160},
  doi={10.21437/Interspeech.2011-71}
}