ISCA Archive Interspeech 2011
ISCA Archive Interspeech 2011

Spoken language recognition in the latent topic simplex

Kong Aik Lee, Chang Huai You, Ville Hautamäki, Anthony Larcher, Haizhou Li

This paper proposes the use of latent topic modeling for spoken language recognition, where a topic is defined as a discrete distribution over phone n-grams. The latent topics are trained in an unsupervised manner using the latent Dirichlet allocation (LDA) technique. Language recognition is then performed in a low dimensional simplex defined by the latent topics. We apply the Bhattacharyya measure to compute the n-gram similarity in the topic simplex. Our study shows that some of the latent topics are language specific while others exhibit multilingual characteristic. Experiment conducted on the NIST 2007 language detection task shows that language cues can be sufficiently preserved in the topic simplex.


doi: 10.21437/Interspeech.2011-734

Cite as: Lee, K.A., You, C.H., Hautamäki, V., Larcher, A., Li, H. (2011) Spoken language recognition in the latent topic simplex. Proc. Interspeech 2011, 2933-2936, doi: 10.21437/Interspeech.2011-734

@inproceedings{lee11h_interspeech,
  author={Kong Aik Lee and Chang Huai You and Ville Hautamäki and Anthony Larcher and Haizhou Li},
  title={{Spoken language recognition in the latent topic simplex}},
  year=2011,
  booktitle={Proc. Interspeech 2011},
  pages={2933--2936},
  doi={10.21437/Interspeech.2011-734}
}