Language-Pair Scoring Method Based on SVM for Language Recognition

Article Preview

Abstract:

Support vector machine one vs. rest classification and Gaussian backend classifier are commonly used for language recognition. The LDA matrix of Gaussian backend classifier is often singular when the traditional one vs. one classification is used, and the recognition performance is very poor. In this paper, we present language-pair scoring method, and the performance improved significantly after re-modeling the one vs. one scores of support vector machine. Our experiments are carried on NIST 2011 language recognition evaluation 30s data corpus. Results indicate that the proposed language-pair scoring method obtains equal to or even better performance than traditional one vs. rest classification for ivector and SVM-GSV language recognition systems. The experimental period is also shorten, and the linear fusion result of the proposed method and one vs. rest obtains significantly better performance.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

737-741

Citation:

Online since:

July 2013

Export:

Price:

[1] P. Fung and T. Schultz. Multilingual spoken language processing. Signal Processing Magazine, IEEE, 2008, vol. 1053, p.5888.

DOI: 10.1109/msp.2008.918417

Google Scholar

[2] P. Torres-Carrasquillo, E. Singer, M. Kohler, et al. Approaches to language identification using Gaussian mixture models and shifted delta cepstral features. Seventh International Conference on Spoken Language Processing, Citeseer, (2002).

DOI: 10.21437/icslp.2002-74

Google Scholar

[3] W. Campbell, J. Campbell, D. Reynolds, et al. Support vector machines for speaker and language recognition. Computer Speech & Language, 2006, vol. 20, no. 2-3, pp.210-229.

DOI: 10.1016/j.csl.2005.06.003

Google Scholar

[4] Yun Lei, Hansen, J.H.L. Dialect Classification via Text-Independent Training and Testing for Arabic, Spanish, and Chinese[J]. Audio, Speech, and Language Processing, IEEE Transactions on , 2011, p.85 – 96.

DOI: 10.1109/tasl.2010.2045184

Google Scholar

[5] P. A. Torres-Carrasquillo, D. A. Reynolds, and J. R. Deller Jr. Language identification using Gaussian Mixture Model Tokenization. In ICASSP, Orlando, Fl., USA, (2002).

DOI: 10.1109/icassp.2002.1005850

Google Scholar

[6] M. Schmidt, H. Gish. Text-independent Speaker Identification. Singal Processing Magazine, 1994(11): 18-32.

DOI: 10.1109/79.317924

Google Scholar

[7] W. M. Cambell. A Covariance Kernel for SVM Language Recognition. in Proc. ICASSP[C]. Las Vegas, Nevada, USA, 2008. pp.4141-4144.

Google Scholar

[8] BALAKRISHNAMA S, GANAPATHIRAJU A. Linear Discriminant Analysis-A Brief Tutorial, (1998).

Google Scholar

[9] Ronan Collobertand and SamyBengio. SVMTorch: Support vector machines for large-scale regression problems. Journal of Machine Learning Research, 2001, vol. 1, pp.143-160.

Google Scholar

[10] 2011 Language Recognition Evaluation. http: /www. nist. gov/itl/iad/mig/lre11. cfm.

Google Scholar

[11] F. Castaldo, D. Colibro, E. Dalmasso, et al. Compensation of nuisance factors for speaker and language recognition. Audio, Speech, and Language Processing, IEEE Transactions on, 2007, 5(7): 1969-(1978).

DOI: 10.1109/tasl.2007.901823

Google Scholar

[12] D Martinez, O Plchot, L Burget, et al. Language recognition in ivectors space. Proceedings of Interspeech. Firenze, Italy, (2011).

DOI: 10.21437/interspeech.2011-329

Google Scholar