A Simple Way to Extract I-vector from Normalized Statastics

Lei, Zhenchun; Luo, Jian; Yang, Yingen

doi:10.1007/978-3-319-12484-1_41

Zhenchun Lei²⁰,
Jian Luo²⁰ &
Yingen Yang²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8833))

Included in the following conference series:

Chinese Conference on Biometric Recognition

2260 Accesses

Abstract

In the i-vector model, the utterance statistics are extracted from features using universal background model. The utterance is mapped to a vector in the total variability space, which is called i-vector. The total variability space provides a basis to obtain a low dimensional fixed-length representation of a speech utterance. But, the processing is complicated for the interweaving of the statistics and machine learning method. So, we considered separating them and proposed a simple way to extract i-vector by classical principal component analysis, factor analysis and independent component analysis from normalized statistics. The results on NIST 2008 telephone data show that the performance is very close to the traditional method and they can be improved obviously after score fusion.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Kinnunena, T., Li, H.: An overview of text-independent speaker recognition: From features to supervectors. Speech Communication 52(1), 12–40 (2010)
Article Google Scholar
Dehak, N., Kenny, P., Dehak, R., Dumouchel, P., Ouellet, P.: Front-End Factor Analysis for Speaker Verification. IEEE Transactions on Audio, Speech and Language Processing 19(4), 788–798 (2011)
Article Google Scholar
Reynolds, D.A., Quatieri, T., Dunn, R.: Speaker verification using adapted Gaussian mixture models. Digital Signal Processing 10(3) (2000)
Google Scholar
Campbell, W.M., Sturim, D.E., Reynolds, D.A., Solomonoff, A.: SVM based speaker verification using a GMM supervector kernel and NAP variability compensation. In: Proc. ICASSP, vol. 1, pp. 97–100 (2006)
Google Scholar
Kenny, P., Boulianne, G., Ouellet, P., Dumouchel, P.: Joint factor analysis versus eigenchannels in speaker recognition. IEEE Transactions on Audio, Speech and Language Processing 15(4), 1435–1447 (2007)
Article Google Scholar
Kenny, P., Gilles, B., Pierre, D.: Eigenvoice Modeling With Sparse Training Data. IEEE Trans. Speech and Audio Proc. 13(3), 345–354 (2005)
Article Google Scholar
Tipping, M., Bishop, C.: Mixtures of probabilistic principal component analyzers. Neural Computation 11, 435–474 (1999)
Article Google Scholar
Glembekl, O., et al.: Simplification And Optimization of I-Vector Extraction. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Prague, Czech Republic, May 22-27, pp. 4516–4519 (2011)
Google Scholar
Li, M., et al.: Speaker Verification Using Simplified and Supervised I-Vector Modeling. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (2013)
Google Scholar
Prince, S.J.D., Elder, J.H.: Probabilistic Linear Discriminant Analysis for Inferences About Identity. In: IEEE 11th International Conference on Computer Vision (2007)
Google Scholar
Jiang, Y., Lee, K.A., Tang, Z., Ma, B., Larcher, A., Li, H.: PLDA Modeling in I-vector and Supervector Space for Speaker Verification. In: Annual Conference of the International Speech Communication Association, Interspeech (2012)
Google Scholar
Machlica, L., Zajıc, Z.: An efficient implementation of Probabilistic Linear Discriminant Analysis. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (2013)
Google Scholar
Garcia-Romero, D., Espy-Wilson, C.Y.: Analysis of i-vector length normalization in speaker recognition systems. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 249–252 (2011)
Google Scholar
Martinez, A.M., Kak, A.C.: PCA versus LDA. IEEE Transactions on Pattern Analysis and Machine Intelligence 23(2), 228–233 (2004)
Article Google Scholar
Johnson, R.A., Wichern, D.W.: Applied Multivariate Statistical Analysis, 6th edn. Pearson Education (2007)
Google Scholar
Hyvärinen, A., Oja, E.: Independent Component Analysis: Algorithms and Applications. Neural Networks 13(4-5), 411–430 (2000)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer and Information Engineering, Jiangxi Normal University, Nanchang, China
Zhenchun Lei, Jian Luo & Yingen Yang

Authors

Zhenchun Lei
View author publications
You can also search for this author in PubMed Google Scholar
Jian Luo
View author publications
You can also search for this author in PubMed Google Scholar
Yingen Yang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, P.O. Box 2728, 100190, Beijing, China
Zhenan Sun
Institute of Computing Technology, Chinese Academy of Sciences, No.6 Kexueyuan South Road, P.O. Box 2704, 100190, HaiDian, Beijing, China
Shiguang Shan
School of Information Science and Engineering, Shenyang University of Technology, No.111, Shenliao West Road, Economic and Technological Development Zone, 110870, Shenyang, China
Haifeng Sang & Weiqi Yuan &
Software Engineering Institute, Tsinghua University, No. 1 Tsinghua Yuan, Haidian District, 100084, Beijing, China
Jie Zhou
School of Computer Science and Engineering, Beihang University, No. 37 Xueyuan Road, Haidian District, 100191, Beijing, China
Yunhong Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lei, Z., Luo, J., Yang, Y. (2014). A Simple Way to Extract I-vector from Normalized Statastics. In: Sun, Z., Shan, S., Sang, H., Zhou, J., Wang, Y., Yuan, W. (eds) Biometric Recognition. CCBR 2014. Lecture Notes in Computer Science, vol 8833. Springer, Cham. https://doi.org/10.1007/978-3-319-12484-1_41

Download citation

DOI: https://doi.org/10.1007/978-3-319-12484-1_41
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12483-4
Online ISBN: 978-3-319-12484-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics