Abstract
In the i-vector model, the utterance statistics are extracted from features using universal background model. The utterance is mapped to a vector in the total variability space, which is called i-vector. The total variability space provides a basis to obtain a low dimensional fixed-length representation of a speech utterance. But, the processing is complicated for the interweaving of the statistics and machine learning method. So, we considered separating them and proposed a simple way to extract i-vector by classical principal component analysis, factor analysis and independent component analysis from normalized statistics. The results on NIST 2008 telephone data show that the performance is very close to the traditional method and they can be improved obviously after score fusion.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Kinnunena, T., Li, H.: An overview of text-independent speaker recognition: From features to supervectors. Speech Communication 52(1), 12–40 (2010)
Dehak, N., Kenny, P., Dehak, R., Dumouchel, P., Ouellet, P.: Front-End Factor Analysis for Speaker Verification. IEEE Transactions on Audio, Speech and Language Processing 19(4), 788–798 (2011)
Reynolds, D.A., Quatieri, T., Dunn, R.: Speaker verification using adapted Gaussian mixture models. Digital Signal Processing 10(3) (2000)
Campbell, W.M., Sturim, D.E., Reynolds, D.A., Solomonoff, A.: SVM based speaker verification using a GMM supervector kernel and NAP variability compensation. In: Proc. ICASSP, vol. 1, pp. 97–100 (2006)
Kenny, P., Boulianne, G., Ouellet, P., Dumouchel, P.: Joint factor analysis versus eigenchannels in speaker recognition. IEEE Transactions on Audio, Speech and Language Processing 15(4), 1435–1447 (2007)
Kenny, P., Gilles, B., Pierre, D.: Eigenvoice Modeling With Sparse Training Data. IEEE Trans. Speech and Audio Proc. 13(3), 345–354 (2005)
Tipping, M., Bishop, C.: Mixtures of probabilistic principal component analyzers. Neural Computation 11, 435–474 (1999)
Glembekl, O., et al.: Simplification And Optimization of I-Vector Extraction. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Prague, Czech Republic, May 22-27, pp. 4516–4519 (2011)
Li, M., et al.: Speaker Verification Using Simplified and Supervised I-Vector Modeling. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (2013)
Prince, S.J.D., Elder, J.H.: Probabilistic Linear Discriminant Analysis for Inferences About Identity. In: IEEE 11th International Conference on Computer Vision (2007)
Jiang, Y., Lee, K.A., Tang, Z., Ma, B., Larcher, A., Li, H.: PLDA Modeling in I-vector and Supervector Space for Speaker Verification. In: Annual Conference of the International Speech Communication Association, Interspeech (2012)
Machlica, L., Zajıc, Z.: An efficient implementation of Probabilistic Linear Discriminant Analysis. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (2013)
Garcia-Romero, D., Espy-Wilson, C.Y.: Analysis of i-vector length normalization in speaker recognition systems. In: Annual Conference of the International Speech Communication Association (Interspeech), pp. 249–252 (2011)
Martinez, A.M., Kak, A.C.: PCA versus LDA. IEEE Transactions on Pattern Analysis and Machine Intelligence 23(2), 228–233 (2004)
Johnson, R.A., Wichern, D.W.: Applied Multivariate Statistical Analysis, 6th edn. Pearson Education (2007)
Hyvärinen, A., Oja, E.: Independent Component Analysis: Algorithms and Applications. Neural Networks 13(4-5), 411–430 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Lei, Z., Luo, J., Yang, Y. (2014). A Simple Way to Extract I-vector from Normalized Statastics. In: Sun, Z., Shan, S., Sang, H., Zhou, J., Wang, Y., Yuan, W. (eds) Biometric Recognition. CCBR 2014. Lecture Notes in Computer Science, vol 8833. Springer, Cham. https://doi.org/10.1007/978-3-319-12484-1_41
Download citation
DOI: https://doi.org/10.1007/978-3-319-12484-1_41
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12483-4
Online ISBN: 978-3-319-12484-1
eBook Packages: Computer ScienceComputer Science (R0)