ISCA Archive Interspeech 2014
ISCA Archive Interspeech 2014

Improving the performance of far-field speaker verification using multi-condition training: the case of GMM-UBM and i-vector systems

Anderson R. Avila, Milton Sarria-Paja, Francisco J. Fraga, Douglas O'Shaughnessy, Tiago H. Falk

While considerable work has been done to characterize the detrimental effects of channel variability on automatic speaker verification (ASV) performance, little attention has been paid to the effects of room reverberation. This paper investigates the effects of room acoustics on the performance of two far-field ASV systems: GMM-UBM (Gaussian mixture model - universal background model) and i-vector. We show that ASV performance is severely affected by reverberation, particularly for i-vector based systems. Three multi-condition training methods are then investigated to mitigate such detrimental effects. The first uses matched train/test speaker models based on estimated reverberation time (RT) values. The second utilizes two-condition training where clean and reverberant models are used. Lastly, a four-condition training setup is proposed where models for clean, mild, moderate, and severe reverberation levels are used. Experimental results show the first and third multi-condition training methods providing significant gains in performance relative to the baseline, with the latter being more suitable for practical resource-constrained far-field applications.


doi: 10.21437/Interspeech.2014-282

Cite as: Avila, A.R., Sarria-Paja, M., Fraga, F.J., O'Shaughnessy, D., Falk, T.H. (2014) Improving the performance of far-field speaker verification using multi-condition training: the case of GMM-UBM and i-vector systems. Proc. Interspeech 2014, 1096-1100, doi: 10.21437/Interspeech.2014-282

@inproceedings{avila14_interspeech,
  author={Anderson R. Avila and Milton Sarria-Paja and Francisco J. Fraga and Douglas O'Shaughnessy and Tiago H. Falk},
  title={{Improving the performance of far-field speaker verification using multi-condition training: the case of GMM-UBM and i-vector systems}},
  year=2014,
  booktitle={Proc. Interspeech 2014},
  pages={1096--1100},
  doi={10.21437/Interspeech.2014-282}
}