Abstract.
Speaker recognition has been an active research area for many years. Methods to represent and quantify information embedded in speech signal are termed as features of the signal. The features are obtained, modeled and stored for further reference when the system is to be tested. Decision whether to accept or reject speakers are taken based on parameters of the data modeling techniques. Real world offers various degradations to the signal that hamper the signal quality. The degradations may be due to ambient background noise, reverberation or multispeaker scenario. This paper presents a survey of various feature extraction, data modeling methods, metrics that are used to take the decisions and methods that can be used to preprocess the degraded data that have been used to perform the task of speaker recognition.
© 2013 by Walter de Gruyter Berlin Boston
This article is distributed under the terms of the Creative Commons Attribution Non-Commercial License, which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.