ISCA Archive Eurospeech 2003
ISCA Archive Eurospeech 2003

Robust speaker identification using posterior union models

Ji Ming, Darryl Stewart, Philip Hanna, Pat Corr, Jack Smith, Saeed Vaseghi

This paper investigates the problem of speaker identification in noisy conditions, assuming that there is no prior knowledge about the noise. To confine the effect of the noise on recognition, we use a multi-stream approach to characterize the speech signal, assuming that while all of the feature streams may be affected by the noise, there may be some streams that are less severely affected and thus still provide useful information about the speaker. Recognition decisions are based on the feature streams that are uncontaminated or least contaminated, thereby reducing the effect of the noise on recognition. We introduce a novel statistical method, the posterior union model, for selecting reliable feature streams. An advantage of the union model is that knowledge of the structure of the noise is not needed, thereby providing robustness to time-varying unpredictable noise corruption. We have tested the new method on the TIMIT database with additive corruption from real-world nonstationary noise; the results obtained are encouraging.


doi: 10.21437/Eurospeech.2003-722

Cite as: Ming, J., Stewart, D., Hanna, P., Corr, P., Smith, J., Vaseghi, S. (2003) Robust speaker identification using posterior union models. Proc. 8th European Conference on Speech Communication and Technology (Eurospeech 2003), 2645-2648, doi: 10.21437/Eurospeech.2003-722

@inproceedings{ming03_eurospeech,
  author={Ji Ming and Darryl Stewart and Philip Hanna and Pat Corr and Jack Smith and Saeed Vaseghi},
  title={{Robust speaker identification using posterior union models}},
  year=2003,
  booktitle={Proc. 8th European Conference on Speech Communication and Technology (Eurospeech 2003)},
  pages={2645--2648},
  doi={10.21437/Eurospeech.2003-722}
}