Using missing feature theory to actively select features for robust speech recognition with interruptions, filtering and noise

Lippmann, Richard; Carlson, Beth A.

doi:10.21437/Eurospeech.1997-6

Using missing feature theory to actively select features for robust speech recognition with interruptions, filtering and noise

Richard Lippmann, Beth A. Carlson

Speech recognizers trained with quiet wide-band speech degrade dramatically with high-pass, low-pass, and notch filtering, with noise, and with interruptions of the speech input. A new and simple approach to compensate for these degradations is presented which uses mel-filter-bank (MFB) magnitudes as input features and missing feature theory to dynamically modify the probability computations performed in Hidden Markov Model recognizers. When the identity of features missing due to filtering or masking is provided, recognition accuracy on a large talker-independent digit recognition task often rises from below 50% to above 95%. These promising results suggest future work to continuously estimate SNR's within MFB bands for dynamic adaptation of speech recognizers.

doi: 10.21437/Eurospeech.1997-6

Cite as: Lippmann, R., Carlson, B.A. (1997) Using missing feature theory to actively select features for robust speech recognition with interruptions, filtering and noise. Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997), kn37-kn40, doi: 10.21437/Eurospeech.1997-6

@inproceedings{lippmann97_eurospeech,
  author={Richard Lippmann and Beth A. Carlson},
  title={{Using missing feature theory to actively select features for robust speech recognition with interruptions, filtering and noise}},
  year=1997,
  booktitle={Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997)},
  pages={kn37-kn40},
  doi={10.21437/Eurospeech.1997-6}
}