ISCA Archive Interspeech 2004
ISCA Archive Interspeech 2004

Weighting observation vectors for robust speech recognition in noisy environments

Zhenyu Xiong, Fang Zheng, Wenhu Wu

In this paper, we propose a novel approach to robust speech recognition in noisy environments by discriminating the observation vectors. In conventional HMM-based speech recognition, all the observation vectors are treated with equal importance no matter how the corresponding speech segment is corrupted with noise. Our approach proposed here modifies the conventional decoder by weighting the likelihood scores for different observation vectors based on the signal to noise ratios (SNRs) of the corresponding speech frames when the probabilities of generating a sequence of observations are being calculated for some models. The proposed approach combined with spectral subtraction is evaluated with four different kinds of noises added to the clean speech. The experimental results show the superior performance of the proposed method over the method where only the spectral subtraction is applied, especially in the median SNR environments.


doi: 10.21437/Interspeech.2004-631

Cite as: Xiong, Z., Zheng, F., Wu, W. (2004) Weighting observation vectors for robust speech recognition in noisy environments. Proc. Interspeech 2004, 2069-2072, doi: 10.21437/Interspeech.2004-631

@inproceedings{xiong04b_interspeech,
  author={Zhenyu Xiong and Fang Zheng and Wenhu Wu},
  title={{Weighting observation vectors for robust speech recognition in noisy environments}},
  year=2004,
  booktitle={Proc. Interspeech 2004},
  pages={2069--2072},
  doi={10.21437/Interspeech.2004-631}
}