Speech feature compensation based on pseudo stereo codebooks for robust speech recognition in additive noise environments

Hsieh, Tsung-hsueh; Hung, Jeih-weih

doi:10.21437/Interspeech.2007-96

Speech feature compensation based on pseudo stereo codebooks for robust speech recognition in additive noise environments

Tsung-hsueh Hsieh, Jeih-weih Hung

In this paper, we propose several compensation approaches to alleviate the effect of additive noise on speech features for speech recognition. These approaches are simple yet efficient noise reduction techniques that use online constructed pseudo stereo codebooks to evaluate the statistics in both clean and noisy environments. The process yields transforms for noise-corrupted speech features to make them closer to their clean counterparts. We apply these compensation approaches on various well-known speech features, including mel-frequency cepstral coefficients (MFCC), autocorrelation mel-frequency cepstral coefficients (AMFCC) and perceptual linear prediction cepstral coefficients (PLPCC). Experimental results conducted on the Aurora-2 database show that the proposed approaches provide all types of the features with a significant performance gain when compared to the baseline results and those obtained by using the conventional utterance-based cepstral mean and variance normalization (CMVN).

doi: 10.21437/Interspeech.2007-96

Cite as: Hsieh, T.-h., Hung, J.-w. (2007) Speech feature compensation based on pseudo stereo codebooks for robust speech recognition in additive noise environments. Proc. Interspeech 2007, 242-245, doi: 10.21437/Interspeech.2007-96

@inproceedings{hsieh07_interspeech,
  author={Tsung-hsueh Hsieh and Jeih-weih Hung},
  title={{Speech feature compensation based on pseudo stereo codebooks for robust speech recognition in additive noise environments}},
  year=2007,
  booktitle={Proc. Interspeech 2007},
  pages={242--245},
  doi={10.21437/Interspeech.2007-96}
}