ISCA Archive ICSLP 2002
ISCA Archive ICSLP 2002

Filter bank subtraction for robust speech recognition

Kazuo Onoe, Hiroyuki Segi, Takeshi Kobayakawa, Shoei Sato, Toru Imai, Akio Ando

In this paper, we propose a new technique of filter bank subtraction for robust speech recognition under various acoustic conditions. Spectral subtraction is a simple and useful technique for reducing the influence of additive noise. Conventional spectral subtraction assumes accurate estimation of the noise spectrum and no correlation between speech and noise. Those assumptions, however, are rarely satisfied in reality, leading to the degradation of speech recognition accuracy. Moreover, the recognition improvement attained by conventional methods is slight when the input SNR changes sharply. We propose a new method in which the output values of filter banks are used for noise estimation and subtraction. By estimating noise at each filter bank, instead of at each frequency point, the method alleviates the necessity for precise estimation of noise. We also take into consideration phase differences between the spectra of speech and noise in the subtraction. Recognition experiments on test sets at several SNRs showed that the filter bank subtraction technique improved the word accuracy significantly and got better results than conventional spectral subtraction on all the test sets. In other experiments, on recognizing speech from TV news field reports with environmental noise, the proposed subtraction method yielded better results than the conventional method.


doi: 10.21437/ICSLP.2002-27

Cite as: Onoe, K., Segi, H., Kobayakawa, T., Sato, S., Imai, T., Ando, A. (2002) Filter bank subtraction for robust speech recognition. Proc. 7th International Conference on Spoken Language Processing (ICSLP 2002), 1021-1024, doi: 10.21437/ICSLP.2002-27

@inproceedings{onoe02_icslp,
  author={Kazuo Onoe and Hiroyuki Segi and Takeshi Kobayakawa and Shoei Sato and Toru Imai and Akio Ando},
  title={{Filter bank subtraction for robust speech recognition}},
  year=2002,
  booktitle={Proc. 7th International Conference on Spoken Language Processing (ICSLP 2002)},
  pages={1021--1024},
  doi={10.21437/ICSLP.2002-27}
}