Evaluation of SPLICE on the Aurora 2 and 3 tasks

Droppo, Jasha; Deng, Li; Acero, Alex

doi:10.21437/ICSLP.2002-6

Evaluation of SPLICE on the Aurora 2 and 3 tasks

Jasha Droppo, Li Deng, Alex Acero

Stereo-based Piecewise Linear Compensation for Environments (SPLICE) is a general framework for removing distortions from noisy speech cepstra. It contains a non-parametric model for cepstral corruption, which is learned from two channels of training data. We evaluate SPLICE on both the Aurora 2 and 3 tasks. These tasks consist of digit sequences in five European languages. Noise corruption is both synthetic (Aurora 2) and realistic (Aurora 3). For both the Aurora 2 and 3 tasks, we use the same training and testing procedure provided with the corpora. By holding the back-end constant, we ensure that any increase in word accuracy is due to our front-end processing techniques. In the Aurora 2 task, we achieve a 76.86% average decrease in word error rate with clean acoustic models, and an overall improvement of 62.63%. For the Aurora 3 task, we achieve a 75.06% average decrease in word error rate for the high-mismatch experiment, and an overall improvement of 47.19%.

doi: 10.21437/ICSLP.2002-6

Cite as: Droppo, J., Deng, L., Acero, A. (2002) Evaluation of SPLICE on the Aurora 2 and 3 tasks. Proc. 7th International Conference on Spoken Language Processing (ICSLP 2002), 29-32, doi: 10.21437/ICSLP.2002-6

@inproceedings{droppo02_icslp,
  author={Jasha Droppo and Li Deng and Alex Acero},
  title={{Evaluation of SPLICE on the Aurora 2 and 3 tasks}},
  year=2002,
  booktitle={Proc. 7th International Conference on Spoken Language Processing (ICSLP 2002)},
  pages={29--32},
  doi={10.21437/ICSLP.2002-6}
}