ISCA Archive ICSLP 2002
ISCA Archive ICSLP 2002

Combining search spaces of heterogeneous recognizers for improved speech recogniton

Xiang Li, Rita Singh, Richard M. Stern

In speech recognition systems, information from multiple sources such as different feature streams or acoustic models can be combined in many different ways to yield better recognition performance. It is theoretically expected that the best performance is obtainable through the simultaneous use of all sources of information, in a system capable of using these in parallel. Such systems, however, are extremely complex and difficult to construct. In this paper we propose a simple alternative criterion for combination which can factorize the complex recognizer into several simple recognizers, each of which is based on a single source of information. We use this criterion in simple experiments which combine lattices from recognizers built with different feature streams. Experimental results obtained on five different corpora show that the proposed method is effective in improving recognition performance.


doi: 10.21437/ICSLP.2002-166

Cite as: Li, X., Singh, R., Stern, R.M. (2002) Combining search spaces of heterogeneous recognizers for improved speech recogniton. Proc. 7th International Conference on Spoken Language Processing (ICSLP 2002), 405-408, doi: 10.21437/ICSLP.2002-166

@inproceedings{li02d_icslp,
  author={Xiang Li and Rita Singh and Richard M. Stern},
  title={{Combining search spaces of heterogeneous recognizers for improved speech recogniton}},
  year=2002,
  booktitle={Proc. 7th International Conference on Spoken Language Processing (ICSLP 2002)},
  pages={405--408},
  doi={10.21437/ICSLP.2002-166}
}