ISCA Archive ICSLP 2002
ISCA Archive ICSLP 2002

Improving spoken language understanding using word confusion networks

Gokhan Tur, Jerry Wright, Allen Gorin, Giuseppe Riccardi, Dilek Hakkani-Tür

A natural language spoken dialog system includes a large vocabulary automatic speech recognition (ASR) engine, whose output is used as the input of a spoken language understanding component. Two challenges in such a framework are that the ASR component is far from being perfect and the users can say the same thing in very different ways. So, it is very important to be tolerant to recognition errors and some amount of orthographic variability. In this paper, we present our work on developing new methods and investigating various ways of robust recognition and understanding of an utterance. To this end, we exploit word-level confusion networks (sausages), obtained fromASR word graphs (lattices) instead of the ASR 1-best hypothesis. Using sausages with an improved confidence model, we decreased the call-type classification error rate for AT&T’s How May I Help You (HMIHY) natural dialog system by 38%.


doi: 10.21437/ICSLP.2002-374

Cite as: Tur, G., Wright, J., Gorin, A., Riccardi, G., Hakkani-Tür, D. (2002) Improving spoken language understanding using word confusion networks. Proc. 7th International Conference on Spoken Language Processing (ICSLP 2002), 1137-1140, doi: 10.21437/ICSLP.2002-374

@inproceedings{tur02_icslp,
  author={Gokhan Tur and Jerry Wright and Allen Gorin and Giuseppe Riccardi and Dilek Hakkani-Tür},
  title={{Improving spoken language understanding using word confusion networks}},
  year=2002,
  booktitle={Proc. 7th International Conference on Spoken Language Processing (ICSLP 2002)},
  pages={1137--1140},
  doi={10.21437/ICSLP.2002-374}
}