ISCA Archive Eurospeech 1995
ISCA Archive Eurospeech 1995

A statistical approach to language modelling for the ATIS task

Joshua Koppelman, Stephen Delia Pietra, Mark Epstein, Salim Roukos, Todd Ward

The goal of this research is to develop an effective natural language component for IBM's spoken language understanding system for the ATIS domain. We use training data to assign a probability distribution to the reference interpretation, the NLParse, which minimizes the observed perplexity of the test data. We limit our scope to deal only with those ATIS2 sentences which can be understood unambiguously out of context (the so-called "Class A" queries). The decoder component of the finished system will use the natural language probabilities to select the most probable NLParse translations for a given English input. The NLParse translation can then be deterministically converted to SQL to query the ATIS database for the correct answer. We use a number of different deleted interpolation and maximum entropy techniques to improve on the standard trigram model, and we achieve a reduction in test perplexity from 15.9 to 14.1 bits per item.


doi: 10.21437/Eurospeech.1995-445

Cite as: Koppelman, J., Pietra, S.D., Epstein, M., Roukos, S., Ward, T. (1995) A statistical approach to language modelling for the ATIS task. Proc. 4th European Conference on Speech Communication and Technology (Eurospeech 1995), 1785-1788, doi: 10.21437/Eurospeech.1995-445

@inproceedings{koppelman95_eurospeech,
  author={Joshua Koppelman and Stephen Delia Pietra and Mark Epstein and Salim Roukos and Todd Ward},
  title={{A statistical approach to language modelling for the ATIS task}},
  year=1995,
  booktitle={Proc. 4th European Conference on Speech Communication and Technology (Eurospeech 1995)},
  pages={1785--1788},
  doi={10.21437/Eurospeech.1995-445}
}