ISCA Archive ICSLP 2000
ISCA Archive ICSLP 2000

On enhancing katz-smoothing based back-off language model

Jian Wu, Fang Zheng

Though the statistical language modeling plays an important role in speech recognition, there are still problems that are difficult to be solved such as the sparseness of training data. Generally, two kinds of smoothing approaches, namely the back-off model and the interpolated model, have been proposed to solve the problem of the impreciseness of language models caused by the sparseness of training data. By expanding the idea of interpolation model to Katz-smoothing based re-estimation of the seen word pairs, a back-off model based modified method is proposed, referred to as the enhanced Katz smoothing with deleted interpolation (EKSWDI). A uniform expression and two simplified versions for this modified model are also given. Experiments on a Chinese pinyin-to-character conversion system and the perplexity measure show that the proposed model has a better performance than the Katz smoothing method does.


doi: 10.21437/ICSLP.2000-49

Cite as: Wu, J., Zheng, F. (2000) On enhancing katz-smoothing based back-off language model. Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000), vol. 1, 198-201, doi: 10.21437/ICSLP.2000-49

@inproceedings{wu00_icslp,
  author={Jian Wu and Fang Zheng},
  title={{On enhancing katz-smoothing based back-off language model}},
  year=2000,
  booktitle={Proc. 6th International Conference on Spoken Language Processing (ICSLP 2000)},
  pages={vol. 1, 198-201},
  doi={10.21437/ICSLP.2000-49}
}