Pronunciation lexicon modeling and design for Korean large vocabulary continuous speech recognition

Lee, Kyong-Nim; Chung, Minhwa

doi:10.21437/Interspeech.2004-576

Pronunciation lexicon modeling and design for Korean large vocabulary continuous speech recognition

Kyong-Nim Lee, Minhwa Chung

In this paper, we describe a pronunciation lexicon model which is especially useful for constructing morpheme-based pronunciation lexicon to improve the performance of a Korean LVCSR. There are a lot of pronunciation variations occurring at morpheme boundaries in continuous speech. For modeling of cross-morpheme pronunciation variations, we usually used a context-dependent multiple pronunciation lexicon with possible multiple phonetic transcriptions for each word. Since phonemic context together with morphological category and morpheme boundary information affect Korean pronunciation variations, we have distinguished phonological rules that can be applied to phonemes in within-morpheme and crossmorpheme. However, pronunciation variations in morpheme boundaries are increasing the lexicon size; we have designed the optimized pronunciation lexicon which is decreasing the confusability and increasing pronunciation coverage. The results of Korean Broadcast News Transcription experiments show that a reduction of 18% in pronunciation lexicon size and an absolute reduction of 0.27% in WER from the same lexical entries were achieved by building a proposed pronunciation lexicon.

doi: 10.21437/Interspeech.2004-576

Cite as: Lee, K.-N., Chung, M. (2004) Pronunciation lexicon modeling and design for Korean large vocabulary continuous speech recognition. Proc. Interspeech 2004, 1537-1540, doi: 10.21437/Interspeech.2004-576

@inproceedings{lee04n_interspeech,
  author={Kyong-Nim Lee and Minhwa Chung},
  title={{Pronunciation lexicon modeling and design for Korean large vocabulary continuous speech recognition}},
  year=2004,
  booktitle={Proc. Interspeech 2004},
  pages={1537--1540},
  doi={10.21437/Interspeech.2004-576}
}