ISCA Archive Interspeech 2009
ISCA Archive Interspeech 2009

Lexical and phonetic modeling for Arabic automatic speech recognition

Long Nguyen, Tim Ng, Kham Nguyen, Rabih Zbib, John Makhoul

In this paper, we describe the use of either words or morphemes as lexical modeling units and the use of either graphemes or phonemes as phonetic modeling units for Arabic automatic speech recognition (ASR). We designed four Arabic ASR systems: two wordbased systems and two morpheme-based systems. Experimental results using these four systems show that they have comparable state-of-the-art performance individually, but the more sophisticated morpheme-based system tends to be the best. However, they seem to complement each other quite well within the ROVER system combination framework to produce substantially-improved combined results.


doi: 10.21437/Interspeech.2009-244

Cite as: Nguyen, L., Ng, T., Nguyen, K., Zbib, R., Makhoul, J. (2009) Lexical and phonetic modeling for Arabic automatic speech recognition. Proc. Interspeech 2009, 712-715, doi: 10.21437/Interspeech.2009-244

@inproceedings{nguyen09_interspeech,
  author={Long Nguyen and Tim Ng and Kham Nguyen and Rabih Zbib and John Makhoul},
  title={{Lexical and phonetic modeling for Arabic automatic speech recognition}},
  year=2009,
  booktitle={Proc. Interspeech 2009},
  pages={712--715},
  doi={10.21437/Interspeech.2009-244}
}