ISCA Archive Interspeech 2011
ISCA Archive Interspeech 2011

Evaluation of tree-trellis based decoding in over-million LVCSR

Naoaki Ito, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda

Very large vocabulary continuous speech recognition (CSR) that can recognize every sentence is one of important goals in speech recognition. Several attempts have been made to achieve very large vocabulary CSR. However, very large vocabulary CSR using a tree-trellis based decoder has not been reported. We report the performance evaluation and improvement of the "Julius" treetrellis based decoder in large vocabulary CSR (LVCSR) involving more than one million vocabulary, referred to here as over-million LVCSR. Experiments indicated that Julius achieved a word accuracy of about 91% and a real time factor of about 2 in over-million LVCSR for Japanese newspaper speech transcription.


doi: 10.21437/Interspeech.2011-362

Cite as: Ito, N., Nankaku, Y., Lee, A., Tokuda, K. (2011) Evaluation of tree-trellis based decoding in over-million LVCSR. Proc. Interspeech 2011, 1937-1940, doi: 10.21437/Interspeech.2011-362

@inproceedings{ito11b_interspeech,
  author={Naoaki Ito and Yoshihiko Nankaku and Akinobu Lee and Keiichi Tokuda},
  title={{Evaluation of tree-trellis based decoding in over-million LVCSR}},
  year=2011,
  booktitle={Proc. Interspeech 2011},
  pages={1937--1940},
  doi={10.21437/Interspeech.2011-362}
}