Fast likelihood computation methods for continuous mixture densities in large vocabulary speech recognition

Ortmanns, Stefan; Firzlaff, Thorsten; Ney, Hermann

doi:10.21437/Eurospeech.1997-62

Fast likelihood computation methods for continuous mixture densities in large vocabulary speech recognition

Stefan Ortmanns, Thorsten Firzlaff, Hermann Ney

This paper studies algorithms for reducing the computational effort of the mixture density calculations in HMM-based speech recognition systems. These likelihood calculations take about 70 total recognition time in the RWTH system for large vocabulary continuous speech recognition. To reduce the computational cost of the likelihood calculations, we investigate several space partitioning methods. A detailed comparison of these techniques is given on the North American Business Corpus (NAB'94) for a 20 000- word task. As a result, the so-called projection search algorithm in combination with the VQ method reduces the cost of likelihood computation by a factor of about 8 with no significant loss in the word recognition accuracy.

doi: 10.21437/Eurospeech.1997-62

Cite as: Ortmanns, S., Firzlaff, T., Ney, H. (1997) Fast likelihood computation methods for continuous mixture densities in large vocabulary speech recognition. Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997), 139-142, doi: 10.21437/Eurospeech.1997-62

@inproceedings{ortmanns97_eurospeech,
  author={Stefan Ortmanns and Thorsten Firzlaff and Hermann Ney},
  title={{Fast likelihood computation methods for continuous mixture densities in large vocabulary speech recognition}},
  year=1997,
  booktitle={Proc. 5th European Conference on Speech Communication and Technology (Eurospeech 1997)},
  pages={139--142},
  doi={10.21437/Eurospeech.1997-62}
}