ISCA Archive Interspeech 2011
ISCA Archive Interspeech 2011

Optimization of the Gaussian mixture model evaluation on GPU

Jan Vaněk, Jan Trmal, Josef V. Psutka, Josef Psutka

In this paper we present a highly optimized implementation of Gaussian mixture acoustic model evaluation algorithm. Evaluation of these likelihoods is one of the most computationally intensive parts of automatics speech recognizers but it can be well-parallelized and offloaded to GPU devices. Our approach offers significant speed-up compared to the recently published approaches, since it exploits the GPU architecture better. All the recent implementations were programmed either in CUDA or OpenCL GPU programming frameworks. We present results for both; CUDA as well as OpenCL.

Results suggest that even very large acoustic models can be utilized in real-time speech recognition engines on computers and laptops equipped with a low-end GPU. Optimization of acoustic likelihoods computation on GPU enables to use the remaining GPU resources for offloading of other compute-intensive parts of LVCSR decoder. Other possible use of the freed GPU resources is to evaluate several acoustic models at the same time and use fusion techniques or model selection techniques to improve the quality of resulting conditional likelihoods under diverse conditions.


doi: 10.21437/Interspeech.2011-195

Cite as: Vaněk, J., Trmal, J., Psutka, J.V., Psutka, J. (2011) Optimization of the Gaussian mixture model evaluation on GPU. Proc. Interspeech 2011, 1737-1740, doi: 10.21437/Interspeech.2011-195

@inproceedings{vanek11_interspeech,
  author={Jan Vaněk and Jan Trmal and Josef V. Psutka and Josef Psutka},
  title={{Optimization of the Gaussian mixture model evaluation on GPU}},
  year=2011,
  booktitle={Proc. Interspeech 2011},
  pages={1737--1740},
  doi={10.21437/Interspeech.2011-195}
}