Memory-Efficient Modeling and Search Techniques for Hardware ASR Decoders

Price, Michael; Chandrakasan, Anantha; Glass, James

doi:10.21437/Interspeech.2016-287

Memory-Efficient Modeling and Search Techniques for Hardware ASR Decoders

Michael Price, Anantha Chandrakasan, James Glass

This paper gives an overview of acoustic modeling and search techniques for low-power embedded ASR decoders. Our design decisions prioritize memory bandwidth, which is the main driver in system power consumption. We evaluate three acoustic modeling approaches — Gaussian mixture model (GMM), subspace GMM (SGMM) and deep neural network (DNN) — and identify tradeoffs between memory bandwidth and recognition accuracy. We also present an HMM search scheme with WFST compression and caching, predictive beam width control, and a word lattice. Our results apply to embedded system implementations using microcontrollers, DSPs, FPGAs, or ASICs.

doi: 10.21437/Interspeech.2016-287

Cite as: Price, M., Chandrakasan, A., Glass, J. (2016) Memory-Efficient Modeling and Search Techniques for Hardware ASR Decoders. Proc. Interspeech 2016, 1893-1897, doi: 10.21437/Interspeech.2016-287

@inproceedings{price16_interspeech,
  author={Michael Price and Anantha Chandrakasan and James Glass},
  title={{Memory-Efficient Modeling and Search Techniques for Hardware ASR Decoders}},
  year=2016,
  booktitle={Proc. Interspeech 2016},
  pages={1893--1897},
  doi={10.21437/Interspeech.2016-287}
}