This paper presents research on integrating context-dependent durational knowledge into HMM-based speech recognition. The first part of the paper presents work on obtaining relations between the parameters of the context-free HMMs and their durational behaviour, in preparation for the context-dependent durational modelling presented in the second part. Duration integration is realised via rescoring in the post-processing step of our N-best monophone recogniser. We use the multi-speaker TIMIT database for our analyses.
Cite as: Wang, X., Bosch, L.F.M.t., Pols, L.C.W. (1996) Integration of context-dependent durational knowledge into HMM-based speech recognition. Proc. 4th International Conference on Spoken Language Processing (ICSLP 1996), 1073-1076, doi: 10.21437/ICSLP.1996-282
@inproceedings{wang96_icslp, author={Xue Wang and Louis F. M. ten Bosch and Louis C. W. Pols}, title={{Integration of context-dependent durational knowledge into HMM-based speech recognition}}, year=1996, booktitle={Proc. 4th International Conference on Spoken Language Processing (ICSLP 1996)}, pages={1073--1076}, doi={10.21437/ICSLP.1996-282} }