Dynamic features for segmental speech recognition

Harte, Naomi; Vaseghi, Saeed V.; Milner, Ben

doi:10.21437/ICSLP.1996-241

Dynamic features for segmental speech recognition

Naomi Harte, Saeed V. Vaseghi, Ben Milner

Speech models and features that emphasise the dynamic aspects of speech can provide improved speech recognition. The cepstral time matrix has been established as a successful method of encoding dynamics. This paper extends this set of dynamic features, considering cepstral time features on both a segmental and subsegmental level. This offers the potential of using a conditional pdf for the state observation within a HMM and incorporating this into the training stage. Methods of linear discriminative analysis are applied to the new feature set to identify the subset of features making the greatest contribution to the task of recognition.

doi: 10.21437/ICSLP.1996-241

Cite as: Harte, N., Vaseghi, S.V., Milner, B. (1996) Dynamic features for segmental speech recognition. Proc. 4th International Conference on Spoken Language Processing (ICSLP 1996), 933-936, doi: 10.21437/ICSLP.1996-241

@inproceedings{harte96_icslp,
  author={Naomi Harte and Saeed V. Vaseghi and Ben Milner},
  title={{Dynamic features for segmental speech recognition}},
  year=1996,
  booktitle={Proc. 4th International Conference on Spoken Language Processing (ICSLP 1996)},
  pages={933--936},
  doi={10.21437/ICSLP.1996-241}
}