ISCA Archive Interspeech 2004
ISCA Archive Interspeech 2004

Speaker dependent model order selection of spectral envelopes

Matthias Wölfel

This work introduces a maximum-likelihood based model order (MO) selection technique for spectral envelopes to apply speaker dependent adaptation in the feature-space similar to vocal tract length normalization. Speech recognition systems based on spectral envelopes are using a fixed MO for the underlying linear parametric model. Using a fixed MO over different speakers or channels might not be optimal. To address this problem we investigated the use of warped and scaled minimum variance distortionless response spectral estimation techniques with speaker dependent MOs based on a maximum-likelihood criteria. Comparing experimental results on the Translanguage English Database we can show an improvement by 1,9% relative compared to the word error rate by the fixed MO and 3,5% relative to the traditional Mel-frequency cepstral coefficients.


doi: 10.21437/Interspeech.2004-23

Cite as: Wölfel, M. (2004) Speaker dependent model order selection of spectral envelopes. Proc. Interspeech 2004, 2949-2952, doi: 10.21437/Interspeech.2004-23

@inproceedings{wolfel04_interspeech,
  author={Matthias Wölfel},
  title={{Speaker dependent model order selection of spectral envelopes}},
  year=2004,
  booktitle={Proc. Interspeech 2004},
  pages={2949--2952},
  doi={10.21437/Interspeech.2004-23}
}