ISCA Archive Eurospeech 2003
ISCA Archive Eurospeech 2003

Enhanced tree clustering with single pronunciation dictionary for conversational speech recognition

Hua Yu, Tanja Schultz

Modeling pronunciation variation is key for recognizing conversational speech. Rather than being limited to dictionary modeling, we argue that triphone clustering is an integral part of pronunciation modeling. We propose a new approach called enhanced tree clustering. This approach, in contrast to traditional decision tree based state tying, allows parameter sharing across phonemes. We show that accurate pronunciation modeling can be achieved through efficient parameter sharing in the acoustic model. Combined with a single pronunciation dictionary, a 1.8% absolute word error rate improvement is achieved on Switchboard, a large vocabulary conversational speech recognition task.


doi: 10.21437/Eurospeech.2003-563

Cite as: Yu, H., Schultz, T. (2003) Enhanced tree clustering with single pronunciation dictionary for conversational speech recognition. Proc. 8th European Conference on Speech Communication and Technology (Eurospeech 2003), 1869-1872, doi: 10.21437/Eurospeech.2003-563

@inproceedings{yu03d_eurospeech,
  author={Hua Yu and Tanja Schultz},
  title={{Enhanced tree clustering with single pronunciation dictionary for conversational speech recognition}},
  year=2003,
  booktitle={Proc. 8th European Conference on Speech Communication and Technology (Eurospeech 2003)},
  pages={1869--1872},
  doi={10.21437/Eurospeech.2003-563}
}