ISCA Archive Interspeech 2012
ISCA Archive Interspeech 2012

Evaluation of many-to-many alignment algorithm by automatic pronunciation annotation using web text mining

Keigo Kubo, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano

The need for robust pronunciation annotation over out-of-vocabulary (OOV) words has been increasing with the development of an application that deals with proper nouns and brand-new words, such as Voice Search. In robust pronunciation annotation over OOV words, the alignment between graphemes and phonemes is vital data. For a many-to-many alignment algorithm between graphemes and phonemes, we describe its problems and methods to overcome them. An evaluation experiment of a many-to-many alignment by automatic pronunciation annotation using Web text mining is also performed. That experimental result shows that the proposed many-to-many alignment produces an alignment that has the high generalization ability for OOV words while avoiding degradation of the accuracy of the pronunciation annotation compared with the conventional approach.

Index Terms: string alignment, out-of-vocabulary word, pronunciation annotation


doi: 10.21437/Interspeech.2012-608

Cite as: Kubo, K., Kawanami, H., Saruwatari, H., Shikano, K. (2012) Evaluation of many-to-many alignment algorithm by automatic pronunciation annotation using web text mining. Proc. Interspeech 2012, 2318-2321, doi: 10.21437/Interspeech.2012-608

@inproceedings{kubo12b_interspeech,
  author={Keigo Kubo and Hiromichi Kawanami and Hiroshi Saruwatari and Kiyohiro Shikano},
  title={{Evaluation of many-to-many alignment algorithm by automatic pronunciation annotation using web text mining}},
  year=2012,
  booktitle={Proc. Interspeech 2012},
  pages={2318--2321},
  doi={10.21437/Interspeech.2012-608}
}