Cross-lingual acoustic model adaptation based on transfer vector field smoothing with MAP

Saiko, Masahiro; Matsuda, Shigeki; Hanazawa, Ken; Isotani, Ryosuke; Hori, Chiori

doi:10.21437/Interspeech.2013-735

Cross-lingual acoustic model adaptation based on transfer vector field smoothing with MAP

Masahiro Saiko, Shigeki Matsuda, Ken Hanazawa, Ryosuke Isotani, Chiori Hori

We propose a method to adapt acoustic models for robust speech recognition in real environments using data from other languages. In real-world speech recognition systems, we can effectively adapt acoustic models using the speech data logged by the system. However, when developing a system for a new language, this step is impossible since we have no such speech data for it. Assuming that similar Gaussians of each language have similar transfer vectors, in our proposed method, we estimate the transfer vectors of each Gaussian of the language for acoustic model adaptation by the transfer vectors of the other language. We evaluated the performance of Indonesian acoustic models that were adapted using the transfer vectors estimated from Japanese transfer vectors. Our proposed method achieved a relative error reduction rate of 10.6% for real environmental speech data.

doi: 10.21437/Interspeech.2013-735

Cite as: Saiko, M., Matsuda, S., Hanazawa, K., Isotani, R., Hori, C. (2013) Cross-lingual acoustic model adaptation based on transfer vector field smoothing with MAP. Proc. Interspeech 2013, 3322-3326, doi: 10.21437/Interspeech.2013-735

@inproceedings{saiko13_interspeech,
  author={Masahiro Saiko and Shigeki Matsuda and Ken Hanazawa and Ryosuke Isotani and Chiori Hori},
  title={{Cross-lingual acoustic model adaptation based on transfer vector field smoothing with MAP}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={3322--3326},
  doi={10.21437/Interspeech.2013-735},
  issn={2308-457X}
}