ISCA Archive Interspeech 2014
ISCA Archive Interspeech 2014

Mixture of latent words language models for domain adaptation

Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi

This paper introduces a novel language model (LM) adaptation method based on mixture of latent word language models (LWLMs). LMs are often constructed as mixture of n-gram models, whose mixture weights are optimized using target domain data. However, n-gram mixture modeling is not flexible enough for domain adaptation because model merger is conducted on the observed word space. Since the words in out-of-domain LMs often differ from those in the target domain LM, it is hard for out-of domain LMs to offer adequate adaptation performance. Our solution is to carry out model merger in a latent variable space created from LWLMs. The latent variables in the LWLMs are represented as specific words selected from the observed word space, so LWLMs can share a common latent variable space and we can realize mixture modeling with consideration of the latent variable space. Following this change, this paper also describes a method to estimate mixture weights for LWLM mixture modeling. We use a sampling technique based on the Bayesian criterion in place of the conventional expectation maximization algorithm. Our experiments show that the LWLM mixture modeling is more effective than n-gram mixture modeling.


doi: 10.21437/Interspeech.2014-349

Cite as: Masumura, R., Asami, T., Oba, T., Masataki, H., Sakauchi, S. (2014) Mixture of latent words language models for domain adaptation. Proc. Interspeech 2014, 1425-1429, doi: 10.21437/Interspeech.2014-349

@inproceedings{masumura14_interspeech,
  author={Ryo Masumura and Taichi Asami and Takanobu Oba and Hirokazu Masataki and Sumitaka Sakauchi},
  title={{Mixture of latent words language models for domain adaptation}},
  year=2014,
  booktitle={Proc. Interspeech 2014},
  pages={1425--1429},
  doi={10.21437/Interspeech.2014-349}
}