Compact WFSA Based Language Model and Its Application in Statistical Machine Translation

Fu, Xiaoyin; Wei, Wei; Lu, Shixiang; Ke, Dengfeng; Xu, Bo

doi:10.1007/978-3-642-34456-5_15

Xiaoyin Fu⁵,
Wei Wei⁵,
Shixiang Lu⁵,
Dengfeng Ke⁵ &
…
Bo Xu⁵

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 333))

Included in the following conference series:

CCF International Conference on Natural Language Processing and Chinese Computing

782 Accesses

Abstract

The authors explore the fast query techniques for n-gram language model (LM) in statistical machine translation (SMT), and then propose a compact WFSA (weighted finite-state automaton) based LM motivated by the contextual features in process of model queries. It is demonstrated that the query based on WFSA can effectively avoid the redundant queries and accelerate the query speed. Furthermore, it is revealed that investigating a simple caching techni que can further speed up the query. The experiment results show that this method can finally speed up the LM query by 75% in relative. With the LM order increasing, the performance benefits by WFSA will be much more significant.

This work was supported by 863 program in China (No. 2011AA01A207).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Thorsten, B., Popat, A.C., Peng, X., Franz, J.O., Jeffrey, D.: Large Language Models in Machine Translation. In: Proceedings of EMNLP-CoNLL, pp. 858–867 (2007)
Google Scholar
Goodman, J.: A Bit of Progress in Language Modeling. Technical report. Microsoft Research (2001)
Google Scholar
Marcello, F., Mauro, C.: Efficient handling of n-gram language models for statis tical machine translation. In: Proceedings of the 2nd Workshop on Statistical Machine Translation, pp. 88–95 (2007)
Google Scholar
David, T., Miles, O.: Randomised language modelling for statistical machine translation. In: Proceedings of the ACL, pp. 512–519 (2007)
Google Scholar
Kevin, K., Jonathan, G.: An overview of probabilistic tree transducers for na tural language processing. In: Proceedings of CICLing (2005)
Google Scholar
David, C., Jonathan, G., Kevin, K., Adam, P., Sujith, R.: Bayesian inference for Finite-State transducers. In: Proceedings of the NAACL, pp. 447–455 (2010)
Google Scholar
Adam, P., Dan, K.: Faster and Smaller N-Gram Language Models. In: Proceedings of the ACL, pp. 258–267 (2011)
Google Scholar
Zhifei, L., Sanjeev, K.: A scalable decoder for parsing- based machine translation with equivalent language model state maintenance. In: Proceedings of the Second Workshop on Syntax and Structure in Statistical Translation, pp. 10–18 (2008)
Google Scholar
Kenneth, H.: KenLM: Faster and Smaller Language Model Queries. In: Proceedings of the 6th Workshop on Statistical Machine Translation, pp. 187–197 (2011)
Google Scholar
Lambert, M., William, B.: Statistical phrase-based speech translation. In: Proceedings of ICASSP (2006)
Google Scholar
Okan, K., Willian, B., Philip, R.: A generative probabilistic OCR model for NLP applications. In: Proceedings of the HLT-NAACL (2003)
Google Scholar
Alexis, N., Yannick, E., Frédéric, B., Thierry, S., de Renato, M.: A language model combining N-grams and stochastic finite state automata. In: Proceedings of Eurospeech (1999)
Google Scholar
Reinhard, K., Hermann, N.: Improved backing-off for m-gram language modeling. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 181–184 (1995)
Google Scholar
Slava, M.K.: Estimation of probabilities from sparse data for the language model component of a speech recognizer. IEEE Transactions on Acoustics, Speech and Signal Processing, 400–401 (1987)
Google Scholar
Andreas, S.: SRILM: An extensible language modeling toolkit. In: Proceedings of Interspeech (2002)
Google Scholar
Edward, W., Bhiksha, R.: Quantization based language model compression. In: Proceedings of Eurospeech (2001)
Google Scholar
David, C.: A hierarchical phrase-based model for statistical machine translation. In: Proceedings of ACL, pp. 263–270 (2005)
Google Scholar
David, C.: Hierarchical phrase-based translation. Computational Linguistics 33(2), 201–228 (2007)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Interactive Digital Media Technology Research Center, Institute of Automation, Chinese Academy of Sciences, Beijing, China
Xiaoyin Fu, Wei Wei, Shixiang Lu, Dengfeng Ke & Bo Xu

Authors

Xiaoyin Fu
View author publications
You can also search for this author in PubMed Google Scholar
Wei Wei
View author publications
You can also search for this author in PubMed Google Scholar
Shixiang Lu
View author publications
You can also search for this author in PubMed Google Scholar
Dengfeng Ke
View author publications
You can also search for this author in PubMed Google Scholar
Bo Xu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Microsoft Research Asia, Beijing, China
Ming Zhou
Soochow University, 215006, Suzhou, China
Guodong Zhou
Institute of Computer Science & Technology, Peking University, 100871, Beijing, China
Dongyan Zhao & Lei Zou &
Institute of Computing Technology, Chinese Academy of Sciences, No.6 Kexueyuan South Road, Haidian District, 100190, Beijing, China
Qun Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fu, X., Wei, W., Lu, S., Ke, D., Xu, B. (2012). Compact WFSA Based Language Model and Its Application in Statistical Machine Translation. In: Zhou, M., Zhou, G., Zhao, D., Liu, Q., Zou, L. (eds) Natural Language Processing and Chinese Computing. NLPCC 2012. Communications in Computer and Information Science, vol 333. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34456-5_15

Download citation

DOI: https://doi.org/10.1007/978-3-642-34456-5_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34455-8
Online ISBN: 978-3-642-34456-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics