ABSTRACT
We present a language model consisting of a collection of costed bidirectional finite state automata associated with the head words of phrases. The model is suitable for incremental application of lexical associations in a dynamic programming search for optimal dependency tree derivations. We also present a model and algorithm for machine translation involving optimal "tiling" of a dependency tree with entries of a costed bilingual lexicon. Experimental results are reported comparing methods for assigning cost functions to these models. We conclude with a discussion of the adequacy of annotated linguistic strings as representations for machine translation.
- Alshawi, H. 1987. Memory and Context for Language Interpretation. Cambridge University Press, Cambridge, England. Google ScholarDigital Library
- Alshawi, H. 1996. "Underspecified First Order Logics". In Semantic Ambiguity and Underspecification, edited by K. van Deemter and S. Peters, CSLI Publications, Stanford, California.Google Scholar
- Alshawi, H. 1992. The Core Language Engine. MIT Press, Cambridge, Massachusetts.Google Scholar
- Alshawi, H., D. Carter, B. Gamback and M. Rayner. 1992. "Swedish-English QLF Translation". In H. Alshawi (ed.) The Core Language Engine. MIT Press, Cambridge, Massachusetts.Google Scholar
- Booth, T. 1969. "Probabilistic Representation of Formal Languages". Tenth Annual IEEE Symposium on Switching and Automata Theory.Google ScholarDigital Library
- Brew, C. 1992. "Letting the Cat out of the Bag: Generation for Shake-and-Bake MT". Proceedings of COLING92, the International Conference on Computational Linguistics, Nantes, France. Google ScholarDigital Library
- Brown, P., J. Cocke, S. Della Pietra, V. Della Pietra, F. Jelinek, J. Lafferty, R. Mercer and P. Rossin. 1990. "A Statistical Approach to Machine Translation". Computational Linguistics 16:79--85. Google ScholarDigital Library
- Brown, P. F., S. A. Della Pietra, V. J. Della Pietra, and R. L. Mercer. 1993. "The Mathematics of Statistical Machine Translation: Parameter Estimation". Computational Linguistics 19:263--312. Google ScholarDigital Library
- Chen, K. H. and H. H. Chen. 1992. "Attachment and Transfer of Prepositional Phrases with Constraint Propagation". Computer Processing of Chinese and Oriental Languages, Vol. 6, No. 2, 123--142.Google Scholar
- Church K. and R. Patil. 1982. "Coping with Syntactic Ambiguity or How to Put the Block in the Box on the Table". Computational Linguistics 8:139--149. Google ScholarDigital Library
- Collins, M. and J. Brooks. 1995. "Prepositional Phrase Attachment through a Backed-Off Model." Proceedings of the Third Workshop on Very Large Corpora, Cambridge, Massachusetts, ACL, 27--38.Google Scholar
- Dorr, B. J. 1994. "Machine Translation Divergences: A Formal Description and Proposed Solution". Computational Linguistics 20:597--634. Google ScholarDigital Library
- Dunning, T. 1993. "Accurate Methods for Statistics of Surprise and Coincidence." Computational Linguistics. 19:61--74. Google ScholarDigital Library
- Early, J. 1970. "An Efficient Context-Free Parsing Algorithm". Communications of the ACM 14: 453--60. Google ScholarDigital Library
- Gazdar, G., E. Klein, G. K. Pullum, and I. A. Sag. 1985. Generalised Phrase Structure Grammar. Blackwell, Oxford.Google Scholar
- Hinton, G. E., P. Dayan, B. J. Frey and R. M. Neal. 1995. "The 'Wake-Sleep' Algorithm for Unsupervised Neural Networks". Science 268:1158--1161.Google ScholarCross Ref
- Hudson, R. A. 1984. Word Grammar. Blackwell, Oxford.Google Scholar
- Hirschman, L., M. Bates, D. Dahl, W. Fisher, J. Garofolo, D. Pallett, K. Hunicke-Smith, P. Price, A. Rudnicky, and E. Tzoukermann. 1993. "Multi-Site Data Collection and Evaluation in Spoken Language Understanding". In Proceedings of the Human Language Technology Workshop, Morgan Kaufmann, San Francisco, 19--24. Google ScholarDigital Library
- Isabelle, P. and E. Macklovitch. 1986. "Transfer and MT Modularity", Eleventh International Conference on Computational Linguistics, Bonn, Germany, 115--117. Google ScholarDigital Library
- Jackendoff, R. S. 1977. X-bar Syntax: A Study of Phrase Structure. MIT Press, Cambridge, Massachusetts.Google Scholar
- Jelinek, F., R. L. Mercer and S. Roukos. 1992. "Principles of Lexical Language Modeling for Speech Recognition". In S. Furui and M. M. Sondhi (eds.), Advances in Speech Signal Processing, Marcel Dekker, New York. Google ScholarDigital Library
- Lafferty, J., D. Sleator and D. Temperley. 1992. "Grammatical Trigrams: A Probabilistic Model of Link Grammar". In Proceedings of the 1992 AAAI Fall Symposium on Probabilistic Approaches to Natural Language, 89--97.Google Scholar
- Kay, M. 1989. "Head Driven Parsing". In Proceedings of the Workshop on Parsing Technologies, Pittsburg, 1989.Google Scholar
- Lindop, J. and J. Tsujii. 1991. "Complex Transfer in MT: A Survey of Examples". Technical Report 91/5, Centre for Computational Linguistics, UMIST, Manchester, UK.Google Scholar
- Resnik, P. 1992. "Probabilistic Tree-Adjoining Grammar as a Framework for Statistical Natural Language Processing". In Proceedings of COLING-92, Nantes, France, 418--424. Google ScholarDigital Library
- Sata, G. and O. Stock. 1989. "Head-Driven Bidirectional Parsing". In Proceedings of the Workshop on Parsing Technologies, Pittsburg, 1989.Google Scholar
- Schabes, Y. 1992. "Stochastic Lexicalized Tree-Adjoining Grammars". In Proceedings of COLING-92, Nantes, France, 426--432. Google ScholarDigital Library
- Whitelock, P. J. 1992. "Shake-and-Bake Translation". Proceedings of COLING92, the International Conference on Computational Linguistics, Nantes, France. Google ScholarDigital Library
- Younger, D. 1967. Recognition and Parsing of Context-Free Languages in Time n.3 Information and Control, 10, 189--208.Google Scholar
- Head automata and bilingual tiling: translation with minimal representations
Recommendations
Head-Transducer Models for Speech Translation and TheirAutomatic Acquisition from Bilingual Data
This article presents statistical language translation models, called ``dependency transduction models'', based on collections of ``head transducers''. Head transducers are middle-out finite-state transducers which translate a head word in a source string ...
Bitext Dependency Parsing With Auto-Generated Bilingual Treebank
This paper proposes a method to improve the accuracy of bilingual texts (bitexts) dependency parsing by using an auto-generated bilingual treebank created with the help of statistical machine translation (SMT) systems. Previous bitext parsing methods use ...
Cross language dependency parsing using a bilingual lexicon
ACL '09: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1This paper proposes an approach to enhance dependency parsing in a language by using a translated treebank from another language. A simple statistical machine translation method, word-by-word decoding, where not a parallel corpus but a bilingual lexicon ...
Comments