Abstract
A common practice in operational Machine Translation (MT) and Natural Language Processing (NLP) systems is to assume that a verb has a fixed number of senses and rely on a precompiled lexicon to achieve large coverage. This paper demonstrates that this assumption is too weak to cope with the similar problems of lexical divergences between languages and unexpected uses of words that give rise to cases outside of the pre-compiled lexicon coverage. We first examine the lexical divergences between English verbs and Chinese verbs. We then focus on a specific lexical selection problem—translating Englishchange-of-state verbs into Chinese verb compounds. We show that an accurate translation depends not only on information about the participants, but also on contextual information. Therefore, selectional restrictions on verb arguments lack the necessary power for accurate lexical selection. Second, we examine verb representation theories and practices in MT systems and show that under the fixed sense assumption, the existing representation schemes are not adequate for handling these lexical divergences and extending existing verb senses to unexpected usages. We then propose a method of verb representation based on conceptual lattices which allows the similarities among different verbs in different languages to be quantitatively measured. A prototype system UNICON implements this theory and performs more accurate MT lexical selection for our chosen set of verbs. An additional lexical module for UNICON is also provided that handles sense extension.
Similar content being viewed by others
References
Church, K. 1991. Some Statistical Opportunities in Speech and Language. InProceedings of 23rd Symposium on the Interface, Computing Science and Statistics, Seattle, Washington, April.
Dong, Z.D. 1987. Language, Translation, Machine Translation. InProceedings of International Forum on Today's Translation, Hong Kong.
Dong, Z.D. 1994. A Categorization System of Chinese Movement Concepts. InProceedings of International Conference on Chinese Computing, Singapore.
Dorr, B.J. 1993.Machine Translation: A View from the Lexicon. MIT Press, Cambridge, Massachusetts.
Egedi, D. 1991. Learning Through Metaphor. In D. Nerin-Aime, editor,Knowledge Modeling and Expertise Transfer. ISO Press, Washington, DC.
Feng, Z. and X. Zhou, editors. 1989.New Chinese Multi-Purpose Dictionary. International Culture Publisher.
Grishman, R. and J. Sterling. 1989. Preference Semantics for Message Understanding. InProceedings of the DARPA Speech and Natural Language Workshop, Cape Cod, MA.
Henne, H. 1977.A handbook on Chinese Language Structure. Columbia University Press.
Hobbs, J. 1986. Overview of the TACITUS Project.Computational Linguistics, 12(3).
Jackendoff, R. 1990.Semantic Structures. MIT Press.
Kurohashi, S. and M. Nagao. 1992. Dynamic Programming Method for Analyzing Conjunctive Structures in Japanese. InProceedings of the 14th International Conference on Computational Linguistics (COLING-92), Nantes, France.
Langacker, R.W. 1988. An Overview of Cognitive Grammar. In B. Rudzka-Ostyn, editor,Topics in Cognitive Grammar. John Benjamins Publishing Company, Amsterdam/Philadelphia.
Levin, B. 1987. Approaches to Lexical Semantic representation. In B. Levin, editor,Readings for Lexical Semantics. Northwestern University.
Levin, B. 1993.English Verb Classes and Alternations: A Preliminary Investigation. University of Chicago Press, Chicago.
Nirenburg, S., J. Carbonell, M. Tomita, and K. Goodman. 1992.Machine Translation: A Knowledge-Based Approach. Morgan Kaufmann Publishers.
Onyshkevich, B. and S. Nirenburg. 1995. A Lexicon for Knowledge-Based MT.Machine Translation, 10(1–2).
P. Resnik. 1993.Selection and Information: A Class-Based Approach to Lexical Relationships. Ph.D. thesis, Department of Information and Computer Science, University of Pennsylvania.
Palmer, M. 1990. Customizing Verb Definitions for Specific Semantic Domains.Machine Translation, 5(30).
Palmer, M. and A. Polguère. 1995. A Preliminary Lexical and Conceptual Analysis of BREAK: a Computational Perspective. In P. Saint-Dizier and E. Viegas, editors,Computational Lexical Semantics. Cambridge University Press.
Procter, P. 1978.Longman Dictionary of Contemporary English. Longman, London.
Pustejovsky, J. 1991. The generative lexicon.Computational Linguistics, 17(4).
Wilks, Y. 1986. An intelligent analyzer and understander of English. In B.J. Grosz, K. Sparck-Jones, and B.L. Webber, editors,Readings in Natural Language Processing. Morgan Kaufmann.
Wu, D. 1991.On Serial verb Construction. Ph.D. thesis, Department of Information and Computer Science, University of Maryland.
Wu, Z. 1994.Verb Semantics and Lexical Selection. Ph.D. thesis, Department of Information System and Computer Science, National University of Singapore.
Wu, Z. and M. Palmer. 1994. Verb Semantics and Lexical Selection. InProceedings of ACL94, New Mexico State University, USA, May.
Y.R.Chao. 1968.A Grammar of Spoken Chinese. University of California Press.
Yuan, C., C. Huang, Y. Peng, and Z. Guo. 1994. Construction and Application of the Chinese Morpheme Database. InProceedings of International Conference on Chinese Computing, Singapore.
Zernik, U. 1987. The Self-Extending Phrasal Lexicon.Computational Linguistics, 13(3–4).
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Palmer, M., Wu, Z. Verb semantics for English-Chinese translation. Mach Translat 10, 59–92 (1995). https://doi.org/10.1007/BF00997232
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF00997232