Abstract
This paper describes a sense tagging technique for the automatic sense tagging of running Chinese text. The system takes as input running Chinese text, and outputs sense disambiguated text. Whereas previous work ((Yarowsky, 1992; Gale et al., 1992; Gale et al., 1993)) relies heavily on the role of statistics, the present system makes use of Machine Readable/Tractable Dictionaries ((Wilks et al., 1990; Guo, 1995)) and an example-based reasoning technique ((Nagao, 1984; Sumita et al., 1990); to treat novel words, compound words, and phrases found in the input text.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bai, S. H., Xia, Y., Huang, C-L. 1992, Research on Chinese Grammatical Tagging Method for Chinese Corpus. In: Chen (ed.). Development in Machine Translation. Dianzi Gongye Publishing House, Beijing, pp. 408–418.
Fu, X-L. 1987. Xiandai Hanyu Tongyong Zidian. Waiyu Jiaoxue Yu Yanjiu Publishing House, Beijing.
Gale, W. A., Church, K. W. and Yarowsky, D. 1992. Work on Statistical Methods for Word Sense Disambiguation. Working Notes for AAAI Fall Symposium on Probabilistic Approaches to Natural Language. pp. 54–60.
Gale, W. A., Church, K. W. and Yarowsky, D. 1992. ‘A Method for Disambiguating Word Senses in a Large Corpus’. Computers and the Humanities. 26 (5–6): 415–439.
Guo, C. M. 1995. Machine Tractable Dictionaries: Design and Construction. Ablex, Norwood, NJ.
Mei, J. Zh. 1983. Tongyici Cilin. Shanghai Cishu Publishing House, Shanghai.
Nagao, M. 1984. A Framework of a Mechanical Translation Between Japanese and English by Analogy Example. In: Elithorn and Benerji (eds). Artificial and Human Intelligence. Elsevier, Amsterdam.
Sinclair, J. 1991. Monitor Corpora. Corpus, Concordance, Collocation. Oxford University Press. pp. 24–26.
Sumita, E., Iida, H. and Kohyama, H. 1990. Translating with Examples: a New Approach to Machine Translation. The Third International Conference on Theoretical and Methodological Issues in Machine Translation of Natural Language. Austin, Texas.
Wilks, Y., Fass, D., Guo, C. M., McDonald, J., Plate, T., and Slator, B. M. 1990. Providing Machine Tractable Dictionary Tools. Journal of Machine Translation. Vol. 5, no. 2, pp. 99–151.
Yarowsky, D. 1992. Word-sense Disambiguation Using Statistical Models of Roget’s Categories Trained on Large Corpora. COLINC-92. Nantes, France. pp. 454–460.
Zhang, W. 1986. Character Meanings and Word Meanings. China Wuzi Publishing House,Beijing.
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Tong, X., Huang, C., Guo, C. (1999). Example-Based Sense Tagging of Running Chinese Text. In: Armstrong, S., Church, K., Isabelle, P., Manzi, S., Tzoukermann, E., Yarowsky, D. (eds) Natural Language Processing Using Very Large Corpora. Text, Speech and Language Technology, vol 11. Springer, Dordrecht. https://doi.org/10.1007/978-94-017-2390-9_5
Download citation
DOI: https://doi.org/10.1007/978-94-017-2390-9_5
Publisher Name: Springer, Dordrecht
Print ISBN: 978-90-481-5349-7
Online ISBN: 978-94-017-2390-9
eBook Packages: Springer Book Archive