Abstract
In this paper we show several experiments motivated by the hypothesis that counting the number of relationships each synset has in WordNet 2.0 is related to the senses that are the most frequent (MFS), because MFS usually has a longer gloss, more examples of usage, more relationships with other words (synonyms, hyponyms), etc. We present a comparison of finding the MFS through the relationships in a semantic network (WordNet) versus measuring only the number of characters, words and other features in the gloss of each sense. We found that counting only inbound relationships is different to counting both inbound and outbound relationships, and that second order relationships are not so helpful, despite restricting them to be of the same kind. We analyze the contribution of each different kind of relationship in a synset; and finally, we present an analysis of the different cases where our algorithm is able to find the correct sense in SemCor, being different from the MFS listed in WordNet.
We thank the support of Instituto Politécnico Nacional, and the Mexican Government (CONACyT-SNI, SIP-IPN, COFAA-IPN, and BEIFI-IPN).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Bidirectional count of hyponyms yields 43.20 %, being lower as the unidirectional count shown in Table 4.
References
Calvo, H., Gelbukh, A.: Finding the most frequent sense of a word by the length of its definition. In: Gelbukh, A., Espinoza, F.C., Galicia-Haro, S.N. (eds.) MICAI 2014, Part I. LNCS, vol. 8856, pp. 1–8. Springer, Heidelberg (2014)
Hawker, T., Honnibal, M.: Improved default sense selection for word sense disambiguation. In: Proceedings of the 2006 Australasian Language Technology Workshop (ALTW2006), pp 11–17 (2006)
Lesk, M.: Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone. In: Proceedings of the 5th annual International Conference on Systems Documentation, pp. 24–26. ACM (1986)
Lin, D.: An information-theoretic definition of similarity. Int. Conf. Mach. Learn. 98, 296–304 (1998)
Marcus, M.P., Marcinkiewicz, M.A., Santorini, B.: Building a large annotated corpus of English: the Penn treebank. Comput. Linguist. 19(2), 313–330 (1993)
Màrquez, L., Taulé, M., Martí, M.A., García, M., Artigas, N., Real, F.J., Ferrés, D.: Senseval-3: the Spanish lexical sample task. In: Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Barcelona, Spain, Association for Computational Linguistics (2004)
McCarthy, D., Koeling, R., Weeds, J.: Carroll unsupervised acquisition of predominant word senses. Comput. Linguist. 33(4), 553–590 (2007)
Mihalcea, R., Chklovski, T., Kilgarriff, A.: The Senseval-3 English lexical sample task. In: Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, pp. 25–28 (2004)
Miller, G., Leacock, C., Tengi, R., Bunker, R.T.: A semantic concordance. In: Proceedings of ARPA Workshop on Human Language Technology, pp. 303–308 (1993)
Miller, G.A., Chodorow, M., Landes, S., Leacock, C., Thomas, R.G.: Using a semantic concordance for sense identification. In: Proceedings of the ARPA Human Language Technology Workshop, pp. 240–243 (1994)
Snyder, B., Palmer, M.: The English all-words task. In: ACL 2004 Senseval-3 Workshop, Barcelona, Spain (2004)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Calvo, H., Gelbukh, A. (2015). Is the Most Frequent Sense of a Word Better Connected in a Semantic Network?. In: Huang, DS., Han, K. (eds) Advanced Intelligent Computing Theories and Applications. ICIC 2015. Lecture Notes in Computer Science(), vol 9227. Springer, Cham. https://doi.org/10.1007/978-3-319-22053-6_52
Download citation
DOI: https://doi.org/10.1007/978-3-319-22053-6_52
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-22052-9
Online ISBN: 978-3-319-22053-6
eBook Packages: Computer ScienceComputer Science (R0)