Abstract
Previous definitions of semantic similarity can be classified into two approaches. The node(information content)-based approach uses an entropy measure that is computed on the basis of child node population. The edge-based approach involves the use of the number of edges between two concepts within a hierarchical conceptual structure. The edge-based distance method is more intuitive, while the node-based information content approach is more theoretically sound. We consider a combined model that is derived from the edge-based notion with the addition of the information content. In this paper, we propose a method for computerized conceptual similarity calculation in WordNet space. The proposed method provides a degree of conceptual dissimilarity between two concepts. It gives a higher correlation value with a criterion based on human similarity judgment.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Miller, G.A.: Introduction to WordNet: An On-line Lexical Database. International Journal of Lexicography (1990)
Sussna: WordSense Disambiguation for Free-text Indexing Using a Massive Semantic Network. In: Proceedings of the Second International Conference on Information and Knowledge Management, CIKM 1993 (1993)
Resnik, P.: Using Information Content to Evaluate Semantic Similarity in a Taxonomy. In: Proceedings of the 14th International Joint Conference on Artificial Intelligence (1995)
Richardson, R., Smeaton, A.F.: UsingWordNet as a Knowledge-Base for Measuring Semantic Similarity between Words. In: Working paper, CA-1294, School of Computer Applications. Dublin City University, Ireland (1995)
Agirre, E., Rigau, G.: Word sense disambiguation using conceptual density. In: Proceedings of the 16th International Conference on Computational Linguistics, pp. 16–22. Copenhagen (1996)
Jiang, J.J., Conrath, D.W.: Semantic Similarity Based Corpus Statistics and Lexical Taxonomy. In: Proc. Of International Conference Research on Computational Linguistics (1997)
Park, Y.C., Golshani, F., Panchanathan, S.: Conceptualization and Ontology:Tools for Efficient Storage and Retrieval of Semantic Visual Information. In: Internet Multimedia Management Systems Conference (2000)
Guarino, N., Welty, C.: Supporting Ontological Analysis of Taxonomic Relationships. Data and Knowledge Engineering (2001) (in press)
Budanitsky, A., Hirst, G.: Semantic Distance inWordNet:An Experimental, Applicationoriented Evaluation of Five Measures. In: Workshop on WordNet and Other Lexical Resources, in the North American Chapter of the Association for Computational Linguistics (NAACL 2000), Pittsburgh, PA (June 2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cho, M., Choi, J., Kim, P. (2003). An Efficient Computational Method for Measuring Similarity between Two Conceptual Entities. In: Dong, G., Tang, C., Wang, W. (eds) Advances in Web-Age Information Management. WAIM 2003. Lecture Notes in Computer Science, vol 2762. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45160-0_38
Download citation
DOI: https://doi.org/10.1007/978-3-540-45160-0_38
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40715-7
Online ISBN: 978-3-540-45160-0
eBook Packages: Springer Book Archive