Abstract
The need to have a search technique which will help a computer in understanding the user, and his requirements, has long been felt. This paper proposes a new technique, of doing so. It proceeds by first clustering the English language words into clusters of similar meaning, and then connecting those clusters according to their observed relationships and co-occurrences in web pages. These known relationships between word clusters are used to enhance the user’s query, and in effect ’understand’ it. This process will result in giving results of more value to the user. This procedure does not suffer with the problems faced by many of the presently used techniques.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
WordNet Lexical Database, http://www.cogsci.princeton.edu/~wn/
Miller, G.A., Beckwith, R., Fellbaum, C., et al.: Introduction to WordNet: An On-line Lexical Database (1993)
Brin, S., Page, L.: The Anatomy of a Large Scale Hypertextual Web Search Engine. In: Proceedings of the 7th International World Wide Web Conference, Brisbane, Australia, pp. 107–117. Elsevier Science, Amsterdam (April 1998)
Selberg, Etzioni: The MetaCrawler Architecture for Resource Aggregation on the Web. IEEE Expert 12(1), 8–14 (1997)
Howe, A.E., Dreilinger, D.: SavvySearch: A Meta-Search Engine that Learns which Search Engines to Query (1997)
Sahami, M.: Using Machine Learning to Improve Information Access. PhD dissertation. Stanford Univ. (December 1998)
Harmon, D.: Ranking algorithms. In: Frakes, W.B., Baeza-Yates, R. (eds.) Information Retrieval: Data Structures and Algorithms, pp. 363–392. Prentice Hall, Englewood Cliffs (1992)
Salton, G., Buckley, C.: Term Weighing approaches in Automatic Text Retrieval. Information Processing and Management 24(5), 513–523 (1988)
Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. McGraw-Hill Book Company, New York (1983)
Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Communications of the ACM 18, 613–620 (1975)
Salton, G.: The SMART Information Retrieval system. Prentice Hall, Englewood Cliffs (1975)
van Rijsbergen, C.J.: Information Retrieval. Butterworths, Butterworths (1979)
Lawrence, S.: Context in Web Search. IEEE Data Engineering Bulletin 23(3), 25–32 (2000)
McCarthy, J.: Notes on formalizing context. In: Proceedings of the 13th IJCAI, vol. 1, pp. 555–560 (1993)
Finkelstein, L., et al.: Placing Search in Context: The Concept revisited. In: 10th World Wide Web Conference, Hong Kong, May 2-5 (2001)
Bharat, K.: SearchPad: Explicit Capture of Search Context to Support Web Search. In: Proceedings of the 9th International World Wide Web Conference, WWW9, Amsterdam (May 2000)
Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by Latent Semantic Analysis. Journal of the American Society of Information Science (1990)
Richardson, R., Smeaton, A.: Using Wordnet in a Knowledge-Based Approach to Information Retrieval. In: Proceedings of the BCS-IRSG Colloquium, Crewe
Smeaton, A.F., Quigley, L.: Experiments on using semantic distances between words in image caption retrieval. In: Proceedings of the 19th International conference on research and Development in IR (1996)
Uchida, H., Zhu, M., Della, S.T.: UNL: A Gift for a Millennium. The United Nations University (1995)
Gonzalo, J., Verdejo, F., Chugur, I., Cigarran, J.: Indexing with WordNet synsets can improve Text Retrieval. In: Proceedings of the COLING/ACL 1998 Workshop on Usage of WordNet for NLP, Montreal (1998)
Bhattacharya, P., Choudhary, B.: Text Clustering using Semantics
Brezillon, P.: Context in Problem Solving: A Survey
Martin, S., Liermann, J., Ney, H.: Algorithms for Bigram and Trigram Word Clustering
Ushioda, A.: Hierarchical clustering of words and application to nlp tasks. In: Proceedings of the Fourth Workshop on Very Large Corpora (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Khare, A. (2003). Connecting Word Clusters to Represent Concepts with Application to Web Searching. In: Palade, V., Howlett, R.J., Jain, L. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2003. Lecture Notes in Computer Science(), vol 2773. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45224-9_109
Download citation
DOI: https://doi.org/10.1007/978-3-540-45224-9_109
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40803-1
Online ISBN: 978-3-540-45224-9
eBook Packages: Springer Book Archive