Skip to main content

Connecting Word Clusters to Represent Concepts with Application to Web Searching

  • Conference paper
Knowledge-Based Intelligent Information and Engineering Systems (KES 2003)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2773))

  • 1183 Accesses

Abstract

The need to have a search technique which will help a computer in understanding the user, and his requirements, has long been felt. This paper proposes a new technique, of doing so. It proceeds by first clustering the English language words into clusters of similar meaning, and then connecting those clusters according to their observed relationships and co-occurrences in web pages. These known relationships between word clusters are used to enhance the user’s query, and in effect ’understand’ it. This process will result in giving results of more value to the user. This procedure does not suffer with the problems faced by many of the presently used techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 74.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. WordNet Lexical Database, http://www.cogsci.princeton.edu/~wn/

  2. Miller, G.A., Beckwith, R., Fellbaum, C., et al.: Introduction to WordNet: An On-line Lexical Database (1993)

    Google Scholar 

  3. Brin, S., Page, L.: The Anatomy of a Large Scale Hypertextual Web Search Engine. In: Proceedings of the 7th International World Wide Web Conference, Brisbane, Australia, pp. 107–117. Elsevier Science, Amsterdam (April 1998)

    Google Scholar 

  4. Selberg, Etzioni: The MetaCrawler Architecture for Resource Aggregation on the Web. IEEE Expert 12(1), 8–14 (1997)

    Article  Google Scholar 

  5. Howe, A.E., Dreilinger, D.: SavvySearch: A Meta-Search Engine that Learns which Search Engines to Query (1997)

    Google Scholar 

  6. Sahami, M.: Using Machine Learning to Improve Information Access. PhD dissertation. Stanford Univ. (December 1998)

    Google Scholar 

  7. Harmon, D.: Ranking algorithms. In: Frakes, W.B., Baeza-Yates, R. (eds.) Information Retrieval: Data Structures and Algorithms, pp. 363–392. Prentice Hall, Englewood Cliffs (1992)

    Google Scholar 

  8. Salton, G., Buckley, C.: Term Weighing approaches in Automatic Text Retrieval. Information Processing and Management 24(5), 513–523 (1988)

    Article  Google Scholar 

  9. Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. McGraw-Hill Book Company, New York (1983)

    MATH  Google Scholar 

  10. Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Communications of the ACM 18, 613–620 (1975)

    Article  MATH  Google Scholar 

  11. Salton, G.: The SMART Information Retrieval system. Prentice Hall, Englewood Cliffs (1975)

    Google Scholar 

  12. van Rijsbergen, C.J.: Information Retrieval. Butterworths, Butterworths (1979)

    Google Scholar 

  13. Lawrence, S.: Context in Web Search. IEEE Data Engineering Bulletin 23(3), 25–32 (2000)

    Google Scholar 

  14. McCarthy, J.: Notes on formalizing context. In: Proceedings of the 13th IJCAI, vol. 1, pp. 555–560 (1993)

    Google Scholar 

  15. Finkelstein, L., et al.: Placing Search in Context: The Concept revisited. In: 10th World Wide Web Conference, Hong Kong, May 2-5 (2001)

    Google Scholar 

  16. Bharat, K.: SearchPad: Explicit Capture of Search Context to Support Web Search. In: Proceedings of the 9th International World Wide Web Conference, WWW9, Amsterdam (May 2000)

    Google Scholar 

  17. Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by Latent Semantic Analysis. Journal of the American Society of Information Science (1990)

    Google Scholar 

  18. Richardson, R., Smeaton, A.: Using Wordnet in a Knowledge-Based Approach to Information Retrieval. In: Proceedings of the BCS-IRSG Colloquium, Crewe

    Google Scholar 

  19. Smeaton, A.F., Quigley, L.: Experiments on using semantic distances between words in image caption retrieval. In: Proceedings of the 19th International conference on research and Development in IR (1996)

    Google Scholar 

  20. Uchida, H., Zhu, M., Della, S.T.: UNL: A Gift for a Millennium. The United Nations University (1995)

    Google Scholar 

  21. Gonzalo, J., Verdejo, F., Chugur, I., Cigarran, J.: Indexing with WordNet synsets can improve Text Retrieval. In: Proceedings of the COLING/ACL 1998 Workshop on Usage of WordNet for NLP, Montreal (1998)

    Google Scholar 

  22. Bhattacharya, P., Choudhary, B.: Text Clustering using Semantics

    Google Scholar 

  23. Brezillon, P.: Context in Problem Solving: A Survey

    Google Scholar 

  24. Martin, S., Liermann, J., Ney, H.: Algorithms for Bigram and Trigram Word Clustering

    Google Scholar 

  25. Ushioda, A.: Hierarchical clustering of words and application to nlp tasks. In: Proceedings of the Fourth Workshop on Very Large Corpora (1996)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Khare, A. (2003). Connecting Word Clusters to Represent Concepts with Application to Web Searching. In: Palade, V., Howlett, R.J., Jain, L. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2003. Lecture Notes in Computer Science(), vol 2773. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45224-9_109

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-45224-9_109

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-40803-1

  • Online ISBN: 978-3-540-45224-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics