ABSTRACT
Effective organization of web search results can greatly improve the utility of search engine and enhance the quality of search results. However, the organization of search results is difficult because the sub-topics of a query are usually not explicitly given. In this paper, we propose a novel topic-driven search result organization method, which can first detect the sub-topics of a query by finding the coherent Wikipedia concept groups from its search results; then organize these results using a topic-driven clustering algorithm; in the end we score and rank the topics using the support vector regression model. Empirical results show that our method can achieve competitive performance.
- Hearst, M. A. and Pedersen, J. O. Reexamining the cluster hypothesis: scatter/gather on retrieval results. In Proc. of SIGIR, 1996. Google ScholarDigital Library
- Medelyan, O., Witten, I. H. and Milne, D. Topic indexing with Wikipedia. In Proc. of the AAAI WikiAI, 2008.Google Scholar
- Newman, M. and Girvan, M. Finding and evaluating community structure in networks. Physical review E, vol. 69, no. 2, p. 26113, 2004.Google Scholar
- Witten, D. M. and Milne, D. An effective, low-cost measure of semantic relatedness obtained from Wikipedia links, In Proc. of AAAI WikiAI, 2008.Google Scholar
- Everitt, B. S. Landau S., and Lesse M. Cluster Analysis, 4th Ed. Oxford University Press, 2001. Google ScholarDigital Library
- Zamir, O. and Etzioni, O. Web document clustering: A feasibility demonstration. In Proc. of SIGIR, 1998. Google ScholarDigital Library
- Masowska, I. Phrase-Based Hierarchical Clustering of Web Search Results. Advances in Information Retrieval, 2003. Google ScholarDigital Library
- Osiriski, S., Stefanowski, J. and Weiss, D. Lingo: Search results clustering algorithm based on singular value decomposition. In Proc.of the IIS: IIPWM'04, 2004.Google Scholar
- Lawrie, D. J. and Croft, W. B. Generating Hierarchical Summaries for Web Searches. In Proc. of SIGIR, 2003. Google ScholarDigital Library
- Smola, A. J. & Schlkopf, B. A tutorial on support vector regression. Statistics and Computing, vol. 14, no. 3, 2004. Google ScholarDigital Library
Index Terms
- Topic-driven web search result organization by leveraging wikipedia semantic knowledge
Recommendations
Improving Ranking Consistency for Web Search by Leveraging a Knowledge Base and Search Logs
CIKM '15: Proceedings of the 24th ACM International on Conference on Information and Knowledge ManagementIn this paper, we propose a new idea called ranking consistency in web search. Relevance ranking is one of the biggest problems in creating an effective web search system. Given some queries with similar search intents, conventional approaches typically ...
Learn from web search logs to organize search results
SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrievalEffective organization of search results is critical for improving the utility of any search engine. Clustering search results is an effective way to organize search results, which allows a user to navigate into relevant documents quickly. However, two ...
Semantic-based topic detection using Markov decision processes
In the field of text mining, topic modeling and detection are fundamental problems in public opinion monitoring, information retrieval, social media analysis, and other activities. Document clustering has been used for topic detection at the document ...
Comments