Abstract
We argue that the emphasis normally placed on query-similarity in Web search limits search precision. We draw on related work in case-based reasoning (CBR) and recommender systems research, which shows how enhancing diversity can improve the quality of retrieved cases and recommendations. We investigate the use of related diversity-enhancing retrieval techniques in Web search, showing that similar benefits are available, i.e. that result diversity can be significantly enhanced without compromising query similarity or result precision and recall.
The support of the Informatics Research Initiative of Enterprise Ireland is gratefully acknowledged
Chapter PDF
Similar content being viewed by others
References
K. Bradley and B. Smyth “Improving Recommendation Diversity”, Proceedings of the 12th National Conference in Artificial Intelligence and Cognitive Science (AICS-01), pp. 75–84, Maynooth, Ireland, 2001.
D. Bridge and A. Ferguson “Diverse Product Recommendations using an Expressive Language for Case Retrieval”, Proceedings of the 16th European Conference on Case-Based Reasoning, pp. 43–57, 2002.
S. Brin and L. Page “The Anatomy of A Large-Scale Hypertextual Web Search Engine”, Proceedings of the 7th International World-Wide Web Conference, 2001.
J. Budzik and K. Hammond “User Interactions with Everyday Applications as Context for Just-in-time Information Access”, Proceedings of the International Conference on Intelligent User Interfaces, pp. 44–51, ACM Press, 2000.
D. R. Cutting and D. R. Karger and J. O. Pedersen and J. W. Tukey “Scatter Gather: a cluster-based approach to browsing large document collections”, Proceedings of the 15h International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 318–29, ACM Press, 1992.
E. Glover and S. Lawrence and M. D. Gordon and W. P. Birmingham and C. Lee Giles “Web Search-Your Way”, Communications of the ACM, 44(12), pp. 97–102, 2000.
T. H. Haveliwala “Topic-Sensitive PageRank”, Proceedings of the 11th World-Wide Web Conference, ACM Press, 2002.
Z. Jiang and A. Joshi and R. Krishnapuram and L. Yi “Retriever: Improving Web Search Engine Results Using Clustering”, Managing Business with Electronic Commerce: Issues and Trends, Idea Press, 2001.
R. Krovetz and W. B. Croft “Lexical Ambiguity and Information Retrieval”, Information Systems, 10(2), pp. 115–141, 1992.
S. Lawrence “Context in Web Search”, IEEE Data Engineering Bulletin, 23(3), pp. 25–32, 2000.
S. Lawrence and C. Lee Giles “Searching the Web: General and Scientific Information Access”, IEEE Communications 37(1), pp. 116–122, 1999.
A. Leouski and W. Croft “An Evaluation of Techniques for Clustering Search Results”, Technical Report IR-76, Department of Computer Science, University of Massachusetts, Amherst, 1996.
D. McSherry “Diversity-Conscious Retrieval”, Proceedings of the 6th European Conference on Case-Based Reasoning, pp. 219–233, Aberdeen, Scotland, 2002.
H. Shimazu “ExpertClerk: Navigating Shoppers’ Buying Process with the Combination of Asking and Proposing”, Proceedings of the 17th International Joint Conference on Artificial Intelligence, pp. 1443–1448, Seattle, Washington, USA, 2001.
C. Silverstein and M. Henzinger and H. Marais and M. Moricz “Analysis of a Very Large AltaVista Query Log”, Technical Report 1998-014, Digital SRC Technical Notes http://gatekeeper.dec.com/pub/DEC/SRC/technical-notes/abstracts/src-tn-1998-014.html, 1998.
B. Smyth and E. Balfe and P. Briggs and M. Coyle and J. Freyne “Collaborative Web Search”, Proceedings of the 18th International Joint Conference on Artificial Intelligence, pp. 1417–1419, Acapulco, Mexico, 2003.
B. Smyth and E. Balfe and P. Briggs and M. Coyle and J. Freyne “I-SPY-Anonymous, Community-based Personalization by Collaborative Meta-search”, Proceedings of the 23rd SGAI International Conference on Innovative Techniques and Applications of Artificial Intelligence, Cambridge, UK, 2003.
B. Smyth and P. McClave “Similarity vs. Diversity”, Proceedings of the 4th International Conference on Case-Based Reasoning, Vancouver, Canada, 2001.
C. J. van Rijsbergen “Information Retrieval, 2nd Edition”, Department of Computer Science, University of Glasgow, 1979.
Y. Wang and M. Kitsuregawa “Link-based Clustering of Web Search Results”, Lecture Notes in Computer Science, Advances in Web-Age Information Management, Second International Conference, 2118. pp. 225–237, WAIM 2001.
O. Zamir and O. Etzioni “Web Document Clustering: A Feasibility Demonstration”, Research and Development in Information Retrieval, pp. 46–54, 1998.
O. Zamir and O. Etzioni “Grouper: A Dynamic Clustering Interface to Web Search Results”, Computer Networks, 31(11–16), pp. 1361–1374, Amsterdam, Netherlands, 1999.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 International Federation for Information Processing
About this paper
Cite this paper
Coyle, M., Smyth, B. (2005). On the Importance of Being Diverse. In: Shi, Z., He, Q. (eds) Intelligent Information Processing II. IIP 2004. IFIP International Federation for Information Processing, vol 163. Springer, Boston, MA. https://doi.org/10.1007/0-387-23152-8_43
Download citation
DOI: https://doi.org/10.1007/0-387-23152-8_43
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-23151-8
Online ISBN: 978-0-387-23152-5
eBook Packages: Computer ScienceComputer Science (R0)