ABSTRACT
Recent work introduced a probabilistic framework that measures search engine performance information-theoretically. This allows for novel meta-evaluation measures such as Information Difference, which measures the magnitude of the difference between search engines in their ranking of documents. for which we have relevance information. Using Information Difference we can compare the behavior of search engines-which documents the search engine prefers, as well as search engine performance-how likely the search engine is to satisfy a hypothetical user. In this work, we a) extend this probabilistic framework to precision-oriented contexts, b) show that Information Difference can be used to detect similar search engines at shallow ranks, and c) demonstrate the utility of the Information Difference methodology by showing that well-tuned search engines employing different retrieval models are more similar than a well-tuned and a poorly tuned implementation of the same retrieval model.
- Ben Carterette. System effectiveness, user models, and user utility: A conceptual framework for investigation. In SIGIR '11. Google ScholarDigital Library
- Peter B. Golbus and Javed A. Aslam. A mutual information-based framework for the analysis of information retrieval systems. In SIGIR '13. Google ScholarDigital Library
- Peter B. Golbus, Javed A. Aslam, and Charles L.A. Clarke. Increasing evaluation sensitivity to diversity. Information Retrieval, 16(4), 2013. Google ScholarDigital Library
- I. Ounis, G. Amati, V. Plachouras, B. He, C. Macdonald, and C. Lioma. Terrier: A High Performance and Scalable Information Retrieval Platform. In OSIR '06.Google Scholar
Index Terms
- On the information difference between standard retrieval models
Recommendations
Ranking, relevance judgment, and precision of information retrieval on children's queries: Evaluation of Google, Yahoo!, Bing, Yahoo! Kids, and ask Kids
This study employed benchmarking and intellectual relevance judgment in evaluating Google, Yahoo!, Bing, Yahoo! Kids, and Ask Kids on 30 queries that children formulated to find information for specific tasks. Retrieved hits on given queries were ...
MapReduce Based Information Retrieval Algorithms for Efficient Ranking of Webpages
In this paper, the authors discuss the MapReduce implementation of crawler, indexer and ranking algorithms in search engines. The proposed algorithms are used in search engines to retrieve results from the World Wide Web. A crawler and an indexer in a ...
Information retrieval on the web: improving relevancy by disambiguating user queries
ACST'06: Proceedings of the 2nd IASTED international conference on Advances in computer science and technologyIn this paper we present a system to improve the performance of web search engines. The system uses a sense disambiguation algorithm which is based on contextual ranking to improve the user queries. We tested the system using Google index and found that ...
Comments