ABSTRACT
Fields are a valuable auxiliary source of information in semi-structured HTML web documents. So, it is no surprise that ranking models have been designed to leverage this information to improve search effectiveness. We present the first (initial) study of utilizing field-based information in the relevance modeling framework. Fields play two different, and integrated, roles in our models: sources of information for inducing relevance models and units on which relevance models are applied for ranking. Our preliminary results suggest that field-based relevance modeling can improve precision at top ranks; specifically, to a greater extent than the commonly used BM25F and SDM-Fields field-based models. Further analysis shows that using field-based relevance models mainly improves the effectiveness of tail queries. Our findings suggest that using field-based information together with relevance modeling is a promising area of future exploration.
- N. Abdul-Jaleel, J. Allan, W. B. Croft, F. Diaz, L. Larkey, X. Li, M. D., and C. Wade. 2004. UMASS at TREC 2004 --- Novelty and HARD.Google Scholar
- F. Diaz. 2015. Condensed List Relevance Models. In Proc. ICTIR. 313--316. Google ScholarDigital Library
- L. Gallagher, R. Chen, J. Mackenzie, F. Scholer, R. Benham, and J. S. Culpepper. 2013. RMIT at the NTCIR-13 We Want Web Task. In Proc. NTCIR.Google Scholar
- J. Y. Kim and W. B. Croft. 2012. A Field Relevance Model for Structured Document Retrieval. In Proc. ECIR. 97--108. Google ScholarDigital Library
- V. Lavrenko and W. B. Croft. 2001. Relevance Based Language Models. In Proc. SIGIR. 120--127. Google ScholarDigital Library
- D. Metzler and W. B. Croft. 2005. A Markov Random Field Model for Term Dependencies. In Proc. SIGIR. 472--479. Google ScholarDigital Library
- D. Metzler, T. Strohman, Y. Zhou, and W. B. Croft. 2005. Indri at TREC 2005: Terabyte Track. In Proc. TREC.Google Scholar
- H. R. Mohammad, K. Xu, J. Callan, and J. S. Culpepper. 2018. Dynamic Shard Cutoff Prediction for Selective Search. In Proc. SIGIR. 85--94. Google ScholarDigital Library
- P. Ogilvie and J. P. Callan. 2003. Combining document representations for known-item search. In Proc. SIGIR. 143--150. Google ScholarDigital Library
- S. Robertson, S. Walker, S. Jones, M. M. Hancock-Beaulieu, M. Gatford, and others. 1994. Okapiat TREC-3. In Proc. TREC. 109--126.Google Scholar
- S. Robertson, H. Zaragoza, and M. Taylor. 2004. Simple BM25 Extension to Multiple Weighted Fields. In Proc. CIKM. 42--49. Google ScholarDigital Library
- H. Zamani, B. Mitra, X. Song, N. Craswell, and S. Tiwary. 2018. Neural Ranking Models with Multiple Document Fields. In Proc. WSDM. 700--708. Google ScholarDigital Library
- H. Zaragoza, N. Craswell, M. Taylor, S. Saria, and S. Robertson. 2004. Microsoft Cambridge at TREC 13: Web and Hard Tracks. In Proc. TREC.Google Scholar
- Improving Search Effectiveness with Field-based Relevance Modeling
Recommendations
Improving Ranking Consistency for Web Search by Leveraging a Knowledge Base and Search Logs
CIKM '15: Proceedings of the 24th ACM International on Conference on Information and Knowledge ManagementIn this paper, we propose a new idea called ranking consistency in web search. Relevance ranking is one of the biggest problems in creating an effective web search system. Given some queries with similar search intents, conventional approaches typically ...
A Linguistic Study on Relevance Modeling in Information Retrieval
WWW '21: Proceedings of the Web Conference 2021Relevance plays a central role in information retrieval (IR), which has received extensive studies starting from the 20th century. The definition and the modeling of relevance has always been critical challenges in both information science and computer ...
Passage relevance models for genomics search
CIKM '08: Proceedings of the 17th ACM conference on Information and knowledge managementWe present a passage relevance model for integrating semantic and statistical evidence of biomedical concepts and topics in context using the framework of a probabilistic graphical model. Component models of topics, concepts, terms, and document are ...
Comments