ABSTRACT
We investigate the effect of rewarding terms according to their locations in documents for probabilistic information retrieval. The intuition behind our approach is that a large amount of authors would summarize their ideas in some particular parts of documents. In this paper, we focus on the beginning part of documents. Several shape functions are defined to simulate the influence of term location information. We propose a Reward Term Retrieval model that combines the reward terms' information with BM25 to enhance probabilistic information retrieval performance.
- J. Gao, K. Toutanova, and W. Yih. Clickthrough-based latent semantic models for web search. In SIGIR'11, pages 675--684, New York, NY, USA, 2011. ACM. Google ScholarDigital Library
- S. Robertson, S. Walker, and M. Beaulieu. Okapi at TREC-7: automatic ad hoc, filtering, vlc and interactive track. In TREC'99, pages 253--264, 1999.Google Scholar
- S. Robertson, H. Zaragoza, and M. Taylor. Simple BM25 extension to multiple weighted fields. In CIKM'04, pages 42--49, New York, NY, USA, 2004. ACM. Google ScholarDigital Library
- U. Shah, T. Finin, A. Joshi, R. S. Cost, and J. Matfield. Information retrieval on the semantic web. In CIKM'02, pages 461--468, New York, NY, USA, 2002. ACM. Google ScholarDigital Library
- Q. Wang, Q. Li, S. Wang, and X. Du. Exploiting semantic tags in xml retrieval. In INEX'09, pages 133--144, Berlin, 2010. Google ScholarDigital Library
- L. Zhao and J. Callan. Effective and efficient structured retrieval. In CIKM'09, pages 1573--1576, New York, NY, USA, 2009. ACM. Google ScholarDigital Library
Index Terms
- Rewarding term location information to enhance probabilistic information retrieval
Recommendations
Modeling Term Associations for Probabilistic Information Retrieval
Traditionally, in many probabilistic retrieval models, query terms are assumed to be independent. Although such models can achieve reasonably good performance, associations can exist among terms from a human being’s point of view. There are some recent ...
Using Term Location Information to Enhance Probabilistic Information Retrieval
SIGIR '15: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information RetrievalNouns are more important than other parts of speech in information retrieval and are more often found near the beginning or the end of sentences. In this paper, we investigate the effects of rewarding terms based on their location in sentences on ...
Exploring term dependences in probabilistic information retrieval model
Most previous information retrieval (IR) models assume that terms of queries and documents are statistically independent from each another. However, this kind of conditional independence assumption is obviously and openly understood to be wrong, so we ...
Comments