A comparison of search term weighting: term relevance vs. inverse document frequency

Authors:
Harry Wu

Cornell University

Cornell University
View Profile

,
Gerard Salton

Cornell University

Cornell University
View Profile

Authors Info & Claims

ACM SIGIR Forum Volume 16 Issue 1Summer 1981pp 30–39https://doi.org/10.1145/1013228.511759

Published:31 May 1981Publication History

ACM SIGIR Forum

Abstract

The term relevance weighting method has been shown to produce optimal information retrieval queries under well-defined conditions. The parameters needed to generate the term relevance factors cannot unfortunately be estimated accurately in practice; futhermore, in realistic test situations, it appears difficult to obtain improved retrieval results using the term relevance weights over much simpler term weighting systems such as, for example, the inverse document frequency weights.It is shown in this study that the inverse document frequency weights and the term relevance weights are closely related over a wide range of the frequency spectrum. Methods are introduced for estimating the term relevance weights, and experimental results are given comparing the inverse document frequency with the estimated term relevance weights.

References

D. H. Kraft and A. Bookstein, Evaluation of Information Retrieval Systems: A Decision Theory Approach, Journal of the ASIS, Vol. 29, 1978, p. 31-34.Google Scholar
S. E. Robertson and K. Sparck Jones, Relevance Weighting of Search Terms, Journal of the ASIS, Vol. 27, No. 3, 1976, p. 129-146.Google Scholar
C. T. Yu, W. S. Luk and M. K. Siu, On Models of Information Retrieval Processes, Information Systems, Vol. 4, No. 3, 1979, p. 205-218.Google Scholar
C. T. Yu and G. Salton, Precision Weighting - An Effective Automatic Indexing Method, Journal of the ACM, Vol. 23, No. 1, 1976, p. 76-88. Google ScholarDigital Library
K. Sparck Jones, Experiments in Relevance Weighting of Search Terms, Information Processing and Management, Vol. 15, 1979, p. 133-144.Google ScholarCross Ref
K. Sparck Jones, Search Term Relevance Weighting Given Little Relevance Information, Journal of Documentation, Vol. 35, 1979, p. 30-48.Google ScholarCross Ref
K. Sparck Jones, Search Term Relevance Weighting - Some Recent Results, Journal of Information Science, Vol. 1, 1980, p. 325-332.Google ScholarDigital Library
S. E. Robertson, C. J. VanRijsbergen and M. F. Porter, Probabilistic Models of Indexing and Searching, Proc. of ACM-BCS Symposium on Research and Development in Information Retrieval, Cambridge, England, 1980. Google ScholarDigital Library
G. Salton, A. Wong, and C. T. Yu, Automatic Indexing Using Term Discrimination and Term Precision Measurements, Information Processing and Management, Vol. 12, 1976, p. 43-51.Google Scholar
G. Salton and R. K. Waldstein, Term Relevance Weights in On-Line Information Retrieval, Information Processing and Management, Vol. 14, 1978, p. 29-35.Google ScholarCross Ref
W. B. Croft and D. J. Harper, Using Probabilistic Models of Document Retrieval Without Relevance Information, Journal of Documentation, Vol. 35, 1979, p. 285-295.Google ScholarCross Ref
D. J. Harper and C. J. VanRijsbergen, An Evaluation of Feedback in Retrieval Using Co-Occurrence Data, Journal of Documentation, Vol. 34, 1978, p. 189-216.Google ScholarCross Ref
G. Salton, H. Wu, and C. T. Yu, The Measurement of Term Importance in Automatic Indexing, to be published in Journal of the ASIS.Google Scholar
C. T. Yu, K. Lam, and G. Salton, Optimum Term Weighting in Information Retrieval Using the Term Precision Model, to be published in Journal of the ACM. Google ScholarDigital Library
K. Sparck Jones, A Statistical Interpretation of Term Specificity and its Application in Retrieval, Journal of Documentation, Vol. 28, 1972, p. 11-21.Google ScholarCross Ref

Recommendations

Context-Aware Document Term Weighting for Ad-Hoc Search
WWW '20: Proceedings of The Web Conference 2020

Bag-of-words document representations play a fundamental role in modern search engines, but their power is limited by the shallow frequency-based term weighting scheme. This paper proposes HDCT, a context-aware document term weighting framework for ...
Read More
A comparison of search term weighting: term relevance vs. inverse document frequency
SIGIR '81: Proceedings of the 4th annual international ACM SIGIR conference on Information storage and retrieval: theoretical issues in information retrieval

The term relevance weighting method has been shown to produce optimal information retrieval queries under well-defined conditions. The parameters needed to generate the term relevance factors cannot unfortunately be estimated accurately in practice; ...
Read More
A Comparison of Search Term Weighting: Term Relevance vs. Inverse Document Frequency
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM SIGIR Forum Volume 16, Issue 1
Summer 1981
149 pages
ISSN:0163-5840
DOI:10.1145/1013228
Issue’s Table of Contents
SIGIR '81: Proceedings of the 4th annual international ACM SIGIR conference on Information storage and retrieval: theoretical issues in information retrieval
May 1981
149 pages
ISBN:0897910524
DOI:10.1145/511754
Conference Chair:
Carolyn J. Crouch
The University of Alabama
,
General Chair:
William Cooper
University of California at Berkeley
,
Program Chair:
Jessie Herr
Research Libraries Group
Copyright © 1981 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 31 May 1981
Check for updates
Qualifiers
- article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 49
  Total Citations
  View Citations
- 931
  Total Downloads
- Downloads (Last 12 months)102
- Downloads (Last 6 weeks)21
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

A comparison of search term weighting: term relevance vs. inverse document frequency

ACM SIGIR Forum

Abstract

References

Cited By

Recommendations

Context-Aware Document Term Weighting for Ad-Hoc Search

A comparison of search term weighting: term relevance vs. inverse document frequency

A Comparison of Search Term Weighting: Term Relevance vs. Inverse Document Frequency