skip to main content
article
Free Access

A comparison of search term weighting: term relevance vs. inverse document frequency

Published:31 May 1981Publication History
Skip Abstract Section

Abstract

The term relevance weighting method has been shown to produce optimal information retrieval queries under well-defined conditions. The parameters needed to generate the term relevance factors cannot unfortunately be estimated accurately in practice; futhermore, in realistic test situations, it appears difficult to obtain improved retrieval results using the term relevance weights over much simpler term weighting systems such as, for example, the inverse document frequency weights.It is shown in this study that the inverse document frequency weights and the term relevance weights are closely related over a wide range of the frequency spectrum. Methods are introduced for estimating the term relevance weights, and experimental results are given comparing the inverse document frequency with the estimated term relevance weights.

References

  1. D. H. Kraft and A. Bookstein, Evaluation of Information Retrieval Systems: A Decision Theory Approach, Journal of the ASIS, Vol. 29, 1978, p. 31-34.Google ScholarGoogle Scholar
  2. S. E. Robertson and K. Sparck Jones, Relevance Weighting of Search Terms, Journal of the ASIS, Vol. 27, No. 3, 1976, p. 129-146.Google ScholarGoogle Scholar
  3. C. T. Yu, W. S. Luk and M. K. Siu, On Models of Information Retrieval Processes, Information Systems, Vol. 4, No. 3, 1979, p. 205-218.Google ScholarGoogle Scholar
  4. C. T. Yu and G. Salton, Precision Weighting - An Effective Automatic Indexing Method, Journal of the ACM, Vol. 23, No. 1, 1976, p. 76-88. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. K. Sparck Jones, Experiments in Relevance Weighting of Search Terms, Information Processing and Management, Vol. 15, 1979, p. 133-144.Google ScholarGoogle ScholarCross RefCross Ref
  6. K. Sparck Jones, Search Term Relevance Weighting Given Little Relevance Information, Journal of Documentation, Vol. 35, 1979, p. 30-48.Google ScholarGoogle ScholarCross RefCross Ref
  7. K. Sparck Jones, Search Term Relevance Weighting - Some Recent Results, Journal of Information Science, Vol. 1, 1980, p. 325-332.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. S. E. Robertson, C. J. VanRijsbergen and M. F. Porter, Probabilistic Models of Indexing and Searching, Proc. of ACM-BCS Symposium on Research and Development in Information Retrieval, Cambridge, England, 1980. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. G. Salton, A. Wong, and C. T. Yu, Automatic Indexing Using Term Discrimination and Term Precision Measurements, Information Processing and Management, Vol. 12, 1976, p. 43-51.Google ScholarGoogle Scholar
  10. G. Salton and R. K. Waldstein, Term Relevance Weights in On-Line Information Retrieval, Information Processing and Management, Vol. 14, 1978, p. 29-35.Google ScholarGoogle ScholarCross RefCross Ref
  11. W. B. Croft and D. J. Harper, Using Probabilistic Models of Document Retrieval Without Relevance Information, Journal of Documentation, Vol. 35, 1979, p. 285-295.Google ScholarGoogle ScholarCross RefCross Ref
  12. D. J. Harper and C. J. VanRijsbergen, An Evaluation of Feedback in Retrieval Using Co-Occurrence Data, Journal of Documentation, Vol. 34, 1978, p. 189-216.Google ScholarGoogle ScholarCross RefCross Ref
  13. G. Salton, H. Wu, and C. T. Yu, The Measurement of Term Importance in Automatic Indexing, to be published in Journal of the ASIS.Google ScholarGoogle Scholar
  14. C. T. Yu, K. Lam, and G. Salton, Optimum Term Weighting in Information Retrieval Using the Term Precision Model, to be published in Journal of the ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. K. Sparck Jones, A Statistical Interpretation of Term Specificity and its Application in Retrieval, Journal of Documentation, Vol. 28, 1972, p. 11-21.Google ScholarGoogle ScholarCross RefCross Ref

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in

Full Access

  • Published in

    cover image ACM SIGIR Forum
    ACM SIGIR Forum  Volume 16, Issue 1
    Summer 1981
    149 pages
    ISSN:0163-5840
    DOI:10.1145/1013228
    Issue’s Table of Contents
    • cover image ACM Conferences
      SIGIR '81: Proceedings of the 4th annual international ACM SIGIR conference on Information storage and retrieval: theoretical issues in information retrieval
      May 1981
      149 pages
      ISBN:0897910524
      DOI:10.1145/511754

    Copyright © 1981 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 31 May 1981

    Check for updates

    Qualifiers

    • article

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader