Abstract
MEDLINE database is most resourceful of biomedical literatures. Lay users may get difficulty to formulate a query. Query expansion technique reformulates user query by adding more significant and related terms to original terms to retrieve more relevant results. Finding related terms are explored form external resources, collection and query context. Since each MEDLINE document is manually assigned with controlled vocabularies which is called MeSH (Medical Subject Headings). These controlled vocabularies may be beneficial for query expansion. This paper proposes pseudo-relevance feedback by using MeSH terms in documents for query expansion. Additionally, re-weighting scheme called RABAM-PRF (Rank-Based MeSH Pseudo-Relevance Feedback) for filtering misleading terms is studied. In experiment, we use Lucene to retrieve the OHSUMED collection as baseline. The proposed method improves retrieval performance in MAP, P@10, and B-pref. Furthermore, the experiment showed that not all MeSH terms should be included to the query.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Fontaine, J.-F., Barbosa-Silva, A., Schaefer, M., Huska, M.R., Muro, E.M., Andrade-Navarro, M.A.: MedlineRanker: Flexible ranking of biomedical literature. Nucleic Acids Res. 37, W141-W146 (2009)
Yoo, S., Choi, J.: On the query reformulation technique for effective MEDLINE document retrieval. J. Biomed. Inform. 43, 686–693 (2010)
Jalali, V., Matash Borujerdi, M.: Information retrieval with concept-based pseudo-relevance feedback in MEDLINE. J. Knowl. Inf. Syst. 29, 237–248 (2011)
Rocchio, J.J.: Relevance feedback in information retrieval. In: Salton, G. (ed.) The SMART Retrieval System: Experiments in Automatic Document Processing, pp. 313–323. Prentice-Hall, Englewood Cliffs (1971)
William Hersh, S.P., Donohoe, L.: Assessing thesaurus-based query expansion using the UMLS Metathesaurus. AMIA, 344–348 (2000)
Baeza-Yates, R., Ribeiro-Neto, B.: Modern information retrieval. ACM press, New York (1999)
Xu, X., Zhu, W., Zhang, X., Hu, X., Song, I.-Y.: A comparison of local analysis, global analysis and ontology-based query expansion strategies for bio-medical literature search. In: IEEE International Conference on Systems, Man and Cybernetics, SMC 2006, pp. 3441–3446. IEEE (2006)
Xu, X., Zhang, X., Hu, X.: Using Two-Stage Concept-Based Singular Value Decomposition Technique as a Query Expansion Strategy. In: 21st International Conference on Advanced Information Networking and Applications Workshops, AINAW 2007, pp. 295–300 (2007)
Abdou, S., Savoy, J.: Searching in Medline: Query expansion and manual indexing evaluation. J. Infoproman. 44, 781–789 (2008)
Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by Latent Semantic Analysis. JASIST 41, 391–407 (1990)
Witten, I.H., Frank, E., Hall, M.A.: Data Mining: Practical Machine Learning Tools and Techniques: Practical Machine Learning Tools and Techniques. Elsevier (2011)
Xu, X., Hu, X.: Cluster-based query expansion using language modeling in the biomedical domain. In: 2010 IEEE International Conference on International Conference on Bioinformatics and Biomedicine Workshops (BIBMW), pp. 185–188. IEEE (2010)
Zhu, W., Xu, X., Hu, X., Song, I.-Y., Allen, R.B.: Using UMLS-based Re-Weighting Terms as a Query Expansion Strategy. In: IEEE International Conference on Granular Computing, pp. 217–222. IEEE (2006)
Benjamin King, L.W., Provalor, I., Zhou, J.: Cengage Learning at TREC 2011 Medical Track. In: The 20th Text REtrieval Conference (TREC). National Institute for Standards and Technology (2011)
Jalali, V., Borujerdi, M.R.M.: The Effect of Using Domain Specific Ontologies in Query Expansion in Medical Field. In: IEEE Innovations in Information Technology (2008)
Jalali, V., Borujerdi, M.R.M.: A Hybrid Information Retrieval System for Medical Field Using MeSH Ontology. In: Prasad, S.K., Routray, S., Khurana, R., Sahni, S. (eds.) ICISTM 2009. CCIS, vol. 31, pp. 31–40. Springer, Heidelberg (2009)
Unified Medical Language Systems, http://www.nlm.nih.gov/research/umls
Hersh, W., Buckley, C., Leone, T.J., Hickam, D.: OHSUMED: An Interactive Retrieval Evaluation and New Large Test Collection for Research. In: Croft, B., Rijsbergen, C.J. (eds.) SIGIR 1994, pp. 192–201. Springer, London (1994)
Text REtrieval Conference. The trec eval Evaluation Package (2004), http://trec.nist.gov/trec_eval/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Thesprasith, O., Jaruskulchai, C. (2014). Query Expansion Using Medical Subject Headings Terms in the Biomedical Documents. In: Nguyen, N.T., Attachoo, B., Trawiński, B., Somboonviwat, K. (eds) Intelligent Information and Database Systems. ACIIDS 2014. Lecture Notes in Computer Science(), vol 8397. Springer, Cham. https://doi.org/10.1007/978-3-319-05476-6_10
Download citation
DOI: https://doi.org/10.1007/978-3-319-05476-6_10
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-05475-9
Online ISBN: 978-3-319-05476-6
eBook Packages: Computer ScienceComputer Science (R0)