Abstract
Text classification is becoming an interesting research field due to increased availability of documents in digital form which is necessary to organize. The machine learning paradigm is usually applied to text classification, according to which a general inductive process automatically builds an text classifier from a set of pre-classified documents. In this paper we investigate the application of Bayesian networks to classify MedLine documents, where each document is identified by a set of MeSH ontology terms. Bayesian networks have been selected for their ability to describe conditional independencies between variables and provide clear methodologies for learning from observations.Our experimental evaluation of these ideas is based on the relevance judgments of the 2004 TREC workshop Genomics track.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Cowell, R.G., Dawid, P., Lauritzen, S., Spiegelhalter, D.: Probabilistic Networks and Expert Systems. Springer, New York (1999)
Duda, R.O., Hart, P.E.: Pattern Classification and Scene Analysis, pp. 98–105. John Wiley and Sons, Chichester (1973)
Friedman, N., Goldszmidt, M.: Building classifiers using bayesian networks. In: Proceedings of the 13th National Conference on Artificial Intelligence, pp. 1277–1284. AAAI Press, Menlo Park (1996)
Heckerman, D.: A tutorial on learning with bayesian networks. Technical Report MSR-TR-95-06, Microsoft Research, Redmond, Washington (1995)
Hliaoutakis, A.: Semantic Similarity Measures in MeSH Ontology and their application to Information Retrieval on Medline. PhD thesis, Technical Univ. of Crete (TUC), Dept. of Electronic and Computer Engineering, Chania, Crete, Greece (November 2005)
Nelson, S.J., Johnston, D., Humphreys, B.L.: Relationships in medical subject headings. In: Bean, C.A., Green, R. (eds.) Relationships in the Organization of Knowledge, pp. 171–184. Kluwer Academic Publishers, New York (2001)
Névéol, A., Shooshan, S.E.E., Humphrey, S.M.M., Mork, J.G.G., Aronson, A.R.R.: A recent advance in the automatic indexing of the biomedical literature. Journal of Biomedical Informatics (December 2008)
Pearl, J.: Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann, San Mateo (1998)
Sebastiani, F., Ricerche, C.N.D.: Machine learning in automated text categorization. ACM Computing Surveys 34, 1–47 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Glez-Peña, D., López, S., Pavón, R., Laza, R., Iglesias, E.L., Borrajo, L. (2009). Classification of MedLine Documents Using MeSH Terms. In: Omatu, S., et al. Distributed Computing, Artificial Intelligence, Bioinformatics, Soft Computing, and Ambient Assisted Living. IWANN 2009. Lecture Notes in Computer Science, vol 5518. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02481-8_141
Download citation
DOI: https://doi.org/10.1007/978-3-642-02481-8_141
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02480-1
Online ISBN: 978-3-642-02481-8
eBook Packages: Computer ScienceComputer Science (R0)