Machine Learning Methods for Mortality Prediction of Polytraumatized Patients in Intensive Care Units – Dealing with Imbalanced and High-Dimensional Data

Moreno García, María N.; González Robledo, Javier; Martín González, Félix; Sánchez Hernández, Fernando; Sánchez Barba, Mercedes

doi:10.1007/978-3-319-10840-7_38

María N. Moreno García¹⁸,
Javier González Robledo¹⁹,
Félix Martín González¹⁹,
Fernando Sánchez Hernández²⁰ &
…
Mercedes Sánchez Barba²¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8669))

Included in the following conference series:

International Conference on Intelligent Data Engineering and Automated Learning

1596 Accesses
3 Citations

Abstract

The aim of this study is the prediction of death of polytraumatized patients based on epidemiological, clinical and health treatment variables by means of data-mining methods. The main problems to be addressed were high dimensionality and imbalanced data. Since the techniques usually used to deal with these drawbacks, as feature selection methods and sampling strategies respectively, did not provided satisfactory results, the aim of the study was to find out the data mining algorithms showing the best behavior in this kind of scenarios. The study was carried out with data from 497 patients diagnosed with severe trauma who were hospitalized in the Intensive Care Unit (ICU) of the University Hospital of Salamanca. The results of the study reveal the better behavior of multiclassifiers as compared with simple classifiers in contexts of high dimensionality and imbalanced datasets, without the need to resort to undersampling and oversampling strategies, which can lead to the loss of valuable data and overfitting problems respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Breiman, L.: Bagging predictors. Machine Learning 24(2), 123–140 (1996)
MathSciNet MATH Google Scholar
Cooper, G.F., Herskovits, E.: A Bayesian Method for the induction of probabilistic networks from data. Machine Learning 9(3), 09–347 (1992)
Google Scholar
Freund, Y., Schapire, R.E.: Experiments with a new boosting algorithm. In: Proceedings of the 13th International Conference on Machine Learning, pp. 148–156 (1996)
Google Scholar
Gama, J., Brazdil, P.: Cascade Generalization. Machine Learning 41(3), 315–343 (2000)
Article MATH Google Scholar
Ghazikhani, A., Monsefi, R., Yazdi, H.S.: Ensemble of online neural networks for non-stationary and imbalanced data streams. Neurocomputing 122(25), 535–544 (2013)
Article Google Scholar
Hall, M.A.: Correlation-based Feature Selection for Machine Learning. PhD Thesis, University of Waikato, Hamilton, Nueva Zelanda (1999)
Google Scholar
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA Data Mining Software: An Update. SIGKDD Explorations 11(1), 10–18 (2009)
Article Google Scholar
Hemmila, M.R., Jakubus, J.L., Maggio, P.M., et al.: Real money: complications and hospital costs in trauma patients. Surgery 144(2), 307–316 (2008)
Article Google Scholar
Hulse, J., Khoshgoftaar, T., Napolitano, A.: Experimental perspectives on learning from imbalanced data. In: Proceedings of the 24th International Conference on Machine Learning, pp. 935–942 (2007)
Google Scholar
Kuncheva, L.I.: Combining Pattern Classifiers: Methods and Algorithms. John Wiley & Sons (2004)
Google Scholar
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo (1993)
Google Scholar
Shao, Y.H., Chen, W.J., Zhang, J.J., Wang, Z., Deng, N.Y.: An efficient weighted Lagrangian twin support vector machine for imbalanced data classification. Pattern Recognition 47, 3158–3167 (2014)
Article Google Scholar
Wolpert, D.H.: Stacked Generalization. Neural Networks 5(2), 241–259 (1992)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computing and Automation, University of Salamanca, Salamanca, Spain
María N. Moreno García
Intensive Care Unit, University Hospital of Salamanca, Salamanca, Spain
Javier González Robledo & Félix Martín González
School of Nursing and Physiotherapy, University of Salamanca, Prehospital Emergency Services, Salamanca, Spain
Fernando Sánchez Hernández
Department of Statistics, University of Salamanca, Salamanca, Spain
Mercedes Sánchez Barba

Authors

María N. Moreno García
View author publications
You can also search for this author in PubMed Google Scholar
Javier González Robledo
View author publications
You can also search for this author in PubMed Google Scholar
Félix Martín González
View author publications
You can also search for this author in PubMed Google Scholar
Fernando Sánchez Hernández
View author publications
You can also search for this author in PubMed Google Scholar
Mercedes Sánchez Barba
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Salamanca, Plaza de la Merced S/N, 37008, Salamanca, Spain
Emilio Corchado & Héctor Quintián &
University of the Basque Country, Pasco Manuel de Lardizábal 1, 20018, San Sebastián, Spain
José A. Lozano
The University of Manchester, Sackville Street, M13 9PL, Manchester, UK
Hujun Yin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Moreno García, M.N., González Robledo, J., Martín González, F., Sánchez Hernández, F., Sánchez Barba, M. (2014). Machine Learning Methods for Mortality Prediction of Polytraumatized Patients in Intensive Care Units – Dealing with Imbalanced and High-Dimensional Data. In: Corchado, E., Lozano, J.A., Quintián, H., Yin, H. (eds) Intelligent Data Engineering and Automated Learning – IDEAL 2014. IDEAL 2014. Lecture Notes in Computer Science, vol 8669. Springer, Cham. https://doi.org/10.1007/978-3-319-10840-7_38

Download citation

DOI: https://doi.org/10.1007/978-3-319-10840-7_38
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10839-1
Online ISBN: 978-3-319-10840-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics