Skip to main content

Advertisement

Log in

Towards Building a Global Robust Model for Heart Disease Detection

  • Original Research
  • Published:
SN Computer Science Aims and scope Submit manuscript

Abstract

Heart disease is one of the most menacing non-communicable diseases. Heart disease prediction is one of the most popular topics in the field of biomedical engineering. Almost all the models for heart disease detection are based on centralized training of data where data are concentrated in a single server. Centralized training of the model may give a good accuracy to predict/detect, yet the models are not robust as only a single dataset is used to train and test the model. This is a major problem with centralized training. Therefore, in order to solve this problem of robustness, we decided to use and simulate federated learning (FL) to build a heart disease detection model through which we can use multiple datasets from different regions to train the model. We created four virtual workers where each one is representing a hospital present in a different location containing the local data. We have used the dataset downloaded from IEEE Dataport. It is a combination of five different datasets, including the Cleveland, Hungarian, Switzerland, Long Beach VA, and Statlog Heart Dataset. The final dataset contains 11 input features and the output label. We divided the dataset into training and testing data. The training data are divided into four sub-datasets using both IID (Independent and Identically Distributed) and non-IID approaches. Then, the sub-datasets are sent to these four hospitals differently for both approaches. We have used the artificial neural network as the FL base model. We trained both the IID and non-IID models and then tested the models with test data and got an accuracy of 84.87 and 86.55% for the IID and non-IID approaches, respectively. We have also checked the AUC score, sensitivity, and specificity for the IID and non-IID approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Data availability

The data that support the findings of this study are openly available at https://www.kaggle.com/datasets/sid321axn/heart-statlog-cleveland-hungary-final, [20].

References

  1. Cardiovascular diseases. Available at: https://www.who.int/india/health-topics/cardiovascular-diseases. Accessed: 2022.

  2. Zhang C, Xie Y, Bai H, Yu B, Li W, Gao Y. A survey on federated learning. Knowl-Based Syst. 2021;216: 106775.

    Article  Google Scholar 

  3. Brisimi TS, Chen R, Mela T, Olshevsky A, Paschalidis IC, Shi W. Federated learning of predictive models from federated electronic health records. Int J Med Inform. 2018;112:59–67.

    Article  Google Scholar 

  4. Li Q, Wen Z, Wu Z, Hu S, Wang N, Li Y, Liu X, He B. A survey on federated learning systems: vision, hype and reality for data privacy and protection. IEEE Trans Knowl Data Eng. 2021.

  5. Saxena Purushottam K, Sharma R. Efficient heart disease prediction system. Procedia Comput Sci. 2016;85:962–9 (International Conference on Computational Modelling and Security (CMS 2016)).

    Article  Google Scholar 

  6. Nikhar S, Karandikar AM. Prediction of heart disease using machine learning algorithms. Intl J Adv Eng Manag Sci. 2016;2(6): 239484.

    Google Scholar 

  7. Karayılan T, Kılıç Ö. Prediction of heart disease using neural network. In: 2017 International Conference on Computer Science and Engineering (UBMK). IEEE; 2017. p. 719–723.

  8. Khateeb N, Usman M. Efficient heart disease prediction system using k-nearest neighbor classification technique. In: Proceedings of the International Conference on Big Data and Internet of Thing, 2017, p. 21–6.

  9. Dinesh KG, Arumugaraj K, Santhosh KD, Mareeswari V. Prediction of cardiovascular disease using machine learning algorithms. In: 2018 International Conference on Current Trends towards Converging Technologies (ICCTCT). IEEE; 2018. p. 1–7.

  10. Repaka AN, Ravikanti SD, Franklin RG. Design and implementing heart disease prediction using Naives Bayesian. In: 2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI), 2019, p. 292–297.

  11. Mohan S, Thirumalai C, Srivastava G. Effective heart disease prediction using hybrid machine learning techniques. IEEE Access. 2019;7:81542–54.

    Article  Google Scholar 

  12. Ali F, El-Sappagh S, Islam SMR, Kwak D, Ali A, Imran M, Kwak K-S. A smart healthcare monitoring system for heart disease prediction based on ensemble deep learning and feature fusion. Inf Fusion. 2020;63:208–22.

    Article  Google Scholar 

  13. Shah D, Patel S, Bharti SK. Heart disease prediction using machine learning techniques. SN Comput Sci. 2020;1(6):1–6.

    Article  Google Scholar 

  14. Kavitha M, Gnaneswar G, Dinesh R, Sai YR, Sai SR. Heart disease prediction using hybrid machine learning model. In: 2021 6th International Conference on Inventive Computation Technologies (ICICT). IEEE; 2021. p. 1329–33.

  15. Sajja GS, Mustafa M, Phasinam K, Kaliyaperumal K, Ventayen Randy JM, Kassanuk T. Towards application of machine learning in classification and prediction of heart disease. In: 2021 Second International Conference on Electronics and Sustainable Communication Systems (ICESC). IEEE; 2021. p. 1664–9.

  16. Saranya G, Pravin A. A novel feature selection approach with integrated feature sensitivity and feature correlation for improved prediction of heart disease. J Ambient Intell Humaniz Comput. 2022: 1–15.

  17. Grama M, Musat M, Muñoz-González L, Passerat-Palmbach J, Rueckert D, Alansary A. Robust aggregation for adaptive privacy preserving federated learning in healthcare. 2020. arXiv preprint arXiv:2009.08294.

  18. Li J, Meng Y, Ma L, Du S, Zhu H, Pei Q, Shen S. A federated learning based privacy-preserving smart healthcare system. IEEE Trans Indus Inform. 2021.

  19. Linardos A, Kushibar K, Walsh S, Gkontra P, Lekadir K. Federated learning for multi-center imaging diagnostics: a simulation study in cardiovascular disease. Sci Rep. 2022;12(1):1–12.

    Article  Google Scholar 

  20. Siddhartha M. Heart disease dataset (comprehensive). Kaggle Inc; 2019. https://www.kaggle.com/datasets/sid321axn/heart-statlog-cleveland-hungary-final

  21. McMahan B, Moore E, Ramage D, Hampson S, y Arcas BA. Communication-efficient learning of deep networks from decentralized data. In: Artificial intelligence and statistics. PMLR; 2017. p. 1273–82.

  22. Mothukuri V, Parizi RM, Pouriyeh S, Huang Y, Dehghantanha A, Srivastava G. A survey on security and privacy of federated learning. Future Gener Comput Syst. 2021;115:619–40.

    Article  Google Scholar 

  23. PyTorch. Available at: https://en.wikipedia.org/wiki/PyTorch. Accessed: 2022.

  24. PySyft. Available at: https://blog.openmined.org/install/. Accessed: 2022.

  25. Li L, Fan Y, Tse M, Lin K-Y. A review of applications in federated learning. Comput Indus Eng. 2020;149: 106854.

    Article  Google Scholar 

  26. Rieke N, Hancox J, Li W, Milletari F, Roth HR, Albarqouni S, Bakas S, Galtier MN, Landman BA, Maier-Hein K, et al. The future of digital health with federated learning. NPJ Digit Med. 2020;3(1):1–7.

    Article  Google Scholar 

  27. Liu D, Miller T, Sayeed R, Mandl KD. FADL: federated-autonomous deep learning for distributed electronic health record. 2018. arXiv preprint arXiv:1811.11400.

  28. Nguyen PT, Ha DH, Avand M, Jaafari A, Al-Ansari N, Van Phong T, Sharma R, Kumar R, Le HV, Le HV, et al. Soft computing ensemble models based on logistic regression for groundwater potential mapping. Appl Sci. 2020;10(7):2469.

    Article  Google Scholar 

  29. Bahman AM, Ebrahim SA. Prediction of the minimum film boiling temperature using artificial neural network. Int J Heat Mass Transf. 2020;155:119834.

    Article  Google Scholar 

  30. Abdolrasol MGM, Hussain SM, Ustun TS, Sarker MR, Hannan MA, Mohamed R, Ali JA, Mekhilef S, Milad A. Artificial neural networks based optimization techniques: a review. Electronics. 2021;10(21):2689.

    Article  Google Scholar 

  31. Movassagh AA, Alzubi JA, Gheisari M, Rahimi M, Mohan S, Abbasi AA, Nabipour N. Artificial neural networks training algorithm integrating invasive weed optimization with differential evolutionary model. J Ambient Intell Humaniz Comput. 2021:1–9 .

  32. Hamadneh NN, Khan WA, Ashraf W, Atawneh SH, Khan I, Hamadneh BN. Artificial neural networks for prediction of Covid-19 in Saudi Arabia. Comput Mater Continua. 2021:2787–96.

  33. Bukhari MM, Alkhamees BF, Hussain S, Gumaei A, Assiri A, Ullah SS. An improved artificial neural network model for effective diabetes prediction. Complexity. 2021;2021.

  34. Bhambhvani HP, Zamora A, Shkolyar E, Prado K, Greenberg DR, Kasman AM, Liao J, Shah S, Srinivas S, Skinner EC, et al. Development of robust artificial neural networks for prediction of 5-year survival in bladder cancer. In: Urologic oncology: seminars and original investigations, vol. 39. Elsevier; 2021. p. 193–e7

  35. Hara P, Piekutowska M, Niedbała G. Selection of independent variables for crop yield prediction using artificial neural network models with remote sensing data. Land. 2021;10(6):609.

    Article  Google Scholar 

  36. Mwaura JI, Kenduiywo BK. County level maize yield estimation using artificial neural network. Model Earth Syst Environ. 2021;7(3):1417–24.

    Article  Google Scholar 

  37. Amin ZMM, Ali Assistant SAK. Climate change prediction using artificial neural network. In: IOP Conference Series. Earth and Environmental Science, vol. 961. IOP Publishing; 2022.

  38. Moghanlo S, Alavinejad M, Oskoei V, Saleh HN, Mohammadi AA, Mohammadi H, DerakhshanNejad Z. Using artificial neural networks to model the impacts of climate change on dust phenomenon in the Zanjan Region, North-West Iran. Urban Clim. 2021;35:100750.

    Article  Google Scholar 

  39. Avand M, Moradi HR, Ramazanzadeh Lasboyee M. Spatial prediction of future flood risk: an approach to the effects of climate change. Geosciences. 2021;11(1):25.

    Article  Google Scholar 

  40. Classification: ROC curve and AUC. Available at: https://developers.google.com/machine-learning/crash-course/classification/roc-and-auc. Accessed: 2022.

  41. Lee GH, Shin S-Y. Federated learning on clinical benchmark data: performance assessment. J Med Internet Res. 2020;22(10): e20891.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pranav Kumar Singh.

Ethics declarations

Conflict of Interest

On behalf of all the authors, the corresponding author states that there is no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Mali, B., Saha, S., Brahma, D. et al. Towards Building a Global Robust Model for Heart Disease Detection. SN COMPUT. SCI. 4, 596 (2023). https://doi.org/10.1007/s42979-023-02083-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s42979-023-02083-7

Keywords

Navigation