ABSTRACT
Healthcare research priorities predictive modelling accuracy and effectiveness, especially in cardiovascular disease (CVD) risk assessment and early detection. This transformative study uses advanced data mining techniques to build an unmatched predictive model with 99.37% accuracy. This novel hybrid framework relies on the powerful Random Forest algorithm, which is adaptable and robust. The Random Forest paradigm, which trains many decision trees on a random subset of the data, has been successful in uncovering complex feature interactions and data relationships. The above trait allows it to navigate the complex CVD risk assessment conundrum. Due to its ability to iteratively improve performance, Gradient Boosting is added to the Random Forest framework. Gradient Boosting is used to improve model prediction in this study. This method refines predictive performance by iteratively fixing previous model errors. This study uses an iterative refinement process to capture even the smallest dataset nuances, improving analysis precision. Creating the hybrid model in this study required skill and planning. Random Forest's ability to understand complex feature interactions and Gradient Boosting's iterative precision improve its prediction. The convergence of data mining methods creates a predictive model that outperforms its components, predicting cardiovascular disease risk with 99.37% accuracy. This revolutionary hybrid model has two major implications. This tool helps doctors identify potential cases of CVD and implement personalized care strategies by assessing risk accurately and quickly. This phenomenon also shows the potential that data mining's complex methods unleash when seamlessly integrated. This harmonious integration helps explore uncharted territories and opens new predictive modelling precision frontiers. The convergence of Random Forest and Gradient Boosting methods for cardiovascular disease risk assessment advances predictive modelling. A novel hybrid model with 98.737% accuracy is presented in this study. This model solves complex cardiovascular disease detection and risk mitigation problems, making it a major data mining advancement. By using this innovative approach, data mining techniques can be used to address CVD's formidable challenges. The concept is a symbol that guides the healthcare community towards better diagnostic precision and patient-centered care.
- WHO, “Cardiovascular diseases (CVDs).” .Google Scholar
- C. Krittanawong, H. J. Zhang, Z. Wang, M. Aydar, and T. Kitai, “Artificial Intelligence in Precision Cardiovascular Medicine,” J. Am. Coll. Cardiol., vol. 69, no. 21, pp. 2657–2664, 2017, doi: 10.1016/j.jacc.2017.03.571.Google ScholarCross Ref
- M. Swathy and K. Saruladha, “A comparative study of classification and prediction of Cardio-Vascular Diseases (CVD) using Machine Learning and Deep Learning techniques,” ICT Express, vol. 8, no. 1, pp. 109–116, 2022, doi: 10.1016/j.icte.2021.08.021.Google ScholarCross Ref
- J. Azmi, M. Arif, M. T. Nafis, M. A. Alam, S. Tanweer, and G. Wang, “A systematic review on machine learning approaches for cardiovascular disease prediction using medical big data,” Med. Eng. Phys., vol. 105, no. February, p. 103825, 2022, doi: 10.1016/j.medengphy.2022.103825.Google ScholarCross Ref
- H. B. Kibria and A. Matin, “The severity prediction of the binary and multi-class cardiovascular disease − A machine learning-based fusion approach,” Comput. Biol. Chem., vol. 98, no. November 2021, p. 107672, 2022, doi: 10.1016/j.compbiolchem.2022.107672.Google ScholarDigital Library
- J. Moon, H. F. Posada-Quintero, and K. H. Chon, “A literature embedding model for cardiovascular disease prediction using risk factors, symptoms, and genotype information,” Expert Syst. Appl., vol. 213, no. PA, p. 118930, 2023, doi: 10.1016/j.eswa.2022.118930.Google ScholarDigital Library
- U. Ahmed, J. C. W. Lin, and G. Srivastava, “Multivariate time-series sensor vital sign forecasting of cardiovascular and chronic respiratory diseases,” Sustain. Comput. Informatics Syst., vol. 38, no. December 2021, 2023, doi: 10.1016/j.suscom.2023.100868.Google ScholarCross Ref
- J. Femila Roseline, G. B. S. R. Naidu, V. Samuthira Pandi, S. Alamelu alias Rajasree, and D. N. Mageswari, “Autonomous credit card fraud detection using machine learning approach☆,” Comput. Electr. Eng., vol. 102, no. December 2021, p. 108132, 2022, doi: 10.1016/j.compeleceng.2022.108132.Google ScholarDigital Library
- W. Yu , “Associations between residential greenness and the predicted 10-year risk for atherosclerosis cardiovascular disease among Chinese adults,” Sci. Total Environ., vol. 868, no. July 2022, p. 161643, 2023, doi: 10.1016/j.scitotenv.2023.161643.Google ScholarCross Ref
- I. El Boujnouni, B. Harouchi, A. Tali, S. Rachafi, and Y. Laaziz, “Automatic diagnosis of cardiovascular diseases using wavelet feature extraction and convolutional capsule network,” Biomed. Signal Process. Control, vol. 81, no. May 2022, p. 104497, 2023, doi: 10.1016/j.bspc.2022.104497.Google ScholarCross Ref
- P. Srinivas and R. Katarya, “hyOPTXg: OPTUNA hyper-parameter optimization framework for predicting cardiovascular disease using XGBoost,” Biomed. Signal Process. Control, vol. 73, no. November 2021, p. 103456, 2022, doi: 10.1016/j.bspc.2021.103456.Google ScholarCross Ref
- S. Lv , “Long-term effects of particulate matter on incident cardiovascular diseases in middle-aged and elder adults: The CHARLS cohort study,” Ecotoxicol. Environ. Saf., vol. 262, no. June, p. 115181, 2023, doi: 10.1016/j.ecoenv.2023.115181.Google ScholarCross Ref
- J. O. Klompmaker , “Long-term exposure to specific humidity and cardiovascular disease hospitalizations in the US Medicare population,” ISEE Conf. Abstr., vol. 2021, no. 1, p. 108182, 2021, doi: 10.1289/isee.2021.p-597.Google ScholarCross Ref
- Y. Tian , “Short-term exposure to reduced specific-size ambient particulate matter increase the risk of cause-specific cardiovascular disease: A national-wide evidence from hospital admissions,” Ecotoxicol. Environ. Saf., vol. 263, no. August, p. 115327, 2023, doi: 10.1016/j.ecoenv.2023.115327.Google ScholarCross Ref
- N. Kumaraswamy, M. K. Markey, J. C. Barner, and K. Rascati, “Feature engineering to detect fraud using healthcare claims data,” Expert Syst. Appl., vol. 210, no. August, p. 118433, 2022, doi: 10.1016/j.eswa.2022.118433.Google ScholarDigital Library
- M. E. Hossain, S. Uddin, and A. Khan, “Network analytics and machine learning for predictive risk modelling of cardiovascular disease in patients with type 2 diabetes,” Expert Syst. Appl., vol. 164, no. April 2020, p. 113918, 2021, doi: 10.1016/j.eswa.2020.113918.Google ScholarCross Ref
- B. P. Doppala, D. Bhattacharyya, M. Chakkravarthy, and T. hoon Kim, “A hybrid machine learning approach to identify coronary diseases using feature selection mechanism on heart disease dataset,” Distrib. Parallel Databases, vol. 41, no. 1–2, pp. 1–20, 2023, doi: 10.1007/s10619-021-07329-y.Google ScholarDigital Library
- C. Guo, J. Zhang, Y. Liu, Y. Xie, Z. Han, and J. Yu, “Recursion Enhanced Random Forest with an Improved Linear Model (RERF-ILM) for Heart Disease Detection on the Internet of Medical Things Platform,” IEEE Access, vol. 8, pp. 59247–59256, 2020, doi: 10.1109/ACCESS.2020.2981159.Google ScholarCross Ref
- N. L. Fitriyani, M. Syafrudin, G. Alfian, and J. Rhee, “HDPM: An Effective Heart Disease Prediction Model for a Clinical Decision Support System,” IEEE Access, vol. 8, pp. 133034–133050, 2020, doi: 10.1109/ACCESS.2020.3010511.Google ScholarCross Ref
- G. T. Reddy, M. P. K. Reddy, K. Lakshmanna, D. S. Rajput, R. Kaluri, and G. Srivastava, “Hybrid genetic algorithm and a fuzzy logic classifier for heart disease diagnosis,” Evol. Intell., vol. 13, no. 2, pp. 185–196, 2020, doi: 10.1007/s12065-019-00327-1.Google ScholarCross Ref
- S. ULIANOVA, “Cardiovascular Disease dataset | Kaggle.” [Online]. Available: https://www.kaggle.com/datasets/sulianova/cardiovascular-disease-dataset.Google Scholar
- Singh, U. P., Saxena, V., Kumar, A., Bhari, P., & Saxena, D. (2022, December). Unraveling the Prediction of Fine Particulate Matter over Jaipur, India using Long Short-Term Memory Neural Network. In Proceedings of the 4th International Conference on Information Management & Machine Intelligence (pp. 1-5).Google ScholarDigital Library
- Kumar, A., Bhari, P. L., Singh, U. P., & Saxena, V. (2022, December). Comparative Study of different Machine Learning Algorithms to Analyze Sentiments with a Case Study of Two Person's Microblogs on Twitter. In Proceedings of the 4th International Conference on Information Management & Machine Intelligence (pp.1-6).Google ScholarDigital Library
- Saxena, V., Saxena, D., & Singh, U. P. (2022, December). Security Enhancement using Image verification method to Secure Docker Containers. In Proceedings of the 4th International Conference on Information Management & Machine Intelligence (pp. 1-5).Google ScholarDigital Library
- Chauhan, M., Malhotra, R., Pathak, M., & Singh, U. P. (2012). Different aspects of cloud security. International Journal of Engineering Research and Applications, 2, 864-869.Google Scholar
- Mittal, A. K., Singh, U. P., Tiwari, A., Dwivedi, S., Joshi, M. K., & Tripathi, K. C. (2015). Short-term predictions by statistical methods in regions of varying dynamical error growth in a chaotic system. Meteorology and Atmospheric Physics, 127, 457-465.Google Scholar
- Singh, U. P., Mittal, A. K., Dwivedi, S., & Tiwari, A. (2015). Predictability study of forced Lorenz model: an artificial neural network approach. History, 40(181), 27-33.Google Scholar
- Singh, U. P., Mittal, A. K., Dwivedi, S., & Tiwari, A. (2020). Evaluating the predictability of central Indian rainfall on short and long timescales using theory of nonlinear dynamics. Journal of water and Climate Change, 11(4), 1134-1149.Google Scholar
- Singh, U., Pathak, M., Malhotra, R., & Chauhan, M. (2012). Secure communication protocol for ATM using TLS handshake. Journal of Engineering Research and Applications (IJERA), 2(2), 838-948.Google Scholar
- Singh, U. P., & Mittal, A. K. (2021). Testing reliability of the spatial Hurst exponent method for detecting a change point. Journal of Water and Climate Change, 12(8), 3661-3674.Google Scholar
- Tiwari, A., Mittal, A. K., Dwivedi, S., & Singh, U. P. (2015). Nonlinear time series analysis of rainfall over central Indian region using CMIP5 based climate model. Climate Change, 1(4), 411-417.Google Scholar
Recommendations
An agent-based self risk assessment and monitoring system for cardiovascular disease patients
Telehealth '07: The Third IASTED International Conference on TelehealthCardiovascular disease has become one of the most critical chronic diseases today. In this paper, we proposed an agent-based system for cardiovascular disease patients to assess the risk and to monitor their health situation by themselves. There are two ...
Artificial intelligence framework for predictive cardiovascular and stroke risk assessment models: A narrative review of integrated approaches using carotid ultrasound
Abstract Recent findingsCardiovascular disease (CVD) is the leading cause of mortality and poses challenges for healthcare providers globally. Risk-based approaches for the management of CVD are becoming popular for recommending ...
Highlights- Cardiovascular disease (CVD) risk prediction algorithms are becoming popular for recommending treatment plans for asymptomatic individuals.
Ensemble framework for cardiovascular disease prediction
AbstractHeart disease is the major cause of non-communicable and silent death worldwide. Heart diseases or cardiovascular diseases are classified into four types: coronary heart disease, heart failure, congenital heart disease, and ...
Highlights- The proposed framework consists of Stacking Based Ensemble learning which adds diversity to the classifier.
Comments