Skip to main content

Comparing Performance of Ensemble-Based Machine Learning Algorithms to Identify Potential Obesity Risk Factors from Public Health Datasets

  • Conference paper
  • First Online:
Emerging Technologies in Data Mining and Information Security

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1286))

Abstract

Societal factors such as globalization, supermarket growth, rapid unplanned urbanization, sedentary lifestyle, economical distribution, and social position gradually develop behavioral risk factors in humans. Behavioral risk factors are unhealthy habits (consumption of tobacco and alcohol), improper diet (consumption of high calorific discretionary fast foods, sweet beverages), and physical inactivity. The behavioral risks may lead to physiological risks, body–energy imbalance. Obesity is one of the foremost lifestyle diseases that leads to other health conditions, such as cardiovascular disease (CVDs), chronic obstructive pulmonary disease (COPD), cancer, diabetes type II, hypertension, and depression. It is not restricted within the boundary of age and socio-economic background. “World health organization (WHO)” has predicted that lifestyle diseases will claim 71–73% of the global death, by the end of 2020. It can be prevented with proper identification of associated risk factors and appropriate behavioral intervention plans. The key determinants of obesity are—a. age, b. weight, c. height, and d. body mass index (BMI). This paper addresses the potential of ensemble machine learning approaches to assess the associated risk factors of obesity through the evaluation of existing, publicly accessible health datasets, such as “Kaggle”, and “UCI”. Followed by, we compared our identified risk factors with the obtained risk factors from literature study. In future, we are intending to reuse the obtained knowledge to collect data from a controlled trial of adult population (age between 20 and 60) in south Norway to generate personalized, contextual, and behavioral recommendations with a smart electronic coaching (eCoaching) system for behavioral intervention for the promotion of healthy lifestyle.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Butler, É.M., et al.: Prediction models for early childhood obesity: applicability and existing issues. In: Hormone Research in Paediatrics, pp. 358–367 (2018)

    Google Scholar 

  2. Singh, B., Tawfik, H.: A machine learning approach for predicting weight gain risks in young adults. In: 2019 10th International Conference on Dependable Systems, Services and Technologies (DESSERT), pp. 231–234 IEEE (2019)

    Google Scholar 

  3. Grabner, M.: BMI trends, socioeconomic status, and the choice of dataset. In: Obesity Facts, pp. 112–126 (2012)

    Google Scholar 

  4. WHO page. https://www.who.int/news-room/fact-sheets/detail/obesity-and-overweight

  5. Csige, I., Ujvárosy, D., Szabó, Z., Lőrincz, I., Paragh, G., Harangi, M., Somodi, S.: The impact of obesity on the cardiovascular system. J. Diabet. Res. (2018)

    Google Scholar 

  6. Gerdes, M., Martinez, S., Tjondronegoro, D.: Conceptualization of a personalized ecoach for wellness promotion. In: Proceedings of the 11th EAI International Conference on Pervasive Computing Technologies for Healthcare, pp. 365–374 (2017)

    Google Scholar 

  7. Chatterjee, A., Gerdes, M.W., Martinez, S.: eHealth initiatives for the promotion of healthy lifestyle and allied implementation difficulties. In: 2019 International Conference on Wireless and Mobile Computing, Networking and Communications (WiMob), pp. 1–8. IEEE (2019)

    Google Scholar 

  8. Chatterjee, A., Gerdes, M.W., Martinez, S.G.: Identification of risk factors associated with obesity and overweight—a machine learning overview. Sensors 20(9), 2734 (2020)

    Article  Google Scholar 

  9. Padmanabhan, M., Yuan, P., Chada, G., Van Nguyen, H.: Physician-friendly machine learning: a case study with cardiovascular disease risk prediction. J. Clin. Med., 1050 (2019)

    Google Scholar 

  10. Selya, A.S., Anshutz, D.: Machine learning for the classification of obesity from dietary and physical activity patterns. In: Advanced Data Analytics in Health, pp. 77–97. Springer, Cham (2018)

    Google Scholar 

  11. Jindal, K., Baliyan, N., Rana, P.S.: Obesity prediction using ensemble machine learning approaches. In: Recent Findings in Intelligent Computing Techniques, pp. 355–362. Singapore (2018)

    Google Scholar 

  12. Schapire, R.E., Freund, Y.: Boosting: foundations and algorithms. In: Kybernetes (2013)

    Google Scholar 

  13. Brandt, S.: Statistical and computational methods in data analysis. No. 04; QA273, B73 1976. In: Amsterdam: North-Holland Publishing Company (1976)

    Google Scholar 

  14. Sklearn page. https://scikit-learn.org/stable/supervised_learning.html

  15. Kaggle data page. https://www.kaggle.com/data

  16. Eating-health-module-dataset description. https://www.bls.gov/tus/ehmintcodebk1416.pdf

  17. Chatterjee, A., Gerdes, M.W., Martinez, S.G.: Statistical explorations and univariatetimeseries Analysis on COVID-19 datasets to understand the trend of disease spreading and death. Sensors 20(11), 3089 (2020)

    Article  Google Scholar 

  18. Python page. https://docs.python.org/

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ayan Chatterjee .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Chatterjee, A., Gerdes, M.W., Prinz, A., Martinez, S.G. (2021). Comparing Performance of Ensemble-Based Machine Learning Algorithms to Identify Potential Obesity Risk Factors from Public Health Datasets. In: Hassanien, A.E., Bhattacharyya, S., Chakrabati, S., Bhattacharya, A., Dutta, S. (eds) Emerging Technologies in Data Mining and Information Security. Advances in Intelligent Systems and Computing, vol 1286. Springer, Singapore. https://doi.org/10.1007/978-981-15-9927-9_26

Download citation

Publish with us

Policies and ethics