Skip to main content

Advertisement

Log in

Time to CARE: a collaborative engine for practical disease prediction

  • Published:
Data Mining and Knowledge Discovery Aims and scope Submit manuscript

Abstract

The monumental cost of health care, especially for chronic disease treatment, is quickly becoming unmanageable. This crisis has motivated the drive towards preventative medicine, where the primary concern is recognizing disease risk and taking action at the earliest signs. However, universal testing is neither time nor cost efficient. We propose CARE, a Collaborative Assessment and Recommendation Engine, which relies only on patient’s medical history using ICD-9-CM codes in order to predict future disease risks. CARE uses collaborative filtering methods to predict each patient’s greatest disease risks based on their own medical history and that of similar patients. We also describe an Iterative version, ICARE, which incorporates ensemble concepts for improved performance. Also, we apply time-sensitive modifications which make the CARE framework practical for realistic long-term use. These novel systems require no specialized information and provide predictions for medical conditions of all kinds in a single run. We present experimental results on a large Medicare dataset, demonstrating that CARE and ICARE perform well at capturing future disease risks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  • Burges C (1998) A tutorial on support vector machines for pattern recognition. Data Min Knowl Disc 2(2): 121–167

    Article  Google Scholar 

  • Barabasi A-L (2007) Network medicine—from obesity to the dieaseasome. N Engl J Med 357: 404–407

    Article  Google Scholar 

  • Breese JS, Heckerman D, Kadie C (1998) Empirical analysis of predictive algorithms for collaborative filtering. Technical Report MSR-TR-98-12, Microsoft Research, May

  • Cherry DK, Burt CW, Woodwell D (2001) A national ambulatory medical care survey: 2001 summary. Adv Data 337: 1–16

    Google Scholar 

  • Christakis NA, Allison PD (2006) Mortality after the hospitalization of a spouse. N Engl J Med 354(7): 719–730

    Article  Google Scholar 

  • Cordn O, Herrera F, de la Montab́1a J, Sd́fnchez A, Villar P (2002) A prediction system for cardiovascularity diseases using genetic fuzzy rule-based systems. In: Proceedings of the 8th Ibero–American Conference on AI, pp 381–391. Springer, Berlin

  • Coyle P, Hartung H-P (2002) Use of interferon beta in multiple sclerosis: rationale for early treatment and evidence of dose- and frequency-dependent effects on clinical response. Multiple Scler 8(1): 2–9

    Article  Google Scholar 

  • Davis D, Chawla NV, Blumm N, Christakis N, Barabasi A-L (2008a) Care for your future: prospective disease prediction using collaborative filtering. In: Proceedings of the KDD 2008 workshop on mining medical data

  • Davis D, Chawla NV, Blumm N, Christakis N, Barabasi A-L (2008b) Predicting individual disease risk based on medical history. In: Proceedings of the ACM conference on information and knowledge management

  • Dietterich TG (2000) Ensemble methods in machine learning. In: Proceedings of the first international workshop on multiple classifier systems, pp 1–15, June

  • Edelman D et al (2006) A multidimensional integrative medicine intervention to improve cardiovascular risk. J Gen Intern Med 21(7): 728–734

    Article  Google Scholar 

  • Glasgow RE et al (2001) Does the chronic care model serve also as a template for improving prevention?. Milbank Q 79(4): 579–612

    Article  Google Scholar 

  • Goldberg K, Roeder T, Gupta D, Perkins C (2000) Eigentaste: a constant time collaborative filtering algorithm. Technical report, University of California, Berkley, August

  • Grcar M, Fortuna B, Mladenic D (2005) Knn versus svm in the collaborative filtering framework. In: WebKDD August

  • Heckerman D, Chickering DM, Meek C, Rounthwaite R, Kadie C (2001) Dependency networks for inference, collaborative filtering, and data visualization. Technical Report MSR-TR-2000-16, Microsoft Research, February

  • Herlocker JL, Konstan JA, Terveen LG, Riedl JT (2004) Evaluating collaborative filtering recommender systems. ACM Trans Inf Sys 22: 5–53

    Article  Google Scholar 

  • Hofmann T (2004) Latent semantic models for collaborative filtering. ACM Trans Inf Sys 22: 89–115

    Article  Google Scholar 

  • Hofmann T, Puzicha J (1999) Latent class models for collaborative filtering. In: Proceedings of the 16th international joint conference on artificial intelligence, pp 688–693

  • Hunt J, Kristal A, White E, Lynch J, Fries E (1995) Physician recommendations for dietary change: their prevalence and impact in a population-based sample. Am J Public Health 85: 722–726

    Article  Google Scholar 

  • Kahn CE Jr (2005) Collaborative filtering to improve navigation of large radiology knowledge resources. J Digit Imaging 18(2): 131–137

    Article  Google Scholar 

  • Kannel WB, Dawber TR, Kagan A, Revotskie N, Stokes J III (1961) Factors of risk in the development of coronary heart disease: six-year follow-up experience: The Framingham Study. Ann Intern Med 55: 33–50

    Google Scholar 

  • Koertge J et al (2003) Improvement in medical risk factors and quality of life in women and men with coronary heart disease in the Multicenter Lifestyle Demonstration Project. Am J Cardiol 91(11): 1316–1322

    Article  Google Scholar 

  • Konstan JA, Miller BN, Maltz D, Herlocker JL, Gordon LR, Riedl J (1997) Grouplens: applying collaborative filtering to usenet news. Commun ACM 40: 77–87

    Article  Google Scholar 

  • Lauderdale DS, Furner SE, Miles TP, Goldberg J (1993) Epidemiologic uses of medicare data. Epidemiol Rev 15: 319–327

    Google Scholar 

  • Liu Y, Teverovskiy L, Lopez O, Aizenstein H, Meltzer C, Becker J (2007) Discovery of biomarkers for alzheimer’s disease prediction from structural mr images. In: 2007 IEEE international symposium on biomedical imaging, April

  • Loscalzo J (2007) Association studies in an era of too much information - clinical analysis of new biomarker andgenetic data. Circulation 116(17): 1866–1870

    Article  Google Scholar 

  • Loscalzo J, Kohane I, Barabasi A-L (2007) Human disease classification in the postgenomic era. Mol Syst Biol

  • Mitchell JB, Bubolz T, Paul JE, Pashos CI, Escarce JJ, Muhlbaier LH, Wiesman JM, Young WW, Epstein RS, Javitt JC (1994) Using medicare claims for outcomes research. Medical Care 32: 38–51

    Article  Google Scholar 

  • Mould R (2003) Prediction of long-term survival rates of cancer patients. Lancet 361: 262

    Article  Google Scholar 

  • NC for Health Statistics (2007) International Classification of Diseases, 9th Revision, Clinical modification (icd-9-cm). http://www.cdc.gov/nchs/about/otheract/icd9/abticd9.htm

  • NC Institute (2007) Cancer trends progress report—2007 update

  • Paterek A (2007) Improving regularized singular value decomposition for collaborative filtering. In: KDDCup August

  • Pennock DM, Horvitz E (1999) Collaborative filtering by personality diagnosis: a hybrid memory- and model-based approach. In: Proceedings of the IJCAI workshop on machine learning for information filtering

  • Resnick P, Iancovou N, Sushak M, Bergstrom P, Riedl J (1994) Grouplens: an open architecture for collaborative filtering of netnews. In: Proceedings of the ACM conference on computer supported cooperative, pp 175–186

  • Salton G, McGill M (1983) Introduction to modern information retrieval. McGraw-Hill, New York

    MATH  Google Scholar 

  • Shardanand U, Maes P (1995) Social information filtering: algorithms for automating “word of mouth”. In: Proceedings of the computer human interaction, pp 210–217, May

  • Si L, Jin R (2003) Flexible mixture model for collaborative filtering. In: Proceedings of ICML

  • Snyderman R, Williams RS (2003) Prospective medicine: the next health care transformation. Future Med

  • Starfield B, Lemke KW, Bernhardt T, Foldes SS, Forrest CB, Weiner JP (2003) Comorbidity: implications for the the importance of primary care in case management. Ann Fam Med 1: 8–14

    Article  Google Scholar 

  • van den Akker M, Buntinx F, Metsemakers JF, Roos S, Knottnerus JA (1998) Multimorbidity in general practice: prevalence, incidence, and determinants of co-occuring chronic and recurrent diseases. J Clin Epidemiol 51: 367–375

    Article  Google Scholar 

  • Weston AD, Hood L (2004) Systems biology, proteomics, and the future of health care: toward predictive, preventative, and personalized medicine. J Proteome Res 3(2): 179–196

    Article  Google Scholar 

  • Wong DT, Knaus WA (1991) Predicting outcome in critical care: the current status of the apache prognostic scoring system. Can J Anesth 38: 374–383

    Article  Google Scholar 

  • WTC Consortium (2007) Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 447:661–678

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nitesh V. Chawla.

Additional information

Responsible editor: R. Bharat Rao and Romer Rosales.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Davis, D.A., Chawla, N.V., Christakis, N.A. et al. Time to CARE: a collaborative engine for practical disease prediction. Data Min Knowl Disc 20, 388–415 (2010). https://doi.org/10.1007/s10618-009-0156-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10618-009-0156-z

Keywords

Navigation