Abstract
Accurate prediction of future onset of disease from Electronic Health Records (EHRs) has important clinical and economic implications. In this domain the arrival of data comes at semi-irregular intervals and makes the prediction task challenging. We propose a method called multiplicative-forest point processes (MFPPs) that learns the rate of future events based on an event history. MFPPs join previous theory in multiplicative forest continuous-time Bayesian networks and piecewise-continuous conditional intensity models. We analyze the advantages of using MFPPs over previous methods and show that on synthetic and real EHR forecasting of heart attacks, MFPPs outperform earlier methods and augment off-the-shelf machine learning algorithms.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Nodelman, U., Shelton, C.R., Koller, D.: Learning continuous time Bayesian networks. In: Uncertainty in Artificial Intelligence (2003)
Gunawardana, A., Meek, C., Xu, P.: A model for temporal dependencies in event streams. In: Advances in Neural Information Processing Systems (2011)
Weiss, J., Natarajan, S., Page, D.: Multiplicative forests for continuous-time processes. In: Advances in Neural Information Processing Systems 25, pp. 467–475 (2012)
Brown, E.N., Kass, R.E., Mitra, P.P.: Multiple neural spike train data analysis: state-of-the-art and future challenges. Nature Neuroscience 7(5), 456–461 (2004)
Engle, R.: The econometrics of ultra-high-frequency data. Econometrica 68(1), 1–22 (2000)
Diggle, P., Rowlingson, B.: A conditional approach to point process modelling of elevated risk. Journal of the Royal Statistical Society. Series A (Statistics in Society), 433–440 (1994)
Nodelman, U., Shelton, C., Koller, D.: Continuous time Bayesian networks. In: Uncertainty in Artificial Intelligence, pp. 378–387. Morgan Kaufmann Publishers Inc. (2002)
Rajaram, S., Graepel, T., Herbrich, R.: Poisson-networks: A model for structured point processes. In: AI and Statistics, vol. 10 (2005)
Ripley, B.: The second-order analysis of stationary point processes. Journal of Applied Probability, 255–266 (1976)
Wilson, P., D’Agostino, R., Levy, D., Belanger, A., Silbershatz, H., Kannel, W.: Prediction of coronary heart disease using risk factor categories. Circulation 97(18), 1837–1847 (1998)
Tzoulaki, I., Liberopoulos, G., Ioannidis, J.: Assessment of claims of improved prediction beyond the Framingham risk score. JAMAÂ 302(21), 2345 (2009)
Weiss, J., Natarajan, S., Peissig, P., McCarty, C., Page, D.: Machine learning for personalized medicine: predicting primary myocardial infarction from electronic health records. AI Magazine 33(4), 33 (2012)
McCarty, C.A., Wilke, R.A., Giampietro, P.F., Wesbrook, S.D., Caldwell, M.D.: Marshfield clinic personalized medicine research project (pmrp): design, methods and recruitment for a large population-based biobank. Personalized Medicine 2(1), 49–79 (2005)
Breiman, L.: Random forests. Machine Learning 45(1), 5–32 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Weiss, J.C., Page, D. (2013). Forest-Based Point Process for Event Prediction from Electronic Health Records. In: Blockeel, H., Kersting, K., Nijssen, S., Železný, F. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2013. Lecture Notes in Computer Science(), vol 8190. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40994-3_35
Download citation
DOI: https://doi.org/10.1007/978-3-642-40994-3_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40993-6
Online ISBN: 978-3-642-40994-3
eBook Packages: Computer ScienceComputer Science (R0)