Closed-Form Estimation of Multiple Change-Point Models

Greg Jensen

doi:10.7287/peerj.preprints.90v3

Closed-Form Estimation of Multiple Change-Point Models

Greg Jensen

Department of Psychology, Columbia University, New York, NY, United States

DOI: 10.7287/peerj.preprints.90v3

Published: 2013-12-09
Accepted: 2013-12-08

Subject Areas: Statistics
Keywords: change-point analysis, Bayesian statistics, time series analysis, marginal likelihood, model selection

Copyright: © 2013 Jensen
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Cite this article: Jensen G. 2013. Closed-Form Estimation of Multiple Change-Point Models. PeerJ PrePrints 1:e90v3 https://doi.org/10.7287/peerj.preprints.90v3

Abstract

Identifying discontinuities (or change-points) in otherwise stationary time series is a powerful analytic tool. This paper outlines a general strategy for identifying an unknown number of change-points using elementary principles of Bayesian statistics. Using a strategy of binary partitioning by marginal likelihood, a time series is recursively subdivided on the basis of whether adding divisions (and thus increasing model complexity) yields a justified improvement in the marginal model likelihood. When this approach is combined with the use of conjugate priors, it yields the Conjugate Partitioned Recursion (CPR) algorithm, which identifies change-points without computationally intensive numerical integration. Using the CPR algorithm, methods are described for specifying change-point models drawn from a host of familiar distributions, both discrete (binomial, geometric, Poisson) and continuous (exponential, Gaussian, uniform, and multiple linear regression), as well as multivariate distribution (multinomial, multivariate normal, and multivariate linear regression). Methods by which the CPR algorithm could be extended or modified are discussed, and several detailed applications to data published in psychology and biomedical engineering are described.

Author Comment

See also the supplemental material, for further analytic examples, a sensitivity analysis of the described algorithm, implementation of two operational modifications, further mathematical support, and a Matlab package that includes all data presented in the manuscript and implements the algorithm as a function.

This version includes a number of small changes throughout the manuscript.

Supplemental Information

Example: Curvilinear vs. Linear Change-Point Analysis of Reaction Times

Reaction times from a single subject learning a psychophysical task, originally reported by Palmeri (1997). The dashed line corresponds to a four-parameter “learning curve,” reported by Heathcote et al. (2000), while the solid lines interpret the same data as approximately linear, with two change-points.

Supplemental Information

Example: Curvilinear vs. Linear Change-Point Analysis of Reaction Times

Prior vs. Posterior Probability Densities Given Different Hyperparameters

Maximum Likelihood vs. Marginal Model Likelihood

Uniform vs. Non-Uniform Event Times

Binomial Data, Uncorrected vs. Corrected With Respect To Small Sample Bias

Gaussian Data, Uncorrected vs. Corrected With Respect To Small Sample Bias

Applying the CPR Algorithm To A String of Binary Data

Initial Learning For 25 Simultaneous Chains

Reaction Times As A Function Of Trials And Task Difficulty

Stimulus-Specific Reaction Times as a Function of Trials

3D Position Data, Assessed With Respect To A Multivariate Normal Distribution

3D Movement Data Visualized With Respect To Time

Detailed View of 3D Movement Data, Given a Multivariate Normal Model

Detailed View of 3D Movement Data, Given a Multiple Linear Regression Model

Supplemental Material

Supplemental: British Coal-Mining Disaster Analysis

Supplemental: Well-Log Data

Supplemental: US Treasury Bill Data

Supplemental: Sensitivity Analysis with Respect to τ

Supplemental: Simulated Data Showcasing A Failure Condition For The CPR Algorithm

Supplemental: Simulation Data Analyzed Using the 'Dicing' Operation and the Forward-Retrospective Strategy

Matlab Implementation of the CPR Algorithm

Matlab Data and Instructional Vignettes

Add your feedback

Top referrals unique visitors

Share this preprint

Metrics

Download article