Abstract
Recurrent event data are frequently encountered in clinical and observational studies related to biomedical science, econometrics, reliability and demography. In some situations, recurrent events serve as important indicators for evaluating disease progression, health deterioration, or insurance risk. In statistical literature, non informative censoring is typically assumed when statistical methods and theories are developed for analyzing recurrent event data. In many applications, however, there may exist a terminal event, such as death, that stops the follow-up, and it is the correlation of this terminal event with the recurrent event process that is of interest. This work considers joint modeling and analysis of recurrent event and terminal event data, with the focus primarily on determining how the terminal event process and the recurrent event process are correlated (i.e. does the frequency of the recurrent event influence the risk of the terminal event). We propose a joint model of the recurrent event process and the terminal event, linked through a common subject-specific latent variable, in which the proportional intensity model is used for modeling the recurrent event process and the additive hazards model is used for modeling the terminal event time.
Similar content being viewed by others
References
Andersen PK, Gill RD (1982) Cox’s regression model for counting processes: a large sample study. Ann Stat 10:1100–1120
Andersen PK, Borgan O, Gill RD, Keiding N (1993) Statistical models based on counting process. Springer, New York
Breslow NE, Day NE (1987) Statistical methods in cancer research. Volume II: the design and analysis of cohort studies. IARC Sci Publ 82:1–406
Byar DP (1980) The Veterans administration study of chemoprophylaxis for recurrent stage I bladder tumors: comparisons of placebo, pyridoxine and topical thiotepa. In: Pavone-Macaluso M, Smith PH, Edsmyr F (eds) Bladder tumors and other topics in urological oncology. Plenum, New York, pp 363–370
Chang SH, Wang MC (1999) Conditional regression analysis for recurrence time data. J Am Stat Assoc 94:1221–1230
Chiang CL (1968) Introduction to stochastic processes in biostatistics. Wiley, New York
Cook RJ, Lawless JF (1997) Marginal analysis of recurrent events and a terminating event. Stat Med 16:911–924
Cook RJ, Lawless JF (2007) The statistical analysis of recurrent events. Springer, New York
Dunson DB, Herring AH (2005) Bayesian model selection and averaging in additive and proportional hazards models. Lifetime Data Anal 11:213–232
Gail MH, Santner TJ, Brown CC (1980) An analysis of comparative carcinogenesis experiments based on multiple times to tumor. Biometrics 36:255–266
Ghosh D (2004) Accelerated rates regression models for recurrent failure time data. Lifetime Data Anal 10:247–261
Ghosh D, Lin DY (2000) Nonparametric analysis of recurrent events and death. Biometrics 56:554–562
Ghosh D, Lin DY (2002) Marginal regression models for recurrent and terminal events. Stat Sin 12:663–688
Henderson R, Diggle P, Dobson A (2000) Joint modelling of longitudinal measurements and event time data. Biostatistics 1:465–480
Huang CY, Wang MC (2004) Joint modeling and estimation of recurrent event processes and failure time data. J Am Stat Assoc 99:1153–1165
Lawless JF, Nadeau C (1995) Some simple robust methods for the analysis of recurrent events. Technometrics 37:158–168
Lee EW, Wei LJ, Amato DA (1992) Cox-type regression analysis for large numbers of small groups of correlated failure time observations. In: Klein JP, Goel PK (eds) Survival analysis: state of the art. Kluwer Acad. Publ, Dordrecht, pp 237–247
Liang KY, Zeger SL (1986) Longitudinal data analysis using generalized linear models. Biometrika 73:13–22
Lin DY, Ying Z (1994) Semiparametric analysis of the additive risk model. Biometrika 81:61–71
Lin DY, Ying Z (2001) Semiparametric and nonparametric regression analysis of longitudinal data. J Am Stat Assoc 96:103–126
Lin DY, Wei LJ, Yang I, Ying Z (2000) Semiparametric regression for the mean and rate function of recurrent events. J R Stat Soc B 69:711–730
Lin DY, Wei LJ, Ying Z (2001) Semiparametric transformation models for point processes. J Am Stat Assoc 96:620–628
Lin H, Turnbull BW, McCulloch CE, Slate EH (2002) Latent class models for joint analysis of longitudinal biomarker and event process data: Application to longitudinal prostate-specific antigen readings and prostate cancer. J Am Stat Assoc 97:53–65
Liu L, Wolfe RA, Huang X (2004) Shared frailty models for recurrent events and a terminal event. Biometrics 60:747–756
Liu YY, Wu YS, Cai JW, Zhou HB (2010) Additive-multiplicative rates model for recurrent events. Lifetime Data Anal 16:353–373
Mazroui Y, Mathoulin-Pelissier S, Soubeyran P, Rondeau V (2010) General joint frailty model for recurrent event data with a dependent terminal event: application to follicular lymphoma data. Stat Med 31:1162–1176
Miloslavsky M, Keles S, Van der Laan MJ, Butler S (2004) Recurrent events analysis in the presence of time-dependent covariates and dependent censoring. J R Stat Soc B 66:239–257
Murphy SA (1994) Consistency in a proportional hazards model incorporating a random effect. Ann Stat 22:712–731
Murphy SA (1995) Asymptotic theory for the frailty model. Ann Stat 23:182–198
Nielsen GG, Gill RD, Andersen PK, Sorensen TIA (1992) A counting process approach to maximum likelihood estimation in frailty models. Scand J Stat 19:25–43
Pan Q, Schaubel DE (2009) Flexible estimation of differences in treatment-specific recurrent event means in the presence of a terminating event. Biometrics 65:753–761
Pepe MS, Cai J (1993) Some graphical displays and marginal regression analysis for recurrent failure times and time-dependent covariates. J Am Stat Assoc 88:811–820
Prentice RL, Williams BJ, Peterson AV (1981) On the regression analysis of multivariate failure time data. Biometrika 68:373–379
Ross SM (1989) Introduction to probability models. Academic Press, New York
Schaubel DE, Zeng D, Cai J (2006) A semiparametric additive rates model for recurrent event data. Lifetime Data Anal 12:389–406
Sun L, Su B (2008) A class of accelerated means regression models for recurrent event data. Lifetime Data Anal 14:357–375
Sun L, Tong X, Zhou X (2011) A class of Box–Cox transformation models for recurrent event data. Lifetime Data Anal 17:280–301
Sun L, Zhao X, Zhou J (2012) A class of mixed models for recurrent event data. Can J Stat (in press)
van der Vaart AW, Wellner JA (1996) Weak convergence and empirical processes. Springer, New York
Wang MC, Qin J, Chiang CT (2001) Analyzing recurrent event data with informative censoring. J Am Stat Assoc 96:1057–1065
Wei LJ, Lin DY, Weissfeld L (1989) Regression analysis of multivariate incomplete failure time data by modeling marginal distributions. J Am Stat Assoc 84:1065–1073
Ye Y, Kalbfleisch JD, Schaubel DE (2007) Semiparametric analysis of correlated recurrent and terminal events. Biometrics 63:78–87
Zeng D, Cai J (2010) Semiparametric additive rate model for recurrent events with informative terminal event. Biometrika 97:699–712
Zeng D, Lin DY (2006) Efficient estimation of semiparametric transformation models for counting processes. Biometrika 93:627–640
Zeng D, Lin DY (2007) Semiparametric transformation models with random effects for recurrent events. J Am Stat Assoc 102:167–180
Zeng D, Lin DY (2009) Semiparametric transformation models with random effects for joint analysis of recurrent and terminal events. Biometrics 65:746–752
Acknowledgments
We are deeply indebted to three anonymous referees for their considerable time and effort in reviewing this work. Their conscientiousness and expertise led to numerous significant improvements of the manuscript. We would also like to thank Dr. Norbert Henze, Editor for Metrika, for his responsiveness, efficiency, and excellence in the management of the editorial process.
Author information
Authors and Affiliations
Corresponding author
Additional information
This work represents portions of Xiaoyu Che’s 2013 Ph.D. thesis at the Institute of Mathematical Sciences, Claremont Graduate University.
Appendix: Proofs of asymptotic results
Appendix: Proofs of asymptotic results
In this section, we will use the same notation defined above in Sect. 2.1, and all limits are taken as \(n\rightarrow \infty \). Let \(V_{i}=m_{i}\{\Lambda _{0}(Y_{i})e^{\gamma _{0}'X_{i}}\}^{-1}\) and \(\Omega _{i}=m_{i}(m_{i}-1)\{\Lambda _{0}(Y_{i})e^{\gamma _{0}'X_{i}}\}^{-2}\). Recall that \(\left( N_{i}(\cdot ),Y_{i},\delta _{i},X_{i},D_{i},C_{i},v_{i}\right) ,i=1,2,\ldots \) are assumed to be independent and identically distributed copies of \(\left( N(\cdot ),Y,\delta ,X,D,C,v\right) \), each i corresponding to a subject sampled at random.
In order to study the asymptotic distributions of \(\widehat{\eta }\) and \(\widehat{\theta },\) we need the following regularity conditions:
-
(R1)
\(P(Y\ge \tau ,v>0)>0,\) \(P(Y>\tau _{\varepsilon })=1,\) where \(\tau _{\varepsilon }=\inf \{t:\Lambda _{0}(t)>\varepsilon \}\) for some \(\varepsilon >0\), and \(E\{N(\tau )^{2}\}<\infty .\)
-
(R2)
\(G(t)=E\{vI(Y\ge t)\exp (\gamma _{0}'X)\}\) is a continuous function for \(t\in [0,\tau ].\)
-
(R3)
The weight function Q(t) has bounded variation and converges to a deterministic function q(t) in probability uniformly in \(t\in [0,\tau ];\)
-
(R4)
A is non singular and \(\Sigma =E(\xi _{i}\xi _{i}')\) exists.
\(A=\left( \begin{array}{cc} A_{11} &{} \quad A_{12}\\ A_{12}' &{} \quad A_{22} \end{array}\right) \),
where \(\Delta (t)=I(t\le Y)\), \(\xi _{i}\) is defined in Lemma 5.2 below, and \(\bar{x}(t)\) and \(\bar{v}(t)\) are the limits (in probability) respectively, of \(\bar{X}(t)\) and \(\bar{V}(t)\), which are well defined by regularity condition R1.
Define \(R(t)=G(t)\Lambda _{0}(t)\),
and
where \(T_{ij}\) denotes the occurrence time of the jth event of the ith subject, and \(P_{1}(x^{*},y,m)\) is the joint probability measure of \((X_{i}^{*},Y_{i},m_{i}).\) Note that \(X_{i}\) and \(X_{i}^{*}\) share the same random variables, hereafter we use \(P_{1}(x,y,m)\) to denote the joint probability measure of \((X_{i},Y_{i},m_{i}).\) Let \(\phi _{1i}\) denote the vector \(D_{1}^{-1}e_{i}\) without the first entry and \(\phi _{2i}\) denote the first entry of \(D_{1}^{-1}e_{i}\). Set \(\varphi _{i}(t)=\kappa _{i}(t)+\phi _{2i}\), and \(b_{i}(y,x)=\varphi _{i}(y)+\phi _{1i}'x\).
Under conditions (R1) and (R2), it follows from Wang et al. (2001) that
and
Lemma 5.1
(Lemma A.1 of Lin and Ying 2001) Let \(H_{n}(t)\) and \(M_{n}(t)\) be two sequences of bounded processes. Suppose that \(H_{n}(t)\) is monotone and converges uniformly to H(t) in probability (i.e., \(\sup _{t}|H_{n}(t)-H(t)|\) converges to 0 in probability) and that \(M_{n}(t)\) converges weakly to a zero-mean process with continuous sample paths. Then \(\int _{0}^{t}\{H_{n}(s)-H(s)\}dM_{n}(s)\) converges in probability to 0 uniformly in t.
Lemma 5.2
Under the joint models specified in Sect. 2 and condition (R4), \(n^{-1/2}\sum _{i=1}^{n}\xi _{i}\) has an asymptotically normal distribution with mean zero and covariance matrix \(\Sigma =E(\xi _{i}\xi _{i}')\), where \(\xi _{i}=(\xi _{1i}',\xi _{2i}')'\) with
and
where \(dM_{i}^{D}(t)=dN_{i}^{D}(t)-\Delta _{i}(t)\{\eta _{0}'X_{i}+\theta _{0}V_{i}+\alpha _{0}(t)\}dt\), and \(P_{1}(x,y,m)\) and \(P_{2}(x,y,m,\delta )\) denote the joint probability measures of \((X_{i},Y_{i},m_{i})\) and \((X_{i},Y_{i},m_{i},\delta _{i})\), respectively.
Proof
Note that under the joint models specified in Sect. 2.1, \(\xi _{i},i=1,\ldots ,n\) are independent and identically distributed. Recalling the Central Limit Theorem and condition (R4), it suffices to verify that \(E(\xi _{i})=0\). By the definition of \(dM_{i}^{D}(t)\), under model (2) and the Poisson model (1), it has mean zero given the covariates. Hence, the first term of \(\xi _{1i}\) has mean zero. In addition, for fixed x and y, under model (1), we have
By the Fubini theorem, we exchange the order of expectation, and can verify that the second term of \(\xi _{1i}\) has mean zero. So, we have verified that \(E(\xi _{1i})=0\). Similarly, we can verify \(E(\xi _{2i})=0\), which completes the proof.
Lemma 5.3
Under the regularity conditions (R1)-(R3), we have
and
Proof
By using the Taylor expansion theorem and (7), (8), we have for (9)
where \(o_{p}(.)\) is independent of i since (7) holds uniformly in t (Huang and Wang 2004). Similarly, we can show that (10) holds.
Lemma 5.4
Under the regularity conditions (R1)-(R3),we have
and
Proof
Straightforward calculations give
To obtain (11), it suffices to show that the third term is \(o_{p}(1)\). We first show that
To see this, let \(M^{n}(t)=\int _{0}^{t}{-1/2}\sum _{i=1}^{n}dM_{i}^{D}(t)\). Then, by the functional Central Limit Theorem(Vaart and Wellner 1996), under conditions(R1)-(R3), \(M^{n}(t)\) converges weakly to a Gaussian process G with continuous paths. By the Skorohod embedding theorem, there exists a sequence \(\tilde{M}^{n}(t)\) which is equal to \(M^{n}(t)\) in law and converges uniformly to a Gaussian process \(\tilde{G}\) with continuous paths. In addition, by the uniform law of large numbers, \(Q(t)\{\bar{X}(t)-\bar{x}(t)\}\) converges uniformly to 0 almost surely. Then, integrating by parts and applying Lemma 5.1, we have that
which implies that
Hence, (13) is obtained. Applying the expansion (9), we have
By the uniform law of large numbers, we have that \(\bar{X}(t)-\bar{x}(t)=o_{p}(1)\), \(n^{-1}\sum _{i=1}^{n}m_{i}\{\Lambda _{0}(Y_{i})e^{\gamma _{0}'X_{i}}\}^{-1}\Delta _{i}(t)=O_{p}(1)\) and \(n^{-1}\sum _{i=1}^{n}m_{i}\{\Lambda _{0}(Y_{i})e^{\gamma _{0}'X_{i}}\}^{-1}X_{i}\Delta _{i}(t)=O_{p}(1)\) uniformly in t. By the functional form of the Central Limit Theorem, we have \(n^{-1/2}\sum _{j=1}^{n}\varphi _{j}(t)=O_{p}(1)\) uniformly in t. \(n^{-1/2}\sum _{j=1}^{n}\phi _{1j}=O_{p}(1)\) follows by the Central Limit Theorem. Note that Q(t) converges uniformly to q(t) by assumption (R3), so applying Lemma 5.1 again we obtain
which completes the proof of (11). In a similar way, we can show that (12) also holds.
Lemma 5.5
Under the regularity conditions (R1)-(R3), \(n^{-1/2}U(\eta _{0},\theta _{0})\) has asymptotically a normal distribution with mean zero and covariance matrix \(\Sigma =E(\xi _{i}\xi _{i}')\), where \(U(\eta ,\theta )=(U_{1}(\eta ,\theta )',U_{2}(\eta ,\theta )')'\) and \(\xi _{i}\) is defined in Lemma 5.2.
Proof
Combing the results of Lemmas 5.3 and 5.4, with justifications similar to those in the proof of (11), we obtain that
Hence, the proof can be completed by applying Lemma 5.2.
Lemma 5.6
Under the regularity conditions (R1)–(R4),
Proof
Note that \(-n^{-1}\partial {U}(\eta _{0},\theta _{0})/\partial (\eta ',\theta ')=\widehat{A}\) is independent of \(\eta _{0}\) and \(\theta _{0}\) and converges in probability to A, which is defined in condition (R4). Straightforward calculation gives
Therefore, based on the consistency of \(\widehat{\eta }\) and \(\widehat{\theta }\) proved in Sect. 2.1, we complete the proof.
Proof of Theorem 2.1
Proof
Combining Lemmas 5.5 and 5.6, we have, \(\widehat{\eta }\) and \(\widehat{\theta }\) converge in probability to \(\eta _{0}\) and \({\theta }_{0}\), respectively. In addition \(n^{1/2}(\widehat{\eta }-\eta _{0})\) and \(n^{1/2}(\widehat{\theta }-\theta _{0})\) have asymptotically a joint normal distribution with mean zero and covariance matrix \(A^{-1}\Sigma A^{-1}.\)
Rights and permissions
About this article
Cite this article
Che, X., Angus, J. A new joint model of recurrent event data with the additive hazards model for the terminal event time. Metrika 79, 763–787 (2016). https://doi.org/10.1007/s00184-016-0577-9
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00184-016-0577-9