Abstract
We propose methods for Bayesian inference for missing covariate data with a novel class of semi-parametric survival models with a cure fraction. We allow the missing covariates to be either categorical or continuous and specify a parametric distribution for the covariates that is written as a sequence of one dimensional conditional distributions. We assume that the missing covariates are missing at random (MAR) throughout. We propose an informative class of joint prior distributions for the regression coefficients and the parameters arising from the covariate distributions. The proposed class of priors are shown to be useful in recovering information on the missing covariates especially in situations where the missing data fraction is large. Properties of the proposed prior and resulting posterior distributions are examined. Also, model checking techniques are proposed for sensitivity analyses and for checking the goodness of fit of a particular model. Specifically, we extend the Conditional Predictive Ordinate (CPO) statistic to assess goodness of fit in the presence of missing covariate data. Computational techniques using the Gibbs sampler are implemented. A real data set involving a melanoma cancer clinical trial is examined to demonstrate the methodology.
Similar content being viewed by others
References
M.-H. Chen and J. G. Ibrahim, “Maximum likelihood methods for cure rate models with missing covariates,” Biometrics vol. 57 pp. 43–52, 2001.
M.-H. Chen, J. G. Ibrahim, and D. Sinha, “A new Bayesian model for survival data with a surviving fraction,” Journal of the American Statistical Association vol. 94 pp. 909–919, 1999.
M.-H. Chen and Q.-M. Shao, “Monte Carlo estimation of Bayesian credible and HPD intervals,” Journal of Computational and Graphical Statistics vol. 8 pp. 69–92, 1999a.
M.-H. Chen and Q.-M. Shao, “Propriety of posterior distribution for Dichotomous Quantal Response models with general link functions,” Proceedings of the American Mathematical Society, to appear, 1999b.
M.-H. Chen, Q.-M. Shao, and J. G. Ibrahim, Monte Carlo Methods in Bayesian Computation, Springer-Verlag: New York, 2000.
D. K. Dey, M.-H. Chen, and H. Chang, “Bayesian approach for nonlinear random effects models,” Biometrics vol. 53 pp. 1239- 1252, 1997.
S. Geisser, Predictive Inference: An Introduction, Chapman and Hall: London, 1993.
A. E. Gelfand, D. K. Dey, and H. Chang, “Model determination using predictive distributions with implementation via sampling based methods” (with discussion), in Bayesian Statistics, J. M. Bernardo, J. O. Berger, A. P. Dawid, and A. F. M. Smith (4 eds.), Oxford University Press: Oxford, U.K., pp. 147–167, 1992.
A. E. Gelfand and B. Mallick, “Bayesian analysis of proportional hazards models built from monotone functions,” Biometrics vol. 51 pp. 843–852, 1995.
W. R. Gilks and P. Wild, “Adaptive rejection sampling for Gibbs sampling,” Applied Statistics vol. 41 pp. 337–348, 1992.
J. G. Ibrahim and M.-H. Chen, “Power prior distributions for regression models,” Statistical Science vol. 15 pp. 46–60, 2000.
J. G. Ibrahim, M.-H. Chen, and D. Sinha. Bayesian Survival Analysis. New York: Springer-Verlag, 2001.
J. G. Ibrahim, S. R. Lipsitz, and M.-H. Chen, “Missing covariates in generalized linear models when the missing data mechanism is nonignorable,” Journal of the Royal Statistical Society, Series B vol. 61 pp. 173–190, 1999.
J. M. Kirkwood, J. G. Ibrahim, V. K. Sondak, J. Richards, L. E. Flaherty, M. S. Ernstoff, T. J. Smith, U. Rao, M. Steele, and R. H. Blum, “The role of high-and low-dose interferon alfa-2b in high-risk melanoma: First analysis of intergroup trial E1690/S9111/C9190,” submitted, 1999.
J. M. Kirkwood, M. H. Strawderman, M. S. Ernstoff, T. J. Smith, E. C. Borden, and R. H. Blum, “Interferon alfa-2b adjuvant therapy of high-risk resected cutaneous melanoma: The Eastern Cooperative Oncology group trial EST 1684,” Journal of Clinical Oncology vol. 14 pp. 7–17, 1996.
R. J. A. Little and D. B. Rubin, Statistical Analysis with Missing Data. New York: John-Wiley, 1987.
D. B. Rubin, “Inference and missing data,” Biometrika vol. 63 pp. 581–592, 1976.
D. Sinha and D. K. Dey, “Semiparametric Bayesian analysis of survival data,” Journal of the American Statistical Association vol. 92 pp. 1195–1212, 1997.
A. Tsodikov, “A proportional hazards model taking account of long-term survivors,” Biometrics vol. 54 pp. 1508–1516, 1998.
A. Y. Yakovlev, B. Asselain, V. J. Bardou, A. Fourquet, T. Hoang, A. Rochefediere, and A. D. Tsodikov, “A simple stochastic model of tumor recurrence and its applications to data on premenopausal breast cancer,” in Biometrie et Analyse de Donnees Spatio-Temporelles, # 12, B. Asselain, M. Boniface, C. Duby, C. Lopez, J. P. Masson, and J. Tranchefort (eds.), Rennes: France, pp. 66–82, 1993.
A. Y. Yakovlev and A. D. Tsodikov, Stochastic Models of Tumor Latency and Their Biostatistical Applications, World Scientific: New Jersey, 1996.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Chen, MH., Ibrahim, J.G. & Lipsitz, S.R. Bayesian Methods for Missing Covariates in Cure Rate Models. Lifetime Data Anal 8, 117–146 (2002). https://doi.org/10.1023/A:1014835522957
Issue Date:
DOI: https://doi.org/10.1023/A:1014835522957