Abstract
We develop a novel modeling strategy for analyzing data with repeated binary responses over time as well as time-dependent missing covariates. We assume that covariates are missing at random (MAR). We use the generalized linear mixed logistic regression model for the repeated binary responses and then propose a joint model for time-dependent missing covariates using information from different sources. A Monte Carlo EM algorithm is developed for computing the maximum likelihood estimates. We propose an extended version of the AIC criterion to identify the important factors that m a y explain the binary responses. A real plant dataset is used to motivate and illustrate the proposed methodology.
Similar content being viewed by others
References
Akaike, H. (1973), “Information Theory and an Extension of the Maximum Likelihood Principle,” in International Symposium on Information Theory, eds. B. N. Petrov and F. Csaki, Budapest: Akademia Kiado, pp. 267–281.
Booth, J. G., and Hobert, J. P. (1999), “Maximizing Generalized Linear Mixed Model Likelihoods with an Automated Monte Carlo E M Algorithm,” Journal of the Royal Statistical Society, Ser. B, 61, 265–285.
Breslow, N. E., and Clayton, D. G. (1993), “Approximate Inference in Generalized Linear Mixed Models,” Journal of the American Statistical Association, 88, 9–25.
Chen, M.-H., and Shao, Q.-M. (2001), “Propriety of Posterior Distribution for Dichotomous Quantal Response Models with General Link Functions,” Proceedings of the American Mathematical Society, 129, 293–302.
Chen, M.-H., Shao, Q.-M., and Xu, D. (2002), “Suffcient and Necessary Conditions on the Propriety of Posterior Distributions for Generalized Linear Mixed Models,” Sankhyā, Series A, Pt. 1, 64, 57–85.
Diggle, P. J., Heagerty, P. J., Liang, K.-Y., and Zeger, S. L. (2002), Analysis of Longitudinal Data (2nd ed.), Oxford Science Publications, Oxford: Clarendon Press.
Gilks, W. R., and Wild, P. (1992), “Adaptive Rejection Sampling for Gibbs Sampling,” Applied Statistics, 41, 337–348.
Harper, J. L. (1977), Population Biology of Plants, London: Academic Press.
Horton, N. J., and Laird, N. M. (1999), “Maximum Likelihood Analysis of Generalized Linear Models with Missing Covariates,” Statistical Methods in Medical Research, 8, 37–50.
Ibrahim, J. G., Chen, M.-H., and Lipsitz, S.R. (2002), “Bayesian Methods for Generalized Linear Models With Missing Covariates,” The Canadian Journal of Statistics, 30, 55–78.
Ibrahim, J. G., Chen, M.-H., Lipsitz, S. R., and Herring, A. H. (2005), “Missing Data Methods in Regression Models,” Journal of the American Statistical Association, 100, 332–346.
Kenward, M. G. (1998), “Selection Models for Repeated Measurements with Nonrandom Dropout: An Illustration of Sensitivity,” Statistics in Medicine, 17, 2723–2732.
Lindstrom, J., Ranta, E., Kokko, H., Lundberg, P., and Kaitala, V. (2001), “From Arctic Lemmings to Adaptive Dynamics: Charles Elton’s Legacy in Population Ecology,” Biological Reviews, 76, 129–158.
Lipsitz, S. R., and Ibrahim, J. G. (1996), “A Conditional Model for Incomplete Covariates in Parametric Regression Models,” Biometrika, 83, 916–922.
Little, R. J. A. (1992), “Regression with Missing X’s: A Review,” Journal of the American Statistical Association, 87, 1227–1237.
Little, R. J. A., and Rubin, D. B. (2002), Statistical Analysis With Missing Data (2nd ed.), New York: Wiley.
Louis, T. (1982), “Finding the Observed Information Matrix when Using the E M Algorithm,” Journal of the Royal Statistical Society, Ser. B, 44, 226–233.
McCullagh, P., and Nelder, J. A. (1989), Generalized Linear Models (2nd ed.), London: Chapman and Hall.
Molenberghs, G., and Verbeke, G. (2005), Models for Discrete Longitudinal Data, New York: Springer.
Monselise, S. P., and Goldschmidt, E. E. (1982), “Alternative Bearing in Fruit Trees,” Horticultural Reviews, 4, 128–173.
O’Neill, R. (1971), “Algorithm AS47-function Minimization Using a Simplex Procedure,” Applied Statistics, 20, 338–345.
Rubin, D. B. (1976), “Inference and Missing Data,” Biometrika, 63, 581–592.
Stubbendick, A. L. (2003), “Likelihood-Based Inference for Longitudinal Models with Nonignorable Missing Response and Covariate Data,” Unpublished Ph.D. Dissertation, The Harvard School of Public Health, Boston, Massachusetts.
Verbeke, G., Lesaffre, E., and Spiessens, B. (2001), “The Practical Use of Different Strategies to Handle Dropout in Longitudinal Studies,” Drug Information Journal, 35, 419–434.
Verbeke, G., and Molenberghs, G. (2000), Linear Mixed Models for Longitudinal Data, New York: Springer.
Williams, K. S., and Simon, C. (1995), “The Ecology, Behavior, and Evolution of Periodical Cicadas,” Annual Review of Entomology, 40, 269–295.
Wolfnger, R., and O’Connell, M. (1993), “Generalized Linear Mixed Models: A Pseudo-Likelihood Approach,” Journal of Statistical Computation and Simulation, 48, 233–243.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Huang, L., Chen, MH., Yu, F. et al. On modeling repeated binary responses and time-dependent missing covariates. JABES 13, 270–293 (2008). https://doi.org/10.1198/108571108X338023
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1198/108571108X338023