Abstract
This paper aims at investigating model checking for parametric models with response missing at random which is a more general missing mechanism than missing completely at random. Different from existing approaches, two tests have normal distributions as the limiting null distributions no matter whether the inverse probability weight is estimated parametrically or nonparametrically. Thus, \(p\) values can be easily determined. This observation shows that slow convergence rate of nonparametric estimation does not have significant effect on the asymptotic behaviors of the tests although it may have impact in finite sample scenarios. The tests can detect the alternatives distinct from the null hypothesis at a nonparametric rate which is an optimal rate for locally smoothing-based methods in this area. Simulation study is carried out to examine the performance of the tests. The tests are also applied to analyze a data set on monozygotic twins for illustration.
Similar content being viewed by others
References
Aerts, M., Claeskens, G., Hart, J. D. (1999). Testing lack of fit in multiple regression. Journal of the American Statistical Association, 94, 869–879.
Chown, J., Müller, U. U. (2013). Efficiently estimating the error distribution in nonparametric regression with responses missing at random. Journal of Nonparametric Statistics, 25, 665–677.
Dette, H. (1999). A consistent test for the functional form of a regression based on a difference of variance estimators. Annals of Statistics, 27, 1012–1050.
Dette, H. (2002). A consistent test for heteroscedasticity in nonparametric regression based on the kernel method. Journal of Statistical Planning and Inference, 103, 311–329.
Dette, H., Hildebrandt, T. (2012). A note on testing hypotheses for stationary processes in the frequency domain. Journal of Multivariate Analysis, 104, 101–114.
Dette, H., Spreckelsen, I. (2003). A note on a specification test for time series models based on spectral density estimation. Scandinavian Journal of Statistics, 30, 481–491.
Dette, H., Spreckelsen, I. (2004). Some comments on specification tests in nonparametric absolutely regular processes. Journal of Time Series Analysis, 25, 159–172.
Dette, H., von Lieres und Wilkau, C. (2001). Testing additivity by kernel-based methods-what is a reasonable test? Bernoulli, 7, 669–697.
Eubank, R. L., Li, C. S., Wang, S. J. (2005). Testing lack-of-fit of parametric regression models using nonparametric regression techniques. Statistica Sinica, 15, 135–152.
Fan, J., Huang, L. (2001). Goodness-of-fit tests for parametric regression models. Journal of the American Statistical Association, 96, 640–652.
Gao, J., Wang, Q., Yin, J. (2011). Specification testing in nonlinear time series with long-range dependence. Econometric Theory, 27, 260–284.
González-Manteiga, W., Crujeiras, R. M. (2013). An updated review of Goodness-of-Fit tests for regression models. Test, 22, 361–411.
González-Manteiga, W., Pérez-González, A. (2006). Goodness-of-fit tests for linear regression models with missing response data. Canadian Journal of Statistics, 34, 149–170.
Guo, X., Xu, W. L. (2012). Goodness-of-fit tests for general linear models with covariates missed at random. Journal of Statistical Planning and Inference, 142, 2047–2058.
Hall, P. (1984). Central limit theorem for integrated square error of multivariate nonparametric density estimators. Journal of Multivariate Analysis, 14, 1–16.
Härdle, W., Mammen, E. (1993). Comparing nonparametric versus parametric regression fits. Annals of Statistics, 21, 1926–1947.
Koul, H. L., Ni, P. P. (2004). Minimum distance regression model checking. Journal of Statistical Planning and Inference, 119, 109–141.
Koul, H. L., Müller, U. U., Schick, A. (2012). The transfer principle: a tool for complete case analysis. Annals of Statistics, 40, 3031–3049.
Lavergne, P., Vuong, Q. H. (2000). Nonparametric significance testing. Econometric Theory, 16, 576–601.
Li, X. Y. (2012). Lack-of-fit testing of regression model with response missing at random. Journal of Statistical Planning and Inference, 142, 155–170.
Little, R. J. A., Rubin, D. B. (1987). Statistical analysis with missing data. New York: Wiley.
Lopez, O., Patilea, V. (2009). Nonparametric lack-of-fit tests for parametric mean-regression models with censored data. Journal of Multivariate Analysis, 100, 210–230.
Müller, U. U., Van Keilegom, I. (2012). Efficient parameter estimation in regression with missing responses. Electronic Journal of Statistics, 6, 1200–1219.
Sperlich, S. (2014). On the choice of regularization parameters in specification testing: a critical discussion. Empirical Economics. doi:10.1007/s00181-013-0752-z.
Stute, W., Zhu, L. X. (2002). Model checks for generalized linear models. Scandinavian Journal of Statistics, 29, 535–546.
Stute, W., Gonzáles-Manteiga, W., Presedo-Quindimil, M. (1998a). Bootstrap approximation in model checks for regression. Journal of American Statistical Association, 93, 141–149.
Stute, W., Thies, S., Zhu, L. X. (1998b). Model checks for regression: An innovation process approach. Annals of Statistics, 26, 1916–1934.
Stute, W., Xu, W. L., Zhu, L. X. (2008). Model diagnosis for parametric regression in high-dimensional spaces. Biometrika, 95, 451–467.
Sun, Z. H., Wang, Q. H. (2009). Checking the adequacy of a general linear model with responses missing at random. Journal of Statistical Planning and Inference, 139, 3588–3604.
Sun, Z., Wang, Q., Dai, P. (2009). Model checking for partially linear models with missing responses at random. Journal of Multivariate Analysis, 100, 636–651.
Van Keilegom, I., Gonzáles-Manteiga, W., Sánchez Sellero, C. (2008). Goodness-of-fit tests in parametric regression based on the estimation of the error distribution. Test, 17, 401–415.
White, H. (1981). Consequences and detection of misspecified nonlinear regression models. Journal of the American Statistical Association, 76, 419–433.
Xu, W. L., Guo, X., Zhu, L. X. (2012). Goodness-of-fitting for partial linear model with missing response at random. Journal of Nonparametric Statistics, 24, 103–118.
Xue, L. G. (2009). Empirical likelihood for linear models with missing responses. Journal of Multivariate Analysis, 100, 1353–1366.
Zheng, J. X. (1996). A consistent test of functional form via nonparametric estimation techniques. Journal of Econometrics, 75, 263–289.
Zhu, L. X., Ng, K. W. (2003). Checking the adequacy of a partial linear model. Statistica Sinica, 13, 763–781.
Acknowledgments
The authors thank the editor, the associate editor and two referees for their constructive comments and suggestions which led a substantial improvement of an early manuscript. The research described here was supported by a grant from the University Grants Council of Hong Kong, Hong Kong and National Natural Science Foundation of China (No. 11071253).
Author information
Authors and Affiliations
Corresponding author
About this article
Cite this article
Guo, X., Xu, W. & Zhu, L. Model checking for parametric regressions with response missing at random. Ann Inst Stat Math 67, 229–259 (2015). https://doi.org/10.1007/s10463-014-0451-3
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10463-014-0451-3