Model checking for parametric regressions with response missing at random

Guo, Xu; Xu, Wangli; Zhu, Lixing

doi:10.1007/s10463-014-0451-3

Model checking for parametric regressions with response missing at random

Published: 27 March 2014

Volume 67, pages 229–259, (2015)
Cite this article

Annals of the Institute of Statistical Mathematics Aims and scope Submit manuscript

Xu Guo^1,2,
Wangli Xu³ &
Lixing Zhu⁴

471 Accesses
9 Citations
Explore all metrics

Abstract

This paper aims at investigating model checking for parametric models with response missing at random which is a more general missing mechanism than missing completely at random. Different from existing approaches, two tests have normal distributions as the limiting null distributions no matter whether the inverse probability weight is estimated parametrically or nonparametrically. Thus, \(p\) values can be easily determined. This observation shows that slow convergence rate of nonparametric estimation does not have significant effect on the asymptotic behaviors of the tests although it may have impact in finite sample scenarios. The tests can detect the alternatives distinct from the null hypothesis at a nonparametric rate which is an optimal rate for locally smoothing-based methods in this area. Simulation study is carried out to examine the performance of the tests. The tests are also applied to analyze a data set on monozygotic twins for illustration.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Empirical likelihood inference in general linear model with missing values in response and covariates by MNAR mechanism

Article 05 April 2019

Doubly robust estimation and robust empirical likelihood in generalized linear models with missing responses

Article 14 November 2023

Estimation in partially linear varying-coefficient errors-in-variables models with missing response variables

Article 14 February 2020

References

Aerts, M., Claeskens, G., Hart, J. D. (1999). Testing lack of fit in multiple regression. Journal of the American Statistical Association, 94, 869–879.
Google Scholar
Chown, J., Müller, U. U. (2013). Efficiently estimating the error distribution in nonparametric regression with responses missing at random. Journal of Nonparametric Statistics, 25, 665–677.
Google Scholar
Dette, H. (1999). A consistent test for the functional form of a regression based on a difference of variance estimators. Annals of Statistics, 27, 1012–1050.
Google Scholar
Dette, H. (2002). A consistent test for heteroscedasticity in nonparametric regression based on the kernel method. Journal of Statistical Planning and Inference, 103, 311–329.
Google Scholar
Dette, H., Hildebrandt, T. (2012). A note on testing hypotheses for stationary processes in the frequency domain. Journal of Multivariate Analysis, 104, 101–114.
Google Scholar
Dette, H., Spreckelsen, I. (2003). A note on a specification test for time series models based on spectral density estimation. Scandinavian Journal of Statistics, 30, 481–491.
Google Scholar
Dette, H., Spreckelsen, I. (2004). Some comments on specification tests in nonparametric absolutely regular processes. Journal of Time Series Analysis, 25, 159–172.
Google Scholar
Dette, H., von Lieres und Wilkau, C. (2001). Testing additivity by kernel-based methods-what is a reasonable test? Bernoulli, 7, 669–697.
Eubank, R. L., Li, C. S., Wang, S. J. (2005). Testing lack-of-fit of parametric regression models using nonparametric regression techniques. Statistica Sinica, 15, 135–152.
Google Scholar
Fan, J., Huang, L. (2001). Goodness-of-fit tests for parametric regression models. Journal of the American Statistical Association, 96, 640–652.
Google Scholar
Gao, J., Wang, Q., Yin, J. (2011). Specification testing in nonlinear time series with long-range dependence. Econometric Theory, 27, 260–284.
Google Scholar
González-Manteiga, W., Crujeiras, R. M. (2013). An updated review of Goodness-of-Fit tests for regression models. Test, 22, 361–411.
Google Scholar
González-Manteiga, W., Pérez-González, A. (2006). Goodness-of-fit tests for linear regression models with missing response data. Canadian Journal of Statistics, 34, 149–170.
Google Scholar
Guo, X., Xu, W. L. (2012). Goodness-of-fit tests for general linear models with covariates missed at random. Journal of Statistical Planning and Inference, 142, 2047–2058.
Google Scholar
Hall, P. (1984). Central limit theorem for integrated square error of multivariate nonparametric density estimators. Journal of Multivariate Analysis, 14, 1–16.
Google Scholar
Härdle, W., Mammen, E. (1993). Comparing nonparametric versus parametric regression fits. Annals of Statistics, 21, 1926–1947.
Google Scholar
Koul, H. L., Ni, P. P. (2004). Minimum distance regression model checking. Journal of Statistical Planning and Inference, 119, 109–141.
Google Scholar
Koul, H. L., Müller, U. U., Schick, A. (2012). The transfer principle: a tool for complete case analysis. Annals of Statistics, 40, 3031–3049.
Google Scholar
Lavergne, P., Vuong, Q. H. (2000). Nonparametric significance testing. Econometric Theory, 16, 576–601.
Google Scholar
Li, X. Y. (2012). Lack-of-fit testing of regression model with response missing at random. Journal of Statistical Planning and Inference, 142, 155–170.
Google Scholar
Little, R. J. A., Rubin, D. B. (1987). Statistical analysis with missing data. New York: Wiley.
Lopez, O., Patilea, V. (2009). Nonparametric lack-of-fit tests for parametric mean-regression models with censored data. Journal of Multivariate Analysis, 100, 210–230.
Google Scholar
Müller, U. U., Van Keilegom, I. (2012). Efficient parameter estimation in regression with missing responses. Electronic Journal of Statistics, 6, 1200–1219.
Google Scholar
Sperlich, S. (2014). On the choice of regularization parameters in specification testing: a critical discussion. Empirical Economics. doi:10.1007/s00181-013-0752-z.
Stute, W., Zhu, L. X. (2002). Model checks for generalized linear models. Scandinavian Journal of Statistics, 29, 535–546.
Google Scholar
Stute, W., Gonzáles-Manteiga, W., Presedo-Quindimil, M. (1998a). Bootstrap approximation in model checks for regression. Journal of American Statistical Association, 93, 141–149.
Google Scholar
Stute, W., Thies, S., Zhu, L. X. (1998b). Model checks for regression: An innovation process approach. Annals of Statistics, 26, 1916–1934.
Google Scholar
Stute, W., Xu, W. L., Zhu, L. X. (2008). Model diagnosis for parametric regression in high-dimensional spaces. Biometrika, 95, 451–467.
Google Scholar
Sun, Z. H., Wang, Q. H. (2009). Checking the adequacy of a general linear model with responses missing at random. Journal of Statistical Planning and Inference, 139, 3588–3604.
Google Scholar
Sun, Z., Wang, Q., Dai, P. (2009). Model checking for partially linear models with missing responses at random. Journal of Multivariate Analysis, 100, 636–651.
Google Scholar
Van Keilegom, I., Gonzáles-Manteiga, W., Sánchez Sellero, C. (2008). Goodness-of-fit tests in parametric regression based on the estimation of the error distribution. Test, 17, 401–415.
White, H. (1981). Consequences and detection of misspecified nonlinear regression models. Journal of the American Statistical Association, 76, 419–433.
Google Scholar
Xu, W. L., Guo, X., Zhu, L. X. (2012). Goodness-of-fitting for partial linear model with missing response at random. Journal of Nonparametric Statistics, 24, 103–118.
Google Scholar
Xue, L. G. (2009). Empirical likelihood for linear models with missing responses. Journal of Multivariate Analysis, 100, 1353–1366.
Google Scholar
Zheng, J. X. (1996). A consistent test of functional form via nonparametric estimation techniques. Journal of Econometrics, 75, 263–289.
Google Scholar
Zhu, L. X., Ng, K. W. (2003). Checking the adequacy of a partial linear model. Statistica Sinica, 13, 763–781.
Google Scholar

Download references

Acknowledgments

The authors thank the editor, the associate editor and two referees for their constructive comments and suggestions which led a substantial improvement of an early manuscript. The research described here was supported by a grant from the University Grants Council of Hong Kong, Hong Kong and National Natural Science Foundation of China (No. 11071253).

Author information

Authors and Affiliations

Department of Mathematics, Hong Kong Baptist University, Fong Shu-Chuen Library 1110, Kowloon Tong, Hong Kong
Xu Guo
College of Economics and Management, Nanjing University of Aeronautics and Astronautics, 29 Jiangjun Avenue, 210016, Nanjing, China
Xu Guo
School of Statistics, Center for Applied Statistics, Renmin University of China, Zhongguancun Street 59, 100872 , Beijing, China
Wangli Xu
Department of Mathematics, Hong Kong Baptist University, Fong Shu Chuen Library 1208 Waterloo Road 224, Kowloon Tong, Hong Kong
Lixing Zhu

Authors

Xu Guo
View author publications
You can also search for this author in PubMed Google Scholar
Wangli Xu
View author publications
You can also search for this author in PubMed Google Scholar
Lixing Zhu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lixing Zhu.

About this article

Cite this article

Guo, X., Xu, W. & Zhu, L. Model checking for parametric regressions with response missing at random. Ann Inst Stat Math 67, 229–259 (2015). https://doi.org/10.1007/s10463-014-0451-3

Download citation

Received: 19 December 2012
Revised: 24 November 2013
Published: 27 March 2014
Issue Date: April 2015
DOI: https://doi.org/10.1007/s10463-014-0451-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Model checking for parametric regressions with response missing at random

Abstract

Access this article

Similar content being viewed by others

Empirical likelihood inference in general linear model with missing values in response and covariates by MNAR mechanism

Doubly robust estimation and robust empirical likelihood in generalized linear models with missing responses

Estimation in partially linear varying-coefficient errors-in-variables models with missing response variables

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

About this article

Cite this article

Keywords

Navigation

Model checking for parametric regressions with response missing at random

Abstract

Access this article

Similar content being viewed by others

Empirical likelihood inference in general linear model with missing values in response and covariates by MNAR mechanism

Doubly robust estimation and robust empirical likelihood in generalized linear models with missing responses

Estimation in partially linear varying-coefficient errors-in-variables models with missing response variables

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

About this article

Cite this article

Share this article

Keywords

Search

Navigation