Abstract
In small area estimation, linear mixed models are frequently used. Variable selection methods for linear mixed models are available. However, in many applications such as small area estimation data users often apply variable selection methods that ignore the random effects. In this paper, we first evaluate the accuracy of such variable selection method for the Fay-Herriot model, a regression model when dependent variable is subject to sampling error variability. We show that the approximation error, that is, the difference between the standard variable selection criterion and the corresponding ideal variable selection criterion without any sampling error variability, does not converge to zero in probability even for a large sample size. In our simulation, we notice that standard variable selection criterion could severely underestimate the ideal adjusted R 2 and BIC variable selection criteria in presence of high sampling error variability. We propose a simple adjustment to the standard variable selection method for the Fay-Herriot model that reduces the approximation errors. In particular, we show that the approximation error for our new variable selection criteria converge to zero in probability for large sample size. Using a Monte Carlo simulation, we demonstrate that our proposed variable selection criterion tracks the corresponding ideal variable selection criterion very well compared to the standard variable selection method.
Similar content being viewed by others
References
Bell, W.R. (1999). Accounting for uncertainty about variances in small area estimation. Bulletin of the International Statistical Institute. 52. available at www.census.gov/hhes/www/saipe under “Publications”.
Carter, G.M. and Rolph, J.E. (1974). Empirical Bayes methods applied to estimating fire alarm probabilities. J. Am. Stat. Assoc. 69, 880–885.
Chatterjee, S. and Lahiri, P. (2007). A simple computational method for estimating mean squared prediction error in general small-area model. JSM Proceedings, Section on Survey Research Methods, American Statistical Association 3486–3493.
Claeskens, G. and Hjort, N.L. (2008). Model Selection and Model Averaging. Cambridge University Press.
Efron, B. and Morris, C. (1975). Data analysis using Steins estimator and its generalizations. J. Am. Stat. Assoc. 70, 311–319.
Fay, R.E. and Herriot, R.A. (1979). Estimates of income for small places: an application of James-Stein procedure to census data. J. Am. Stat. Assoc. 74, 269–277.
Jiang, J., Rao J.S., Gu, Z. and Nguyen, T. (2008). Fence methods for mixed model selection. Ann. Stat. 36, 1669–1692.
Kutner, M.H., Nachtsheim, C.J. and Neter, J. (2004). Applied Linear Regression Models. McGraw-Hill/Irwin Series Operations and Decision Sciences.
Pfeffermann, D (2002). Small area estimation - new developments and directions. Int. Statist. Rev. 70, 125–143.
Rao, J.N.K. (2003). Small Area E stimation. Wiley, New York.
Rao, C.R. and Wu, Y. (2001) On model Selection. In Model Selection, (P. Lahiri, eds.). Institute of Mathematical Statistics Lecture Notes-Monograph Series, 38.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Lahiri, P., Suntornchost, J. Variable Selection for Linear Mixed Models with Applications in Small Area Estimation. Sankhya B 77, 312–320 (2015). https://doi.org/10.1007/s13571-015-0096-0
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13571-015-0096-0