Abstract
By evolving science, knowledge, and technology, new and precise methods for measuring, collecting, and recording information have been innovated, which have been resulted in the appearance and developing of high-dimensional data, in which the number of explanatory variables is much larger than the number of observations. Analysis and modeling the high-dimensional data is one of the most challenging problems faced by the world today. Interpreting such data is not easy and needs to use the modern methods. Penalized methods are one of the most popular ways to analyze the high-dimensional data. Semiparametric models, which a combination of both parametric and nonparametric models, are very flexible models. They are useful when the model contains both parametric and nonparametric elements in the data set. As known, the LASSO approach is a popular technique for variable selection in high-dimensional sparse regression models. Here, we show that the prediction performance of the LASSO method can be improved by eliminating the structured noises. The main purpose of this research is to introduce a modified variable selection or estimation method for a high-dimensional semiparametric regression model through a nonlinear mixed-integer programming technique. Finally, the performance of the proposed method is examined through a real-data analysis about the production of vitamin B2 and some Monte-Carlo simulation studies.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Akdeniz, F., & Roozbeh, M. (2017). Efficiency of the generalized difference-based weighted mixed almost unbiased two parameter estimator in Partially Linear Model. Communications in Statistics - Theory & Methods, 46, 12259–12280.
Akdeniz, F., & Roozbeh, M. (2019). Generalized difference-based weighted mixed almost unbiased ridge estimator in partially linear models. Statistical Papers, 60, 1717–1739.
Akdeniz Duran, E., Akdeniz, F., & Roozbeh, M. (2018). A new difference-based weighted mixed Liu estimator in partially linear models. Statistics: A Journal of Theoretical and Applied Statistics, 52, 1309–1327.
Amini, M., & Roozbeh, M. (2019). Improving the prediction performance of the LASSO by subtracting the additive structural noises. Computational Statistics, 34, 415–432.
Arashi, M., & Roozbeh, M. (2019). Some improved estimation strategies in high-dimensional semiparametric regression models with application to riboflavin production data. Statistical Papers, 60, 317–336.
Arashi, M. (2012). Preliminary test and Stein estimators in simultaneous linear equations. Linear Algebra and its Applications, 436(5), 1195–1211.
Arashi, M., & Tabatabaey, S. M. M. (2009). Improved variance estimation under sub-space restriction. Journal of Multivariate Analysis, 100, 1752–1760.
Bertsimas, D., & Tsitsiklis, J. N. (1997). Introduction to linear optimization. Massachusetts: Athena Scientific.
Bühlmann, P., Kalisch, M., & Meier, L. (2014). High-dimensional statistics with a view towards applications in biology. Annual Review of Statistics and Its Application, 1, 255–278.
Efron, B., & Hastie, T. (2017). Computer age statistical inference. Cambridge: Cambridge University Press.
Gibbons, D. G. (1981). A simulation study of some ridge estimators. Journal of the American Statistical Association, 76, 131–139.
Hall, P., Kay, J. W., & Titterington, D. M. (1990). On estimation of noise variance in two-dimensional signal processing. Advances in Applied Probability, 23, 476–495.
Härdle, W., Liang, H., & Gao, J. (2000). Partially linear models. Heidelberg: Physika Verlag.
Hoerl, A. E., & Kennard, R. W. (1970). Ridge regression: Biased estimation for non-orthogonal problems. Technometrics, 12, 55–67.
McDonald, G. C., & Galarneau, D. I. (1975). A monte carlo evaluation of some ridge-type estimators. Journal of the American Statistical Association, 70, 407–416.
Roozbeh, M. (2015). Shrinkage ridge estimators in semiparametric regression models. Journal of Multivariate Analysis, 136, 56–74.
Roozbeh, M. (2018). Optimal QR-based estimation in partially linear regression models with correlated errors using GCV criterion. Computational Statistics & Data Analysis, 117, 45–61.
Roozbeh, M., Maanavi, M., & Babaie-Kafaki, S. (2020). Robust high-dimensional semiparametric regression using optimized differencing method applied to vitamin B2 production data. Iranian Journal of Health Sciences, 8, 9–22.
Roozbeh, M., Babaie-Kafaki, S., & Aminifard, Z. (2021). A nonlinear mixed-integer programming approach for variable selection in linear regression model. Communications in Statistics - Simulation & Computation. https://doi.org/10.1080/03610918.2021.1990323
Roozbeh, M., Babaie-Kafaki, S., & Maanavi, M. (2021). A heuristic algorithm to combat outliers and multicollinearity in regression model analysis. Iranian Journal of Numerical Analysis and Optimization. https://doi.org/10.22067/IJNAO.2021.68160.1008.
Saleh, A. KMd. E. (2006). Theory of preliminary test and stein-type estimation with applications. New York: Wiley.
Speckman, P. (1988). Kernel somoothing in partial linear models. Journal of the Royal Statistical Society, Series B, 50, 413–436.
Tibshirani, R. (1996). Regression shrinkage and selection via the Lasso. Journal of the Royal Statistical Society, Series B, 58, 267–288.
Wang, L., Brown, L. D., & Cai, T. T. (2011). A difference based approach to semiparametric partial linear model. Electronic Journal of Statistics, 5, 619–641.
Yatchew, A. (1997). An elemantary estimator of the partial linear model. Economics Letters, 57, 135–143.
Yatchew, A. (2003). Semiparametric regression for the applied econometrican. Cambridge: Cambridge University Press.
Yüzbaşi, B., & Arashi, M. (2019). Double shrunken selection operator. Communications in Statistics - Simulation & Computation, 48, 666–674.
Yüzbaşi, B., Arashi, M., & Ahmed, S. E. (2020). Shrinkage estimation strategies in generalized ridge regression models under low/high-dimension regime. International Statistical Review, 88, 229–251.
Yüzbaşi, B., Arashi, M., & Akdeniz, F. (2021). Penalized regression via the restricted bridge estimator. Soft Computing, 25, 8401–8416.
Acknowledgements
The author thank the anonymous Reviewers and the Associate Editor for their valuable comments and suggestions that helped to improve the quality of this work.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Roozbeh, M. (2022). High-Dimensional Nonlinear Optimization Problem in Semiparametric Regression Model. In: Bekker, A., Ferreira, J.T., Arashi, M., Chen, DG. (eds) Innovations in Multivariate Statistical Modeling. Emerging Topics in Statistics and Biostatistics . Springer, Cham. https://doi.org/10.1007/978-3-031-13971-0_16
Download citation
DOI: https://doi.org/10.1007/978-3-031-13971-0_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-13970-3
Online ISBN: 978-3-031-13971-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)