Abstract
An ensemble, a model-independent technique based on combining several models for classification/regression tasks, allows us to achieve a high accuracy that is often not achievable with single models. Such combinations have gained increasing attention in many fields. This paper proposes the use of random subspace (RS)-based regression ensemble as an alternative method for near-infrared (NIR) spectroscopic calibration of tobacco samples. Because of the considerable reduction of variables in a random subspace, multiple linear regression (MLR) is used as the base algorithm and the method is therefore also referred to as RS-MLR. The overall performance of the proposed RS-MLR method is compared to those of partial least square regression (PLSR), kernel principal component regression (KPCR) and kernel partial least square regression (KPLSR). The results reveal that the RS-MLR method not only has a simple concept but also can produce a more parsimonious and more accurate calibration model than PLSR, KPCR and KPLSR, at a lower computational cost. Besides, we also found that the RS-MLR method is very appropriate for the so-called small sample problems and that the calibration models built by RS-MLR are less sensitive to overfitting.
Similar content being viewed by others
References
J. Sádecká and J. Polonský, J. Chromatogr., A, 2003, 988, 161.
J. M. Garrigues, A. Pérez-Ponce, S. Garrigues, and M. Guardia, Anal. Chim. Acta, 1998, 373, 63.
H. P. Xie, J. H. Jiang, Z. Q. Chen, G. L. Shen, and R. Q. Yu, Anal. Sci., 2006, 22, 1111.
X. G. Shao and Y. D. Zhuang, Anal. Sci., 2004, 20, 451.
C. Tan and M. L. Li, Anal. Sci., 2007, 23, 201.
J. Rantanen, H. Wikström, R. Turner, and L. S. Taylor, Anal. Chem., 2005, 77, 556.
W. R Li, H. Wang, T. X. Yang, and H. S. Zhang, Anal. Bioanal. Chem., 2003, 377, 350.
M. Casale, M. J. S. Abajo, J. M. G. Sáiz, C. Pizarro, and M. Forina, Anal. Chim. Acta, 2006, 557, 360.
Q. S. Chen, J. W. Zhao, X. Y. Fang, H. D. Zhang, and M. H. Liu, Microchem. J., 2006, 83, 42.
L. R. O. Jose, H. Maria, and C. Juan, J. Agric. Food Chem., 1997, 45, 2815.
J. H. Jiang, R. J. Berry, H. W. Siesier, and Y. Ozaki, Anal. Chem., 2002, 74, 3555.
Y. P. Du, Y. Z. Liang, J. H. Jiang, R. J. Berry, and Y. Ozaki, Anal. Chim. Acta, 2004, 501, 183.
R. Leardi, J. Chemometr., 2000, 14, 643.
V. Centner and D. L. Massart, Anal. Chem., 1996, 68, 3851.
A. Borin and R. J. Poppi, Vib. Spectrosc., 2005, 37, 27.
R. Leardi and L. Nørgaard, J. Chemometr., 2004, 18, 486.
Q. Ding and G. W. Small, Anal. Chem., 1998, 70, 4472.
R. Leardi, M. B. Seasholtz, and R. J. Pell, Anal. Chim. Acta, 2002, 461, 189.
F. Rossi, A. Lendasse, D. Francois, V. Wertz, and M. Verleysen, Chemom. Intell. Lab. Syst., 2006, 80, 215.
J. A. Hageman, M. Streppel, R. Wehrens, and L. M. C. Buydens, J. Chemometr., 2003, 17, 427.
K. Saeki, K. Funatsu, and K. Tanabe, Anal. Sci., 2003, 19, 309.
L. Xiang, G. Q. Fan, J. H. Li, H. Kang, Y. L. Yan, J. H. Zheng, and D. Guo, Phytochem. Anal., 2002, 13, 272.
X. G. Shao, F. Wang, D. Chen, and Q. D. Su, Anal. Bioanal. Chem., 2003, 19, 309.
C. E. W. Gributs and D. H. Burns, Chemom. Intell. Lab. Syst., 2006, 83, 44.
A. Borin, M. F. Ferrão, C. Mello, D. A. Maretto, and R. J. Poppi, Anal. Chim. Acta, 2006, 579, 25.
U. Thissen, B. Üstun, W. J. Melssen, and L. M. C. Buydens, Anal. Chem., 2004, 76, 3099.
K. Kim, J. M. Lee, and I. B. Lee, Chemom. Intell. Lab. Syst., 2005, 79, 22.
J. Cheng, Q. S. Liu, H. Q. Lu, and Y. W. Chen, Pattern Recogn., 2006, 39, 81.
M. J. Sáiz-Abajo, B.-H. Mevikb, V. H. Segtnan, and T. Næs, Anal. Chim. Acta, 2005, 533, 147.
B. H.-Mevik, V. H. Segtnan, and T. Næs, J. Chemometr., 2004, 18, 498.
M. H. Zhang, Q. S. Xu, and D. L. Massart, Anal. Chem., 2005, 77, 1423.
H. Shinzawa, J. H. Jiang, P. Ritthiruangdej, and Y. Ozaki, J. Chemometr., 2006, 20, 436.
M. Christian, M. Harald, S. G. Tanja, R. Olivier, S. Martin, and L. Thomas, J. Chem. Inf. Comput. Sci., 2004, 44, 1971.
A. Tsymbal, M. Pechenizkiy, and P. Cunningham, Inf. Fusion, 2005, 6, 83.
G. Brown, J. L. Wyatt, and P. Timo, J. Mach. Learn. Res., 2005, 6, 1621.
P. He, C. J. Xu, Y. Z. Liang, and K. T. Fang, Chemom. Intell. Lab. Syst., 2004, 70, 39.
N. Benoudjita, E. Cools, M. Meurens, and M. Verleysen, Chemom. Intell. Lab. Syst., 2004, 70, 47.
R. Rosipal and L. Trejo, J. Mach. Learn. Res., 2001, 2, 97.
R. K. H. Galváo, M. C. U. Araújo, G. E. José, M. J. C. Pontes, E. C. Silva, and T. C. B. Saldanha, Talanta, 2005, 67, 736.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Tan, C., Li, M. & Qin, X. Random Subspace Regression Ensemble for Near-Infrared Spectroscopic Calibration of Tobacco Samples. ANAL. SCI. 24, 647–653 (2008). https://doi.org/10.2116/analsci.24.647
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.2116/analsci.24.647