Hybrid ensemble selection algorithm incorporating GRASP with path relinking

Zhang, Ting; Dai, Qun

doi:10.1007/s10489-015-0724-4

Hybrid ensemble selection algorithm incorporating GRASP with path relinking

Published: 14 November 2015

Volume 44, pages 704–724, (2016)
Cite this article

Applied Intelligence Aims and scope Submit manuscript

Ting Zhang¹ &
Qun Dai¹

349 Accesses
4 Citations
Explore all metrics

Abstract

The greedy randomized adaptive search procedure (GRASP) is an iterative two-phase multi-start metaheuristic procedure for a combination optimization problem, while path relinking is an intensification procedure applied to the solutions generated by GRASP. In this paper, a hybrid ensemble selection algorithm incorporating GRASP with path relinking (PRelinkGraspEnS) is proposed for credit scoring. The base learner of the proposed method is an extreme learning machine (ELM). Bootstrap aggregation (bagging) is used to produce multiple diversified ELMs, while GRASP with path relinking is the approach for ensemble selection. The advantages of the ELM are inherited by the new algorithm, including fast learning speed, good generalization performance, and easy implementation. The PRelinkGraspEnS algorithm is able to escape from local optima and realizes a multi-start search. By incorporating path relinking into GRASP and using it as the ensemble selection method for the PRelinkGraspEnS the proposed algorithm becomes a procedure with a memory and high convergence speed. Three credit datasets are used to verify the efficiency of our proposed PRelinkGraspEnS algorithm. Experimental results demonstrate that PRelinkGraspEnS achieves significantly better generalization performance than the classical directed hill climbing ensemble pruning algorithm, support vector machines, multi-layer perceptrons, and a baseline method, the best single model. The experimental results further illustrate that by decreasing the average time needed to find a good-quality subensemble for the credit scoring problem, GRASP with path relinking outperforms pure GRASP (i.e., without path relinking).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

Imbalanced data preprocessing techniques for machine learning: a systematic mapping study

Article 09 November 2022

Machine learning techniques for credit risk evaluation: a systematic literature review

Article 01 April 2020

References

Yu L, Wang S, Lai KK (2008) Credit risk assessment with a multistage neural network ensemble learning approach. Expert Syst Appl 34:1434–1444
Article Google Scholar
Wang G, Hao J, Ma J, Jiang H (2011) A comparative assessment of ensemble learning for credit scoring. Expert Syst Appl 38:223–230
Article Google Scholar
Hand DJ, Henley WE (1997) Statistical classification methods in consumer credit scoring: a review. J R Stat Soc A Stat Soc 160:523–541
Article Google Scholar
Huang Z, Chen H, Hsu C-J, Chen W-H, Wu S (2004) Credit rating analysis with support vector machines and neural networks: a market comparative study. Decis Support Syst 37:543–558
Article Google Scholar
Wiginton JC (1980) A note on the comparison of logit and discriminant models of consumer credit behavior. J Financ Quant Anal 15:757–770
Article Google Scholar
Fisher RA (1936) The use of multiple measurements in taxonomic problems. Ann Eugen 7:179–188
Article Google Scholar
Glover F (1990) Improved linear programming models for discriminant analysis^∗. Decis Sci 21:771–785
Article Google Scholar
Grablowsky BJ, Talley WK (1981) Probit and discriminant functions for classifying credit applicants. J Econ Bus 33:254–261
Google Scholar
Henley W, Hand DJ (1996) A k-nearest-neighbour classifier for assessing consumer credit risk. The Statistician:77–95
Van Gestel IT, Baesens B, Garcia IJ, Van Dijcke P (2003) A support vector machine approach to credit scoring. Bank en Financiewezen, pp 73–82
Chen M-C, Huang S-H (2003) Credit scoring and rejected instances reassigning through evolutionary computation techniques. Expert Syst Appl 24:433–441
Article Google Scholar
Varetto F (1998) Genetic algorithms applications in the analysis of insolvency risk. J Bank Financ 22:1421–1439
Article Google Scholar
Malhotra R, Malhotra D (2003) Evaluating consumer loans using neural networks. Omega 31:83–96
Article Google Scholar
Smalz R, Conrad M (1994) Combining evolution with credit apportionment: a new learning algorithm for neural nets. Neural Netw 7:341–351
Article Google Scholar
Abdou H, Pointon J, El-Masry A (2008) Neural nets versus conventional techniques in credit scoring in Egyptian banking. Expert Syst Appl 35:1275–1292
Article Google Scholar
Angelini E, di Tollo G, Roli A (2008) A neural network approach for credit risk evaluation. Q Rev Econ Finance 48:733–755
Article Google Scholar
Khashman A (2010) Neural networks for credit risk evaluation: investigation of different neural models and learning schemes. Expert Syst Appl 37:6233–6239
Article Google Scholar
Lai KK, Yu L, Wang S, Zhou L (2006) Credit risk analysis using a reliability-based neural network ensemble model. In: Artificial neural networks–ICANN 2006. Springer, Berlin, pp 682–690
Min JH, Lee Y-C (2008) A practical approach to credit scoring. Expert Syst Appl 35:1762–1770
Article Google Scholar
Huang G-B, Zhu Q-Y, Siew C-K (2006) Extreme learning machine: theory and applications. Neurocomputing 70:489–501
Article Google Scholar
Liu N, Wang H (2010) Ensemble based extreme learning machine. IEEE Signal Process Lett 17:754–757
Article Google Scholar
You Z-H, Lei Y-K, Zhu L, Xia J, Wang B (2013) Prediction of protein-protein interactions from amino acid sequences with ensemble extreme learning machines and principal component analysis. BMC Bioinf 14:S10
Article Google Scholar
Lian C, Zeng Z, Yao W, Tang H (2014) Ensemble of extreme learning machine for landslide displacement prediction based on time series analysis. Neural Comput & Applic 24:99–107
Article Google Scholar
Zhai J-H, Xu H-Y, Wang X-Z (2012) Dynamic ensemble extreme learning machine based on sample entropy. Soft Comput 16:1493–1502
Article Google Scholar
Tsoumakas G, Partalas I, Vlahavas I (2009) An ensemble pruning primer. In: Applications of supervised and unsupervised ensemble methods. Springer, Berlin, pp 1–13
Partalas I, Tsoumakas G, Vlahavas I (2010) An ensemble uncertainty aware measure for directed hill climbing ensemble pruning. Mach Learn 81:257–282
Article MathSciNet Google Scholar
Caruana R, Niculescu-Mizil A, Crew G, Ksikes A (2004) Ensemble selection from libraries of models. In: Proceedings of the 21st international conference on machine learning, p 18
Partalas I, Tsoumakas G, Vlahavas I A study on greedy algorithms for ensemble pruning, Technical Report TR-LPIS-360-12, Department of Informatics, Aristotle University of Thessaloniki, Greece 2012
Martinez-Muoz G, Hernández-Lobato D, Suárez A (2009) An analysis of ensemble pruning techniques based on ordered aggregation. IEEE Trans Pattern Anal Mach Intell 31:245–259
Article Google Scholar
Dai Q (2013) A novel ensemble pruning algorithm based on randomized greedy selective strategy and ballot. Neurocomputing 122:258–265
Article Google Scholar
Dai Q (2013) A competitive ensemble pruning approach based on cross-validation technique. Knowl-Based Syst 37:394–414
Article Google Scholar
Dai Q, Liu Z (2013) ModEnPBT: a modified backtracking ensemble pruning algorithm. Appl Soft Comput 13:4292–4302
Article Google Scholar
Liu Z, Dai Q, Liu N (2014) Ensemble selection by GRASP. Appl Intell 41:128–144
Article MathSciNet Google Scholar
Banfield RE, Hall LO, Bowyer KW, Kegelmeyer WP (2005) Ensemble diversity measures and their application to thinning. Inf Fusion 6:49–62
Article Google Scholar
Margineantu DD, Dietterich TG (1997) Pruning adaptive boosting. In: ICML, pp 211–218
Martýnez-Munoz G, Suárez A (2004) Aggregation ordering in bagging. In: Proceedings of the IASTED international conference on artificial intelligence and applications, pp 258–263
Festa P, Pardalos PM, Pitsoulis LS, Resende MG (2007) GRASP with path relinking for the weighted MAXSAT problem. Journal of Experimental Algorithmics (JEA) 11:2.4
MathSciNet MATH Google Scholar
Huang G-B (2003) Learning capability and storage capacity of two-hidden-layer feedforward networks. IEEE Trans Neural Netw 14:274–281
Article Google Scholar
Huang G-B, Babri HA (1998) Upper bounds on the number of hidden neurons in feedforward networks with arbitrary bounded nonlinear activation functions. IEEE Trans Neural Netw 9:224–229
Article Google Scholar
Rao CR, Mitra SK (1971) Generalized inverse of matrices and its applications, vol 7. Wiley, New York
MATH Google Scholar
Johnson CR (1990) Matrix theory and applications: american mathematical soc.
Breiman L (1996) Bagging predictors. Mach Learn 24:123–140
MathSciNet MATH Google Scholar
Feo TA, Resende MG (1995) Greedy randomized adaptive search procedures. J Glob Optim 6:109–133
Article MathSciNet MATH Google Scholar
Layeb A, Selmane M, Elhoucine MB (2013) A new greedy randomised adaptive search procedure for multiple sequence alignment. Int J Bioinforma Res Appl 9:323–335
Article Google Scholar
Glover F (1997) Tabu search and adaptive memory programming—advances, applications and challenges. In: Interfaces in computer science and operations research. Springer, Berlin, pp 1–75
Gevezes T, Pitsoulis L (2013) A greedy randomized adaptive search procedure with path relinking for the shortest superstring problem. J Comb Optim:1–25
Ribeiro CC, Resende MG (2012) Path-relinking intensification methods for stochastic local search algorithms. J Heuristics 18:193–214
Article Google Scholar
Martínez-Muñoz G, Suárez A (2006) Pruning in ordered bagging ensembles. In: Proceedings of the 23rd international conference on Machine learning, pp 609–616
Asuncion A, Newman D UCI repository of machine learning databases. http://www.ics.uci.edu/~mlearn/MLRepository.html or ftp.ics.uci.edu:pub/machine-learning-databases
Feo TA, Resende MG, Smith SH (1994) A greedy randomized adaptive search procedure for maximum independent set. Oper Res 42:860–878
Article MATH Google Scholar
Lin CJ LIBSVM: a library for support vector machines. http://www.csie.ntu.edu.tw/~cjlin/

Download references

Acknowledgments

This work is supported by the National Natural Science Foundation of China under Grant no. 61473150.

Author information

Authors and Affiliations

College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing, 210016, China
Ting Zhang & Qun Dai

Authors

Ting Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Qun Dai
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Qun Dai.

Ethics declarations

Compliance with Ethical Standards

1.
Disclosure of potential conflict of interest

Conflict of interest: The authors declare that they have no conflict of interest.
2.
Research involving Human Participants and/or Animals

All procedures performed in studies involving human participants were carried out in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.

For this type of study formal consent was not required.

All applicable international, national, and/or institutional guidelines for the care and use of animals were followed.
3.
Informed consent

Informed consent was obtained from all individual participants involved in the study.

Additional informed consent was obtained from all individual participants for whom identifying information is included in this article.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, T., Dai, Q. Hybrid ensemble selection algorithm incorporating GRASP with path relinking. Appl Intell 44, 704–724 (2016). https://doi.org/10.1007/s10489-015-0724-4

Download citation

Published: 14 November 2015
Issue Date: April 2016
DOI: https://doi.org/10.1007/s10489-015-0724-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Hybrid ensemble selection algorithm incorporating GRASP with path relinking

Abstract

Access this article

Similar content being viewed by others

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

Imbalanced data preprocessing techniques for machine learning: a systematic mapping study

Machine learning techniques for credit risk evaluation: a systematic literature review

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Compliance with Ethical Standards

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Hybrid ensemble selection algorithm incorporating GRASP with path relinking

Abstract

Access this article

Similar content being viewed by others

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

Imbalanced data preprocessing techniques for machine learning: a systematic mapping study

Machine learning techniques for credit risk evaluation: a systematic literature review

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Compliance with Ethical Standards

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation