Ensemble Data Classification based on Diversity of Classifiers Optimized by Genetic Algorithm

Article Preview

Abstract:

In this research we propose an ensemble classification technique base on creating classification from a variety of techniques such as decision trees, support vector machines, neural networks and then choosing optimize the appropriate classifiers by genetic algorithm and also combined by a majority vote in order to increase classification accuracy. From classification accuracy test on Australian Credit, German Credit and Bankruptcy Data, we found that the proposed ensemble classification models selected by genetic algorithm yields highest performance and our algorithms are effective in building ensemble.

You might also be interested in these eBooks

Info:

Periodical:

Advanced Materials Research (Volumes 433-440)

Pages:

6572-6578

Citation:

Online since:

January 2012

Export:

Price:

[1] Y. Yao, Z. Fu, X. Zhao and W. Cheng, Combining Classifier Based on Decision Tree, icie, vol. 2, pp.37-40, 2009 WASE International Conference on Information Engineering, (2009).

Google Scholar

[2] J. Wang, J. Yang, S. Li, Q. Dai, and J. Xie, Number Image Recognition Based on Neural Network Ensemble, in Proceedings of the Third International Conference on Natural Computation - Volume 01: IEEE Computer Society, (2007).

DOI: 10.1109/icnc.2007.506

Google Scholar

[3] K. C. Ying, S. -W. Lin, Z. J. Lee, and Y. T. Lin, An ensemble approach applied to classify spam e-mails, Expert Syst. Appl., vol. 37, pp.2197-2201, (2010).

DOI: 10.1016/j.eswa.2009.07.080

Google Scholar

[4] X. Pei-Yong, D. X. Qian, and J. B. Ning, A GA-based feature selection and ensemble learning for high-dimensional datasets, in Machine Learning and Cybernetics, 2009 International Conference on, (2009), pp.7-12.

DOI: 10.1109/icmlc.2009.5212542

Google Scholar

[5] L. Zhou, K. K. Lai, and L. Yu, Least squares support vector machines ensemble models for credit scoring, Expert Systems with Applications, vol. 37, pp.127-133, (2010).

DOI: 10.1016/j.eswa.2009.05.024

Google Scholar

[6] G. Martinez-Muoz, D. Hernandez-Lobato, and A. Suarez, An Analysis of Ensemble Pruning Techniques Based on Ordered Aggregation, IEEE Trans. Pattern Anal. Mach. Intell., vol. 31, pp.245-259, (2009).

DOI: 10.1109/tpami.2008.78

Google Scholar

[7] K. -H. Liu, B. Li, J. Zhang, and J. -X. Du, Ensemble component selection for improving ICA based microarray data prediction models, Pattern Recognition, vol. 42, pp.1274-1283, (2009).

DOI: 10.1016/j.patcog.2009.01.021

Google Scholar

[8] X. B. C. B. Ning, Y.W. Xu, J. Zhang, Virus-Evolutionary Genetic Algorithm Based Selective Ensemble Classifier for Pedestrian Detection, " in GEC, 09 Shanghai, China, (2009).

DOI: 10.1145/1543834.1543893

Google Scholar

[9] M. T. Mitchell, Machine Learning. New York: McGraw-Hill, (1997).

Google Scholar

[10] Quinlan, J. R. Induction of decision trees. Machine Learning, 81–106, (1986).

Google Scholar

[11] Defu Zhang, Qingshan Jiang, and Xin Li. A Hybrid Mining Model Based on Neural Network and Kernel Smoothing Technique. Lecture Notes in Computer Science. (2005), 3516: 801–805.

DOI: 10.1007/11428862_110

Google Scholar

[12] V. Vapnik, The nature of statistical learning theory, New York: Wiley, (1998).

Google Scholar

[13] B. SchP olkopf, A. Smola, Learning with Kernels, MIT Press, Cambridge, MA, (2002).

Google Scholar

[14] Breiman, L. Bagging predictors. Machine Learning, 24, 123–140, (1996).

Google Scholar

[15] J. H. Holland, Adaptation in Natural and Artificial Systems An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence. The MIT Press, (1992).

DOI: 10.7551/mitpress/1090.001.0001

Google Scholar

[16] Murphy, P. M., Aha, D. W. UCI repository of machine learning databases. Department of Information and Computer Science, Uni-versity of California Irvine, CA. Available from http: /archive. ics. uci. edu/ml/, (2001).

Google Scholar

[17] W. Pietruszkiewicz, Application of discrete Predicting structures in an earlywarning expert system for financial distress. Ph.D. Thesis, Szczecin, Technical University, Szczecin., (2004).

Google Scholar

[18] Rayner Alfred, Knowledge Discovery: Enhancing Data Mining and Decision Support Integration, (2005).

Google Scholar

[19] L. Breiman, Bagging predictors, Machine Learning., vol. 24, pp.123-140, (1996).

Google Scholar