Abstract
We exploit the merits of C4.5 decision tree classifier with two stacking meta-learners: back-propagation multilayer perceptron neural network and naive-Bayes respectively. The performance of these two hybrid classification schemes have been empirically tested and compared with C4.5 decision tree using two US data sets (raw data set and new data set incorporated with domain knowledge) simultaneously to predict US bank failure. Significant improvements in prediction accuracy and training efficiency have been achieved in the schemes based on new data set. The empirical test results suggest that the proposed hybrid schemes perform marginally better in term of AUC criterion.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bradley, A.P.: The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition 30, 1145–1159 (1997)
Cherkassky, V., Lari-Najafi, H.: Data representation for diagnostic neural networks. IEEE Expert 7, 43–53 (1992)
Fawcett, T.: ROC Graphs: Notes and Practical Considerations for Researchers (2004), http://www.hpl.hp.com/personal/Tom_Fawcett/papers/ROC101.pdf
George, H.H., Donald, G.S., Alan, B.C.: Bank management: text and cases. John Wiley & Sons, Inc, Chichester (1994)
Hirsh, H., Noordewier, M.: Using background knowledge to improve inductive learning of DAS sequences. In: Proceedings of IEEE Conference on AI for Applications (1994)
John, G., Kohavi, R., Pfleger, K.: Irrelevant features and subset selection problem. In: Proceedings of 11th International Conference on Machine Learning (1994)
Koller, D., Sahami, M.: Toward optimal feature selection. In: Proceedings of the 13th International Conference on Machine Learning (1996)
Ledezma, A., Aler, R., Borrajo, D.: Empirical study of a stacking state-space - Tools with Artificial Intelligence. In: Proceedings of the 13th International Conference. IEEE Expert, vol. 7-9, pp. 210–217 (2001)
Piramuthu, S., Shaw, M.J., Gentry, J.A.: A classification approach using multi-layered neural networks. Decision Support Systems 11, 509–525 (1994)
Radcliffe, N.J., Surry, P.D.: Fundamental limitations on search algorithms: Evolutionary computing in perspective. In: van Leeuwen, J. (ed.) Computer Science Today. LNCS, vol. 1000, Springer, Heidelberg (1995)
Witten, I.H., Frank, E.: Data mining—Practical machine learning tools and techniques with Java implementation. Morgan Kaufmann Publisher, San Francisco (1999)
Zhou, Z.H., Jiang, Y.: NeC4.5: Neural Ensemble Based C4.5. IEEE Transactions on knowledge and data engineering 16(6), 770–773 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wu, W., Lee, V.C., Tan, T. (2004). Contributions of Domain Knowledge and Stacked Generalization in AI-Based Classification Models. In: Webb, G.I., Yu, X. (eds) AI 2004: Advances in Artificial Intelligence. AI 2004. Lecture Notes in Computer Science(), vol 3339. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30549-1_100
Download citation
DOI: https://doi.org/10.1007/978-3-540-30549-1_100
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24059-4
Online ISBN: 978-3-540-30549-1
eBook Packages: Computer ScienceComputer Science (R0)