Contributions of Domain Knowledge and Stacked Generalization in AI-Based Classification Models

Wu, Weiping; Lee, Vincent ChengSiong; Tan, TingYean

doi:10.1007/978-3-540-30549-1_100

Weiping Wu²⁰,
Vincent ChengSiong Lee²⁰ &
TingYean Tan²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3339))

Included in the following conference series:

Australasian Joint Conference on Artificial Intelligence

2574 Accesses
1 Citations

Abstract

We exploit the merits of C4.5 decision tree classifier with two stacking meta-learners: back-propagation multilayer perceptron neural network and naive-Bayes respectively. The performance of these two hybrid classification schemes have been empirically tested and compared with C4.5 decision tree using two US data sets (raw data set and new data set incorporated with domain knowledge) simultaneously to predict US bank failure. Significant improvements in prediction accuracy and training efficiency have been achieved in the schemes based on new data set. The empirical test results suggest that the proposed hybrid schemes perform marginally better in term of AUC criterion.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bradley, A.P.: The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition 30, 1145–1159 (1997)
Article Google Scholar
Cherkassky, V., Lari-Najafi, H.: Data representation for diagnostic neural networks. IEEE Expert 7, 43–53 (1992)
Article Google Scholar
Fawcett, T.: ROC Graphs: Notes and Practical Considerations for Researchers (2004), http://www.hpl.hp.com/personal/Tom_Fawcett/papers/ROC101.pdf
George, H.H., Donald, G.S., Alan, B.C.: Bank management: text and cases. John Wiley & Sons, Inc, Chichester (1994)
Google Scholar
Hirsh, H., Noordewier, M.: Using background knowledge to improve inductive learning of DAS sequences. In: Proceedings of IEEE Conference on AI for Applications (1994)
Google Scholar
John, G., Kohavi, R., Pfleger, K.: Irrelevant features and subset selection problem. In: Proceedings of 11^th International Conference on Machine Learning (1994)
Google Scholar
Koller, D., Sahami, M.: Toward optimal feature selection. In: Proceedings of the 13^th International Conference on Machine Learning (1996)
Google Scholar
Ledezma, A., Aler, R., Borrajo, D.: Empirical study of a stacking state-space - Tools with Artificial Intelligence. In: Proceedings of the 13th International Conference. IEEE Expert, vol. 7-9, pp. 210–217 (2001)
Google Scholar
Piramuthu, S., Shaw, M.J., Gentry, J.A.: A classification approach using multi-layered neural networks. Decision Support Systems 11, 509–525 (1994)
Article Google Scholar
Radcliffe, N.J., Surry, P.D.: Fundamental limitations on search algorithms: Evolutionary computing in perspective. In: van Leeuwen, J. (ed.) Computer Science Today. LNCS, vol. 1000, Springer, Heidelberg (1995)
Chapter Google Scholar
Witten, I.H., Frank, E.: Data mining—Practical machine learning tools and techniques with Java implementation. Morgan Kaufmann Publisher, San Francisco (1999)
Google Scholar
Zhou, Z.H., Jiang, Y.: NeC4.5: Neural Ensemble Based C4.5. IEEE Transactions on knowledge and data engineering 16(6), 770–773 (2004)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

School of Business Systems,
Weiping Wu & Vincent ChengSiong Lee
Department of Accounting and Finance, Monash University, Wellington Road, Clayton, Victoria, 3800, Australia
TingYean Tan

Authors

Weiping Wu
View author publications
You can also search for this author in PubMed Google Scholar
Vincent ChengSiong Lee
View author publications
You can also search for this author in PubMed Google Scholar
TingYean Tan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Information Technology, Monash University, VIC 3800, Australia
Geoffrey I. Webb
Science, Engineering and Technology Portfolio, Royal Melbourne Institute of Technology, VIC 3001, Melbourne, Australia
Xinghuo Yu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wu, W., Lee, V.C., Tan, T. (2004). Contributions of Domain Knowledge and Stacked Generalization in AI-Based Classification Models. In: Webb, G.I., Yu, X. (eds) AI 2004: Advances in Artificial Intelligence. AI 2004. Lecture Notes in Computer Science(), vol 3339. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30549-1_100

Download citation

DOI: https://doi.org/10.1007/978-3-540-30549-1_100
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24059-4
Online ISBN: 978-3-540-30549-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics