Abstract
Induction of useful rules from databases has been studied by several researchers. There remains need for systematic comparison of alternative such methods, especially considering the available variety of rule representation strategies, genetic operators, evolutionary algorithm designs, and so forth. Here, the performance of five commonly employed evolutionary algorithms are examined on a collection of 100 separate rule induction tasks on five freely available datasets. All tasks require the generation of rules in disjunctive normal form with either a fixed or free consequent maximising an accuracy/applicability tradeoff measure; tasks differ in terms of the dataset used, the identity of a fixed consequent (or no fixed consequent), and the maximum number of disjuncts allowed in the antecedent. Results generally indicate that single-member based methods (hill climbing, simulated annealing, tabu search) fare at least as well as population based techniques when rules are restricted to fairly low complexity, but this situation is reversed as rules are allowed to be more complex. These results are of import to data mining application developers and researchers wishing to find the appropriate search strategy for rule induction with respect to their particular needs.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
M.W. Carter, G. Laporte, and S.Y. Lee. Examination timetabling: Algorithmic strategies and applications. Operational Research Society, 47(3):373–383, 1996.
B. Iglesia, J.C.M. Debuse, and V.J. Rayward-Smith. Discovering knowledge in commercial database using modern heuristic techniques. Technical report, Department of Information Systems, University of East Anglia, 1996.
M. Klemettinen, H. Mannila, P. Roukainen, M. Toivonen, and I. Verkamo. Finding interesting rules from large sets of discovered association rules. In N. Adam, B. Bhargava, and Y. Tesha, editors, Third International Conference on Information and Knowledge Management (CIKM94), pages 401–407. ACM Press, 1994.
H. Mannila, M. Toivonen, and I. Verkamo. Efficient algorithms for discovering association rules. In Fayyad and Uthurusamy, editors, Knowledge Discovery in Databases, pages 181–192. AAAI Press, 1994.
M. Hosheimer and A.P.J.M. Siebes. Data mining: the search for knowledge in databases. Technical Report Report CR-R9406, CWI, The Netherlands, 1994. (available via:ftp://ftp.cwi.nl/pub/CWIreports/AA/CSR9406.ps.Z).
R. Quinlan. C4.5 Programs for Machine Learning. Morgan Kauffman, San Mateo, CA, 1993.
N.J. Radcliffe. GA-miner: Parallel data mining with hierarchical genetic algorithms. Technical Report GR/J99278, EPSRC AIKMS Grant, 1996.
N.J. Radcliffe and P.D. Surry. Co-operation through hierarchical competition in genetic data mining. Technical Report EPCC-TR94-09, EPCC, 1994.
C. Voudrais and E. Tsang. Partial constraint satisfaction problems and guided local search. Technical Report TR CSM-250, Dept. Computer Science, University of Essex, 1995.
D. Whitley. The GENITOR algorithms and selection pressure. In J.D. Schaffer, editor, The Proceedings of the Third International Conference on Genetic Algorithms, San Mateo, 1989. Morgan Kaufmann.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 1998 Springer-Verlag Wien
About this paper
Cite this paper
Corne, D. (1998). Comparing a Variety of Evolutionary Algorithm Techniques on a Collection of Rule Induction Tasks. In: Artificial Neural Nets and Genetic Algorithms. Springer, Vienna. https://doi.org/10.1007/978-3-7091-6492-1_121
Download citation
DOI: https://doi.org/10.1007/978-3-7091-6492-1_121
Publisher Name: Springer, Vienna
Print ISBN: 978-3-211-83087-1
Online ISBN: 978-3-7091-6492-1
eBook Packages: Springer Book Archive