research-article

Coevolutionary multi-population genetic programming for data classification

Authors:
Douglas Adriano Augusto

LNCC/MCT, Petrópolis, RJ, Brazil

LNCC/MCT, Petrópolis, RJ, Brazil
View Profile

,
Helio José Corrêa Barbosa

LNCC/MCT & DCC/UFJF, Petrópolis, RJ, Brazil

LNCC/MCT & DCC/UFJF, Petrópolis, RJ, Brazil
View Profile

,
Nelson Francisco Favilla Ebecken

COPPE/UFRJ, Rio de Janeiro, RJ, Brazil

COPPE/UFRJ, Rio de Janeiro, RJ, Brazil
View Profile

GECCO '10: Proceedings of the 12th annual conference on Genetic and evolutionary computationJuly 2010Pages 933–940https://doi.org/10.1145/1830483.1830650

Published:07 July 2010Publication History

GECCO '10: Proceedings of the 12th annual conference on Genetic and evolutionary computation

Pages 933–940

ABSTRACT

This work presents a new evolutionary ensemble method for data classification, which is inspired by the concepts of bagging and boosting, and aims at combining their good features while avoiding their weaknesses. The approach is based on a distributed multiple-population genetic programming (GP) algorithm which exploits the technique of coevolution at two levels. On the inter-population level the populations cooperate in a semi-isolated fashion, whereas on the intra-population level the candidate classifiers coevolve competitively with the training data samples. The final classifier is a voting committee composed by the best members of all the populations. The experiments performed in a varying number of populations show that our approach outperforms both bagging and boosting for a number of benchmark problems.

References

D. A. Augusto, H. J. Barbosa, and N. F. Ebecken. Coevolution of data samples and classifiers integrated with grammatically-based genetic programming for data classification. In Proc. of the 2008 Conf. on Genetic and Evolutionary Computation, pages 1171--1178, New York, NY, USA, 2008. ACM. Google ScholarDigital Library
M. Brameier and W. Banzhaf. Evolving teams of predictors with linear genetic programming. Genetic Programming and Evolvable Machines, 2(4):381--407, 2001. Google ScholarDigital Library
T. G. Dietterich. Ensemble methods in machine learning. LNCS, 1857:1--15, 2000. Google ScholarDigital Library
G. Folino, C. Pizzuti, and G. Spezzano. Gp ensembles for large-scale data classification. IEEE Trans. Evolutionary Computation, 10(5):604--616, 2006. Google ScholarDigital Library
G. Folino, C. Pizzuti, G. Spezzano, L. Vanneschi, and M. Tomassini. Diversity analysis in cellular and multipopulation genetic programming. In IEEE Congress on Evolutionary Computation, pages 305--311, 2003.Google Scholar
Y. Freund. An adaptive version of the boost by majority algorithm. In In Proc. of the Twelfth Annual Conf. on Computational Learning Theory, pages 102--113, 2000. Google ScholarDigital Library
Y. Freund and R. E. Schapire. A decision-theoretic generalization of on-line learning and an application to boosting. In Proc. of the 2nd European Conf. on Computational Learning Theory, pages 23--37, 1995. Google ScholarDigital Library
H. Iba. Bagging, boosting, and bloating in genetic programming. In W. Banzhaf et al., editors, Proc. of the 1999 Genetic and Evolutionary Computation Conf., volume 2, pages 1053--1060, Orlando, Fl, USA.Google Scholar
R. Kohavi. A study of cross-validation and bootstrap for accuracy estimation and model selection. In IJCAI, pages 1137--1145, 1995. Google ScholarDigital Library
S. Kotsiantis and P. Pintelas. Combining bagging and boosting. International Journal of Computational Intelligence and Applications, 1(4):324--333, 2004.Google Scholar
L. I. Kuncheva and C. J. Whitaker. Measures of diversity in classifier ensembles. Machine Learning, 51:181--207, 2003. Google ScholarDigital Library
J. Paredis. Steps towards co-evolutionary classification neural networks. In Proc. of the Fourth International Workshop on the Synthesis and Simulation of Living Systems, pages 102--108, 1994.Google Scholar
G. Paris, D. Robilliard, and C. Fonlupt. Applying boosting techniques to genetic programming. In Selected Papers from the 2002 European Conf. on Artificial Evolution, pages 267--280, London, UK. Google ScholarDigital Library
R. Poli and N. F. McPhee. Parsimony pressure made easy. In GECCO '08: Proceedings of the 10th annual conference on Genetic and evolutionary computation, pages 1267--1274, New York, NY, USA, 2008. ACM. Google ScholarDigital Library
M. Skurichina, L. Kuncheva, and R. P. W. Duin. Bagging and boosting for the nearest mean classifier: Eects of sample size on diversity and accuracy. In Proc. of the 3th International Workshop on Multiple Classifier Systems, pages 62--71, London, UK, 2002. Google ScholarDigital Library
A. C. Tan and D. Gilbert. An empirical comparison of supervised machine learning techniques in bioinformatics. In Proc. of the First Asia-Pacific Bioinformatics Conf., pages 219--222, 2003. Google ScholarDigital Library
M. Tomassini. Spatially Structured Evolutionary Algorithms: Artificial Evolution in Space and Time. Springer-Verlag, 2005. Google ScholarDigital Library
J. Zhu, S. Rosset, H. Zou, and T. Hastie. Multi-class AdaBoost. Technical report, Univ. of Michigan, 2006.Google Scholar

Index Terms

Coevolutionary multi-population genetic programming for data classification
1. Computing methodologies
  1. Machine learning
    1. Learning paradigms
      1. Supervised learning
        Supervised learning by classification
    2. Machine learning approaches
      1. Classification and regression trees
2. Information systems
  1. Information systems applications
    1. Data mining

Recommendations

An ensemble method in hybrid real-coded genetic algorithm with pruning for data classification
AIAP'07: Proceedings of the 25th conference on Proceedings of the 25th IASTED International Multi-Conference: artificial intelligence and applications

To obtain a classification model with high generalization ability, this paper proposes a novel ensemble method that implements a hybrid real-coded genetic algorithm with pruning (HRGA/P). A crucial idea here is to combine ensemble learning and HRGA/P ...
Read More
An organizational coevolutionary algorithm for classification

Taking inspiration from the interacting process among organizations in human societies, a new classification algorithm, organizational coevolutionary algorithm for classification (OCEC), is proposed with the intrinsic properties of classification in ...
Read More
Symbiotic coevolutionary genetic programming: a benchmarking study under large attribute spaces

Classification under large attribute spaces represents a dual learning problem in which attribute subspaces need to be identified at the same time as the classifier design is established. Embedded as opposed to filter or wrapper methodologies address ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
GECCO '10: Proceedings of the 12th annual conference on Genetic and evolutionary computation
July 2010
1520 pages
ISBN:9781450300728
DOI:10.1145/1830483
General Chair:
Martin Pelikan
University of Missouri, USA
,
Program Chair:
Jürgen Branke
University of Warwick, Coventry, UK
Copyright © 2010 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 7 July 2010
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
coevolution
data classification
distributed genetic programming
ensemble method
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate1,669of4,410submissions,38%
Upcoming Conference
GECCO '24

Sponsor:

sigevo

Genetic and Evolutionary Computation Conference

July 14 - 18, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 9
  Total Citations
  View Citations
- 211
  Total Downloads
- Downloads (Last 12 months)1
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Coevolutionary multi-population genetic programming for data classification

GECCO '10: Proceedings of the 12th annual conference on Genetic and evolutionary computation

ABSTRACT

References

Cited By

Index Terms

Recommendations

An ensemble method in hybrid real-coded genetic algorithm with pruning for data classification

An organizational coevolutionary algorithm for classification

Symbiotic coevolutionary genetic programming: a benchmarking study under large attribute spaces