skip to main content
10.1145/1830483.1830650acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
research-article

Coevolutionary multi-population genetic programming for data classification

Published:07 July 2010Publication History

ABSTRACT

This work presents a new evolutionary ensemble method for data classification, which is inspired by the concepts of bagging and boosting, and aims at combining their good features while avoiding their weaknesses. The approach is based on a distributed multiple-population genetic programming (GP) algorithm which exploits the technique of coevolution at two levels. On the inter-population level the populations cooperate in a semi-isolated fashion, whereas on the intra-population level the candidate classifiers coevolve competitively with the training data samples. The final classifier is a voting committee composed by the best members of all the populations. The experiments performed in a varying number of populations show that our approach outperforms both bagging and boosting for a number of benchmark problems.

References

  1. D. A. Augusto, H. J. Barbosa, and N. F. Ebecken. Coevolution of data samples and classifiers integrated with grammatically-based genetic programming for data classification. In Proc. of the 2008 Conf. on Genetic and Evolutionary Computation, pages 1171--1178, New York, NY, USA, 2008. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. M. Brameier and W. Banzhaf. Evolving teams of predictors with linear genetic programming. Genetic Programming and Evolvable Machines, 2(4):381--407, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. T. G. Dietterich. Ensemble methods in machine learning. LNCS, 1857:1--15, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. G. Folino, C. Pizzuti, and G. Spezzano. Gp ensembles for large-scale data classification. IEEE Trans. Evolutionary Computation, 10(5):604--616, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. G. Folino, C. Pizzuti, G. Spezzano, L. Vanneschi, and M. Tomassini. Diversity analysis in cellular and multipopulation genetic programming. In IEEE Congress on Evolutionary Computation, pages 305--311, 2003.Google ScholarGoogle Scholar
  6. Y. Freund. An adaptive version of the boost by majority algorithm. In In Proc. of the Twelfth Annual Conf. on Computational Learning Theory, pages 102--113, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Y. Freund and R. E. Schapire. A decision-theoretic generalization of on-line learning and an application to boosting. In Proc. of the 2nd European Conf. on Computational Learning Theory, pages 23--37, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. H. Iba. Bagging, boosting, and bloating in genetic programming. In W. Banzhaf et al., editors, Proc. of the 1999 Genetic and Evolutionary Computation Conf., volume 2, pages 1053--1060, Orlando, Fl, USA.Google ScholarGoogle Scholar
  9. R. Kohavi. A study of cross-validation and bootstrap for accuracy estimation and model selection. In IJCAI, pages 1137--1145, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. S. Kotsiantis and P. Pintelas. Combining bagging and boosting. International Journal of Computational Intelligence and Applications, 1(4):324--333, 2004.Google ScholarGoogle Scholar
  11. L. I. Kuncheva and C. J. Whitaker. Measures of diversity in classifier ensembles. Machine Learning, 51:181--207, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. J. Paredis. Steps towards co-evolutionary classification neural networks. In Proc. of the Fourth International Workshop on the Synthesis and Simulation of Living Systems, pages 102--108, 1994.Google ScholarGoogle Scholar
  13. G. Paris, D. Robilliard, and C. Fonlupt. Applying boosting techniques to genetic programming. In Selected Papers from the 2002 European Conf. on Artificial Evolution, pages 267--280, London, UK. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. R. Poli and N. F. McPhee. Parsimony pressure made easy. In GECCO '08: Proceedings of the 10th annual conference on Genetic and evolutionary computation, pages 1267--1274, New York, NY, USA, 2008. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. M. Skurichina, L. Kuncheva, and R. P. W. Duin. Bagging and boosting for the nearest mean classifier: Eects of sample size on diversity and accuracy. In Proc. of the 3th International Workshop on Multiple Classifier Systems, pages 62--71, London, UK, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. A. C. Tan and D. Gilbert. An empirical comparison of supervised machine learning techniques in bioinformatics. In Proc. of the First Asia-Pacific Bioinformatics Conf., pages 219--222, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. M. Tomassini. Spatially Structured Evolutionary Algorithms: Artificial Evolution in Space and Time. Springer-Verlag, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. J. Zhu, S. Rosset, H. Zou, and T. Hastie. Multi-class AdaBoost. Technical report, Univ. of Michigan, 2006.Google ScholarGoogle Scholar

Index Terms

  1. Coevolutionary multi-population genetic programming for data classification

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          GECCO '10: Proceedings of the 12th annual conference on Genetic and evolutionary computation
          July 2010
          1520 pages
          ISBN:9781450300728
          DOI:10.1145/1830483

          Copyright © 2010 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 7 July 2010

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate1,669of4,410submissions,38%

          Upcoming Conference

          GECCO '24
          Genetic and Evolutionary Computation Conference
          July 14 - 18, 2024
          Melbourne , VIC , Australia

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader