Skip to main content

Integrating Categorical Variables with Multiobjective Genetic Programming for Classifier Construction

  • Conference paper
Genetic Programming (EuroGP 2008)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4971))

Included in the following conference series:

Abstract

Genetic programming (GP) has proved successful at evolving pattern classifiers and although the paradigm lends itself easily to continuous pattern attributes, incorporating categorical attributes is little studied. Here we construct two synthetic datasets specifically to investigate the use of categorical attributes in GP and consider two possible approaches: indicator variables and integer mapping. We conclude that for ordered attributes, integer mapping yields the lowest errors. For purely nominal attributes, indicator variables give the best misclassification errors.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alpaydin, E.: Combined 5 ×2 cv F-test for comparing supervised classification learning algorithms. Neural Computation 11, 1885–1892 (1999)

    Article  Google Scholar 

  2. Dietterich, T.: Approximate statistical tests for comparing supervised classification learning algorithms. Neural Computation 10, 1895–1923 (1998)

    Article  Google Scholar 

  3. Ekárt, A., Németh, S.Z.: Selection based on the Pareto nondomination criterion for controlling code growth in genetic programming. Genetic Programming & Evolvable Machines 2, 61–73 (2001)

    Article  MATH  Google Scholar 

  4. Fonseca, C.M., Fleming, P.J.: Multi-objective optimization and multiple constraints handling with evolutionary algorithms. Part 1: A unified formulation. IEEE Trans. Systems, Man & Cybernetics 28, 26–37 (1998)

    Article  Google Scholar 

  5. Guo, H., Jack, L.B., Nandi, A.K.: Feature generation using genetic programming with application to fault classification. IEEE Transactions on Systems, Man & Cybernetics - Part B 35, 89–99 (2005)

    Article  Google Scholar 

  6. Ito, T., Iba, H., Sato, S.: Non-destructive depth-dependent crossover for genetic programming. In: 1st European Workshop on Genetic Programming, Paris, France, pp. 14–15 (1998)

    Google Scholar 

  7. Krawiec, K.: Genetic programming-based construction of features for machine learning and knowledge discovery tasks. Genetic Programming & Evolvable Machines 3, 329–343 (2002)

    Article  MATH  Google Scholar 

  8. Kumar, R., Rockett, P.: Improved sampling of the Pareto-front in multi-objective genetic optimization by steady-state evolution: A Pareto converging genetic algorithm. Evolutionary Computation 10, 283–314 (2002)

    Article  Google Scholar 

  9. Loveard, T., Ciesielski, V.: Representing classification problems in genetic programming. In: Congress on Evolutionary Computation, Seoul, Korea, pp. 1070–1077 (2001)

    Google Scholar 

  10. Loveard, T., Ciesielski, V.: Employing nominal attributes in classification using genetic programming. In: 4th Asia-Pacific Conference on Simulated Evolution and Learning (SEAL 2002), Singapore, pp. 487–491 (2002)

    Google Scholar 

  11. Smith, M.G., Bull, L.: Genetic programming with a genetic algorithm for feature construction and selection. Genetic Programming & Evolvable Machines 6, 265–281 (2005)

    Article  Google Scholar 

  12. Tian, Y., Deng, N.: Support vector classification with nominal attributes. In: Hao, Y., Liu, J., Wang, Y.-P., Cheung, Y.-m., Yin, H., Jiao, L., Ma, J., Jiao, Y.-C. (eds.) CIS 2005. LNCS (LNAI), vol. 3801, pp. 586–591. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  13. Zhang, Y., Rockett, P.I.: Evolving optimal feature extraction using multi-objective genetic programming: A methodology and preliminary study on edge detection. In: Genetic & Evolutionary Computation Conference (GECCO 2005), Washington, DC, pp. 795–802 (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Michael O’Neill Leonardo Vanneschi Steven Gustafson Anna Isabel Esparcia Alcázar Ivanoe De Falco Antonio Della Cioppa Ernesto Tarantino

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Badran, K., Rockett, P. (2008). Integrating Categorical Variables with Multiobjective Genetic Programming for Classifier Construction. In: O’Neill, M., et al. Genetic Programming. EuroGP 2008. Lecture Notes in Computer Science, vol 4971. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78671-9_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-78671-9_26

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-78670-2

  • Online ISBN: 978-3-540-78671-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics