research-article

Characterizing fault tolerance in genetic programming

Authors:
Daniel Lombraña González

University of Extremadura, Mérida, Spain

University of Extremadura, Mérida, Spain
View Profile

,
Francisco Fernández de Vega

University of Extremadura, Mérida, Spain

University of Extremadura, Mérida, Spain
View Profile

,
Henri Casanova

University of Hawai'i, Manoa, USA

University of Hawai'i, Manoa, USA
View Profile

BADS '09: Proceedings of the 2009 workshop on Bio-inspired algorithms for distributed systemsJune 2009Pages 1–10https://doi.org/10.1145/1555284.1555286

Published:19 June 2009Publication History

BADS '09: Proceedings of the 2009 workshop on Bio-inspired algorithms for distributed systems

Pages 1–10

ABSTRACT

Evolutionary Algorithms (EAs), and particularly Genetic Programming (GP), are techniques frequently employed to solve difficult real-life problems, which can require up to days or months of computation. One approach to reduce the time to solution is to use parallel computing on distributed platforms. Distributed platforms are prone to failures, and when these platforms are large and/or low-cost, failures are expected events rather than catastrophic exceptions. Therefore, fault tolerance and recovery techniques often become necessary. It turns out that Parallel GP (PGP) applications have an inherent ability to tolerate failures. This ability is quantified via simulation experiments performed using failure traces from real-world distributed platforms, namely, desktop grids (DGs), for two well-known GP problems. A simple technique is then proposed by which PGP applications can better tolerate the different, and often high, failures rates seen in different platforms.

References

D. Anderson. Boinc: a system for public-resource computing and storage. In Grid Computing, 2004. Proceedings. Fifth IEEE/ACM International Workshop on, pages 4--10, 2004. Google ScholarDigital Library
D. Andre and J. R. Koza. Parallel genetic programming: a scalable implementation using the transputer network architecture. pages 317--337, 1996. Google ScholarDigital Library
S. B. and G. G. A. A Large-Scale Study of Failures in High-Performance Computing Systems. In Proceedings of the International Conference on Dependable Systems, pages 249--258, 2006. Google ScholarDigital Library
W. Banzhaf and W. B. Langdon. Some considerations on the reason for bloat. Genetic Programming and Evolvable Machines, 3(1):81--91, Mar. 2002. Google ScholarDigital Library
A. Baratloo, P. Dasgupta, and Z. Kedem. Calypso: a novel software system for fault-tolerant parallel processing on distributed platforms. hpdc, 00:122, 1995. Google ScholarDigital Library
C. C., H. T., L. P., P. L., R. A., R. E., and C. F. Blocking vs. Non-Blocking Coordinated Checkpointing for Large-Scale Fault Tolerant MPI. In Proceedings of the ACM/IEEE SC Conference, Nov. 2006. Google ScholarDigital Library
S. Cahon, N. Melab, and E. Talbi. ParadisEO: A Framework for the Reusable Design of Parallel and Distributed Metaheuristics. Journal of Heuristics, 10(3):357--380, 2004. Google ScholarDigital Library
K. D., F. G., C. F., C. A. A., and C. H. Resource Availability in Enterprise Desktop Grids. Journal of Future Generation Computer Systems, 23(7):888--903, 2007. Google ScholarDigital Library
F. F. de Vega. A fault tolerant optimization algorithm based on evolutionary computation. In Proceedings of the International Conference on Dependability of Computer Systems, 2006. Google ScholarDigital Library
M. L. Douglas Thain. The Grid 2, chapter 19, pages 285--318. Morgan Kaufmann, 2004.Google Scholar
E. Elnozahy, L. Alvisi, Y. Wang, and D. Johnson. A survey of rollback-recovery protocols in message-passing systems. ACM Computing Surveys (CSUR), 34(3):375--408, 2002. Google ScholarDigital Library
L. V. F. Fernández, M. Tomassini. Saving computational effort in genetic programming by means of plagues. Evolutionary Computation, 2003. CEC'03. The 2003 Congress on, 2003.Google ScholarCross Ref
F. Fernandez, G. Spezzano, M. Tomassini, and L. Vanneschi. Parallel genetic programming. In E. Alba, editor, Parallel Metaheuristics, Parallel and Distributed Computing, chapter 6, pages 127--153. Wiley-Interscience, Hoboken, New Jersey, USA, 2005.Google Scholar
F. Fernández and D. Lombraña. Algoritmos evolutivos tolerantes a fallos en entornos de computación distribuida. In XVII Jornadas de Paralelismo, volume 1, pages 401--406, Albacete, Spain, September 2006.Google Scholar
G. Folino, C. Pizzuti, and G. Spezzano. CAGE: A tool for parallel genetic programming applications. In J. F. M. et. al., editor, Genetic Programming, Proceedings of EuroGP'2001, volume 2038 of LNCS, pages 64--73, Lake Como, Italy, 18-20 Apr. 2001. Springer-Verlag. Google ScholarDigital Library
F. G., G. E., B. G., A. T., C. Z., P.-G. J., L. K., and D. J. Extending the MPI Specification for Process Fault Tolerance on High Performance Computing Systems. In Proceedings of International Supercomputer Conference, June 2004.Google Scholar
C. Gagné, M. Parizeau, and M. Dubreuil. Distributed beagle: An environment for parallel and distributed evolutionary computations. In Proc. of the 17th Annual International Symposium on High Performance Computing Systems and Applications (HPCS) 2003, pages 201--208, May 11-14 2003.Google Scholar
F. C. Gartner. Fundamentals of fault-tolerant distributed computing in asynchronous environments. ACM Computing Surveys, 31(1):1--26, 1999. Google ScholarDigital Library
S. Ghosh. Distributed systems: an algorithmic approach. Chapman & Hall/CRC, 2006.Google Scholar
I. Hidalgo, F. Fernández, J. Lanchares, and D. Lombraña. Is the island model fault tolerant? In Genetic and Evolutionary Computation Conference, volume 2, page 1519, London, England, July 2007. Google ScholarDigital Library
D. Kondo, G. Fedak, F. Cappello, A. Chien, and H. Casanova. Characterizing resource availability in enterprise desktop grids. volume 23, pages 888--903. Elsevier, 2007. Google ScholarDigital Library
J. R. Koza. Genetic Programming: On the Programming of Computers by Means of Natural Selection. MIT Press, Cambridge, MA, USA, 1992. Google ScholarDigital Library
D. Lombraña and F. Fernández. Analyzing fault tolerance on parallel genetic programming by means of dynamic-size populations. In Congress on Evolutionary Computation, volume 1, pages 4392--4398, Singapore, September 2007.Google Scholar
D. Lombraña, F. Fernández, L. Trujillo, G. Olague, and B. Segal. Customizable execution environments with virtual desktop grid computing. Parallel and Distributed Computing and Systems, PDCS, 2007. Google ScholarDigital Library
S. Luke and L. Panait. A comparison of bloat control methods for genetic programming. Evolutionary Computation, 14(3):309--344, Fall 2006. Google ScholarDigital Library
J. Pruyne and M. Livny. Managing checkpoints for parallel programs. In Workshop on Job Scheduling Strategies for Parallel Processing (IPPS'96), Honolulu, HI, April 1996. Google ScholarDigital Library
G. R. and S. A. Software-Based Replication for Fault Tolerance. IEEE Computer, 30(4):68--74, 1997. Google ScholarDigital Library
Sullivan, Werthimer, Bowyer, Cobb, Gedye, and Anderson. A New Major SETI Project based on project SERENDIP data and 100,000 Personal Computers. In Astronomical and Biochemical Origins and the Search for Life in the Universe, 1997.Google Scholar
A. T. Tai and K. S. Tso. A performability-oriented software rejuvenation framework for distributed applications. In DSN'05: Proceedings of the 2005 International Conference on Dependable Systems and Networks (DSN'05), pages 570--579, Washington, DC, USA, 2005. IEEE Computer Society. Google ScholarDigital Library
M. Tomassini. Spatially Structured Evolutionary Algorithms. Springer, 2005. Google ScholarDigital Library
Top 500 Supercomputer Sites. http://www.top500.org/, 2009.Google Scholar
L. Trujillo and G. Olague. Automated Design of Image Operators that Detect Interest Points. volume 16, pages 483--507. MIT Press, 2008. Google ScholarDigital Library

Index Terms

Characterizing fault tolerance in genetic programming
1. Computing methodologies
  1. Artificial intelligence
    1. Search methodologies
      1. Heuristic function construction

Recommendations

Characterizing fault tolerance in genetic programming

Evolutionary algorithms, including genetic programming (GP), are frequently employed to solve difficult real-life problems, which can require up to days or months of computation. An approach for reducing the time-to-solution is to use parallel computing ...
Read More
Characterizing fault-tolerance of genetic algorithms in desktop grid systems
EvoCOP'10: Proceedings of the 10th European conference on Evolutionary Computation in Combinatorial Optimization

This paper presents a study of the fault-tolerant nature of Genetic Algorithms (GAs) on a real-world Desktop Grid System, without implementing any kind of fault-tolerance mechanism. The aim is to extend to parallel GAs previous works tackling fault-...
Read More
Low-Overhead Fault-Tolerance Technique for a Dynamically Reconfigurable Softcore Processor

In this paper, we propose a new approach to implement a reliable softcore processor on SRAM-based FPGAs, which can mitigate radiation-induced temporary faults (single-event upsets (SEUs)) at moderate cost. A new Enhanced Lockstep scheme built using a ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
BADS '09: Proceedings of the 2009 workshop on Bio-inspired algorithms for distributed systems
June 2009
114 pages
ISBN:9781605585840
DOI:10.1145/1555284
Program Chairs:
Gianluigi Folino
ICAR-CNR, National Research Council, Italy
,
Natalio Krasnogor
University of Nottingham, United Kingdom
,
Carlo Mastroianni
ICAR-CNR, National Research Council, Italy
,
Franco Zambonelli
Università di Modena e Reggio Emilia, Italy
Copyright © 2009 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 19 June 2009
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
desktop grids.
fault-tolerance
parallel genetic programming
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 6
  Total Citations
  View Citations
- 175
  Total Downloads
- Downloads (Last 12 months)5
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Characterizing fault tolerance in genetic programming

BADS '09: Proceedings of the 2009 workshop on Bio-inspired algorithms for distributed systems

ABSTRACT

References

Cited By

Index Terms

Recommendations

Characterizing fault tolerance in genetic programming

Characterizing fault-tolerance of genetic algorithms in desktop grid systems

Low-Overhead Fault-Tolerance Technique for a Dynamically Reconfigurable Softcore Processor