skip to main content
10.1145/1276958.1277376acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
Article

An improved restricted growth function genetic algorithm for the consensus clustering of retinal nerve fibre data

Published:07 July 2007Publication History

ABSTRACT

This paper describes an extension to the Restricted Growth Function grouping Genetic Algorithm applied to the Consensus Clustering of a retinal nerve fibre layer data-set. Consensus Clustering is an optimisation based method which combines the results of a number of data clustering methods, and is used when it is unknown which clustering method is expected to perform the best. Consensus Clustering has been shown to produce results which are better than the averaged results of the input methods, but could benefit from a more efficient optimisation method. A Restricted Growth Function grouping Genetic Algorithm is a new method of grouping a number of objects into mutually exclusive subsets based upon a fitness function. This method does not suffer from degeneracy, and thus could be applied to the Consensus Clustering problem more efficiently than Simulated Annealing, the current optimisation method. Within this paper it is shown that this type of Genetic Algorithm can indeed improve the performance of Consensus Clustering, and in fact can be improved further by taking advantage of some application specific properties. These findings are demonstrated on a retinal nerve fibre layer data-set and on a synthetic data-set.

References

  1. Altman, D.G., Practical Statistics for Medical Research. Chapman and Hall, London, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Cuff, J.A., Clamp, M.E., Siddiqui, S.A., Finlay, M., and Barton, G.J., JPred: A consensus secondary structure prediction server. Bioinformatics, 14 (1998), 892--893.Google ScholarGoogle ScholarCross RefCross Ref
  3. Er, M., A fast algorithm for generating set partitions. The Computer Journal, 31, 3 (1988), 283--284. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Falkenauer, E., Genetic Algorithms and Grouping Problems. Wiley, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Fraley, C., Raftery, A.E., Model--based clustering, discriminant analysis, and density estimation. Journal of the American Statistical Association 97 (2002), 611--631Google ScholarGoogle ScholarCross RefCross Ref
  6. Garey, M. and Johnson, D., Computers and Intractability. W. H. Freeman and Company, New York, NY, 1979. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Garway-Heath, D.F., Poinoosawmy, D., Fitzke, F., Hitchings, R.A., Mapping the Visual Field to the Optic Disc. Opthalmology 107 (2000), 1809--1815.Google ScholarGoogle ScholarCross RefCross Ref
  8. Goldberg, D. and Lingle, R., Alleles, loci, and the travelling salesman problem. In Proceedings of the First International Conference on Genetic Algorithms and their Applications (1985), 154--159. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Hackworth, T., Genetic algorithms; Some effects of redundancy in chromosomes, In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO--1999) (Orlando, Florida, USA, 1999), 99--106.Google ScholarGoogle Scholar
  10. Hall, L.O., Ozyurt, I.B. and Bezdek, J.C., "Clustering with a genetically optimized approach", IEEE Transactions on Evolutionary Computation 3, 2 (1999), 103--112. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Healey P.R. and Mitchell P., Visibility of lamina cribrosa pores and open-angle glaucoma, American Journal of Ophthalmology 138, 5 (2004), 871--872.Google ScholarGoogle ScholarCross RefCross Ref
  12. Jain, A., Murty, M., and Flynn, P., Data clustering: A review. ACM Computing Surveys 31, 3 (1999), 264--323. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Kaufman, L., Rousseeuw P.J., Clustering by means of medoids, In Statistical Analysis Based Upon the L1 Norm. Edited by: Dodge Y., Amsterdam, Holland, 1987, 405--416.Google ScholarGoogle Scholar
  14. Kellam, P., Liu, X., Martin, N., Orengo, C., Swift, S., and Tucker, A., Comparing, Contrasting and Combining Clusters in Viral Gene Expression Data. In Proceedings of the Intelligent Data Analysis in Medicine and Pharmacology Workshop (IDAMAP--2001) (London, UK, 2001), 56--62.Google ScholarGoogle Scholar
  15. Kirkpatrick, S., Gelatt Jr, C.D., and Vecchi M.P., Optimization by simulated annealing. Science, 220 (1983), 671--680.Google ScholarGoogle ScholarCross RefCross Ref
  16. Kohonen, T., Self Organization and Associative Memory. 3rd edition, Springer-Verlag, New York, 1989. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Lukashin, A.V., and Fuchs, R., Analysis of temporal gene expression profiles: clustering by simulated annealing and determining the optimal number of clusters. Bioinformatics, 17 (2001), 405--414.Google ScholarGoogle ScholarCross RefCross Ref
  18. Ma P.C.H., Chan K.C.C., Xao, X. and Chiu K.Y., An Evolutionary Clustering Algorithm for Gene Expression Microarray Data Analysis, IEEE Transactions on Evolutionary Computation 10,3 (2006), 296--314. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. McQueen, J., Some methods for classification and analysis of multivariate observations. In Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability (Berkeley, 1967), 281--297, 1967.Google ScholarGoogle Scholar
  20. Monti, S., Tamayo, P., Mesirov, J., and Golub, T., Consensus clustering: a resampling-based method for class discovery and visualization of gene expression, microarray data. Machine Learning, 52 (2003), 91--118. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Park, Y. and Song, M., A genetic algorithm for clustering problems. In Proceedings of the 3rd Annual Conference on Genetic Programming, 1998, Morgan Kaufmann, 568--575.Google ScholarGoogle Scholar
  22. Proskurowski, A., Ruskey, F., and Smith, M., Analysis of algorithms for listing equivalence classes of k ary strings. SIAM Journal on Discrete Mathematics, 11, 1 (1998), 94 109. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Radcliffe, N. and Surry, P., Fitness variance of formae and performance prediction. In Whitley, D. and Vose, M., editors, Foundations of Genetic Algorithms 3, (San Mateo, 1995), Morgan Kaufmann, 51--72.Google ScholarGoogle Scholar
  24. Radcliffe, N., Equivalence class analysis of genetic algorithms. Complex Systems 5 (1991), 183--205.Google ScholarGoogle Scholar
  25. Reeves, C. and Yamada, T., Genetic algorithms, path relinking, and the flowshop sequencing problem. Evolutionary Computation 6,1 (1998), 45--60. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Strehl, A., and Ghosh, J., Cluster Ensembles -- A Knowledge Reuse Framework for Combining Multiple Partitions. Journal of Machine Learning Research, 3 (2002), 583--617. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Swift, S., Tucker, A., Vinciotti, V., Martin, N., Orengo, C., Liu, X., and P. Kellam, Consensus Clustering and Functional Interpretation of Gene Expression Data. Genome Biology 5, 11 (2004), R94.1--R94.16.Google ScholarGoogle ScholarCross RefCross Ref
  28. Tucker, A., Crampton, J., and Swift, S., RGFGA: An Efficient Representation and Crossover for Grouping Genetic Algorithms. Evolutionary Computation, 13, 4 (2005), 477--499. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Tucker, A., Swift, S., and Liu, X., Grouping multivariate time series via correlation. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics 31 (2001), 235 245. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Ward, J.H., Hierarchical grouping to optimize an objective function. Journal of the American Statistical Association 58 (1963), 236--244.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. An improved restricted growth function genetic algorithm for the consensus clustering of retinal nerve fibre data

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      GECCO '07: Proceedings of the 9th annual conference on Genetic and evolutionary computation
      July 2007
      2313 pages
      ISBN:9781595936974
      DOI:10.1145/1276958

      Copyright © 2007 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 7 July 2007

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      GECCO '07 Paper Acceptance Rate266of577submissions,46%Overall Acceptance Rate1,669of4,410submissions,38%

      Upcoming Conference

      GECCO '24
      Genetic and Evolutionary Computation Conference
      July 14 - 18, 2024
      Melbourne , VIC , Australia

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader