A High-Speed Two Dimensional Hierarchical Clustering of Microarray Gene Expression Data

Priscilla, R.; Swamynathan, S.

doi:10.1007/978-3-642-27443-5_62

R. Priscilla⁵ &
S. Swamynathan⁶

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 132))

1190 Accesses

Abstract

DNA micro array technology has become the most extensively used functional genomics approach in the bioinformatics field after genome sequencing. Revealing the patterns concealed in gene expression data offers a fabulous opportunity for an enhanced understanding of functional genomics. However, the large number of genes and the difficulty of biological networks greatly increase the challenges of comprehending and interpreting the resulting mass of data, which often consists of millions of measurements. The first step to address this challenge is the use of clustering techniques. Many clustering methods have been devised and used in the analysis of micro array data but less effort has gone into algorithmic speed up of those methods. In this research, quad tree based high-speed two dimensional hierarchical clustering is presented. In the hierarchical clustering process, the construction of the closest pair data structure in each level is the important time factor which determines the processing time of clustering. The proposed high-speed two dimensional clustering process uses the quad tree based data structure for finding the closest pair elements and thus reduces the processing time effectively and produces the better analysis of gene expression data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Liang, J., Kachalo, S.: Computational analysis of microarray gene expression profiles: clustering, classification, and beyond. Chemometrics and Intelligent Laboratory Systems 62(2), 199–216 (2002)
Article Google Scholar
Cvek, U., Trutschl, M., Stone II, R., Syed, Z., Clifford, J.L., Sabichi, A.L.: Multidimensional Visualization Tools for Analysis of Expression Data. World Academy of Science, Engineering and Technology 54(50), 281–289 (2009)
Google Scholar
Kim, S.Y., Choi, T.M.: Fuzzy Types Clustering for Microarray Data. World Academy of Science, Engineering and Technology 4, 12–15 (2005)
MathSciNet Google Scholar
Wu, X., Chen, Y., Brooks, B.R., Su, Y.A.: The Local Maximum Clustering Method and Its Application in Microarray Gene Expression Data Analysis. Eurasip Journal on Applied Signal Processing (1), 53–63 (2004)
Google Scholar
Chen, G., Jaradat, S.A., Banerjee, N., Tanaka, T.S., Ko, M.S.H., Zhang, M.Q.: Evaluation and Comparison of Clustering Algorithms in Analyzing ES Cell Gene Expression Data. Statistica Sinica 12, 241–262 (2002)
MATH MathSciNet Google Scholar
Qin, Z.: Clustering microarray gene expression data using weighted Chinese restaurant Process. Bioinformatics 22(16), 1988–1997 (2006)
Article Google Scholar
Gruzdz, Ihnatowicz, Siddiqi, Akhgar: Mining Genes Relations in Microarray Data Combined with Ontology in Colon Cancer Automated Diagnosis System. World Academy of Science, Engineering and Technology 16(26), 140–144 (2006)
Google Scholar
Wang, R., Scharenbroich, L., Hart, C., Wold, B., Mjolsness, E.: Clustering Analysis of Microarray Gene Expression Data by Splitting Algorithm. J. Parallel Distrib. Comput. 63(7-8), 692–706 (2003)
Article Google Scholar
Lee, M., Kim, Y.-M., Kim, Y.J., Lee, Y.-K., Yoon, H.: An Ant-based Clustering System for Knowledge Discovery in DNA Chip Analysis Data. World Academy of Science, Engineering and Technology 29(48), 261–266 (2007)
Google Scholar
Kim, S.Y., Hamasaki, T.: Evaluation of Clustering based on Preprocessing in Gene Expression Data. International Journal of Biological and Life Sciences 3(1), 48–53 (2007)
Google Scholar
Layana, C., Diambra, L.: Dynamical Analysis of Circadian Gene Expression. International Journal of Biological and Life Sciences 3(2), 101–105 (2007)
Google Scholar
Eisenberg, I., Novershtern, N., Itzhaki, Z., Becker-Cohen, M., Sadeh, M., Willems, P.H.G.M., Friedman, N., Koopman, W.J.H., Mitrani-Rosenbaum, S.: Mitochondrial processes are impaired in hereditary inclusion body myopathy. Human Molecular Genetics 17(23), 3663–3674 (2008)
Article Google Scholar
D’Souza, Sekaran, C., Kandasamy: A Phenomic Algorithm for Reconstruction of Gene Networks. International Journal of Biological and Life Sciences 4(2), 76–81 (2008)
Google Scholar
Jing, L., Ng, M.K., Zeng, T.: Novel Hybrid Method for Gene Selection and Cancer Prediction. World Academy of Science, Engineering and Technology 62(89), 482–489 (2010)
Google Scholar
ALL/AML datasets from http://www.broadinstitute.org/cancer/software/genepattern/datasets/
Larsen, B., Aone, C.: Fast and Effective Text Mining Using Linear-time Document Clustering. In: Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, California, United States, pp. 16–22 (1999)
Google Scholar
Steinbach, M., Karypis, G., Kumar, V.: A Comparison of Document Clustering Techniques. In: Proceedings of the KDD-2000 Workshop on Text Mining, Boston, MA, pp. 109–111 (2000)
Google Scholar
Chakraborty, A., De, S.K., Dasgupta, R.: Balancing of Quad Tree Using Point Pattern Analysis. World Academy of Science, Engineering and Technology (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Anna University, Chennai, India
R. Priscilla
Department of Information Science and Technology, Anna University, Chennai, India
S. Swamynathan

Authors

R. Priscilla
View author publications
You can also search for this author in PubMed Google Scholar
S. Swamynathan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept of Computer Science and Engineering ANITS, Andhra University, Sangivalasa, 530003, Vishakapatnam, India
Suresh Chandra Satapathy
College of Engineering Dept. of CS&SE ANITS, Andhra University, Sangivalasa, 530003, Vishakapatnam, India
P. S. Avadhani
Machine Intelligence Research Labs, Auburn, WA, USA
Ajith Abraham

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Priscilla, R., Swamynathan, S. (2012). A High-Speed Two Dimensional Hierarchical Clustering of Microarray Gene Expression Data. In: Satapathy, S.C., Avadhani, P.S., Abraham, A. (eds) Proceedings of the International Conference on Information Systems Design and Intelligent Applications 2012 (INDIA 2012) held in Visakhapatnam, India, January 2012. Advances in Intelligent and Soft Computing, vol 132. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27443-5_62

Download citation

DOI: https://doi.org/10.1007/978-3-642-27443-5_62
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-27442-8
Online ISBN: 978-3-642-27443-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics