Abstract
Recently Fourier Transform Infrared (FTIR) spectroscopic imaging has been used as a tool to detect the changes in cellular composition that may reflect the onset of a disease. This approach has been investigated as a mean of monitoring the change of the biochemical composition of cells and providing a diagnostic tool for various human cancers and other diseases. The discrimination between different types of tissue based upon spectroscopic data is often achieved using various multivariate clustering techniques. However, the number of clusters is a common unknown feature for the clustering methods, such as hierarchical cluster analysis, k-means and fuzzy c-means. In this study, we apply a FCM based clustering algorithm to obtain the best number of clusters as given by the minimum validity index value. This often results in an excessive number of clusters being created due to the complexity of this biochemical system. A novel method to automatically merge clusters was developed to try to address this problem. Three lymph node tissue sections were examined to evaluate our new method. These results showed that this approach can merge the clusters which have similar biochemistry. Consequently, the overall algorithm automatically identifies clusters that accurately match the main tissue types that are independently determined by the clinician.
Similar content being viewed by others
References
Johnson KS, Chicken DW, Pickard DCO, Lee AC, Briggs G, Falzon M, Bigio IJ, Keshtgar MR, Bown SG (2004) Elastic scattering spectroscopy for intraoperative determination of sentinel lymph node status in the breast. J Biomed Opt 9:1122–1128
Godavarty A, Thompson AB, Roy R, Eppstein MJ, Zhang C, Gurfinkel M, Sevick-Muraca EM (2004) Diagnostic imaging of breast cancer using fluorescence-enhanced optical tomography: phantom studies. J Biomed Opt 9:486–496
Fernandez DC, Bhargava R, Hewitt SM, Levin IW (2005) Infrared spectroscopic imaging for histopathologic recognition. Nat Biotechnol 23:469–474
Lasch P, Haensch W, Lewis EN, Kidder LH, Naumann D (2002) Characterization of colorectal adenocarcinoma sections by spatially resolved FT-IR microspectroscopy. Appl Spectr 56:1–9
Chiriboga L, Xie P, Yee H, Zarou D, Zakim D, Diem M (1998) Infrared spectroscopy of human tissue. IV: detection of dysplastic and neoplastic changes of human cervical tissue via infrared microscopy. Cell Mol Biol 44:219–230
Choo LP, Wetzel DL, Halliday WC, Jackson M, LeVine SM, Mantsch HH (1996) In situ characterization of beta-amyloid in Alzheimer’s diseased tissue by synchrotron Fourier transform infrared microspectroscopy. Biophys J 71:1672–1679
Wood BR, Chiriboga L, Yee H, Quinn MA, McNaughton D, Diem M (2004) Fourier transform infrared (FTIR) spectral mapping of the cervical transformation zone, and dysplastic squamous epithelium. Gynecol Oncol 93:59–68
Romeo MJ, Diem M (2005) Infrared spectral imaging of lymph nodes: strategies for analysis and artifact reduction. Vib Spectr 38:115–119
Lasch P, Haensch W, Naumann D, Diem M (2004) Imaging of colorectal adenocarcinoma using FT-IR microspectroscopy and cluster analysis. Biochim Biophys Acta Mol Basis Dis 1688:176–186
Wang XY, Whitwell G, Garibaldi JM (2004) The application of a simulated annealing fuzzy clustering algorithm for cancer diagnosis. In: Proceedings of the IEEE 4th international conference on intelligent systems design and applications, Budapest, Hungary, pp 467–472
Wang XY, Garibaldi JM (2005) Simulated annealing fuzzy clustering in cancer diagnosis. Eur J Inf 29:61–70
Wang XY, Garibaldi JM, Bird B, George MW (2005) Fuzzy clustering in biochemical analysis of cancer cells. In: Proceedings of the fourth conference of the European society for fuzzy logic and technologies (EUSSFLAT 2005), Barcelona, Spain, pp 1118–1123
Sun H, Wang S, Jiang Q (2004) FCM-based model selection algorithms for determining the number of clusters. Pattern Recognit 37:2027–2037
Jolliffe IT (1986) Principal component analysis. Springer, New York
Ward JH (1963) Hierarchical grouping to optimize an objective function. J Am Stat Assoc 58(301):236–244
Xie Y, Raghavan VV, Zhao X (2002) 3M algorithm: finding an optimal fuzzy cluster scheme for proximity data. In: Proceedings of the FUZZY—IEEE conference—2002 IEEE world congress on computational intelligence, Honolulu, HI, vol 1, pp 627–632
Richter T, Steiner G, Abu-Id M, Salzer R, Bergmann R, Rodig H, Johannsen B (2002) Identification of tumour tissue by FTIR spectroscopy in combination with positron emission tomography. Vib Spectr 28:103–110
Stone N, Kendall C, Shepherd N, Crow P, Barr H (2002) Near-infrared Raman spectroscopy for the classification of epithelial pre-cancers and cancers. J Raman Spectr 33:564–573
Lasch P, Wasche W, McCarthy WJ, Muller G, Naumann D (1998) Imaging of human colon carcinoma thin sections by FTIR microspectrometry. SPIE Proc 3257(3):187–197
Kim SW, Ban SH, Chung H, Cho S, Chung HJ, Choi PS, Yoo OJ, Liu JR (2004) Taxonomic discrimination of flowering plants by multivariate analysis of Fourier transform infrared spectroscopy data. Plant Cell Rep 23(4):246–250
Liu H, Motoda H (1998) Feature selection for knowledge discovery and data mining. Kluwer Academic, Boston
Bezdek J (1981) Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum, New York
Bezdek J (1998) Pattern Recognition. In: Ruspini, EH, Bonissone PP Pedrycz W (eds) Handbook of fuzzy computation. IOP, Boston
Dunn JC (1973) A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters. J Cybern 3(3):32–57
Rezaee MR, Lelieveldt BPF, ReiBer JHC (1998) A new cluster validity index for the fuzzy c-means. Pattern Recognit Lett 19:237–246
Ray S, Turi R (1999) Determination of number of clusters in k-means clustering and application in colour image segmentation. In: Proceedings of the 4th international conference on advances in pattern recognition and digital techniques, pp 137–143
Xu L (1996) How many clusters? A Ying–Yang machine based theory for a classical open problem in pattern recognition. In: Proceedings of 1996 IEEE international conference on neural networks, pp 1546–1551
Berkhin P (2002) Survey of clustering data mining techniques. Technique Report, Accrue Software, San Jose, CA
Xu R, Wunsch II D (2005) Survey of clustering algorithm. IEEE Trans Neural Netw 16(3):645–678
Ball G, Hall D (1965) ISODATA, a novel method of data analysis and classification. Technical Report AD-699616, Stanford University, Stanford, CA
Turi RH (2001) Clustering-based colour image segmentation. PhD thesis, School of Computer Science and Software Engineering, Monash University, Australia
Chaudhuri D, Chaudhuri BB, Murthy CA (1992) A new split-and-merge clustering technique. Pattern Recognit Lett 13:399–409
Tou JT (1979) DYNOC—A dynamic optimal cluster-seeking technique. Int J Parallel Prog 8(6):541–547
Tseng L, Yang S (2001) A genetic approach to the automatic clustering problem. Pattern Recognit 34:415–424
Garai G, Chaudhuri BB (2004) A novel genetic algorithm for automatic clustering. Pattern Recognit Lett 25:173–187
Zhang DQ, Chen SC (2004) A novel kernelized fuzzy c-means algorithm with application in medical image segmentation. Artif Intell Med 32:37–50
Chen SC, Zhang DQ (2004) Robust image segmentation using FCM with spatial constraints based on new kernel-induced distance measure. IEEE Trans Syst Man Cybern Part B Cybern 34(4):1907–1916
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Wang, XY., Garibaldi, J.M., Bird, B. et al. A novel fuzzy clustering algorithm for the analysis of axillary lymph node tissue sections. Appl Intell 27, 237–248 (2007). https://doi.org/10.1007/s10489-007-0065-z
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-007-0065-z