Abstract
This article provides the performance evaluation of the Silhouette index, which is based on the so called silhouette width. However, the index can be calculated in two ways, and so, the first approach uses the mean of the mean silhouettes through all the clusters. On the other hand, the second one is realized by averaging the silhouettes over the whole data set. These various approaches of the index have significant influence on indicating the proper number of clusters in a data set. To study the performance of the index, as the underlying clustering algorithms, two popular hierarchical methods were applied, that is, the complete-linkage and the single-linkage algorithm. These methods have been used for artificial and real-life data sets, and the results confirm very good performances of the index and they also allow to choose the best approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bache K., Lichman M.: UCI Machine Learning Repository. Irvine, CA: University of California, School of Information and Computer Science (2013), http://archive.ics.uci.edu/ml
Bezdek, J.C.: Numerical taxonomy with fuzzy sets. J.Match. Biol. 1, 57–71 (1974)
Bilski, J., Smoląg, J.: Parallel Approach to Learning of the Recurrent Jordan Neural Network. In: Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M. (eds.) ICAISC 2013, Part I. LNCS (LNAI), vol. 7894, pp. 32–40. Springer, Heidelberg (2013)
Bilski, J., Smoląg, J., Galushkin, A.I.: The Parallel Approach to the Conjugate Gradient Learning Algorithm for the Feedforward Neural Networks. In: Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M. (eds.) ICAISC 2014, Part I. LNCS, vol. 8467, pp. 12–21. Springer, Heidelberg (2014)
Bradley, P., Fayyad, U.: Refining initial points for k-means clustering. In: Proceedings of the Fifteenth International Conference on Knowledge Discovery and Data Mining, pp. 9–15. AAAI Press, New York (1998)
Chaibakhsh, A., Chaibakhsh, N., Abbasi, M., Norouzi, A.: Orthonormal basis function fuzzy systems for biological wastewater treatment processes modeling. Journal of Artificial Intelligence and Soft Computing Research 2(4), 343–356 (2012)
Cpałka, K., Rebrova, O., Nowicki, R., et al.: On design of flexible neuro-fuzzy systems for nonlinear modelling. International Journal of General Systems 42(6 special issue: sI), 706–720 (2013)
Davies, D.L., Bouldin, D.W.: A cluster separation measure. Trans. Pattern Analysis and Machine Intelligence 1(2), 224–227 (1979)
Dunn, J.C.: Well separated clusters and optimal fuzzy Partitions. Journal of Cybernetica 4, 95–104 (1974)
Faber, V.: Clustering and the continuous k-means algorithm. Los Alamos Science 22, 138–144 (1994)
Fukuyama, Y., Sugeno, M.: A new method of choosing the number of clusters for the fuzzy c-means method. In: Proceedings of the 5th Fuzzy Systems Symposium, Japan, pp. 247–250 (1989)
Gałkowski, T.: Kernel Estimation of Regression Functions in the Boundary Regions. In: Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M. (eds.) ICAISC 2013, Part II. LNCS (LNAI), vol. 7895, pp. 158–166. Springer, Heidelberg (2013)
Georgiou, D.A., Botsios, S., Mitropoulou, V., Papaioannou, M., Schizas, C., Tsoulouhas, G.: Learning style recognition based on an adjustable three-layer fuzzy cognitive map. Journal of Artificial Intelligence and Soft Computing Research 1(4), 333–347 (2011)
Jain, A., Dubes, R.: Algorithms for clustering data. Prentice-Hall, Englewood Cliffs (1988)
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surveys 31(3), 264–323 (1999)
Korytkowski, M., Rutkowski, L., Scherer, R.: From ensemble of fuzzy classifiers to single fuzzy rule base classifier. In: Rutkowski, L., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M. (eds.) ICAISC 2008. LNCS (LNAI), vol. 5097, pp. 265–272. Springer, Heidelberg (2008)
Kroll, A.: On choosing the fuzziness parameter for identifying ts models with multidimensional membership function. Journal of Artificial Intelligence and Soft Computing Research 1(4), 283–300 (2011)
Murtagh, F.: A survey of recent advantces in hierarchical clustering algorithms. The Computer Journal 26(4), 354–359 (1983)
Naim, S., Hagras, H.: A big-bang big-crunch optimized general type-2 fuzzy logic approach for multi-criteria group decision making. Journal of Artificial Intelligence and Soft Computing Research 3(2), 117–132 (2013)
Nowicki, R., Pokropińska, A.: Information criterions applied to neuro-fuzzy architectures design. In: Rutkowski, L., Siekmann, J.H., Tadeusiewicz, R., Zadeh, L.A. (eds.) ICAISC 2004. LNCS (LNAI), vol. 3070, pp. 332–337. Springer, Heidelberg (2004)
Nowicki, R., Scherer, R., Rutkowski, L.: A method for learning of hierarchical fuzzy systems. Frontiers in Artificial Intelligence and Applications 76, 124–129 (2002)
Pal, N.R., Bezdek, J.C.: On cluster validity for the fuzzy c-means model. IEEE Trans. Fuzzy Systems 3(3), 370–379 (1995)
Rohlf, F.: Single link clustering algorithms. In: Krishnaiah, P., Kanal, L. (eds.) Handbook of Statistics, pp. 267–284. North-Holland, Amsterdam (1982)
Rutkowski, L.: Generalized regression neural networks in time-varying environment. IEEE Transactions on Neural Networks 15(3), 576–596 (2004)
Rutkowski, L.: Adaptive probabilistic neural networks for pattern classification in time-varying environment. IEEE Transactions on Neural Networks 15(4), 811–827 (2004)
Rutkowski, L., Przybył, A., Cpałka, K., Er, M.J.: Online speed profile generation for industrial machine tool based on neuro-fuzzy approach. In: Rutkowski, L., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M. (eds.) ICAISC 2010, Part II. LNCS (LNAI), vol. 6114, pp. 645–650. Springer, Heidelberg (2010)
Rutkowski, L., Jaworski, M., Pietruczuk, L., Duda, P.: Decision trees for mining data streams based on the gaussian approximation. IEEE Transactions on Knowledge and Data Engineering 26(1), 108–119 (2014)
Rousseeuw, P.J.: Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20, 53–65 (1987)
Theodoridis, D.C., Boutalis, Y.S., Christodoulou, M.A.: Robustifying analysis of the direct adaptive control of unknown multivariable nonlinear systems based on a new neuro-fuzzy method. Journal of Artificial Intelligence and Soft Computing Research 1(1), 59–79 (2011)
Xie, X.I., Beni, G.: A validity measure for fuzzy clustering. IEEE Trans. Pattern Anal. Mach. Intell. 13, 841–847 (1991)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Starczewski, A., Krzyżak, A. (2015). Performance Evaluation of the Silhouette Index. In: Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L., Zurada, J. (eds) Artificial Intelligence and Soft Computing. ICAISC 2015. Lecture Notes in Computer Science(), vol 9120. Springer, Cham. https://doi.org/10.1007/978-3-319-19369-4_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-19369-4_5
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-19368-7
Online ISBN: 978-3-319-19369-4
eBook Packages: Computer ScienceComputer Science (R0)