Abstract
This paper proposes a novel hierarchical clustering algorithm based on high order dissimilarities. These dissimilarity increments are measures computed over triplets of nearest neighbor points. Recently, the distribution of these dissimilarity increments was derived analytically. We propose to incorporate this distribution in a hierarchical clustering algorithm to decide whether two clusters should be merged or not. The proposed algorithm is parameter-free and can identify classes as the union of clusters following the dissimilarity increments distribution. Experimental results show that the proposed algorithm has excellent performance over well separated clusters, also providing a good hierarchical structure insight into touching clusters.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aidos, H., Fred, A.: On the distribution of dissimilarity increments. In: IbPRIA 2011 (to appear, 2011)
Figueiredo, M.A.T., Jain, A.K.: Unsupervised learning of finite mixture models. IEEE Transactions on Pattern Analysis and Machine Intelligence 24, 381–396 (2002)
Fred, A., Jain, A.: Cluster validation using a probabilistic attributed graph. In: Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), Tampa, Florida, USA (2008)
Fred, A., Leitão, J.: A new cluster isolation criterion based on dissimilarity increments. IEEE Transactions on Pattern Analysis and Machine Intelligence 25, 944–958 (2003)
Grünwald, P.D.: A Tutorial Introduction to the Minimum Description Length Principle. In: Advances in Minimum Description Length: Theory and Applications. MIT Press, Cambridge (2005)
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd edn. Springer, Heidelberg (2009)
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Computing Surveys 31, 264–323 (1999)
Johnson, N.L., Kotz, S., Balakrishnan, N.: Continuous Univariate Distributions, Applied Probability and Statistics, 2nd edn., vol. 1. John Wiley & Sons Ltd., Chichester (1994)
Theodoridis, S., Koutroumbas, K.: Pattern Recognition, 2nd edn. Elsevier Academic Press (2003)
Ueda, N., Nakano, R., Ghahramani, Z., Hinton, G.E.: Smem algorithm for mixture models. Neural Computation 12(9), 2109–2128 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Aidos, H., Fred, A. (2011). Hierarchical Clustering with High Order Dissimilarities. In: Perner, P. (eds) Machine Learning and Data Mining in Pattern Recognition. MLDM 2011. Lecture Notes in Computer Science(), vol 6871. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23199-5_21
Download citation
DOI: https://doi.org/10.1007/978-3-642-23199-5_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23198-8
Online ISBN: 978-3-642-23199-5
eBook Packages: Computer ScienceComputer Science (R0)