Skip to main content
Log in

A particular Gaussian mixture model for clustering and its application to image retrieval

  • Focus
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

We introduce a new method for data clustering based on a particular Gaussian mixture model (GMM). Each cluster of data, modeled as a GMM into an input space, is interpreted as a hyperplane in a high dimensional mapping space where the underlying coefficients are found by solving a quadratic programming (QP) problem. The main contributions of this work are (1) an original probabilistic framework for GMM estimation based on QP which only requires finding the mixture parameters, (2) this QP is interpreted as the minimization of the pairwise correlations between cluster hyperplanes in a high dimensional space and (3) it is solved easily using a new decomposition algorithm involving trivial linear programming sub-problems. The validity of the method is demonstrated for clustering 2D toy examples as well as image databases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  • Bach FR, Jordan MI (2003) Learning spectral clustering. Neural information processing systems

  • Ben-Hur A, Horn D, Siegelmann HT, Vapnik V (2000) Support vector clustering. Neural information processing systems, pp 367–373

  • Bezdek JC (1981). Pattern recognition with fuzzy objective function algorithms. Plenum Press, New York

    MATH  Google Scholar 

  • Bishop CM (1995). Neural networks for pattern recognition. Clarendon Press, Oxford

    Google Scholar 

  • Carson C, Belongie S, Greenspan H and Malik J (2002). Blobworld: image segmentation using expectation-maximization and its application to image querying. IEEE Trans Pattern Anal Mach Intell 24(8): 1026–1038

    Article  Google Scholar 

  • Cristianini N and Shawe-Taylor J (2000). An introduction to support vector machines. Cambridge University Press, Cambridge

    Google Scholar 

  • Dave RN (1991). Characterization and detection of noise in clustering. In Pattern Recognit 12(11): 657–664

    Article  Google Scholar 

  • Dempster A, Laird N and Rubin D (1977). Maximum likelihood from incomplete data via the em algorithm. J R Stat Soc B 39(1): 1–38

    MATH  MathSciNet  Google Scholar 

  • Fraley C (1998). Algorithms for model-based gaussian hierarchical clustering. SIAM J Sci Comput 20(1): 270–281

    Article  MATH  MathSciNet  Google Scholar 

  • Frigui H, Krishnapuram R (1997) Clustering by competitive agglomeration. Pattern recognition, vol 30, no. 7

  • Frigui H and Krishnapuram R (1999). A robust competitive clustering algorithm with applications in computer vision. IEEE Trans Pattern Anal Mach Intell 21(5): 450–465

    Article  Google Scholar 

  • Halkidi M, Batistakis Y, Vazirgiannis M (2002) Cluster validity methods: Part i and ii. SIGMOD Record

  • Ichihashai H, Honda K, Tani N (2000) Gaussian mixture pdf approximation and fuzzy c-means clustering with entropy regularization. In: Proceedings of the 4th Asian fuzzy system symposium, pp 217–221

  • Jain AK and Dubes RC (1988). Algorithms for clustering data. Prentice Hall, Englewood Cliffs

    MATH  Google Scholar 

  • Le-Saux B, Boujemaa N (2002) Unsupervised robust clustering for image database categorization. IEEE-IAPR International Conference on Pattern Recognition, pp 259–262

  • MacQueen J (1965) Some methods for classification and analysis of multivariate observations. In: Proceedings of the 5th Berkeley symposium on mathematical statistics and probability, vol 1

  • Orengo CA, Jones DT, Thornton JM (2003) Bioinformatics—genes, protein and computers. bios. ISBN: 1-85996-054-5

  • Osuna E, Freund R, Girosi F (1997) Training support vector machines: an application to face detection. In: Proceedings of the international conference on computer vision and pattern recognition, pp 130–136

  • Platt J (1999). Fast training of support vector machines using sequential minimal optimization. In: Schlkopf, B, Burges, C and Smola, AJ (eds) Advances in kernel methods—support vector learning, pp 185–208. MIT Press, Cambridge

    Google Scholar 

  • Posse C (2001). Hierarchical model-based clustering for large datasets. J Comput Graph Stat 10(3): 464–486

    Article  MathSciNet  Google Scholar 

  • Sahbi H, Boujemaa N (2005) Validity of fuzzy clustering using entropy regularization. In: proceedings of the IEEE conference on fuzzy systems

  • Sneath PH and Sokal RR (1973). Numerical taxonomy—the principles and practice of numerical classification. W. H. Freeman, San Francisco

    MATH  Google Scholar 

  • Tanenbaum J, de Silva V and Langford J (2000). A global geometric framework for non-linear dimensionality reduction. Science 290(5500): 2319–2323

    Article  Google Scholar 

  • Tsao EK, Bezdek JC and Pal NR (1994). Fuzzy kohonen clustering networks. PR 27(5): 757–764

    Google Scholar 

  • Vanderbei RJ (1999). LOQO: an interior point code for quadratic programming. Optim Methods Softw 11: 451–484

    Article  MathSciNet  Google Scholar 

  • Wu M, Schoelkopf B (2006) A local learning approach for clustering. Advances neural information processing systems

  • Yang MS (1993). A survey of fuzzy clustering. MathCompMod 18: 1–16

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hichem Sahbi.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sahbi, H. A particular Gaussian mixture model for clustering and its application to image retrieval. Soft Comput 12, 667–676 (2008). https://doi.org/10.1007/s00500-007-0247-y

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-007-0247-y

Keywords

Navigation