Abstract
For the exploratory analysis of a matrix of proximities or (dis)similarities between objects, one often uses cluster analysis (CA) or multidimensional scaling (MDS). Solutions resulting from such analyses are sometimes interpreted using external information on the objects. Usually the procedures of CA, MDS and using external information are carried out independently and sequentially, although combinations of two of the three procedures (CA and MDS, or multidimensional scaling and using external information) have been proposed in the literature. The present paper offers a procedure that combines all three procedures in one analysis, using a model that describes a partition of objects with cluster centroids represented in a low-dimensional space, which in turn is related to the information in the external variables. A simulation study is carried out to demonstrate that the method works satisfactorily for data with a known underlying structure. Also, to illustrate the method, it is applied to two empirical data sets.
Similar content being viewed by others
References
Abelson R.P., Sermat V. (1962) Multidimensional scaling of facial expressions. Journal of Experimental Psychology 63:546–554
Bock H.H. (1987) On the interface between cluster analysis, principal component analysis, and multidimensional scaling. In: Bozdogan H., Gupta A.K. (eds) Multivariate Statistical Modeling and Data Analysis. Reidel, New York, pp 17–34
Borg I., Groenen P. (1997) Modern Multidimensional Scaling: Theory and Applications. Springer, Berlin Heidelberg New York
Carroll J.D., Chang J.-J. (1970) Analysis of individual differences in multidimensional scaling via an N-way generalization of “Eckart-Young" decomposition. Psychometrika 35:283–319
Cox T.F., Cox M.A.A. (1994) Multidimensional Scaling. Chapman & Hall, London
De Leeuw J. (1977) Applications of convex analysis to multidimensional scaling. In: Barra J.R., Brodeau F., Romier G., van Cutsem B. (eds) Recent Developments in Statistics. North Holland, Amsterdam, pp 133–145
De Leeuw J., Heiser W. (1977) Convergence of correction matrix algorithms for multidimensional scaling, In: Lingoes J.C. (ed) Geometric Representations of Relational Data. Mathesis Press, Ann Arbor Michigan, pp 735–752
De Leeuw J., Heiser W. (1982) Theory of multidimensional scaling. In: Krishnaiah P.R., Kanai L.N. (eds) Handbook of Statistics vol. 2. North Holland, Amsterdam
Diederich G., Messick S.J., Tucker L.R. (1957) A general least squares solution for successive intervals. Psychometrika 22:159–173
Ekman G. (1954) Dimensions of color vision. Journal of Psychology 38:467–474
Engen T., Levy N., Schlosberg H. (1958) The dimensional analysis of a new series of facial expressions. Journal of Experimental Psychology 55:454–458
Escoufier Y. (1973) Le traitement des variables vectorielles [The treatment of vectorial variables]. Biometrics 29:751–760
Gordon A.D. (1990) Constructing dissimilarity measures. Journal of Classification 7:257–269
Gower J.C. (1966) Some distance properties of latent root and vector methods used in multivariate analysis. Biometrika 53:325–338
Green P.E., Carmone F., Smith S.M. (1989) Multidimensional Scaling: Concept and Applications. Allyn & Bacon, Boston
Groenen P.J.F. (1993) The Majorization Approach to Multidimensional Scaling: Some Problems and Extensions. DSWO Press, Leiden
Gutman L., Levy S. (1991) Two structural laws for intelligence tests. Intelligence 15:79–103
Hair J.F., Anderson R.E., Tatham R.L., Black W.C. (1998) Multivariate Data Analysis. Prentice Hall Inc, New Jersey
Heiser W.J. (1993) Clustering in low-dimensional space. In: Opitz O., Lausen B., Klar R. (eds) Information and Classification: Concepts, Methods and Applications. Springer, Berlin Heidelberg NewYork, pp 162–173
Heiser W.J., Groenen P.J.F. (1997) Cluster differences scaling with a within-clusters loss component and a fuzzy successive approximation strategy to avoid local minima. Psychometrika 62:63–83
Heiser W.J., Meulman J.J. (1983) Constrained multidimensional scaling including confirmation. Applied Psychological Measurement 7:381–404
Hubert L., Arabie P. (1985) Comparing partitions. Journal of Classification 2:193–218
Keller J.B. (1962) Factorization of matrices by least squares. Biometrika 49:239–242
Kruskal J.B. (1964) Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis. Psychometrika 29:1–27
MacQueen J.B. (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the 5th Symposumon Mathemaical Statistics and Probability Vol 1. University of California, Berkeley
Mathar R. (1985) The best Euclidean fit to a given distance matrix in prescribed dimensions. Linear Algebra and Its Applications 67:1–6
Milligan W.M., Cooper M.C. (1985) An examination of procedures for determining the number of clusters in a data set. Psychometrika 50:159–179
Penrose R. (1956) On best approximate solutions of linear matrix equations. Proceedings of the Cambridge Philosophical Society 52:17–19
Shepard R.N. (1962) The analysis of proximities: multidimensional scaling with an unknown distance function. Psychometrika 27:125–139, 219–246
Torgerson W.S. (1958) Theory and Methods of Scaling. Wiley, New York
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Kiers, H.A.L., Vicari, D. & Vichi, M. Simultaneous classification and multidimensional scaling with external information. Psychometrika 70, 433–460 (2005). https://doi.org/10.1007/s11336-002-0998-4
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11336-002-0998-4