Graphical representations and odds ratios in a distance-association model for the analysis of cross-classified data

de Rooij, Mark; Heiser, Willem J.

doi:10.1007/s11336-000-0848-1

Graphical representations and odds ratios in a distance-association model for the analysis of cross-classified data

Published: 03 April 2005

Volume 70, pages 99–122, (2005)
Cite this article

Psychometrika Aims and scope Submit manuscript

Mark de Rooij^1,3 &
Willem J. Heiser²

295 Accesses
33 Citations
Explore all metrics

Abstract

Although RC(M)-association models have become a generally useful tool for the analysis of cross-classified data, the graphical representation resulting from such an analysis can at times be misleading. The relationships present between row category points and column category points cannot be interpreted by inter point distances but only through projection. In order to avoid incorrect interpretation by a distance rule, joint plots should be made that either represent the row categories or the column categories as vectors. In contrast, the present study proposes models in which the distances between row and column points can be interpreted directly, with a large (small) distance corresponding to a small (large) value for the association. The models provide expressions for the odds ratios in terms of distances, which is a feature that makes the proposed models attractive reparametrizations to the usual RC(M)-parametrization. Comparisons to existing data analysis techniques plus an overview of related models and their connections are also provided.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Comprehensive Survey of Clustering Algorithms

Article 01 June 2015

Density-Based Clustering Based on Hierarchical Density Estimates

Learning from imbalanced data: open challenges and future directions

Article Open access 22 April 2016

References

Andersen E.B. (1980) Discrete statistical models with social science applications. Amsterdam: North-Holland
Google Scholar
Becker M.P. (1990) Maximum likelihood estimation of the RC(M) association model. Applied statistics 39:152–167
Google Scholar
Becker M.P., Clogg C.C. (1989) Analysis of sets of two-way contingency tables using association models. Journal of the American Statistical Association 84:142–151
MathSciNet Google Scholar
Borg I., Groenen P. (1997) Modern multidimensional scaling; theory and applications. New York: Springer
Google Scholar
Bradley R.A., Terry M.E. (1952) Rank analysis of incomplete designs I. the method of paired comparisons. Biometrika 39:324–345
Google Scholar
Breiman L., Friedman J.H., Olshen R.A., Stone C.J. (1984) Classification and regression trees. Belmont, CA: Wadsworth
Google Scholar
Carroll J.D., Green P.E., Schaffer C.M. (1986) Interpoint distance comparison in correspondence analysis.Journal of Marketing Research 23:271–280
Google Scholar
Carroll J.D., Green P.E., Schaffer C.M. (1987) Comparing interpoint distance in correspondence analysis: A clarification.Journal of Marketing Research 24:445–450
Google Scholar
Carroll J.D., Green P.E., Schaffer C.M. (1989) Reply to Greenacre’s commentary on the Carroll-Green-Schaffer scaling of two-way correspondence analysis solutions. Journal of Marketing Research 26:366–368
Google Scholar
Caussinus H. (1965) Contribution á l’analyse statistique des tableaux de corrélation [contributions to the statistical analysis of correlation matrices]. Annals of the faculty of Science, University of Toulouse 29:715–720
Google Scholar
Clogg C.C., Eliason S.R., Wahl R.J. (1990) Labor-market experiences and labor force outcomes. American Journal of Sociology, 95:1536–1576
Article Google Scholar
Coombs C.H. (1964) A theory of data.New York: Wiley
Google Scholar
De Leeuw, J. Heiser W.J. (1977) Convergence of correction–matrix algorithms for multidimensional scaling. In: J.C. Lingoes, E.E. Roskam & I.Borg (Eds) Geometric representations of relational data (pp 735-752) Ann Arbor, MI: Mathesis Press
Google Scholar
De Rooij M. (2001). Distance models for the analysis of transition frequencies. Unpublished doctoral dissertation, Leiden University
De Rooij, M., Heiser W.J. (2002) A distance representation of the quasi-symmetry model and related distance models. H.Yanai, A.Okada, K.Shigemasu, Y.Kano, J.J. Meulman (eds), New developments on psychometrics: proceedings of the international meeting of the psychometric society pp 487–494).Tokyo: Springer-Verlag
Google Scholar
Defays D. (1978). A short note on a method of seriation. British Journal of Mathematical and Statistical Psychology 3:49–53
Google Scholar
Fienberg S.E., Larntz K. (1976) Loglinear representation of paired and multiple comparison models.Biometrika 63:245–254
Google Scholar
Gifi A. (1990) Nonlinear multivariate analysis.New York: Wiley
Google Scholar
Gilula Z., Haberman S.J. (1986) Canonical analysis of contingency tables by maximum likelihood. Journal of the American Statistical Association, 81:780–788
Google Scholar
Goodman L.A. (1971) The analysis of multidimensional contingency tables: stepwise procedures and direct estimation methods for building models for multiple classifications. Technometrics, 13:33–61
Google Scholar
Goodman L.A. (1972) Some multiplicative models for the analysis of cross-classified data. Sixth Berkely Symposium, 1:649–696
Google Scholar
Goodman L.A. (1979) Simple models for the analysis of association in cross classifications having ordered categories. Journal of the American Statistical Association, 74:537–552
Google Scholar
Goodman L.A. (1981) Association models and cannonical correlation in the analysis of cross-classifications having ordered categories. Journal of the American Statistical Association, 76:320–334
Google Scholar
Goodman L.A. (1985) The analysis of cross-classified data having ordered and or unordered categories: Association models, correlation models, and asymmetric models for contingency tables with or without missing entries.The Annals of Statistics, 13:10–69
Google Scholar
Goodman L.A. (1986) Some useful extensions to the usual correspondence analysis approach and the usual log-linear models approach in the analysis of contingency tables. International Statistical Review, 54:243–309
MathSciNet Google Scholar
Greenacre, M.J (1984) Theory and applications of correspondence analysis London: Academic Press.
Google Scholar
Greenacre M. J (1989) The Carroll-Green-Schaffer scaling in correspondence analysis: A theoretical and expirical appraisal Journal of Marketing Research, 26:358–365
Google Scholar
Groenen P. J.F., De Leeuw J., Mathar R (1996) Least squares multidimensional scaling with transformed distances W.Gaul, D.Pfeifer (eds), Studies in classification, data analysis, and knowledge organization (pp177–185) Berlin: Springer
Google Scholar
Haberman S.J (1974) The analysis of frequency data Chicago: university of Chicago Press
Google Scholar
Haberman S.J (1978) Analysis of qualitative data, (vol. 1) New York: Academic Press
Google Scholar
Haberman S.J (1979) Analysis of qualitative data (vol. 2) New York: Academic Press
Google Scholar
Haberman S.J (1995) Computation of maximum likelihood estimates in association models Journal of the American Statistical Association, 90:1438–1446
Google Scholar
Heiser, W.J. (1981). Unfolding analysis of proximity data. Unpublished doctoral dissertation, Leiden University
Heiser W.J (1987) Joint ordination of species and sites: The unfolding technique P.Legendre, L.Legendre (eds), Developments in numerical ecology pp 189–221) Berlin: Springer Verlag
Google Scholar
Heiser W.J (1988) Selecting a stimulus set with prescribed structure from empirical confusion frequencies British Journal of Mathematical and Statistical Psychology, 41:37–51
Google Scholar
Heiser W.J., Meulman, J (1983) Analyzing rectangular tables by joint and constrained multidimensional scaling Journal of Econometrics, 22:139–167
Article Google Scholar
Hubert L.J., Arabie P (1986) Unidimensional scaling and combinatorial optimization J.De Leeuw, W.J. Heiser, J.Meulman, F.Critchley (eds) Multidimensional data analysis pp 181–196) Leiden: DSWO press
Google Scholar
Ihm P. Van, Groenewoud H (1975) A multivariate ordering of vegetation data based on Gaussian type gradient response curves Journal of Ecology, 63:767–778
Google Scholar
Ihm P. Van, Groenewoud H (1984) Correspondence analysis and Gaussian ordination COMPSTAT Lectures, 3:5–60
Google Scholar
Meulman J.J., Heiser W.J (1998) Visual display of interaction in multiway contingency tables by use of homogeneity analysis: the 2x2x2x2 case J.Blasius M.J. Greenacre (eds) Visualization of categorical data pp (277–296) New York: Academic Press
Google Scholar
Meulman J.J., Heiser W.J. SPSS Inc (1999) Spss Categories 10.0 Chicago, IL: SPSS Inc
Google Scholar
Nishisato S (1980) Analysis of categorical data: Dual scaling and its applications Toronto: University of Toronto Press
Google Scholar
Nosofsky R.M (1985) Overall similarity and the identification of separable-dimension stimuli: A choice model analysis Perception & Psychophysics, 38:415-432
Google Scholar
Shepard R.N (1957) Stimulus and response generalization: A stochastic model relating generalization to distance in psychological space Psychometrika, 22:325–345
Google Scholar
Srole L., Langner T.S., Michael S.T., Opler M.K., Rennie T. A.C (1962) Mental health in the metropolis: The midtown Manhattan study New York: McGraw-Hill
Google Scholar
Takane Y (1987) Analysis of contingency tables by ideal point discriminant analysis Psychometrika, 52:493–513
MathSciNet Google Scholar
Takane Y (1998) Visualization in ideal point discriminant analysis J.Blasius, M.J. Greenacre (eds). Visualization of categorical data (pp 441–459)New York: Academic Press
Google Scholar
Takane Y., Shibayama T (1986) Comparison of models for the stimulus recognition data J.De Leeuw, W.J. Heiser, J.Meulman, F.Critchley (eds), Multidimensional data analysis (PP 119–138) Leiden: DSWO Press
Google Scholar
Takane Y., Shibayama T (1992) Structures in stimulus identification data F.G. Ashby (eds), Probabilistic multidimensional models of perception and cognition (pp 335–362) Hillsdale, NJ: Erlbaum
Google Scholar
Ter Braak C. J.F (1985) Correspondence analysis of incidence and abundance data: properties in terms of a unimodal response model Biometrics, 41:859–873
Google Scholar
Van der Heijden P. G.M (1987) Correspondence analysis of longitudonal categorical data Leiden: DSWO
Google Scholar
Van der Heijden P. G.M., Mooijaart A., Takane Y (1994) Correspondence analysis and contingency models M.J. Greenacre, J.Blasius (eds) Correspondence analysis in the social sciences (pp 79–111) New York: Academic Press
Google Scholar
Wiepkema P.R (1961) An ethological analysis of the reproductive behavior of the bitterling (rhodeus amarus bloch) Archives Neerlandais Zoologique, 14:103–199
Google Scholar
Winsberg S., Carroll J.D (1989) A quasi-nonmetric method for multidimensional scaling via an extended Euclidean model Psychometrika, 54:217–229
MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Leiden University, The Netherlands
Mark de Rooij
Leiden University, The Netherlands
Willem J. Heiser
Department of Psychology, Leiden University, 9555, 2300, RB, Leiden, The Netherlands
Mark de Rooij

Authors

Mark de Rooij
View author publications
You can also search for this author in PubMed Google Scholar
Willem J. Heiser
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mark de Rooij.

Additional information

The authors are indebted to the reviewers, Ab Mooijaart, Patrick Groenen, and Lawrence Hubert for their comments on earlier versions of this manuscript. Netherlands Organization for Scientific Research (NWO) is gratefully acknowledged for funding this project. This research was conducted while the first author was supported by a grant of the Foundation for Behavioral and Educational Sciences of this organization (575-30-006). This paper was completed while the second author was research fellow at the Netherlands Institute in the Advanced Study in the Humanities and Social Sciences (NIAS) in Wassenaar, The Netherlands.

Rights and permissions

Reprints and permissions

About this article

Cite this article

de Rooij, M., Heiser, W.J. Graphical representations and odds ratios in a distance-association model for the analysis of cross-classified data. Psychometrika 70, 99–122 (2005). https://doi.org/10.1007/s11336-000-0848-1

Download citation

Published: 03 April 2005
Issue Date: March 2005
DOI: https://doi.org/10.1007/s11336-000-0848-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Graphical representations and odds ratios in a distance-association model for the analysis of cross-classified data

Abstract

Access this article

Similar content being viewed by others

A Comprehensive Survey of Clustering Algorithms

Density-Based Clustering Based on Hierarchical Density Estimates

Learning from imbalanced data: open challenges and future directions

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Graphical representations and odds ratios in a distance-association model for the analysis of cross-classified data

Abstract

Access this article

Similar content being viewed by others

A Comprehensive Survey of Clustering Algorithms

Density-Based Clustering Based on Hierarchical Density Estimates

Learning from imbalanced data: open challenges and future directions

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation