Abstract
This paper seeks to meet the need for a general treatment of the problem of error in classification. Within an m-attribute classificatory system, an object's typical subclass is that subclass to which it is most often allocated under repeated experimentally independent applications of the classificatory criteria. In these terms, an error of classification is an atypical subclass allocation. This leads to definition of probabilitiesO of occasional subclass membership, probabilitiesT of typical subclass membership, and probabilitiesE of error or, more generally, occasional subclass membership conditional upon typical subclass membership. In the relationshipf: (O, T, E) the relative incidence of independentO, T, andE values is such that generally one can specifyO values givenT andE, but one cannot generally specifyT andE values givenO. Under the restrictions of homogeneity ofE values for all members of a given typical subclass, mutual stochastic independence of errors of classification, and suitable conditions of replication, one can find particular systemsO =f(T, E) which are solvable forT andE givenO. A minimum of three replications of occasional classification is necessary for a solution of systems for marginal attributes, and a minimum of two replications is needed with any cross-classification. Although for such systems one can always specifyT andE values givenO values, the solution is unique for dichotomous systems only.
References
Airy, G. B.On the algebraical and numerical theory of errors of observation and the combination of observations. Cambridge and London: McMillan, 1861.
Boring, E. G. The logic of the normal law of error in mental measurement.Amer. J. Psychol., 1920,31, 1–33.
Feller, W.An introduction to probability theory and its applications. (2nd. ed., vol. 1) New York: Wiley, 1957.
Gulliksen, H.Theory of mental tests. New York: Wiley, 1950.
Guttman, L. The test-retest reliability of qualitative data.Psychometrika, 1946,11, 81–95.
Lazarsfeld, P. F. Latent structure analysis. In S. Koch (Ed.),Psychology: a study of science (vol. 3). New York: McGraw-Hill, 1959.
Scheffé, H.The analysis of variance. New York: Wiley, 1959.
Solomon, H. (Ed.)Studies in item analysis and prediction. Stanford: Stanford Univ. Press, 1961.
Spearman, C. Correlation calculated from faulty data.Brit. J. Psychol. 1910,3, 271–295.
Yule, G. U. and Kendall, M. G.An introduction to the theory of statistics. (11th ed.) London: Griffin, 1937.
Author information
Authors and Affiliations
Additional information
With grateful acknowledgement to the Rockefeller Foundation; and to the United States Department of Health, Education, and Welfare, Public Health Service, for N. I. M. H. Grant M-3950.
Rights and permissions
About this article
Cite this article
Sutcliffe, J.P. A probability model for errors of classification. I. General considerations. Psychometrika 30, 73–96 (1965). https://doi.org/10.1007/BF02289748
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF02289748