Abstract
Variants of a widely discussed problem related (but not restricted) to causal inference are called Simpson’s paradox. In one version, the paradox is that while a new drug may be better than the old drug for both male and female patients, when the data are combined, for all individuals, the old drug appears better. In these cases, the odds ratio is used to determine which treatment is better. First, the paradox is illustrated, and a brief overview of some of the published arguments is given, which aim at explaining what is wrong. Most of these theories say that the paradox occurs as a result of properties of the data or of the data collection procedure. This chapter takes a different position. It is argued that the odds ratio may not be appropriate to measure effect size, because it fails to take into account how popular the compared treatments were, which is a relevant information collected in observational studies. A competing, consistent measure of effect (and a concept of effect) is developed, which never commits the paradox. Finally, the last section does not suggest neither the odds ratio nor the measure developed in the previous section to be used universally; rather, it is argued that for a good choice of the better treatment, additional aspects, not only the numbers of positive and negative responses, need to be taken into account.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
In the biostatistics literature, a somewhat different phenomenon is called Simpson’s paradox, and the properties and the main results are different.
- 2.
This was only approximately true, in the sense that this was the case in the largest departments and where the difference in admission rates was not negligible. Other examples, to be discussed, showed exact Simpson’s paradoxes.
- 3.
Of course, the paradox will not go away, in general, just by changing the order of conditioning with all data sets. This is a particular feature of these data.
- 4.
Consistent, therefore, means that Simpson’s paradox never occurs.
- 5.
This was seen as a desirable property, when the odds ratio was used to measure the strength of association.
References
Bickel, P.J., Hammel, E.A., O’Connell, J.W.: Sex bias in graduate admissions: Data from Berkeley, Science 187, 398–404 (1975)
Curley, S.P., Browne, G.J.: Normative and Descriptive Analyses of Simpson’s Paradox in Decision Making. Organizational Behavior and Human Decision Process, 84, 308–333 (2001)
Neutel, C. I.: The Potential for Simpson’s Paradox in Drug Utilization Studies. Annals of Epidemiology, 7, 517–521 (1997)
Reintjes, R, de Boer, A., van Pelt, W., Mintjes-de Groot, J.: Simpson’s Paradox: An Example from Hospital Epidemiology. Epidemiology, 11, 81–83 (2000)
Rudas, T.: Informative allocation and consistent treatment selection. Statistical Methodology, 7, 323–337 (2010)
Rudas, T.: Effects and interactions. Methodology, 11, 142–149 (2015)
Rudas, T.: Directionally collapsible parameterizations of multivariate binary distributions. Statistical Methodology, 27, 132–145 (2015)
Wainer, H., Brown, L.M.: Two statistical paradoxes in the interpretation of group differences illustrated with medical school admission and licensing data, The American Statistician 58, 117–123 (2004)
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Science+Business Media, LLC, part of Springer Nature
About this chapter
Cite this chapter
Rudas, T. (2018). Simpson’s Paradox. In: Lectures on Categorical Data Analysis. Springer Texts in Statistics. Springer, New York, NY. https://doi.org/10.1007/978-1-4939-7693-5_9
Download citation
DOI: https://doi.org/10.1007/978-1-4939-7693-5_9
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4939-7691-1
Online ISBN: 978-1-4939-7693-5
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)