Simpson’s Paradox

Rudas, Tamás

doi:10.1007/978-1-4939-7693-5_9

Tamás Rudas^5,6

Part of the book series: Springer Texts in Statistics ((STS))

3075 Accesses

Abstract

Variants of a widely discussed problem related (but not restricted) to causal inference are called Simpson’s paradox. In one version, the paradox is that while a new drug may be better than the old drug for both male and female patients, when the data are combined, for all individuals, the old drug appears better. In these cases, the odds ratio is used to determine which treatment is better. First, the paradox is illustrated, and a brief overview of some of the published arguments is given, which aim at explaining what is wrong. Most of these theories say that the paradox occurs as a result of properties of the data or of the data collection procedure. This chapter takes a different position. It is argued that the odds ratio may not be appropriate to measure effect size, because it fails to take into account how popular the compared treatments were, which is a relevant information collected in observational studies. A competing, consistent measure of effect (and a concept of effect) is developed, which never commits the paradox. Finally, the last section does not suggest neither the odds ratio nor the measure developed in the previous section to be used universally; rather, it is argued that for a good choice of the better treatment, additional aspects, not only the numbers of positive and negative responses, need to be taken into account.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 54.99; Price excludes VAT (USA)

Softcover Book: USD 69.99; Price excludes VAT (USA)

Hardcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
In the biostatistics literature, a somewhat different phenomenon is called Simpson’s paradox, and the properties and the main results are different.
2.
This was only approximately true, in the sense that this was the case in the largest departments and where the difference in admission rates was not negligible. Other examples, to be discussed, showed exact Simpson’s paradoxes.
3.
Of course, the paradox will not go away, in general, just by changing the order of conditioning with all data sets. This is a particular feature of these data.
4.
Consistent, therefore, means that Simpson’s paradox never occurs.
5.
This was seen as a desirable property, when the odds ratio was used to measure the strength of association.

References

Bickel, P.J., Hammel, E.A., O’Connell, J.W.: Sex bias in graduate admissions: Data from Berkeley, Science 187, 398–404 (1975)
Google Scholar
Curley, S.P., Browne, G.J.: Normative and Descriptive Analyses of Simpson’s Paradox in Decision Making. Organizational Behavior and Human Decision Process, 84, 308–333 (2001)
Article Google Scholar
Neutel, C. I.: The Potential for Simpson’s Paradox in Drug Utilization Studies. Annals of Epidemiology, 7, 517–521 (1997)
Article Google Scholar
Reintjes, R, de Boer, A., van Pelt, W., Mintjes-de Groot, J.: Simpson’s Paradox: An Example from Hospital Epidemiology. Epidemiology, 11, 81–83 (2000)
Article Google Scholar
Rudas, T.: Informative allocation and consistent treatment selection. Statistical Methodology, 7, 323–337 (2010)
Article MathSciNet MATH Google Scholar
Rudas, T.: Effects and interactions. Methodology, 11, 142–149 (2015)
Article Google Scholar
Rudas, T.: Directionally collapsible parameterizations of multivariate binary distributions. Statistical Methodology, 27, 132–145 (2015)
Article MathSciNet Google Scholar
Wainer, H., Brown, L.M.: Two statistical paradoxes in the interpretation of group differences illustrated with medical school admission and licensing data, The American Statistician 58, 117–123 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Center for Social Sciences, Hungarian Academy of Sciences, Budapest, Hungary
Tamás Rudas
Eötvös Loránd University, Budapest, Hungary
Tamás Rudas

Authors

Tamás Rudas
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Rudas, T. (2018). Simpson’s Paradox. In: Lectures on Categorical Data Analysis. Springer Texts in Statistics. Springer, New York, NY. https://doi.org/10.1007/978-1-4939-7693-5_9

Download citation

DOI: https://doi.org/10.1007/978-1-4939-7693-5_9
Published: 31 March 2018
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4939-7691-1
Online ISBN: 978-1-4939-7693-5
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics