Simpson&#39;s Paradox

Sarah Boslaugh

doi:10.4135/9781412953948

Entry
Reader's guide
Entries A-Z
Subject index

Return to Entries

Simpson's Paradox

Edited by:
Sarah Boslaugh
In:Encyclopedia of Epidemiology
Chapter DOI:https://doi.org/10.4135/9781412953948.n424
Subject:Epidemiology & Biostatistics, Public Health (general), Public Health Research Methods

Request Permissions

Show page numbers Hide page numbers

Simpson's paradox is an extreme form of confounding, where the association between two variables in a full group is in the opposite direction of the association found within every subcategory of a third variable. This paradox was first described by G. U. Yule in 1903 and later developed and popularized by E. H. Simpson in 1951.

By way of example, consider a new drug treatment that initially appears to be effective, with 54% of treated patients recovering, as compared with 46% of patients receiving a placebo. However, when the sample is divided by gender, it is found that 20% of treated males recover compared with 25% of placebo males, and 75% of treated females recover as compared with 80% of placebo females. So the apparent paradox is that the drug is found to be more effective than the placebo in the full group but less effective than the placebo in each of the two gender-specific subgroups that fully comprise the combined group.

The key to unraveling this puzzle involves the gender confound—differing numbers of patients of each gender receiving the treatment versus placebo, combined with differing overall recovery rates for males versus females. Table 1 shows that in this example males are 1.6 times more likely to receive the placebo than the treatment, whereas females are 1.6 times [Page 974]more likely to receive the treatment than the placebo. At the same time, females are more than three times as likely to recover as males within both the treatment group and the placebo group. In other words, females are relatively easy to cure. So the fact that the placebo is more effective than the treatment in both groups is obscured when the groups are combined, due to the disproportionate number of easy-to-cure females in the treatment group.

Table 1 Recovery Rates
	Treatment	Placebo
Male	10/50 (20%)	20/80 (25%)
Female	60/80 (75%)	40/50 (80%)
All patients	70/130 (54%)	60/130 (46%)

In this particular example, it would be commonly agreed that the correct conclusion involves the subgroupspecific results—the drug is not effective—and that the apparent effectiveness found in the combined group is merely a statistical artifact of the study design due to the gender confound.

Simpson's paradox can be problematic when not recognized, leading to naive and misleading conclusions regarding effectiveness or other relations studied. Perhaps more ominously, knowledge of Simpson's paradox can be intentionally used to present or emphasize results that support a desired conclusion, when that conclusion is not valid. More generally, Simpson's paradox has been shown to have implications for the philosophical study of causation and causal inference. In practical terms, it is prudent for both researchers and research consumers to be on guard for this potentially perilous paradox.

Norman A.Constantine

http://dx.doi.org/10.4135/9781412953948.n424

Sign in to access this content

Get a 30 day FREE TRIAL

Watch videos from a variety of sources bringing classroom topics to life
Read modern, diverse business cases
Explore hundreds of books and reference titles

Entry

Reader's guide

Entries A-Z

Subject index

Simpson's Paradox

Further Readings

Sign in to access this content

Get a 30 day FREE TRIAL

Sage Recommends

No internet connection.

All search filters on the page have been cleared.

Your search has been saved.

Entry

Reader's guide

Entries A-Z

Subject index

Simpson's Paradox

Further Readings

Sign in to access this content

Get a 30 day FREE TRIAL

Read next

More like this

Sage Recommends