Bonferroni Test

Neil J.Salkind

doi:10.4135/9781412952644

Entry
Reader's guide
Entries A-Z
Subject index

Return to Entries

Bonferroni Test

Edited by:
Neil J. Salkind
In:Encyclopedia of Measurement and Statistics
Chapter DOI:https://doi.org/10.4135/9781412952644.n60
Subject:Quantitative/Statistical Research, Test & Measurement

Request Permissions

Show page numbers Hide page numbers

The more tests we perform on a set of data, the more likely we are to reject the null hypothesis when it is true (a Type I error). This is a consequence of the logic of hypothesis testing: We reject the null hypothesis if we witness a rare event. But the larger the number of tests, the easier it is to find rare events, and therefore, the easier it is to make the mistake of thinking that there is an effect when there is none. This problem is called the inflation of the alpha level. One strategy for preventing it is to correct the alpha level when performing multiple tests. Making the alpha level more stringent (i.e., smaller) will create fewer errors, but it may also make real effects harder to detect.

The Different Meanings of Alpha

Maybe researchers perform more and more statistical tests on one set of data because computers make statistical analyses easy to run. For example, brain imaging researchers will routinely run millions of tests to analyze an experiment. Running so many tests increases the risk of false alarms. To illustrate, imagine the following “pseudoexperiment”:

I toss 20 coins, and I try to force the coins to fall heads up. I know that, from the binomial test, the null hypothesis is rejected at the α= .05 level if the number of heads is greater than 14. I repeat this experiment 10 times.

Suppose that one trial gives the “significant” result of 16 heads versus 4 tails. Did I influence the coins on that occasion? Of course not, because the larger the number of experiments, the greater the probability of encountering a low-probability event (like 16 versus 4). In fact, waiting long enough is a sure way of detecting rare events!

Probability in the Family

A family of tests is the technical term for a series of tests performed on a set of data. In this section, we show how to compute the probability of rejecting the null hypothesis at least once in a family of tests when the null hypothesis is true.

For convenience, suppose that we set the significance level at α = .05. For each test (i.e., one trial in the example of the coins), the probability of making a Type I error is equal to α= .05. The events “making a Type I error” and “not making a Type I error” are complementary events (they cannot occur simultaneously). Therefore the probability of not making a Type I error on one trial is equal to

Recall that when two events are independent, the probability of observing these two events together is the product of their probabilities. Thus, if the tests are independent, the probability of not making a Type I error on the first and the second tests is

With three tests, we find that the probability of not making a Type I error on all tests is

For a family of C tests, the probability of not making a Type I error for the whole family is

For our example, the probability of not making a Type I error on the family

...

Sign in to access this content

Get a 30 day FREE TRIAL

Watch videos from a variety of sources bringing classroom topics to life
Read modern, diverse business cases
Explore hundreds of books and reference titles

No internet connection.

All search filters on the page have been cleared.

Your search has been saved.

Entry

Reader's guide

Entries A-Z

Subject index

Bonferroni Test

The Different Meanings of Alpha

Probability in the Family

Sign in to access this content

Get a 30 day FREE TRIAL

Read next

More like this

Sage Recommends