Skip to main content icon/video/no-internet

The chi-square test is used for categorical data. There are three situations in which one can use the chi-square test: to test for independence, for equality of proportions, and for goodness of fit. The test statistic and the expected values for the first two cases are identical, but the hypotheses and sampling situations are distinctly different. For the goodness-of-fit situation, the test statistic and expected values are similar but not identical, and the hypotheses and sampling situation are also different from the other cases. The first two cases are presented first. All data tables were created by the author for this entry.

Test for Independence

Situation. One has a single random sample, and this sample is cross-categorized by two variables each with 2 or more categories. The null hypothesis is that there is no relationship between Variable 1 and Variable 2, that is, the two variables are independent of each other. The alternative hypothesis is that there is a relationship between the two variables, that is, the two variables are dependent.

For example, one has a random sample of people who are cross-categorized by race and blood type. The null hypothesis is that there is no relationship between race and blood type, and the alternative hypothesis is that there is a relationship between race and blood type.

Test for Equality of Proportions

Situation. One has a single variable of interest with 2 or more categories and multiple independent random samples. The null hypothesis is that there is no difference in the proportion of each category across the different populations. The alternative hypothesis is that there is at least one difference in a proportion across the different populations.

For example, one has four treatment groups. These treatment groups are the multiple independent random samples. The variable of interest has two categories survived or died. The null hypothesis is that the proportions that survived are the same for each treatment group and the proportion that died are the same for each treatment group. The alternative hypothesis is that at least one proportion is different between the treatment groups. The test for equality of proportions is analogous to the one-way analysis of variance test for equality of means for quantitative data.

Expected Values and Tests Statistic: Independence and Equality of Proportions

With the tests for independence and equality of proportions, the data should be given in table form where i represents the ith row and j represents the jth column of the table. One finds the expected value (Eij) for each ijth cell. The expected value for the ijth cell is the value that one would expect if the null hypothesis is true.

None

Reasoning behind the Expected Value for Test for Independence

For the test for equality of proportions, the null hypothesis states that Variable 1 and Variable 2 are independent. By independence,

None

So the best estimate of the P[ijth cell] is the estimate of the proportion of the ith row multiplied proportion of the jth column, that is,

None

To find the expected count, one would multiply by the grand total, which gives the above formula for Eij.

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading