Skip to main content icon/video/no-internet

The chi-square test for independence is a significance test of the relationship between categorical variables. This test is sometimes known as the “Pearson's chi-square” in honor of its developer, Karl Pearson. As an example of this test, consider an experiment by David M. Lane, S. Camille Peres, Aniko Sándor, and H. Al Napier that evaluated the effectiveness of a new method for initiating computer commands. One group of participants was tested using a mouse; a second group was tested using a track pad. Although for both groups, the new method led to faster performance than did the standard method, the question addressed here is whether there was any relationship between preference for the new method and the type of pointing device (mouse or track pad).

Table 1 is a contingency table showing whether preference is contingent on the device used. As can be seen in Table 1, 4 of the 12 participants in the mouse group (33%), compared with 9 of the 10 participants (90%) in the track pad group, preferred the new method. Therefore, in this sample, there is an association between the pointing device used and the method preferred. A key question is whether this association in the sample justifies the conclusion that there is an association in the population.

The chi-square test for independence, as applied to this experiment, tests the null hypothesis that the preferred method (standard or new) is independent of the pointing device used (mouse or track pad). Another way of stating this null hypothesis is that there is no association between the categorical variables of preferred method and pointing device. If the null hypothesis is rejected, then one can conclude that there is an association in the population.

Calculations

The first step in the calculation is to find the expected frequencies in each cell under the assumption that there is no association between the variables. Since 9 of the 22 participants (0.409) preferred the standard method, then if there were no association between pointing device and preferred method, one would expect 0.409 of the participants in the both the mouse condition and track pad condition to prefer the standard method. Of the 12 participants in the mouse condition, one would therefore expect (0.409)(12) = 4.91 participants to prefer the standard method. Similarly, in the track pad condition, one would expect (0.409)(10) = 4.09 participants to prefer the standard method. Note that this expected frequency is a mathematical concept; the number of participants in a sample with a specific preference would be a whole number.

Table 1 Data From the Example Experiment
Preferred Method
DeviceStandardNewTotal
Mouse8412
Track pad1910
Total91322

An easy way to compute the expected frequency for a cell is to multiply the row total for the cell by the column total and then divide by the grand total. For the cell representing preference for the standard method when using the mouse, the expected frequency is (12)(9)/22 = 4.91. Table 2 shows the expected frequencies in parentheses.

The next step is to subtract, for each cell, the observed frequency from the expected frequency, square the difference, and then divide by the expected frequency. For the first cell, this is equal to (4.91–8.00)2/4.91 = 1.94. The chi-square statistic is then computed by summing the values for all the cells. The formula can be written

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading