Multiple Correspondence Analysis

Dale Southerton

doi:10.4135/9781412994248

Entry
Reader's guide
Entries A-Z
Subject index

Return to Entries

Multiple Correspondence Analysis

Edited by:
Dale Southerton
In:Encyclopedia of Consumer Culture
Chapter DOI:https://doi.org/10.4135/9781412994248.n374
Subject:Sociology of Consumption, Consumer Culture, Consumer Psychology
Keywords:multiple correspondence analysis

Request Permissions

Show page numbers Hide page numbers

Correspondence analysis is a method for interpreting tabular data visually in the form of spatial maps in which the rows and columns of the table are depicted as points. The basic form of the method visualizes cross-tabulations found typically in the social sciences, for example, education groups cross-tabulated

with political party voted for or a table of counts of the number of consumers that associate each of a set of brands with a set of attributes. Multiple correspondence analysis generalizes this method to many variables, typically questions in a survey, showing how the response categories interrelate.

As a first example, we use data from Tawnya Covert's article “Consumption and Citizenship during World War II: Product Advertisements in Women's Magazines,” a study in the Journal of Consumer Culture on consumption during the period of the Second World War after Pearl Harbor, as observed through a sample of advertisements aimed at American women. From their wording, the advertisements could be categorized into three types of advertising appeal: unrestricted consumption ads, rationing ads, deferred payment ads, and an additional fourth category gathering other appeals. Table 1 reproduces two tables from this article, stacked one on top of the other: cross-tabulations of product type by appeal and of year by appeal. Since there are no missing data, the column totals of the two tables are identical. The author interprets these data by calculating percentages [Page 1010]in each row; for example, of the 540 advertisements for food, 439 (81.3%) correspond to unrestricted consumption, 11 (2.0%) to deferred spending, 82 (15.2%) to rationed supplies, and 8 (1.5%) to others. This type of table is perfect for the application of correspondence analysis, a method for visualizing count data.

Table 1 Cross-Tabulations of Product Type by Appeal and Year by Appeal; World War II Data
	Unrestricted Consumption	Deferred Spending	Rationed Supplies	Other Appeals	Sum
Cosmetics	219	0	0	1	220
Personal hygiene	254	1	14	4	273
Household	173	9	15	4	201
Baby	65	0	2	1	68
Food	439	11	82	8	540
Small appliancess	1	11	2	1	15
Large appliances	1	55	9	7	72
Clothes	99	4	17	1	121
Cigarettes	45	0	0	0	45
Linens	13	9	18	2	42
Mattresses	14	5	3	0	22
Silverware	0	32	2	0	34
Home decor	45	24	4	3	76
Miscellaneous	42	45	5	26	118
Sum	1410	206	173	58	1847
1942	126	7	4	3	140
1943	549	59	65	18	691
1944	549	112	87	25	773
1945	186	28	17	12	243
Sum	1410	206	173	58	1847
Source: Covert 2003, 327, 330.

Figure 1 Correspondence Analysis Map of the Cross-Tabulations in Table 1

Note: The map is determined by the first table, and the rows (years) of the second table are added as supplementary points.

Source: Based on data from Covert 2003, 327, 330.

Figure 1 is the correspondence analysis (CA) of the first table (product type by appeal), with the second table (years by appeal) also visualized as so-called supplementary, or passive, points. The basic properties of the method are explained through the interpretation of this map, followed by a description of the extension to multivariate categorical data, called multiple correspondence analysis (MCA).

Simple Correspondence Analysis

The simple form of correspondence analysis (CA) applies primarily to cross-tabulations such as those in Table 1. The method visualizes the information in the table by depicting the rows and columns as points in a spatial map (see Greenacre 2007). In the same way that this table is interpreted numerically by calculating proportions, or equivalently percentages, relative to the row (product) totals, so CA visualizes these sets of relative frequencies as points in a space [Page 1011]to facilitate comparison of the products. The reason why silverware and large and small appliances are grouped together on the right side in Figure 1 is because their proportions across the appeal categories are similar. And the reason why personal hygiene, baby, cosmetics, et cetera, on the left side, are far from those on the right is because their proportions are quite different from those. In fact, the horizontal axis in this map coincides with the largest differences in the data set. The value 0.5302 on this axis quantifies how much of the total interproduct difference is “explained” by this axis, being 80.8% of that total. The vertical axis is not as important as the first, as shown by the value of 0.0694 (10.6% of total), but together they explain 91.4% of the interproduct differences. This percentage is analogous to the explained variance concept in multiple regression—the two axes can be considered two new variables, with values for the products equal to their coordinates, and these two variables predict the proportions in the data with an accuracy of 91.4%, with only 8.6% of the “variance” unexplained.

...

Sign in to access this content

Get a 30 day FREE TRIAL

Watch videos from a variety of sources bringing classroom topics to life
Read modern, diverse business cases
Explore hundreds of books and reference titles

No internet connection.

All search filters on the page have been cleared.

Your search has been saved.

Entry

Reader's guide

Entries A-Z

Subject index

Multiple Correspondence Analysis

Figure 1 Correspondence Analysis Map of the Cross-Tabulations in Table 1

Simple Correspondence Analysis

Sign in to access this content

Get a 30 day FREE TRIAL

Read next

More like this

Sage Recommends