Skip to main content icon/video/no-internet

The term categorical response data refers to data for outcome variables whose values represent distinct categories as opposed to continuous quantities. Categorical variables are pervasive in political science. Examples are as diverse as party affiliation (“Democrat,” “Republican,” “other,” “none”), union membership (“yes,” “no”), form of government (“republic,” “monarchy,” “military dictatorship,” etc.), voter participation (“yes,” “no”), or confidence in the government (“a great deal,” “quite a lot,” “not very much,” “none at all”). Although examples for the collection of categorical data can probably be traced back as far as the ancient censuses in Egypt, it was not until the early 20th century that a systematic development of statistical methods for the analysis of categorical data began. The starting point was the work of Karl Pearson and G. Udny Yule, who debated over how best to analyze associations between categorical variables. Many fundamental contributions that prepared the ground for the emergence of a wide variety of approaches in the subsequent decades were also made by R. A. Fisher in the 1920s and 1930s. This entry provides a brief overview of the rich collection of methods for the analysis of categorical data available today. First, a description of different types of categorical data is given. Second, various ways to analyze categorical data are summarized.

Typology of Categorical Data

The most important differentiation of categorical data distinguishes between nominal and ordinal variables. The values of a nominal variable identify categories that do not have a natural rank order. Examples are ethnicity (“Black,” “White,” “Hispanic,” etc.) or marital status (“single,” “married,” “divorced,” “widowed”). Methods using nominal data should not depend on the numerical values assigned to the categories. The categories of ordinal data, in contrast, possess a natural ranking. Examples are social class (“lower,” “middle,” and “upper”), agreement with the statement “Democracies are indecisive” (“strongly agree,” “agree,” “disagree,” and “strongly disagree”), or self-reported political left–right orientation (on a scale from 1 = left to 10 = right). Methods using ordinal data take into account the order of the values, but the exact spacing does not matter. The distinction between nominal and ordinal variables is not always clear. For example, political parties can be ranked on a left–right scale, but the order may be ambiguous for some parties (because they may rank differently on different policies). Moreover, whether a variable is treated as ordinal can depend on the research question.

A further distinction is made between qualitative and quantitative data. Some authors use the former as a synonym for categorical data. However, while nominal data are clearly qualitative, ordinal data are an intermediate type, often treated as quantitative in practice. Moreover, quantitative data can sometimes be treated as categorical. Quantitative data are either continuous or discrete. Continuous variables can take on any real value in a given interval, whereas discrete variables are restricted to a fixed set of distinct values (e.g., integers). Discrete quantitative variables are often considered as categorical when the range of observed values is small. Count data are an example.

Categorical variables can be divided into dichotomous variables that only have two possible outcomes (“yes”/“no”; “male”/“female”) and polytomous variables with more than two categories (“Christian,” “Muslim,” “Hindu,” “Buddhist,” “Jewish,” etc.). A dichotomous variable that is coded 0 and 1 is called an indicator or binary variable. Dichotomous variables are often technically and conceptually easier to handle than polytomous variables.

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading