Skip to main content icon/video/no-internet

Errors of Measurement: Dichotomization of a Continuous Variable

A variable is any concept that has more than one possible value. This is contrasted with a constant whose value never changes (e.g., pi, the speed of light). A variable, at a minimum, must have two possible values (e.g., gender has male or female). A variable, at the other end, can have an unlimited set of potential values (distance between two places measured in meters). The question is how to organize or set up the measurement of some quantity using an operationalization. This entry introduces dichotomization of a continuous variable as one approach to addressing errors of measurement, paying specific attention to the justifications for and costs of deploying this method.

An operationalization takes the conceptual variable and provides a means of measurement using some device. When measuring an attitude toward chocolate, for example, the device could be as simple as (a) like and (b) dislike. A more sophisticated measurement may involve a number of choices and a person could rate the degree of preference (like or dislike) on a 1–3, 1–5, 1–7 basis, for instance. Essentially, one can use a simple dichotomous measure or use a more refined measurement tool.

What typically happens is that measurements use more than one instance, particularly in the case of a Likert statement. For example, one could rate the desire for chocolate using a Likert statement (e.g., where 1 refers to very much agree and 5 to very much disagree) as a response to the statement, “Chocolate is my favorite food.” Then, the participants are asked a second item, such as, “Chocolate is a great treat to eat.” Basically, one can add a series of statements and then create a scale to measure the attitude. The usual practice is to add up the individual scores from each statement to form a composite score to represent the attitude (usually after a variety of measurement tests examining the factor structure and/or reliability evaluations).

The use of such approaches generates a set of scores that form a continuous variable with a potential range. The expectation is that the estimates of the mean (as well as other measures of central tendency) and the measures of variability (variance, range, standard deviation) reflect the assumption of a normal distribution (bell-shaped curve).

Justification for Dichotomizing a Continuous Variable

The justification or reason for dichotomizing a continuous variable usually involves the convenience related to using particular statistical analytic tools. Many scholars prefer using analysis of variance (ANOVA) or t-tests. Most ANOVA routines expect that the independent variables will be categorical and not continuous, because the participants are placed within groups based on a particular characteristic. The use of categories means that a continuous variable cannot serve as the independent set of conditions used to predict the dependent variable.

The easiest solution to have the variable fit the form required becomes a dichotomization of the variable based on a median split. The reason for the median split is the implication that each level (high and low) becomes equal in size because the median is the score dividing the sample into the top and bottom 50% of values. The use of the median creates essentially two groups of equal size, making the results of the ANOVA omnibus tests orthogonal to each other (considered desirable).

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading