Entry
Reader's guide
Entries A-Z
Subject index
Dichotomous Variables
A dichotomous variable is one that takes on one of only two possible values when observed or measured. The value is most often a representation for a measured variable (e.g., age: under 65/65 and over) or an attribute (e.g., gender: male/female).
If the dichotomous variable represents another variable, that underlying variable may be either observed or unobserved. For example, a dichotomous variable may be used to indicate whether a piece of legislation passed. The dichotomous variable (pass/fail) is a representation of the actual, and observable, vote on the legislation. In contrast, each legislator's vote is the result of an individual, and unobserved, probability distribution between voting either for or against the legislation.
Dichotomous variables are most commonly measured using 1 and 0 as the two possible values. The use of 1 and 0 usually has no specific meaning relating to the variable itself. One could just as easily choose 2 and 0, or 1 and −1 as the two values. Even so, the 1, 0 coding does have the advantage of showing, when cases are summed and averaged, the proportion of cases with a score of 1. The choice and assignment of 1 and 0 is usually based on ease of interpretation and the nature of the specific research question.
Dichotomous variables may be used as either dependent (endogenous)or independent (exogenous) variables. Many research questions in the social sciences use dichotomous dependent variables. Such questions often involve individual binary choices or outcomes (e.g., to purchase or not, to vote for or against, to affirm or reverse). Estimating models with dichotomous dependent variables often requires the use of nonlinear techniques because linear models (such as ordinary least squares) may be unrealistic on a theoretical level and produce inefficient or inconsistent results.
When used as independent variables, dichotomous variables are often referred to as “controlling” or “dummy” variables. Unfortunately, referring to dichotomous independent variables in this way may lead to inadequately justifying the measurement of the variable or its inclusion in the model. As with any other variable, a dichotomous independent variable should have a sound theoretical basis and be properly operationalized.
As a cautionary note, one must not be overly eager to “dichotomize” variables (i.e., convert a measurable underlying variable to dichotomous form) for three reasons. First, such conversions may not be theoretically justifiable. The popularity of estimation techniques such as probit analysis and logit caused some to unnecessarily convert continuous dependent variables to dichotomous form. Second, even when theoretically justified, such conversions necessarily result in a loss of information. It is one thing, for example, to know that the Supreme Court affirmed a decision, but quite another to know whether it did so by a 9–0 or 5–4 vote. Third, forcing a variable into just two categories may not always be easy or appropriate if there is a nontrivial residual category. For example, if one measures handedness dichotomously (left/right), the number of ambidextrous individuals or those with arm injuries that required converting to their other hand may pose coding or analytical problems.
References
...
- Analysis of Variance
- Association and Correlation
- Association
- Association Model
- Asymmetric Measures
- Biserial Correlation
- Canonical Correlation Analysis
- Correlation
- Correspondence Analysis
- Intraclass Correlation
- Multiple Correlation
- Part Correlation
- Partial Correlation
- Pearson's Correlation Coefficient
- Semipartial Correlation
- Simple Correlation (Regression)
- Spearman Correlation Coefficient
- Strength of Association
- Symmetric Measures
- Basic Qualitative Research
- Basic Statistics
- F Ratio
- N(n)
- t-Test
- X¯
- Y Variable
- z-Test
- Alternative Hypothesis
- Average
- Bar Graph
- Bell-Shaped Curve
- Bimodal
- Case
- Causal Modeling
- Cell
- Covariance
- Cumulative Frequency Polygon
- Data
- Dependent Variable
- Dispersion
- Exploratory Data Analysis
- Frequency Distribution
- Histogram
- Hypothesis
- Independent Variable
- Measures of Central Tendency
- Median
- Null Hypothesis
- Pie Chart
- Regression
- Standard Deviation
- Statistic
- Causal Modeling
- Discourse/Conversation Analysis
- Econometrics
- Epistemology
- Ethnography
- Evaluation
- Event History Analysis
- Experimental Design
- Factor Analysis and Related Techniques
- Feminist Methodology
- Generalized Linear Models
- Historical/Comparative
- Interviewing in Qualitative Research
- Latent Variable Model
- Life History/Biography
- Log-Linear Models (Categorical Dependent Variables)
- Longitudinal Analysis
- Mathematics and Formal Models
- Measurement Level
- Measurement Testing and Classification
- Multilevel Analysis
- Multiple Regression
- Qualitative Data Analysis
- Sampling in Qualitative Research
- Sampling in Surveys
- Scaling
- Significance Testing
- Simple Regression
- Survey Design
- Time Series
- ARIMA
- Box-Jenkins Modeling
- Cointegration
- Detrending
- Durbin-Watson Statistic
- Error Correction Models
- Forecasting
- Granger Causality
- Interrupted Time-Series Design
- Intervention Analysis
- Lag Structure
- Moving Average
- Periodicity
- Serial Correlation
- Spectral Analysis
- Time-Series Cross-Section (TSCS) Models
- Time-Series Data (Analysis/Design)
- Trend Analysis
- Loading...
Get a 30 day FREE TRIAL
-
Watch videos from a variety of sources bringing classroom topics to life
-
Read modern, diverse business cases
-
Explore hundreds of books and reference titles
Sage Recommends
We found other relevant content for you on other Sage platforms.
Have you created a personal profile? Login or create a profile so that you can save clips, playlists and searches