Skip to main content icon/video/no-internet

Model specification refers to the process of expressing a theory in mathematical (functional) form. The choice of model specification affects the validity of causal inferences. Arguably, severe specification errors impede the validity of inferences more than the choice of a suboptimal estimator. Yet applied researchers rarely follow a strategy when trying to develop and potentially improve the specification of their model. Rather, the choice of a model specification usually depends on a crude mixture of what is common in the field of research, methodological fads, and individual intuition. Misspecification occurs when the assumptions underlying an empirical analysis deviate from the true data-generating process. Nonrandom sampling, measurement error, model uncertainty, and a lack of independence of observations rank most prominently among the sources for model misspecification. Since researchers hardly, if ever, know the true model, most models analyzed in the social sciences will, necessarily, be misspecified. The most common misspecifications result from insufficient theoretical guidance. The resulting model uncertainty usually implies that researchers do not know the correct set of regressors (independent variables), their optimal operationalization, or the accurate functional form. This entry discusses the origins and types of specification problems and reviews the standard solutions to misspecification. The entry concludes with a discussion of the way in which researchers deal with specification issues.

Model Uncertainty

Model uncertainty results from underspecified theories. In the social sciences, theories aim at simplifying reality, therefore making generous use of ceteris paribus clauses and of highly stylized assumptions. In other words, theories neither provide full guidance about model specification to applied researchers nor do they aim at providing such information. Usually, they do not say much about which variables to include in the list of regressors, which functional form relates the right-hand-side variables to the dependent variable, the existence of conditional effects, and the existence and correct specification of temporal and spatial dependence. This section discusses the resulting specification problems in turn.

Model Uncertainty and Sensitivity Analysis

Model uncertainty has multiple origins; it occurs when researchers do not know the correct set of right-hand-side variables, the correct functional form that relates the dependent to the independent variables, the correct structure of conditionality between exogenous variables (those whose values are independent of the states of other variables in the system under study), and/or the causal dependences between exogenous variables, to mention just a few model uncertainties. Logically, researchers need to start with identifying all exogenous factors that influence the variation of the dependent variable across observations. Both the inclusion of too few or too many regressors may cause bias and will render inferences potentially invalid. Assume, as an example for the “too many” case, that researchers intend to estimate the effect of income on vote choice but include the vote intention into the model. Since vote intentions depend on structural factors such as income and education, the inclusion of the vote intention variable causes right-hand-side endogeneity (i.e., regressors may not be independent of the dependent variable or the error term; see below). If this endogeneity is not correctly modeled, it makes the correct interpretation of results impossible. Indeed, the vote intention variable will capture much of the influence of income and education on vote choice, but researchers cannot separate the two causal mechanisms unless they estimate a simultaneous equation model or unless they drop vote intention from the list of regressors, hoping that income and education fully determine vote intentions.

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading