Skip to main content icon/video/no-internet

Endogenous Variables

Endogenous variables in causal statistical modeling are variables that are hypothesized to have one or more variables at least partially explaining them. Commonly referred to in econometrics and the structural equation modeling family of statistical techniques, endogenous variables may be effect variables that precede other endogenous variables; thus, although some consider endogenous variables to be dependent, such a definition is technically incorrect.

Theoretical considerations must be taken into account when determining whether a variable is endogenous. Endogeneity is a property of the model, not the variable, and will differ among models. For example, if one were to model the effect of income on adoption of environmental behaviors, the behaviors would be endogenous, and income would be exogenous. Another model may consider the effect of education on income; in this case, education would be exogenous and income endogenous.

The Problem of Endogeneity

One of the most commonly used statistical models is ordinary least squares regression (OLS). A variety of assumptions must hold for OLS to be the best unbiased estimator, including the independence of errors. In regression models, problems with endogeneity may arise when an independent variable is correlated with the error term of an endogenous variable. When observational data are used, as is the case with many studies in the social sciences, problems with endogeneity are more prevalent. In cases where randomized, controlled experiments are possible, such problems are often avoided.

Several sources influence problems with endogeneity: when the true value or score of a variable is not actually observed (measurement error), when a variable that affects the dependent variable is not included in the regression, and when recursivity exists between the dependent and independent variables (i.e., there is a feedback loop between the dependent and independent variables). Each of these sources may occur alone or in conjunction with other sources.

Figure 1 Variables B and C Are Endogenous

None

The solution to problems with endogeneity is often to use instrumental variables. Instrumental variables methods include two-stage least squares, limited information maximum likelihood, and jackknife instrumental variable estimators. Advantages of instrumental variables estimation include the transparency of procedures and the ability to test the appropriateness of instruments and the degree of endogeneity. Instrumental variables are beneficial only when they are strongly correlated with the endogenous variable and when they are exogenous to the model.

Endogenous Variables in Structural Equation Modeling

In structural equation modeling, including path analysis, factor analysis, and structural regression models, endogenous variables are said to be “downstream” of either exogenous variables or other endogenous variables. Thus, endogenous variables can be both cause and effect variables.

Consider the simple path model in Figure 1. Variable A is exogenous; it does not have any variables causally prior to it in the model. B is endogenous; it is affected by the exogenous variable A while affecting C. C is also an endogenous variable, directly affected by B and indirectly affected by A.

As error associated with the measurement of endogenous variables can bias standardized direct effects on endogenous variables, structural equation modeling uses multiple measures of latent constructs in order to address measurement error.

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading