Simultaneous Equation Modeling

Bertrand Badie; Dirk Berg-Schlosser; Leonardo Morlino

doi:10.4135/9781412994163

Entry
Reader's guide
Entries A-Z
Subject index

Return to Entries

Simultaneous Equation Modeling

Edited by:
Bertrand Badie
,
Dirk Berg-Schlosser
&
Leonardo Morlino
In:International Encyclopedia of Political Science
Chapter DOI:https://doi.org/10.4135/9781412959636.n556
Subject:General Politics & International Relations, Political Science (general)
Keywords:simultaneous equations

Request Permissions

Show page numbers Hide page numbers

Ordinary least squares (OLS), along with its cousins such as probit, is the workhorse method of empirical political science. Starting with, as usual, the linear model,

OLS is a fine way of estimating β and θ as long as x and z are exogenous. An explanatory variable is exogenous if whatever statistical process determines it does not depend on the statistical process that determines either the dependent variable or the error term. If an explanatory variable is not exogenous (and so is said to be endogenous), then OLS has severe problems. These problems are typically worse than for other forms of misspecification and do not disappear as the sample size grows. Endogeneity issues cannot be fixed by the usual tweaks to OLS (generalized least squares); different estimation methods are required.

Predetermined socioeconomic characteristics are, by definition, exogenous. Beyond a few simple variables (physical characteristics of people or [Page 2408]countries), the argument that a variable is exogenous must be made theoretically and must be made relative to the dependent variable of interest. Thus, for example, if we believe that democracy is exogenous to economic development, a regression of economic development on democracy yields meaningful results; but we need some theory to know that democracy is exogenous—that is, that it is not the case that increased wealth leads to more democracy. Empirical results are conditional on this assumption; the assumption of exogeneity cannot be tested empirically.

Suppose, for example, that we are interested in explaining attitudes toward political candidates. Let y be a measure of how much one likes a political candidate and x be a measure of how close the candidate is to the voter on various issues. It may be the case that voters who like a candidate more are more likely to perceive the candidate as being close to them on issues, so x may not be exogenous. If x is endogenous to y, it is well known that OLS can be badly biased, and these problems persist even with huge sample sizes (i.e., OLS is not even consistent). The solution, worked out in the 1940s by various Nobel Prize-winning econometricians associated with the Cowles Foundation, is to estimate a series of simultaneous equations for both x and y.

Thus, for two equations, if x determines y but y also determines x, we can write

where z and w are exogenous and ε and ζ are error terms that may well be correlated with each other (but separately satisfy the usual Gauss-Markov conditions). One issue is what method is best for estimating the various model parameters; but a more fundamental issue is whether it is even possible to estimate these parameters. The latter is called the identification problem.

Identification

The identification issue is critical, and dealing with it comes before any estimation issue. The basic question is whether more than one set of parameter estimates is equally consistent with the data; almost always, if more than one set of estimates is consistent with the data, an infinite number of estimates are consistent with the data. In such cases, the model is not identified. In this situation, even if we were sure we had ideal estimates of the parameters, someone else could have equally ideal but different parameter estimates. This problem does not arise with models typically estimated via OLS (but only because we assume away the problem by assuming that all explanatory variables are exogenous) but is always a potential issue with simultaneous equation models. A simple example shows the problem. Suppose we have an infinite amount of data, so there is no statistical uncertainty. Consider the model where x determines y and vice versa but nothing else. We run our favorite computer program and obtain,

...

Sign in to access this content

Get a 30 day FREE TRIAL

Watch videos from a variety of sources bringing classroom topics to life
Read modern, diverse business cases
Explore hundreds of books and reference titles

No internet connection.

All search filters on the page have been cleared.

Your search has been saved.

Entry

Reader's guide

Entries A-Z

Subject index

Simultaneous Equation Modeling

Identification

Sign in to access this content

Get a 30 day FREE TRIAL

Read next

More like this

Sage Recommends