Serial Correlation

Neil J.Salkind

doi:10.4135/9781412961288

Entry
Reader's guide
Entries A-Z
Subject index

Return to Entries

Serial Correlation

Edited by:
Neil J. Salkind
In:Encyclopedia of Research Design
Chapter DOI:https://doi.org/10.4135/9781412961288.n415
Subject:Research Design
Keywords:multiple regression; serial correlation

Request Permissions

Show page numbers Hide page numbers

Serial correlation, or autocorrelation, is defined as the correlation of a variable with itself over successive observations. It often exists when the order of [Page 1353]observations matters, the typical scenario of which is when the same variable is measured on the same participant repeatedly over time. For example, serial correlation is an important issue to consider in any longitudinal designs.

Serial correlation has mainly been considered in multiple regression and time-series models. Multiple regression models are designed for independent observations, where the existence of serial correlation is undesirable. So the main focus in multiple regression is on testing whether serial correlation exists. Conversely, the purpose of time-series analysis is to model the serial correlation to understand the nature of time dependence in the data. The pattern of serial correlation is essential for identifying the appropriate model. This presentation on serial correlation is around regression and time series.

Multiple Regression Model

Let the multiple regression model be

where y i is the response and xi is a 1 × (k + 1) vector consisting of a 1 and the values of the k predictors in the ith observation. The assumptions of this model are (a) the expectation of the error εi is 0; (b) εi has constant variance γ2; and (c) εi and εj are uncorrelated if i ≠ j. Let

be the least squares (LS) estimator of β and SSE be the sum of squared errors. When the assumptions are valid,

is the best linear unbiased estimator and s2 = SSE/(n – k − 1) is an unbiased estimator of δ2. When the errors are correlated (violating assumption c), although

is still unbiased, s2 and the estimated standard error of

are biased. Consequently, the F or t statistic in testing the significance of

is misleading. Therefore, it is important to test for the presence of serial correlation. The most widely used test is the Durbin—Watson d test, which tests for first-order serial correlation ρ = Corr(εi, εi-1) using the autoregressive model

Obviously, ρ = 0 if η = 0. To test the null hypothesis H0:η = 0, the test statistic d is formulated as

where

i are the LS residuals with fitting Equation

1. Let

be the LS estimate of η in Equation 2. It follows from

that d ≈(1-

) Thus, if there is no serial correlation, d ≈ 2; if the serial correlation is close to 1, d ≈ 0; and if the serial correlation is close to −1, d ≈ 4. The critical values of d (denote the lower bound as dL and the upper bound as dU) depend on n, k, and the significance level of the test. Tables of dU and dL can be found in the Appendix of Arnold Studenmund's book. The appropriate decision rules of testing for positive serial correlation are

The decision rules of testing for negative serial correlation are

The inconclusive region is one main disadvantage of the Durbin—Watson d test. Moreover, the test ignores serial correlation beyond the first order. It does not allow earlier observed y to predict later y in the regression model either.

...

Sign in to access this content

Get a 30 day FREE TRIAL

Watch videos from a variety of sources bringing classroom topics to life
Read modern, diverse business cases
Explore hundreds of books and reference titles

No internet connection.

All search filters on the page have been cleared.

Your search has been saved.

Entry

Reader's guide

Entries A-Z

Subject index

Serial Correlation

Multiple Regression Model

Sign in to access this content

Get a 30 day FREE TRIAL

Read next

More like this

Sage Recommends