Skip to main content icon/video/no-internet

Weighted Least Squares

One of the most used statistical techniques is regression analysis. In its simplest (and most used) form, regression analysis is used to describe and analyze the relationship between two continuous variables (e.g., weight and systolic blood pressure). In such instances, each observation (i.e., patient) can be represented with a point in a two-dimensional plot. The analysis is done by assuming that the relationship follows a known mathematical model. The data and the model are displayed in a graph called scatterplot, where x, or explanatory variable, is plotted in the horizontal axis, and y, or response, in the vertical axis. When the model is a straight line (defined by a slope and an intercept) and it is fitted to the data using the method of least squares, it is known as ordinary least squares (OLS), and the analysis is called simple linear regression (SLR). Weighted least squares (WLS) is a modification of OLS.

Least Squares Methods

The OLS method consists of finding the equation (i.e., slope and intercept) of the line that minimizes the sum of the squared vertical distances between each point and the line, hence the name least squares. These distances are called residuals. A picture helps understand these concepts. In 2001, Frank Bengel and colleagues studied the relationship between percent change in ejection fraction from rest to exercise (y variable) and hydroxyephedrine retention (x variable) in heart transplant patients. Each point in Figure 1 represents a patient and the line was computed using the OLS method. As is apparent from the figure, some points are farther from the line than others. The residuals are the distances from the line to each point in the y direction. By convention, points below the line have a negative residual and those above the line, a positive residual. The well-known correlation coefficient is a measure of the strength of the linear association and depends on the magnitude of the residuals. In this case, r = .61, indicating a moderately positive association that is statistically significant (i.e., not due to chance, since p < .001). An equivalent statement is that the slope of the line is significantly different from zero.

Mathematically, the OLS method minimizes the quantity

None

where ei is the residual of the ith point and n is the number of points. In this sum, each residual has the same weight, and thus, each point (or residual) contributes equally to the sum. This is consistent with the assumption that every point contains the same amount of information, which is one of the key statistical assumptions of SLR. The (theoretical) assumption is that for each x measure, there exists a distribution of y measures with a mean that depends on the x measure and a variance that is the same for all values of x.

In statistical terms, the uniformity of variances is called homoscedasticity. Frequently, it is assumed that the distribution is normal. Figure 2a illustrates this assumption. The popularity of OLS is due, in large part, to the fact that, under homoscedasticity (together with the independence of the observations), the fitted line has optimal statistical properties (i.e., the slope and intercept have smallest variance).

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading