Skip to main content icon/video/no-internet

The coefficient of correlation evaluates the similarity of two sets of measurements (i.e., two dependent variables) obtained on the same observations. The coefficient of correlation indicates the amount of information common to two variables. This coefficient takes values between −1 and +1 (inclusive).

A value of 0 indicates that the two series of measurement have nothing in common. A value of +1 says that the two series of measurements are measuring the same thing. A value of −1 says that the two measurements are measuring the same thing, but one measurement varies inversely with the other.

The squared correlation gives the proportion of common variance between two variables and is also called the coefficient of determination. Subtracting the coefficient of determination from the unity gives the proportion of variance not shared between two variables, a quantity also called the coefficient of alienation.

The coefficient of correlation measures only the linear relationship between two variables, and its value is very sensitive to outliers. Its significance can be tested with an F or a t test. The coefficient of correlation always overestimates the intensity of the correlation in the population and needs to be “corrected” in order to provide a better estimation. The corrected value is called “shrunken” or “adjusted.”

Notations and Definition

We have S observations, and for each observation, we have two measurements, denoted W and Y, with respective means MW and MY. For each observation, we define the cross product as the product of the deviations of each variable to its mean. The sum of these cross products, denoted SCPWY, is computed as

None

The sum of the cross products reflects the association between the variables. When the deviations tend to have the same sign, they indicate a positive relationship, and when they tend to have different signs, they indicate a negative relationship. The average value of the SCPWY is called the covariance (cov), and just like the variance, the covariance can be computed by dividing by S or (S– 1):

None

The covariance reflects the association between the variables, but it is expressed in the original units of measurement. In order to eliminate them, the covariance is normalized by division by the standard deviation of each variable (σ). This defines the coefficient of correlation, denoted rW .Y, which is equal to

None

Rewriting the previous formula gives a more practical formula:

None

An Example: Correlation Computation

We illustrate the computation for the coefficient of correlation with the following data, describing the values of W and Y for S = 6 subjects:

None

Step 1: Computing the sum of the cross products

First compute the means of W and Y: and

None

and

None

The sum of the cross products is then equal to

None

The sum of squares of Ws is obtained as

None

The sum of squares of Ys is

None

Step 2: Computing rW.Y

The coefficient of correlation between W and Y is equal to

None

We can interpret this value of r =–.5 as an indication of a negative linear relationship between W and Y.

Some Properties of the Coefficient of Correlation

The coefficient of correlation is a number without a unit. This occurs because of dividing the units of the numerator by the same units in the denominator. Hence, the coefficient of correlation can be used to compare outcomes across different variables. The magnitude of the coefficient of correlation is always smaller than or equal to 1. This happens because the numerator of the coefficient of correlation (see Equation 4) is always smaller than or equal to its denominator (this property follows from the Cauchy-Schwartz inequity). A coefficient of correlation equal to +1 or −1 indicates that a plot of the observations will show that they are positioned on a line.

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading