R-squared (R2) is a statistic that explains the amount of variance accounted for in the relationship between two (or more) variables. Sometime R2 is called the coefficient of determination, and it is given as the square of a correlation coefficient.

Given paired variables (Xi,Yi), a linear model that explains the relationship between the variables is given by

Y=β0+β1X+e,

where e is a mean zero error. The parameters of the linear model can be estimated using the least squares method and denoted by β^0 and β^1, respectively. The parameters are estimated by minimizing the sum of squared residuals between variable Yi and the model β0+β1Xi, that is,

(β^0,β^1)=argminβ0,β1(Yiβ0+β1Xi)2.

It can be shown that the least squares estimations are

β^0=Y¯X¯SxySxxandβ^1=SxySxx,

where the sample cross-covariance Sxy is defined as

Sxy=1ni=1n(XiX¯)(YiY¯)=XY¯X¯Y¯.

Statistical packages such as SAS, SPLUS, ...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles