Bivariate Regression

Neil J.Salkind

doi:10.4135/9781412961288

Entry
Reader's guide
Entries A-Z
Subject index

Return to Entries

Bivariate Regression

Edited by:
Neil J. Salkind
In:Encyclopedia of Research Design
Chapter DOI:https://doi.org/10.4135/9781412961288.n31
Subject:Research Design
Keywords:bivariate regression; multiple regression

Request Permissions

Show page numbers Hide page numbers

Regression is a statistical technique used to help investigate how variation in one or more variables predicts or explains variation in another variable. This popular statistical technique is flexible in that it can be used to analyze experimental or nonexperimental data with multiple categorical and continuous independent variables. If only one variable is used to predict or explain the variation in another variable, the technique is referred to as bivariate regression. When more than one variable is used to predict or explain variation in another variable, the technique is referred to as multiple regression. Bivariate regression is the focus of this entry.

Various terms are used to describe the independent variable in regression, namely, predictor variable, explanatory variable, or presumed cause. The dependent variable is often referred to as an outcome variable, criterion variable, or presumed effect. The choice of independent variable term will likely depend on the preference of the researcher or the purpose of the research. Bivariate regression may be used solely for predictive purposes. For example, do scores on a college entrance exam predict college grade point average? Or it may be used for explanation. Do differences in IQ scores explain differences in achievement scores? It is often the case that although the term predictor is used by researchers, the purpose of the research is, in fact, explanatory.

Suppose a researcher is interested in how well reading in first grade predicts or explains fifth-grade science achievement scores. The researcher hypothesizes that those who read well in first grade will also have high science achievement in fifth grade. An example bivariate regression will be performed to test this hypothesis. The data used in this example are a random sample of students (10%) with first-grade reading and fifth-grade [Page 87]science scores and are taken from the Early Childhood Longitudinal Study public database. Variation in reading scores will be used to explain variation in science achievement scores, so first-grade reading achievement is the explanatory variable and fifth-grade science achievement is the outcome variable. Before the analysis is conducted, however, it should be noted that bivariate regression is rarely used in published research. For example, intelligence is likely an important common cause of both reading and science achievement. If a researcher was interested in explaining fifth-grade science achievement, then potential important common causes, such as intelligence, would need to be included in the research.

Regression Equation

The simple equation for bivariate linear regression is Y = a + bX + e. The science achievement score, Y, for a student equals the intercept or constant (a), plus the slope (b) times the reading score (X) for that student, plus error (e). Error, or the residual component (e), represents the error in prediction, or what is not explained in the outcome variable. The error term is not necessary and may be dropped so that the following equation is used: Y′ = a + bX. Y′ is the expected (or predicted) score. The intercept is the predicted fifth-grade science score for someone whose first-grade reading score is zero. The slope (b, also referred to as the unstandardized regression coefficient) represents the predicted unit increase in science scores associated with a one-unit increase in reading scores. X is the observed score for that person. The two parameters (a and (b) that describe the linear relation between the predictor and outcome are thus the intercept and the regression coefficient. These parameters are often referred to as least squares estimators and will be estimated using the two sets of scores. That is, they represent the optimal estimates that will provide the least error in prediction.

...

Sign in to access this content

Get a 30 day FREE TRIAL

Watch videos from a variety of sources bringing classroom topics to life
Read modern, diverse business cases
Explore hundreds of books and reference titles

No internet connection.

All search filters on the page have been cleared.

Your search has been saved.

Entry

Reader's guide

Entries A-Z

Subject index

Bivariate Regression

Regression Equation

Sign in to access this content

Get a 30 day FREE TRIAL

Read next

More like this

Sage Recommends