Skip to main content icon/video/no-internet

Measures of central tendency provide a single summary number that captures the general location of a set of data points. This measure should be a good representation of the set of data. There are three common measures of central tendency used: the mean, the median, and the mode. Depending on the characteristics of the data, one measure may be more appropriate to use than the others.

The mean and the median are most commonly used to summarize data that can take on many different values (i.e., continuous data); the mode is often used to summarize data that can only take on a finite number of specific values (i.e., categorical data). The construction of each measure is illustrated with the gender and height data in Table 1 collected from 22 subjects.

The Mean

The mean is the most commonly used measure of central tendency. It is often referred to as x-bar, x, and is found by using the formula, x =

None

where

xi represents the individual observation from the ith subject;

P is the summation sign that indicates that you sum over everything that follows it. The limits below and above the sign indicate where you start, and stop, the summation, respectively. As it is written above, it says you should begin summing with x1 and stop with xn; and

n represents the number of observations in your data set.

For illustrative purposes, consider the data in Table 1. To compute the mean height, we do the following:

  • Sum over all the observations (the xi values).
  • Divide the quantity in Step 1 by the total number of observations. 1
Table 1 Heights of 22 Students in an Introductory Statistics Class
x1Female61
x2Female62
x3Female63
x4Female63
x5Female64
x6Female64.5
x7Female65
x8Female65
x9Female65
x10Female65
x11Female66
x12Female66
x13Female66
x14Male67
x15Female67
x16Male67
x17Male68
x18Female68
x19Male69
x20Female69.5
x21Male72
x22Male74

For the data set above, the above formula gives

None

Additional Notes about the Mean

  • The small n represents the sample size when computing the mean for a sample. When computing the mean for a population, a large N is generally used.
  • The sample mean represents the population mean better than any other measure of central tendency.
  • The mean can be thought of as being like a fulcrum that balances the weight of the data.
  • The sum of deviations of each observation from the mean is 0.
  • The mean is in the same units of measurement as the observations in your data set.
  • The mean is very sensitive to outliers, that is, extreme values. An observation that lies far away from the others will pull the mean toward it. For example, if the last observation in Table 1 were 740 instead of 74, the mean would jump to 96.5 (an increase of more than 30 in.).
  • The mean is generally the preferred measure of central tendency for data that are symmetric (evenly distributed about their center), but is not generally recommended to be used to describe data with outliers, or data that are not symmetric.

The Median

The median is used less often than the mean to describe the central tendency of a set of continuous data, but is still a commonly used measure. It is the midpoint of the data, and is often denoted with the letter M: To find the median, the following steps are

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading