Skip to main content icon/video/no-internet

A histogram is a pictorial representation of the information in a frequency distribution or a relative frequency distribution of a data set. Histograms are used to display information about continuous variables and typically require grouping data into classes; this feature distinguishes it from a bar graph, which is primarily used for categorical or discrete data with a limited number of categories that do not require further grouping. A histogram displays the overall pattern and deviation of the data, but due to the necessity of grouping data into classes, it is not an exact representation of the data values.

Histograms are most often created using computer software but may also be created manually. To make a histogram of a data set, proceed as follows:

  • Divide the range of the data into classes of equal width.
  • Count the number of observations in each class. These counts are called frequencies, and a table of frequencies for all classes is a frequency table.
  • Draw the histogram. The width or base of the bar represents the class, and the bar height is the class frequency. The graph is drawn with no horizontal space between the bars (unless a class is empty, so that its bar has zero height).

Example

Table 1 gives the survival times in days of 72 guinea pigs after they were injected with tubercle bacilli in a medical experiment. Figure 1 is a histogram for the same data.

This histogram is skewed to the right; that is, the right side of the histogram contains the larger half of the observations in the data and extends a greater distance than the left side. A histogram is skewed to the left when its left side extends a much larger distance than the right side; a histogram is symmetric if the right and left sides have essentially the same shape. A histogram with one major peak is described as unimodal; when a histogram has two major peaks, it is described as bimodal. If every interval has essentially the same number of observations, the histogram is described as a uniform histogram.

Table 1 Survival Times in Days of 72 Guinea Pigs After They Were Injected With Tubercle Bacilli
45455356565758666773
74798080818181828383
84888991919292979999
100100101102102102103104107108
109113114118121123126128137138
139144145147156162174178179184
191198211214243249329380403511
522598
Source: Adapted from Bjerkedal (1960).

Figure 1 Survival Times in Days of 72 Guinea Pigs After They Were Injected With Tubercle Bacilli

None
Source: Adapted from Bjerkedal (1960).

Note that the selection of the number of bars to be included, which may also be stated as selection of the width of the bars, is an important decision. The same data displayed in histograms using varying bar widths can appear quite different, a fact that may be explored using Azzalini and Bowman's data and West and Ogden's histogram applet cited below.

  • histogram
RenjinTu

Further Readings

Azzalini, A., and Bowman, A. W.A look at some data on the Old Faithful geyser. Applied Statistics39 (1990). 365–365.
Bjerkedal, T.Acquisition of resistance in guinea pigs infected with different doses of virulent tubercle bacilli. American Journal of Hygiene72 (1960). 130–148.
West, R. W., & Ogden, R. T. (1998). Interactive demonstrations for statistics on the

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading