Skip to main content icon/video/no-internet

Statistical graphics provide visual displays of quantitative information. While graphs have been an important tool in the statistical sciences, their role and applications have evolved over time. In the past, graphical displays were generally employed for pedagogical purposes and to present the final results of an analysis. Currently, graphs are being incorporated more directly into data analysis itself, where they are used to provide insights and directions for further investigation. In this entry, the basic elements, major advantages, and recent developments of statistical graphics are presented.

Statistical graphs are used for several purposes. First, they provide a tool for exploring the contents of a data set; in effect, graphical displays enable researchers to interact directly with the “raw material” for their analyses. Second, graphs enable researchers to look for structure in their data; they provide a ready means to identify interesting features such as concentrations of observations at particular data values, covariance across variables, outliers in distributions, and so on. Third, graphical displays are useful for checking the assumptions of statistical models; for example, they can be used to examine distributional shapes of constituent variables or to search for systematic patterns among model residuals. Fourth, graphs remain an excellent vehicle for communicating the results of statistical analyses; they provide fairly intuitive representations of complex models that can be understood by broad audiences.

Of course, all of the preceding objectives could also be achieved through numerical methods, such as equations and tabular representations. But graphs have several advantages. For one thing, an enormous amount of information can be represented directly or summarized in a single well-constructed graph; therefore, graphical displays are useful for depicting large, complex data sets. At the same time, graphs are not as reliant on underlying assumptions as are numerical summaries of data such as descriptive statistics or model coefficients. And graphs almost inherently encourage greater interaction between the researcher and the data; interesting features stand out immediately and graphs provide the means to pursue them to gain further insights.

Historical Evolution

The earliest “modern” graphical displays, generated during the Enlightenment period of the 17th and 18th centuries, downplayed depictions of empirical data in favor of graphing the abstract functions that seemed to describe the dynamics of the observable universe. But a minor scientific revolution occurred at the end of the 18th century, when an iconoclastic Scottish political commentator, William Playfair, published The Commercial and Political Atlas (various editions were produced between 1786 and 1801) and the Statistical Breviary (1801). The substantive objective for each of these books was to provide information about the British economy, relative to other nations. But they are remembered largely because Playfair developed a number of graphical displays to illustrate his data and support his arguments. Playfair's innovations would be recognizable immediately to a modern reader; they include bar charts, pie charts, and line charts.

During the 19th century, graphs were used widely in scientific communications. This period is characterized by the use of creative pictorial devices to convey information as efficiently as possible. For example, a famous map by French engineer Charles Joseph Minard depicts Napoleon's march during the Russian campaign of 1812. It has been hailed as the finest statistical graphic, in large part because it encodes information about six different variables into a single display. Graphical representations of empirical information waned once again during the first half of the 20th century. During this period, statisticians turned their attention to confirmatory methods and hypothesis testing. These are strategies where the characteristics of the data need to be well understood before the tests are applied to gain insight about the values of specific parameters. And since assumptions about the processes that generated the data precede the tests, graphical displays tend to be less useful.

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading