Skip to main content icon/video/no-internet

The statistical language R has proven to be a popular choice among statisticians, researchers, and organizations for statistical analysis and graphs. This is partly due to the free nature of the program and the power of its programming capabilities. Other strengths include its exceptionally good support, its continual improvement (updates occur almost monthly at times), and its very active user community. R is a model example of the benefits of free and open-source software, and a wealth of contributed documentation is freely available. In recent years, the R language has amassed a growing, and very strong, supporting literature, which consists of several introductory texts and companions to existing textbooks and which implements modern statistical methods, regression modeling strategies, and specialized types of models. R integrates very well with LaTeX and also has very powerful graphics capabilities. For some, the lack of a powerful graphical user interface (i.e., a point-and-click operation) is one of R's most glaring weaknesses.

Presently, R is an open-source software for statistical computing and graphics that can be freely downloaded from the R-project website and runs on several operating systems, such as Mac OS X, Microsoft Windows, and several UNIX platforms. This entry describes the history, general characteristics, and strengths and weaknesses of R and compares R with other statistical software.

History

In the early 1990s two professors, Ross Ihaka and Robert Gentleman, at the University of Auckland became interested in developing a statistical software that could be used to teach their introductory data analysis courses and that ran efficiently in their teaching laboratories. Since both authors were familiar with the S language, created in the Bell Laboratories by Rick Becker, John Chambers, and Allan Wilks, R was developed as an implementation of S. It gained its name partially as a play on the name S and partially due to the authors’ first names starting with the letter R. An initial version of R was made available to the public in 1993 at StatLib, a system for electronically distributing statistical software. After much constructive feedback and encouragement, Ihaka and Gentleman made the source code available in 1995. Then 2 years later, as a result of the large volume of suggestions and contributions overwhelming their capacity to make changes to the source code and report back to the community, a core group of developers was established to make changes to the source code, and The R Project was born. As The R Project grew, its goals shifted toward optimizing and expanding R's capabilities while continuing to provide support and maintenance for the software.

General Characteristics

R is a command-based environment that supports a wide variety of statistical and mathematical operations. At its simplest function level, R can be used as a calculator, and at higher levels it can execute complex simulations and statistical analyses.

Data Type and Objects

The items that R operates on include vectors and matrices of numerical or logical values and vectors of character strings. Such items can be assigned to objects that can be saved into files and used in later functions. During any session, the names of all objects in use (the workspace) can be accessed with the command:

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading