Skip to main content icon/video/no-internet

The Bernoulli distribution is a discrete probability distribution for a random variable that takes only two possible values, 0 and 1. Examples of events that lead to such a random variable include coin tossing (head or tail), answers to a test item (correct or incorrect), outcomes of a medical treatment (recovered or not recovered), and so on. Although it is the simplest probability distribution, it provides a basis for other important probability distributions, such as the binomial distribution and the negative binomial distribution.

Definition and Properties

An experiment of chance whose result has only two possibilities is called a Bernoulli trial (or Bernoulli experiment). Let p denote the probability of success in a Bernoulli trial (0 < p < 1). Then, a random variable X that assigns value 1 for a success with probability p and value 0 for a failure with probability 1 – p is called a Bernoulli random variable, and it follows the Bernoulli distribution with probability p, which is denoted by X ∼ Ber(p). The probability mass function of Ber(p) is given by

None

The mean of X is p, and the variance is p(1p). Figure 1 shows the probability mass function of Ber(.7). The horizontal axis represents values of X, and the vertical axis represents the corresponding probabilities. Thus, the height is .7 at X = 1, and .3 for X = 0. The mean of Ber(0.7) is 0.7, and the variance is .21.

Suppose that a Bernoulli trial with probability p is independently repeated for n times, and we obtain a random sample X1, X2, …, Xn. Then, the number of successes Y = X1 + X2 + … + Xn follows the binomial distribution with probability p and the number of trials n, which is denoted by Y Bin(n,p). Stated in the opposite way, the Bernoulli distribution is a special case of the binomial distribution in which the number of trials n is 1. The probability mass function of Bin(n, p) is given by

None

where

None
is the factorial of n, which equals the product n(n − 1) … 2 · 1. The mean of Y is np, and the variance is np(1p). Figure 2 shows the probability mass function of Bin(10,.7), which is obtained as the distribution of the sum of 10 independent random variables, each of which follows Ber(.7). The height of each bar represents the probability that Y takes the corresponding value; for example, the probability of Y = 7 is about .27. The mean is 7 and the variance is 2.1. In general, the distribution is skewed to the right when p < .5, skewed to the left when p > .5, and symmetric when p = .5.

Figure 1 Probability Mass Function of the Bernoulli Distribution With p = .7

None

Relationship to Other Probability Distributions

The Bernoulli distribution is a basis for many probability distributions, as well as for the binomial distribution. The number of failures before observing a success t times in independent Bernoulli trials follows the negative binomial distribution with probability p and the number of successes t. The geometric distribution is a special case of the negative binomial distribution in which the number of failures is counted before observing the first success (i.e., t = 1).

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading