Normal Approximation to the Binomial Distribution

Section 3.4 Normal Approximation to the Binomial Distribution

Approximating the Binomial Distribution.

In this section we will see how the normal distribution can be used to approximate probabilities from the binomial distribution. It may seem strange that we would want to approximate binomial probabilities. After all, we can compute their exact value using the binomial probability formula.

\begin{equation*} P(X=x) = C(n,x)p^xq^{n-x}\text{.} \end{equation*}

To see why an approximation may be useful, consider the following example.

Example 3.4.1. Recognizing a Complex Binomial Probability Computation.

A recent study has determined that 32.2% of Americans are obese. A research group wishing to study this phenomena samples \(12,000\) individuals in a large metropolitan area. Describe how to find the probability that no more than \(3750\) of these individuals are obese, but do not perform the actual computation.

Solution

In order to find this probability using the binomial probability formula above, we need to find the sum:

\begin{equation*} P(X\leq 3750) = P(X=0) + P(X=1) + \cdots + P(X=1199) + P(X=3750)\text{.} \end{equation*}

This involves 3751 instances of the binomial probability formula, and would take a lot of time.

If it were possible, we would certainly be interested in being able to approximate the sum above if it can save us from performing 1201 separate computations. Even using a computer, this process would be time consuming. In this section we will learn when we can use the normal distribution to approximate the binomial distribution, as well as how to carry out that approximation.

Objectives

After finishing this section you should be able to

describe the following terms:
- continuity correction
- criteria for approximation
- normal approximation to the binomial distribution
accomplish the following tasks:
- Determine if it is appropriate to use the normal approximation
- Correctly apply the continuity correction
- Use the normal distribution to approximate binomial probabilities

Subsection 3.4.1 Visualizing the Binomial Distribution

Before we even start talking about how we can approximate binomial probabilities using the normal distribution, let's think a little about why we can. Below are three probability histograms for a binomial random variable \(X\) resulting from \(n = 10\) trials. The first shows the distribution of \(X\) when \(p = 0.1\text{,}\) the middle when \(p = 0.5\text{,}\) and the right when \(p = 0.9\text{.}\)

(a) \(p=0.1\)

(b) \(p=0.5\)

Figure 3.4.2. Visualizing Binomial Distributions with \(n=10\)

Which of these distributions would we call mound–shaped? The one in the middle appears to be the most mound-shaped of the three. The other two are skewed either to the right or to the left. Note that the one in the middle has a probability of \(0.5\text{.}\) The binomial distribution looks the most like the normal distribution when \(p = 0.5\text{.}\) However, as \(n\) increases, the value of \(p\) becomes less important. Consider the distributions below with the same values of \(p\text{,}\) but with \(n = 80\text{.}\)

(a) \(p=0.1\)

(b) \(p=0.5\)

Figure 3.4.3. Visualizing Binomial Distributions with \(n=80\)

Notice that with the larger value of \(n\text{,}\) all three of these probability histograms look pretty mound shaped. Also notice that as \(n\) increases, the number of bars increases as well, and the distribution of probabilities starts to look less stair-stepped, and more like a smooth curve. Try playing with this yourself by performing the following steps.

Open the interactive binomial distribution page.
Change the value of \(p\) (in the bottom right-hand corner) to several different percents to see what happens (for example, try 25, 50, and 75).
Change the value of \(n\) (in the bottom left-hand corner) to several different numbers to see what happens (for example, try \(n=10\text{,}\) \(30\text{,}\) \(50\text{,}\) and so on).
Try different combinations of \(n\) and \(p\) and notice how mound-shaped or skewed the distribution looks.
Finally, click the “Show Normal Curve” button to see how the normal curve “fits” on top of the binomial probability histogram.

Hopefully you have noticed that the larger \(n\) is and the closer \(p\) is to \(0.5\text{,}\) the less “gap” there is between the normal curve and the bars. That is, the less of the bar sticks up above, or does not reach up to the normal curve. The smaller this “gap” is, the better our approximation will be.

Figure 3.4.4. Normal Distribution Shape

Checkpoint 3.4.5.

Let \(X\) be a binomial random variable with \(n\) trials and a probability of success \(p\text{.}\)

Question: which values for \(n\) and \(p\) will produce the most mound-shaped probability histogram?n

\(n=500, p=0.5\)
\(n=10, p=0.5\)
\(n=100, p=0.2\)
\(n=50, p=0.85\)

Section 3.4 Normal Approximation to the Binomial Distribution

Approximating the Binomial Distribution.

Example 3.4.1. Recognizing a Complex Binomial Probability Computation.

Objectives

Subsection 3.4.1 Visualizing the Binomial Distribution

Checkpoint 3.4.5.

Checkpoint 3.4.6.

Checkpoint 3.4.7.

Subsection 3.4.2 When can We Approximate?

Principle 3.4.8. Criteria for Approximation.

Example 3.4.9. Determining if We Can Approximate.

Example 3.4.10. Determining a Minimum Number of Trials to Approximate.

Checkpoint 3.4.13.

Checkpoint 3.4.14.

Checkpoint 3.4.15.

Subsection 3.4.3 Continuity Correction

Definition 3.4.17.

Example 3.4.18. Applying a Continuity Correction.

\(P(X \gt 26)\).

\(P(X \leq 60)\).

\(P(19 \leq X \lt 24)\).

Checkpoint 3.4.24.

Checkpoint 3.4.25.

Checkpoint 3.4.26.

Subsection 3.4.4 Normal Approximations

Theorem 3.4.27. Normal Approximation to the Binomial Distribution.

Example 3.4.28. Approximating a Binomial Probability Involving “At Least”.

Example 3.4.30. Approximating a Binomial Probability Involving “No More Than”.

Checkpoint 3.4.35.

Checkpoint 3.4.36.

Checkpoint 3.4.37.

Subsection 3.4.5 How Good are These Approximations?

Example 3.4.38. Approximating with Few Trials.

Example 3.4.40. Approximating with Many Trials.

Checkpoint 3.4.44.

Checkpoint 3.4.45.

Checkpoint 3.4.46.