When and how do we use the normal distribution to approximate binomial probabilities?
Use the normal approximation X∼N(np,np(1−p)) to approximate binomial probabilities for large n
A focused answer to the HSC Maths Extension 1 dot point on the normal approximation of the binomial. The rule of thumb np≥5 and n(1−p)≥5, continuity correction, standardising and computing approximate probabilities, with worked examples.
✦ Generated by Claude Opus 4.8·12 min answer·
Reviewed by: AI editorial process; not yet individually human-reviewed
NESA wants you to recognise when the binomial distribution can be approximated by a normal distribution, write down the approximating normal N(np,np(1−p)), apply the continuity correction, and compute approximate probabilities using z-scores.
The answer
The result
If X∼B(n,p) with n "large" (rule of thumb: both np≥5 and n(1−p)≥5), then
X≈N(μ,σ2)withμ=np,σ2=np(1−p).
This is a consequence of the central limit theorem: the binomial is a sum of n independent identically distributed Bernoulli trials, and sums of many i.i.d. random variables tend to a normal distribution.
Watching the bell curve appear, stage by stage
The approximation is not a coincidence: as the number of trials grows, the binomial bars settle into the shape of a bell curve with the same mean and standard deviation. These panels overlay the matching normal N(np,np(1−p)) on the bars of B(n,0.5) for growing n.
Stage 1, a small n is already close. With n=10 the bars are wide and chunky, yet the bell curve N(5,2.5) already passes through their tops. The fit is rough in the tails but the centre is good.
Stage 2, more trials, narrower bars. At n=30 (mean 15, standard deviation 2.74) there are more, thinner bars and they hug the curve much more closely. This is the central limit theorem taking hold: summing more Bernoulli trials pulls the shape toward the normal.
Stage 3, a close fit you can compute with. By n=50 (mean 25, standard deviation 3.54) the bars and the curve are almost indistinguishable, so the area under the normal curve is a reliable stand-in for a sum of binomial probabilities. This is exactly when replacing a long pmf sum with one or two z-lookups is justified.
When the approximation works
The approximation is good when:
n is large.
p is not too close to 0 or 1 (which makes the distribution very skewed).
The HSC rule of thumb is np≥5 and n(1−p)≥5. With n=100, this works for 0.05≤p≤0.95. With n=20, it works for roughly 0.25≤p≤0.75.
When p is very small and n is large, the Poisson approximation is more appropriate, but that is beyond HSC Extension 1.
Continuity correction
The binomial is discrete; the normal is continuous. To improve the approximation, adjust the boundary by ±0.5.
For X∼B(n,p) and k an integer,
P(X≤k)≈P(Z≤σk+0.5−μ),
P(X≥k)≈P(Z≥σk−0.5−μ),
P(X=k)≈P(σk−0.5−μ≤Z≤σk+0.5−μ).
The half-unit shift accounts for the fact that the binomial X=k corresponds to the interval [k−0.5,k+0.5] in the continuous picture. The diagram below zooms in on the bars near k=55: the bar for X=55 stretches across the continuous interval [54.5,55.5], so P(X≤55) keeps the whole of that bar and the boundary the normal curve should use is 55.5, not 55.
The direction is the part to get right: for an upper bound ("at most k", "≤k") push the boundary out to k+0.5; for a lower bound ("at least k", "≥k") push it out to k−0.5. Either way you are growing the interval to include the whole of the boundary bar.
Standardising
For a normally distributed X with mean μ and standard deviation σ, the standardised value is
Z=σX−μ.
Look up the probability in a standard normal table (or use arcsin-style estimates for HSC if a table is not provided).
When to use the approximation
For HSC Extension 1, use it when:
n is large enough that summing pmf values is tedious.
The question asks for a numerical answer (not exact).
The question explicitly says "using the normal approximation".
Otherwise, compute the binomial directly.
How exam questions ask about the normal approximation
These questions are heavily signposted, so the marks are in following the standard sequence without skipping a step:
"Using the normal approximation, estimate P(…)": the explicit instruction to approximate. Quote μ=np and σ=np(1−p), apply the continuity correction, standardise, and look up the z-value.
"Explain why the normal approximation is appropriate / valid": check and state both conditions, np≥5 and n(1−p)≥5. This is usually a 1 mark justification.
"A coin is tossed 100 times ... probability of at least 60 heads": a large-n "at least" or "at most" probability that would need a huge pmf sum, the classic cue to approximate. Use the continuity-corrected boundary (59.5 here for X≥60).
"State the mean and standard deviation of [the count]": just np and np(1−p); sometimes the lead-in to a later approximation part.
A given table value such as "use P(Z≤1.9)≈0.9713": a strong hint about which z-score you should be standardising to; if your z does not match, recheck the continuity correction and the arithmetic.
Exam-style practice questions
Practice questions written in the style of NESA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
HSC-style4 marksA fair coin is tossed 100 times. Using the normal approximation with a continuity correction, estimate the probability of getting at least 60 heads. Use P(Z≤1.9)≈0.9713.
Show worked answer →
Let X∼B(100,0.5). Then μ=np=50 and σ2=np(1−p)=25, so σ=5.
Both np=50≥5 and n(1−p)=50≥5, so the approximation is valid.
For P(X≥60) apply the continuity correction with 60−0.5=59.5:
P(X≥60)≈P(Z≥559.5−50)=P(Z≥1.9).
P(Z≥1.9)=1−P(Z≤1.9)≈1−0.9713=0.0287.
HSC-style3 marksA biased die shows a six with probability 61. It is rolled 180 times. State the mean and standard deviation of the number of sixes, and explain why the normal approximation is appropriate.
Show worked answer →
Let X∼B(180,61) count the sixes.
Mean: μ=np=180×61=30.
Variance: σ2=np(1−p)=180×61×65=25, so σ=5.
The approximation is appropriate because np=30≥5 and n(1−p)=150≥5, so the binomial is well approximated by N(30,25).