Year 12: Statistical Analysis

NSWMaths Standard 2Syllabus dot point

What is the normal distribution, and how does the empirical rule give the percentage of data within 11, 22 and 33 standard deviations?

Recognise the features of the normal distribution and apply the empirical 6868-9595-99.799.7 rule

A focused answer to the HSC Maths Standard 2 dot point on the normal distribution. The bell-shaped curve, the empirical 6868-9595-99.799.7 rule, mean and standard deviation as the two parameters, and worked Australian examples for heights, exam marks and manufacturing quality control.

Generated by Claude OpusReviewed by Better Tuition Academy7 min answer

Have a quick question? Jump to the Q&A page

What this dot point is asking

NESA wants you to recognise the features of the normal distribution (symmetric, bell-shaped, characterised by mean and standard deviation), apply the empirical 6868-9595-99.799.7 rule to find percentages of data within standard-deviation bands, and use this to solve practical problems.

The answer

Normal distribution bell curve with the empirical rule A symmetric bell-shaped curve, drawn from a true exp minus x squared over two function, centred at the mean mu. Vertical lines fall from the curve to the axis at one, two, and three standard deviations either side. About 68 percent of values lie within plus or minus one standard deviation, 95 percent within two, and 99.7 percent within three. Tail percentages 2.35 percent, 13.5 percent, 34 percent are shown under each region. μ−3σ μ−2σ μ−σ μ μ+σ μ+2σ μ+3σ 68% 95% 99.7% 34% 34% 13.5% 13.5% 2.35% 2.35% Curve from f(x) = e^(-x²/2) sampled every 0.5σ; areas labelled use the 68-95-99.7 rule.

The normal distribution

The normal distribution (or bell curve) is a continuous probability distribution that arises naturally in many contexts (heights, exam scores, measurement errors, manufacturing variation). Its shape is fully specified by two parameters:

  • IMATH_6 (mu): the mean.
  • IMATH_7 (sigma): the standard deviation.

Key properties:

  • Symmetric about the mean.
  • Mean = median = mode = μ\mu.
  • Highest at x=μx = \mu, then falls off on both sides.
  • Total area under the curve is 11 (it is a probability density).
  • Extends from -\infty to ++\infty (in principle).

The empirical rule (68-95-99.7)

For any normal distribution:

  • About 68%68\% of values lie within 11 standard deviation of the mean: μ±σ\mu \pm \sigma.
  • About 95%95\% lie within 22 standard deviations: μ±2σ\mu \pm 2\sigma.
  • About 99.7%99.7\% lie within 33 standard deviations: μ±3σ\mu \pm 3\sigma.

By symmetry, the tails (one side) are half the outside-the-band amount:

  • Above μ+σ\mu + \sigma: 100682=16%\frac{100 - 68}{2} = 16\%.
  • Above μ+2σ\mu + 2\sigma: 100952=2.5%\frac{100 - 95}{2} = 2.5\%.
  • Above μ+3σ\mu + 3\sigma: 10099.72=0.15%\frac{100 - 99.7}{2} = 0.15\%.

Common standard-deviation regions

Region Percentage
Between μ1σ\mu - 1\sigma and IMATH_29 IMATH_30
Between μ\mu and IMATH_32 IMATH_33 (half of 68%68\%)
Between μ+1σ\mu + 1\sigma and IMATH_36 IMATH_37
Between μ+2σ\mu + 2\sigma and IMATH_39 IMATH_40
Above IMATH_41 IMATH_42

These add up: 0.15+2.35+13.5+34=50%0.15 + 2.35 + 13.5 + 34 = 50\% from the mean to the right tail.

Applying the rule

To find the percentage of data in some range:

  1. Express each endpoint in terms of standard deviations from the mean.
  2. Use the empirical rule values or the region table.
  3. Sum or subtract regions as needed.

If endpoints are not at exact integer standard deviations from the mean, Standard 2 expects you to use z-scores and a calculator (covered in the next dot point).

When the normal distribution applies

  • Natural variation. Adult heights, weights, exam scores in large populations, IQ scores.
  • Measurement error. Repeated measurements of the same quantity.
  • Manufacturing. Weights of mass-produced items, dimensions of components.
  • Sums and averages. By the Central Limit Theorem (covered in Maths Advanced), means of large samples are approximately normal regardless of the original distribution.

The normal distribution is the default model whenever a quantity is the result of many small, independent influences adding up.

Past exam questions, worked

Real questions from past NESA papers on this dot point, with our answer explainer.

2022 HSC Q153 marksThe heights of Year 12 boys at a Sydney school are normally distributed with mean 175175 cm and standard deviation 77 cm. What percentage are taller than 189189 cm?
Show worked answer →

189189 cm is 1891757=2\frac{189 - 175}{7} = 2 standard deviations above the mean.

By the empirical rule, 95%95\% of values lie within 22 standard deviations of the mean, so 5%5\% lie outside this band.

By symmetry, half of that (2.5%2.5\%) is above 189189 cm.

So about 2.5%2.5\% of Year 12 boys are taller than 189189 cm.

Markers reward the number of standard deviations from the mean, the application of 95%95\% inside, and the halving by symmetry.

2021 HSC Q164 marksA factory produces bags of rice with a mean weight of 11 kg and standard deviation 2020 g. The weights are normally distributed. (a) What percentage of bags weigh between 980980 g and 10201020 g? (b) What percentage weigh between 940940 g and 10601060 g?
Show worked answer →

Convert standard deviation to grams: σ=20\sigma = 20 g.

(a) 980980 g is 11 SD below the mean (10001000 g), and 10201020 g is 11 SD above. By the empirical rule, 68%68\% of bags lie in this range.

(b) 940940 g is 33 SD below, and 10601060 g is 33 SD above the mean. By the empirical rule, 99.7%99.7\% of bags lie in this range.

Markers reward both: the identification of how many SDs each endpoint is from the mean, and the empirical rule percentage.

Related dot points