← Year 12: Statistical Analysis
What is the normal distribution, and how does the empirical rule give the percentage of data within , and standard deviations?
Recognise the features of the normal distribution and apply the empirical -- rule
A focused answer to the HSC Maths Standard 2 dot point on the normal distribution. The bell-shaped curve, the empirical -- rule, mean and standard deviation as the two parameters, and worked Australian examples for heights, exam marks and manufacturing quality control.
Have a quick question? Jump to the Q&A page
What this dot point is asking
NESA wants you to recognise the features of the normal distribution (symmetric, bell-shaped, characterised by mean and standard deviation), apply the empirical -- rule to find percentages of data within standard-deviation bands, and use this to solve practical problems.
The answer
The normal distribution
The normal distribution (or bell curve) is a continuous probability distribution that arises naturally in many contexts (heights, exam scores, measurement errors, manufacturing variation). Its shape is fully specified by two parameters:
- IMATH_6 (mu): the mean.
- IMATH_7 (sigma): the standard deviation.
Key properties:
- Symmetric about the mean.
- Mean = median = mode = .
- Highest at , then falls off on both sides.
- Total area under the curve is (it is a probability density).
- Extends from to (in principle).
The empirical rule (68-95-99.7)
For any normal distribution:
- About of values lie within standard deviation of the mean: .
- About lie within standard deviations: .
- About lie within standard deviations: .
By symmetry, the tails (one side) are half the outside-the-band amount:
- Above : .
- Above : .
- Above : .
Common standard-deviation regions
| Region | Percentage |
|---|---|
| Between and IMATH_29 | IMATH_30 |
| Between and IMATH_32 | IMATH_33 (half of ) |
| Between and IMATH_36 | IMATH_37 |
| Between and IMATH_39 | IMATH_40 |
| Above IMATH_41 | IMATH_42 |
These add up: from the mean to the right tail.
Applying the rule
To find the percentage of data in some range:
- Express each endpoint in terms of standard deviations from the mean.
- Use the empirical rule values or the region table.
- Sum or subtract regions as needed.
If endpoints are not at exact integer standard deviations from the mean, Standard 2 expects you to use z-scores and a calculator (covered in the next dot point).
When the normal distribution applies
- Natural variation. Adult heights, weights, exam scores in large populations, IQ scores.
- Measurement error. Repeated measurements of the same quantity.
- Manufacturing. Weights of mass-produced items, dimensions of components.
- Sums and averages. By the Central Limit Theorem (covered in Maths Advanced), means of large samples are approximately normal regardless of the original distribution.
The normal distribution is the default model whenever a quantity is the result of many small, independent influences adding up.
Past exam questions, worked
Real questions from past NESA papers on this dot point, with our answer explainer.
2022 HSC Q153 marksThe heights of Year 12 boys at a Sydney school are normally distributed with mean cm and standard deviation cm. What percentage are taller than cm?Show worked answer →
cm is standard deviations above the mean.
By the empirical rule, of values lie within standard deviations of the mean, so lie outside this band.
By symmetry, half of that () is above cm.
So about of Year 12 boys are taller than cm.
Markers reward the number of standard deviations from the mean, the application of inside, and the halving by symmetry.
2021 HSC Q164 marksA factory produces bags of rice with a mean weight of kg and standard deviation g. The weights are normally distributed. (a) What percentage of bags weigh between g and g? (b) What percentage weigh between g and g?Show worked answer →
Convert standard deviation to grams: g.
(a) g is SD below the mean ( g), and g is SD above. By the empirical rule, of bags lie in this range.
(b) g is SD below, and g is SD above the mean. By the empirical rule, of bags lie in this range.
Markers reward both: the identification of how many SDs each endpoint is from the mean, and the empirical rule percentage.
Related dot points
- Calculate z-scores and use them to compare values from different normal distributions and find probabilities
A focused answer to the HSC Maths Standard 2 dot point on z-scores. The formula , interpreting z-scores as standard-deviation distances, comparing observations from different normal distributions, and worked examples from exam scores and Australian salary data.
- Find and use the equation of the least-squares regression line to model a linear relationship between two variables
A focused answer to the HSC Maths Standard 2 dot point on the least-squares regression line. The equation , finding the gradient and intercept using calculator statistics functions, interpreting the gradient in context, and worked Australian examples.
- Calculate and interpret Pearson's correlation coefficient using statistical technology, including the sign and magnitude
A focused answer to the HSC Maths Standard 2 dot point on Pearson's correlation coefficient. What measures, how to interpret its sign and magnitude, the limitations of in non-linear relationships, and how to compute it using calculator statistics functions.