← Year 12: Statistical Analysis
How do we use the normal distribution and z-scores to compute probabilities and compare observations?
Use the normal distribution, z-scores, the empirical rule and the standard normal table to find probabilities and percentiles
A focused answer to the HSC Maths Advanced dot point on the normal distribution. Standardising with z-scores, the 68-95-99.7 empirical rule, computing probabilities and inverse-normal percentiles, with worked examples and exam traps.
Have a quick question? Jump to the Q&A page
What this dot point is asking
NESA wants you to standardise normally distributed data using z-scores, apply the -- empirical rule, find probabilities and percentiles using the standard normal, and interpret z-scores when comparing observations from different distributions.
The answer
The normal distribution
A continuous random variable is normally distributed with mean and standard deviation , written , if its pdf is
Key features:
- The graph is a bell curve symmetric about .
- IMATH_11 is the mean, median and mode.
- IMATH_12 controls the spread: larger gives a flatter, wider curve.
- The total area under the curve is .
The case , is the standard normal, denoted .
z-scores
The z-score of a value measures how many standard deviations it is from the mean:
If , then . Standardising turns any normal calculation into one about the standard normal.
z-scores let you compare observations from different distributions on the same scale. A higher z-score is "further above the mean in standard deviation units".
The empirical rule (68-95-99.7)
For any normal distribution,
- about of values lie within standard deviation of the mean (),
- about within standard deviations (),
- about within standard deviations ().
By symmetry, , , .
Tail probabilities are the complement: , , .
Computing probabilities
For and :
In the exam, the empirical rule covers the common endpoints. For other endpoints, use the standard normal table or the calculator's normalcdf function.
Inverse problems (percentiles)
To find the value such that , find the corresponding from a table or invNorm, then transform: . The 90th percentile of is , the 95th is , the 97.5th is .
Worked examples
Direct use of the empirical rule
Heights of adult males in a city are normally distributed with cm and cm. About what percentage of men are taller than cm?
. So we need , or .
Two-sided interval
For the same distribution, what percentage are between and cm?
These are , so by the empirical rule about .
Mixed empirical-rule interval
For (so ), find .
, .
.
Comparing two distributions with z-scores
Two students sit different tests. Alex scores on a test with , . Sam scores on a test with , . Who performed better relative to their cohort?
Alex: . Sam: .
Same z-score, so they performed equally well relative to their cohorts.
Inverse normal
For (so ), find the value below which of the data lies.
The 95th percentile of is , so .
Common traps
Standardising with the wrong sign. . A value below the mean has a negative z-score. Do not drop the sign.
Confusing and . uses the variance, but the empirical rule and z-score use . If a question gives , then .
Forgetting symmetry. . Use the symmetry of the bell curve rather than computing tails twice.
Adding empirical rule pieces incorrectly. is for the full two-sided interval. The one-sided half is . Do not double-count the central area.
Applying the empirical rule to non-normal data. The -- rule is specific to the normal distribution. For other shapes you must use other methods.
In one sentence
For , standardise with to convert to the standard normal, then use the empirical rule, a table, or normalcdf or invNorm on a calculator to find probabilities and percentiles.
Past exam questions, worked
Real questions from past NESA papers on this dot point, with our answer explainer.
2022 HSC Q294 marksTest marks are normally distributed with mean $70$ and standard deviation $8$. Find the probability that a randomly chosen student scores between $62$ and $86$.Show worked answer →
Standardise the endpoints with .
, .
By the empirical rule, , so . Similarly .
.
Markers reward correct standardisation, splitting the interval at , and applying the empirical rule values cleanly. A calculator's normalcdf gives as a precise answer.
2021 HSC Q283 marksA continuous variable is normally distributed with mean $\mu = 100$ and standard deviation $\sigma = 15$. Approximately what percentage of values lie between $85$ and $115$? Between $70$ and $130$?Show worked answer →
and are one standard deviation either side of the mean, so by the empirical rule about of values lie in this range.
and are two standard deviations either side, so about of values lie in this range.
Markers expect explicit identification of how many from the mean each endpoint is, and the corresponding empirical rule percentage.
Related dot points
- Use probability density functions and cumulative distribution functions to find probabilities, medians, modes, means and variances of continuous random variables
A focused answer to the HSC Maths Advanced dot point on continuous random variables. Probability density functions, cumulative distribution functions, computing probabilities by integration, and finding mean, median, mode and variance, with worked examples.
- Define a discrete random variable by its probability distribution, and calculate the expected value, variance and standard deviation
A focused answer to the HSC Maths Advanced dot point on discrete random variables. Probability distributions, expected value, variance, standard deviation, and linear transformations of a discrete random variable, with worked examples.