How do we use the normal distribution and z-scores to compute probabilities and compare observations?
Use the normal distribution, z-scores, the empirical rule and the standard normal table to find probabilities and percentiles
A focused answer to the HSC Maths Advanced dot point on the normal distribution. Standardising with z-scores, the 68-95-99.7 empirical rule, computing probabilities as areas under the curve and inverse-normal percentiles, with worked examples and exam traps.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
What this dot point is asking
NESA wants you to standardise normally distributed data using z-scores, apply the -- empirical rule, find probabilities as areas under the bell curve, and find percentiles using the standard normal. You also need to interpret z-scores when comparing observations from different distributions. The single idea that ties it all together is that a probability for a normal variable is an area under its curve, and standardising lets you read every such area off one fixed curve, the standard normal.
The answer
The normal distribution
A continuous random variable is normally distributed with mean and standard deviation , written , if its probability density function is
You are never asked to integrate this by hand; what matters is what its graph looks like and how to read areas off it. Key features:
- The graph is a bell curve symmetric about .
- is the mean, median and mode (all at the centre by symmetry).
- controls the spread: a larger gives a flatter, wider curve, a smaller a taller, narrower one.
- The total area under the curve is , so the area over any interval is the probability of landing in that interval.
The case , is the standard normal, denoted . Every normal calculation is turned into one about this single curve by standardising.
z-scores
The z-score of a value measures how many standard deviations it is from the mean:
If , then . The two operations in the formula have a clear meaning: subtracting slides the value so the mean sits at , and dividing by rescales so one standard deviation becomes one unit. A value above the mean has a positive z-score, one below has a negative z-score, and the sign must be kept.
z-scores let you compare observations from different distributions on the same scale, because they strip out the original mean and spread. A higher z-score means "further above the mean in standard-deviation units", regardless of what the raw units were.
The empirical rule (68-95-99.7)
For any normal distribution,
- about of values lie within standard deviation of the mean (),
- about within standard deviations (),
- about within standard deviations ().
These are the central two-sided bands shown in the figure above. Because the curve is symmetric, halve them to get one-sided areas from the mean: , , . Tail probabilities are the complement of half the band: , , .
Computing probabilities as areas
For and , the probability of landing between and is the area under the curve over that interval, and standardising turns it into a standard-normal area:
In the exam, the empirical rule covers the common endpoints (whole numbers of standard deviations). For other endpoints, use the standard normal table or the calculator's normalcdf function. The picture is always the same: shade the region you want, then assemble it from the areas you know.
A single cut-off. The cumulative probability is the whole area to the left of . For example the area left of is about (half the curve, , plus the between and ).
Between two cut-offs. When the two endpoints straddle the mean, split the area at and add the two empirical-rule halves. For from to , the area is .
Inverse problems (percentiles)
The reverse question gives you an area and asks for the cut-off value. To find such that , find the corresponding from a table or invNorm, then untransform with . The most-quoted z-scores are the th percentile , the th , and the th . The diagram below shows the th-percentile cut-off: the upper tail beyond has area , so of the area lies to its left.
How exam questions ask about the normal distribution
- "What percentage lie within / between ...?" Express the endpoints as whole numbers of standard deviations and read the empirical rule. If they straddle the mean, split at .
- "Find the probability that is between and ." Standardise both endpoints, sketch and shade the region, then assemble it from empirical-rule halves (or normalcdf for awkward endpoints).
- "Find the probability that exceeds / is less than a value." A one-sided area: use a half plus or minus an empirical-rule piece, or the symmetry .
- "Compare these two scores from different tests." Convert each to a z-score; the larger z-score is the better relative performance.
- "Find the value below which lie" or "the th percentile." This is the inverse: get for that area, then .
Edge cases worth knowing
- The variance versus the standard deviation. states the variance, but z-scores and the empirical rule use . If a question gives , then before you do anything else.
- A value that is not a whole number of standard deviations. The empirical rule only covers , , standard deviations. For or you need the standard normal table or normalcdf, not the rule.
- "At least" and "at most". For a continuous variable and , because a single point has zero probability, so do not fuss over strict versus non-strict inequalities.
- Symmetry shortcuts. and . Use the mirror image rather than computing a tail twice.
Exam-style practice questions
Practice questions written in the style of NESA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
2022 HSC Q294 marksTest marks are normally distributed with mean and standard deviation . Find the probability that a randomly chosen student scores between and .Show worked answer →
Standardise the endpoints with .
, .
By the empirical rule, , so . Similarly .
.
Markers reward correct standardisation, splitting the interval at , and applying the empirical rule values cleanly. A calculator's normalcdf gives as a precise answer.
2021 HSC Q283 marksA continuous variable is normally distributed with mean and standard deviation . Approximately what percentage of values lie between and ? Between and ?Show worked answer →
and are one standard deviation either side of the mean, so by the empirical rule about of values lie in this range.
and are two standard deviations either side, so about of values lie in this range.
Markers expect explicit identification of how many from the mean each endpoint is, and the corresponding empirical rule percentage.
Related dot points
- Use probability density functions and cumulative distribution functions to find probabilities, medians, modes, means and variances of continuous random variables
A focused answer to the HSC Maths Advanced dot point on continuous random variables. Probability density functions, cumulative distribution functions, computing probabilities by integration, and finding mean, median, mode and variance, with worked examples.
- Define a discrete random variable by its probability distribution, and calculate the expected value, variance and standard deviation
A focused answer to the HSC Maths Advanced dot point on discrete random variables. Probability distributions, expected value, variance, standard deviation, and linear transformations of a discrete random variable, with worked examples.
- Construct scatter plots, calculate and interpret Pearson's correlation coefficient, and fit and use the least-squares regression line
A focused answer to the HSC Maths Advanced dot point on bivariate data. Scatter plots, the Pearson correlation coefficient, the least-squares regression line, prediction, and the limits of extrapolation, with worked examples and exam traps.
- Find antiderivatives of standard functions, apply integration by substitution and evaluate definite integrals using the Fundamental Theorem of Calculus
A focused answer to the HSC Maths Advanced dot point on integration. Antiderivatives of standard functions, integration by substitution, definite integrals and the Fundamental Theorem of Calculus, with worked examples.