Skip to main content
NSWMaths AdvancedSyllabus dot point

How do we use the normal distribution and z-scores to compute probabilities and compare observations?

Use the normal distribution, z-scores, the empirical rule and the standard normal table to find probabilities and percentiles

A focused answer to the HSC Maths Advanced dot point on the normal distribution. Standardising with z-scores, the 68-95-99.7 empirical rule, computing probabilities as areas under the curve and inverse-normal percentiles, with worked examples and exam traps.

Generated by Claude Opus 4.814 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

What this dot point is asking

NESA wants you to standardise normally distributed data using z-scores, apply the 6868-9595-99.799.7 empirical rule, find probabilities as areas under the bell curve, and find percentiles using the standard normal. You also need to interpret z-scores when comparing observations from different distributions. The single idea that ties it all together is that a probability for a normal variable is an area under its curve, and standardising lets you read every such area off one fixed curve, the standard normal.

The answer

The standard normal curve with the empirical ruleA symmetric bell curve centred at the mean. The central band within one standard deviation holds about 68 percent of the area, within two standard deviations about 95 percent, within three about 99.7 percent.μ−3σμ−2σμ−σμμ+σμ+2σμ+3σ68%95%99.7%About 68%, 95% and 99.7% of values lie within 1, 2 and 3standard deviations of the mean for any normal distribution.

The normal distribution

A continuous random variable XX is normally distributed with mean μ\mu and standard deviation σ\sigma, written XN(μ,σ2)X \sim N(\mu, \sigma^2), if its probability density function is

f(x)=1σ2πexp((xμ)22σ2).f(x) = \frac{1}{\sigma \sqrt{2 \pi}} \exp\left( -\frac{(x - \mu)^2}{2 \sigma^2} \right).

You are never asked to integrate this by hand; what matters is what its graph looks like and how to read areas off it. Key features:

  • The graph is a bell curve symmetric about x=μx = \mu.
  • μ\mu is the mean, median and mode (all at the centre by symmetry).
  • σ\sigma controls the spread: a larger σ\sigma gives a flatter, wider curve, a smaller σ\sigma a taller, narrower one.
  • The total area under the curve is 11, so the area over any interval is the probability of landing in that interval.

The case μ=0\mu = 0, σ=1\sigma = 1 is the standard normal, denoted ZN(0,1)Z \sim N(0, 1). Every normal calculation is turned into one about this single curve by standardising.

z-scores

The z-score of a value xx measures how many standard deviations it is from the mean:

z=xμσ.z = \frac{x - \mu}{\sigma}.

If XN(μ,σ2)X \sim N(\mu, \sigma^2), then Z=XμσN(0,1)Z = \frac{X - \mu}{\sigma} \sim N(0, 1). The two operations in the formula have a clear meaning: subtracting μ\mu slides the value so the mean sits at 00, and dividing by σ\sigma rescales so one standard deviation becomes one unit. A value above the mean has a positive z-score, one below has a negative z-score, and the sign must be kept.

z-scores let you compare observations from different distributions on the same scale, because they strip out the original mean and spread. A higher z-score means "further above the mean in standard-deviation units", regardless of what the raw units were.

Standardise a value to a z-scoreA mark of 88 on a test with mean 70 and standard deviation 12 maps down to a z-score of 1.5 on the standard scale by subtracting the mean and dividing by the standard deviation.X4658708294μ = 7088z−3−2−10123z = 1.5z = (88 − 70) / 12 = 1.5Subtract the mean, then divide by σ: the mark moves ontothe common z-scale where the empirical rule applies.

The empirical rule (68-95-99.7)

For any normal distribution,

  • about 68%68\% of values lie within 11 standard deviation of the mean (μ±σ\mu \pm \sigma),
  • about 95%95\% within 22 standard deviations (μ±2σ\mu \pm 2 \sigma),
  • about 99.7%99.7\% within 33 standard deviations (μ±3σ\mu \pm 3 \sigma).

These are the central two-sided bands shown in the figure above. Because the curve is symmetric, halve them to get one-sided areas from the mean: P(0Z1)0.34P(0 \le Z \le 1) \approx 0.34, P(0Z2)0.475P(0 \le Z \le 2) \approx 0.475, P(0Z3)0.4985P(0 \le Z \le 3) \approx 0.4985. Tail probabilities are the complement of half the band: P(Z>1)0.16P(Z > 1) \approx 0.16, P(Z>2)0.025P(Z > 2) \approx 0.025, P(Z>3)0.0015P(Z > 3) \approx 0.0015.

Computing probabilities as areas

For XN(μ,σ2)X \sim N(\mu, \sigma^2) and a<ba < b, the probability of landing between aa and bb is the area under the curve over that interval, and standardising turns it into a standard-normal area:

P(aXb)=P(aμσZbμσ).P(a \le X \le b) = P\left( \frac{a - \mu}{\sigma} \le Z \le \frac{b - \mu}{\sigma} \right).

In the exam, the empirical rule covers the common endpoints (whole numbers of standard deviations). For other endpoints, use the standard normal table or the calculator's normalcdf function. The picture is always the same: shade the region you want, then assemble it from the areas you know.

A single cut-off. The cumulative probability P(Zz)P(Z \le z) is the whole area to the left of zz. For example the area left of z=1z = 1 is about 0.840.84 (half the curve, 0.50.5, plus the 0.340.34 between 00 and 11).

A probability is an area under the curveThe bell curve with the entire region to the left of z equals 1 shaded, representing the cumulative probability that Z is at most 1, which is about 0.84.−3−2−10231P(Z ≤ 1)≈ 0.84Step 1The shaded area left of z = 1 is P(Z ≤ 1) ≈ 0.84.Total area under the curve is 1.

Between two cut-offs. When the two endpoints straddle the mean, split the area at 00 and add the two empirical-rule halves. For zz from 1-1 to 22, the area is P(1Z0)+P(0Z2)0.34+0.475=0.815P(-1 \le Z \le 0) + P(0 \le Z \le 2) \approx 0.34 + 0.475 = 0.815.

Probability between two z-scoresThe bell curve with the region between z equals minus 1 and z equals 2 shaded, representing the probability that Z lies between minus 1 and 2, about 0.815.−3−203−12P(−1 ≤ Z ≤ 2)≈ 0.815Step 2Area between z = −1 and z = 2: split at 0 and addthe two empirical-rule halves, 0.34 + 0.475 = 0.815.

Inverse problems (percentiles)

The reverse question gives you an area and asks for the cut-off value. To find xx such that P(Xx)=pP(X \le x) = p, find the corresponding zz from a table or invNorm, then untransform with x=μ+zσx = \mu + z \sigma. The most-quoted z-scores are the 9090th percentile z1.28z \approx 1.28, the 9595th z1.645z \approx 1.645, and the 97.597.5th z1.96z \approx 1.96. The diagram below shows the 9595th-percentile cut-off: the upper tail beyond z=1.645z = 1.645 has area 0.05\approx 0.05, so 95%95\% of the area lies to its left.

A tail probabilityThe bell curve with the upper tail beyond z equals 1.645 shaded, representing the probability that Z exceeds 1.645, about 0.05, the 95th percentile cut-off.−3−2−10123P(Z > 1.645)≈ 0.05z=1.645Step 3The upper tail beyond z = 1.645 has area ≈ 0.05.So z = 1.645 is the 95th percentile of Z.

How exam questions ask about the normal distribution

  • "What percentage lie within / between ...?" Express the endpoints as whole numbers of standard deviations and read the empirical rule. If they straddle the mean, split at 00.
  • "Find the probability that XX is between aa and bb." Standardise both endpoints, sketch and shade the region, then assemble it from empirical-rule halves (or normalcdf for awkward endpoints).
  • "Find the probability that XX exceeds / is less than a value." A one-sided area: use a half plus or minus an empirical-rule piece, or the symmetry P(Zz)=P(Zz)P(Z \ge -z) = P(Z \le z).
  • "Compare these two scores from different tests." Convert each to a z-score; the larger z-score is the better relative performance.
  • "Find the value below which p%p\% lie" or "the ppth percentile." This is the inverse: get zz for that area, then x=μ+zσx = \mu + z\sigma.

Edge cases worth knowing

  • The variance versus the standard deviation. N(μ,σ2)N(\mu, \sigma^2) states the variance, but z-scores and the empirical rule use σ\sigma. If a question gives σ2=25\sigma^2 = 25, then σ=5\sigma = 5 before you do anything else.
  • A value that is not a whole number of standard deviations. The empirical rule only covers 11, 22, 33 standard deviations. For z=1.5z = 1.5 or z=0.7z = 0.7 you need the standard normal table or normalcdf, not the rule.
  • "At least" and "at most". For a continuous variable P(Xa)=P(X>a)P(X \ge a) = P(X > a) and P(Xa)=P(X<a)P(X \le a) = P(X < a), because a single point has zero probability, so do not fuss over strict versus non-strict inequalities.
  • Symmetry shortcuts. P(Z1)=P(Z1)0.84P(Z \ge -1) = P(Z \le 1) \approx 0.84 and P(Z<1)=P(Z>1)0.16P(Z < -1) = P(Z > 1) \approx 0.16. Use the mirror image rather than computing a tail twice.

Exam-style practice questions

Practice questions written in the style of NESA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

2022 HSC Q294 marksTest marks are normally distributed with mean 7070 and standard deviation 88. Find the probability that a randomly chosen student scores between 6262 and 8686.
Show worked answer →

Standardise the endpoints with z=xμσz = \frac{x - \mu}{\sigma}.

z1=62708=1z_1 = \frac{62 - 70}{8} = -1, z2=86708=2z_2 = \frac{86 - 70}{8} = 2.

By the empirical rule, P(1Z1)0.68P(-1 \le Z \le 1) \approx 0.68, so P(0Z1)0.34P(0 \le Z \le 1) \approx 0.34. Similarly P(0Z2)0.475P(0 \le Z \le 2) \approx 0.475.

P(1Z2)=P(1Z0)+P(0Z2)0.34+0.475=0.815P(-1 \le Z \le 2) = P(-1 \le Z \le 0) + P(0 \le Z \le 2) \approx 0.34 + 0.475 = 0.815.

Markers reward correct standardisation, splitting the interval at 00, and applying the empirical rule values cleanly. A calculator's normalcdf gives 0.81860.8186 as a precise answer.

2021 HSC Q283 marksA continuous variable is normally distributed with mean μ=100\mu = 100 and standard deviation σ=15\sigma = 15. Approximately what percentage of values lie between 8585 and 115115? Between 7070 and 130130?
Show worked answer →

8585 and 115115 are one standard deviation either side of the mean, so by the empirical rule about 68%68\% of values lie in this range.

7070 and 130130 are two standard deviations either side, so about 95%95\% of values lie in this range.

Markers expect explicit identification of how many σ\sigma from the mean each endpoint is, and the corresponding empirical rule percentage.

Related dot points