← Maths Advanced guides

NSWMaths Advanced

HSC Mathematics Advanced statistical analysis (2026 guide)

A complete guide to statistical analysis in HSC Mathematics Advanced. Descriptive statistics, probability distributions (discrete and continuous), the normal distribution, sampling, and applications. With worked examples and the patterns that repeat in HSC papers year after year.

Generated by Claude OpusReviewed by Better Tuition Academy10 min readNESA-MATH-ADV-STAT

Why statistics matters in HSC Mathematics Advanced

Statistical analysis is the second-largest topic in HSC Mathematics Advanced after calculus. It is also the topic where students most often lose marks, partly because statistics demands a different style of thinking than the rest of the course (less "solve the equation", more "interpret the situation").

The topic has clean structure: descriptive statistics for summarising data, probability distributions for modelling random outcomes, and the normal distribution as the universal tool for continuous data.

Descriptive statistics

Measures of centre

  • Mean (xΛ‰\bar{x}): the average. xΛ‰=1nβˆ‘ixi\bar{x} = \frac{1}{n}\sum_i x_i.
  • Median: the middle value when data are sorted. Half the data are below, half above.
  • Mode: the most frequent value.

The mean is sensitive to outliers; the median is robust. For skewed data (e.g. income), the median is usually more representative.

Measures of spread

  • Range: maxβ‘βˆ’min⁑\max - \min.
  • Variance: var(x)=1nβˆ‘i(xiβˆ’xΛ‰)2\text{var}(x) = \frac{1}{n}\sum_i (x_i - \bar{x})^2. (HSC uses population variance, dividing by nn.)
  • Standard deviation: Οƒ=var\sigma = \sqrt{\text{var}}. The square root of variance, in the same units as the data.
  • Interquartile range (IQR): Q3βˆ’Q1Q_3 - Q_1, where Q1Q_1 and Q3Q_3 are the first and third quartiles.

Standard deviation is the most-used spread measure in HSC because it pairs with the normal distribution.

Discrete probability distributions

A discrete probability distribution assigns probabilities to a finite or countable set of outcomes.

Expected value (mean)

For a discrete random variable XX:

E(X)=βˆ‘ixiP(X=xi)E(X) = \sum_i x_i P(X = x_i)

Variance and standard deviation

var(X)=E[(Xβˆ’ΞΌ)2]=βˆ‘i(xiβˆ’ΞΌ)2P(X=xi)\text{var}(X) = E[(X - \mu)^2] = \sum_i (x_i - \mu)^2 P(X = x_i)

Worked example

A spinner has three outcomes: 1 (probability 0.5), 2 (probability 0.3), 3 (probability 0.2). Find the expected value and standard deviation.

E(X)=1β‹…0.5+2β‹…0.3+3β‹…0.2=0.5+0.6+0.6=1.7E(X) = 1 \cdot 0.5 + 2 \cdot 0.3 + 3 \cdot 0.2 = 0.5 + 0.6 + 0.6 = 1.7.

IMATH_14
IMATH_15
=0.245+0.027+0.338=0.61= 0.245 + 0.027 + 0.338 = 0.61.

Οƒ=0.61β‰ˆ0.781\sigma = \sqrt{0.61} \approx 0.781.

The normal distribution

The normal (or Gaussian) distribution is a continuous bell-shaped distribution fully specified by its mean ΞΌ\mu and standard deviation Οƒ\sigma. Notation: X∼N(ΞΌ,Οƒ2)X \sim N(\mu, \sigma^2).

Properties

  • Symmetric about the mean.
  • The mean, median, and mode are all equal to ΞΌ\mu.
  • The total area under the curve is 1 (it's a probability density).
  • 68-95-99.7 rule: approximately 68% of values within 1 SD, 95% within 2 SD, 99.7% within 3 SD of the mean.

Standard normal distribution

The standard normal ZZ has ΞΌ=0\mu = 0 and Οƒ=1\sigma = 1. Any normal distribution can be converted to standard normal via the z-score transformation:

Z=Xβˆ’ΞΌΟƒZ = \frac{X - \mu}{\sigma}

A z-score tells you how many standard deviations above (positive) or below (negative) the mean a value is.

Using normal tables

HSC exam papers provide a table of Ξ¦(z)=P(Z≀z)\Phi(z) = P(Z \leq z) for the standard normal. From this you can compute any normal probability.

Worked example. A class's exam scores are normally distributed with mean 70 and standard deviation 10. Find the probability that a randomly chosen student scored above 85.

Step 1: z=85βˆ’7010=1.5z = \frac{85 - 70}{10} = 1.5.

Step 2: P(X>85)=P(Z>1.5)=1βˆ’P(Z≀1.5)=1βˆ’Ξ¦(1.5)P(X > 85) = P(Z > 1.5) = 1 - P(Z \leq 1.5) = 1 - \Phi(1.5).

Step 3: From a standard normal table, Ξ¦(1.5)β‰ˆ0.9332\Phi(1.5) \approx 0.9332.

Step 4: P(X>85)=1βˆ’0.9332=0.0668P(X > 85) = 1 - 0.9332 = 0.0668, or about 6.68%.

Worked example using 68-95-99.7

The heights of Year 12 boys are normally distributed with mean 175 cm and standard deviation 7 cm. What percentage are taller than 189 cm?

189 cm is (189βˆ’175)/7=2(189 - 175) / 7 = 2 standard deviations above the mean.

By the 68-95-99.7 rule, 95% of values are within 2 SD. So 2.5% are above 189 cm and 2.5% are below 161 cm.

Answer: about 2.5%.

Sampling and the Central Limit Theorem

When you take a random sample of size nn from a population with mean μ\mu and standard deviation σ\sigma, the sample mean Xˉ\bar{X} has:

  • Mean: E(XΛ‰)=ΞΌE(\bar{X}) = \mu.
  • Standard deviation: ΟƒXΛ‰=Οƒn\sigma_{\bar{X}} = \frac{\sigma}{\sqrt{n}} (the standard error of the mean).

The Central Limit Theorem (CLT) says that for large enough nn (usually nβ‰₯30n \geq 30), the sampling distribution of XΛ‰\bar{X} is approximately normal, regardless of the shape of the population distribution.

Worked example

Heights of Year 12 girls have mean 167 cm and standard deviation 8 cm. A random sample of 16 girls is taken. What is the probability the sample mean is above 170 cm?

The sample mean has mean 167 and standard deviation 8/16=28/\sqrt{16} = 2.

z=(170βˆ’167)/2=1.5z = (170 - 167) / 2 = 1.5.

P(XΛ‰>170)=P(Z>1.5)=1βˆ’Ξ¦(1.5)β‰ˆ1βˆ’0.9332=0.0668P(\bar{X} > 170) = P(Z > 1.5) = 1 - \Phi(1.5) \approx 1 - 0.9332 = 0.0668.

About 6.68%.

Common HSC statistics traps

Confusing Οƒ\sigma with Οƒ2\sigma^2. Standard deviation vs variance. Read the question carefully.

Forgetting to convert to z-score. You cannot look up P(X≀85)P(X \leq 85) directly; you must standardise first.

Wrong tail. If the question asks P(X>85)P(X > 85), you need 1βˆ’Ξ¦(z)1 - \Phi(z). If it asks P(X<85)P(X < 85), you need Ξ¦(z)\Phi(z). Visualise the shaded area.

Misapplying the 68-95-99.7 rule. The rule gives the probability inside a band around the mean. If asked for the probability outside, subtract from 1.

Confusing the population SD with the sample mean's SD. The sample mean has SD Οƒ/n\sigma/\sqrt{n}, not Οƒ\sigma.

Forgetting CLT's sample-size requirement. The CLT is approximate; it gets better as nn grows. For small nn from non-normal populations, the sampling distribution may not be normal.

How statistics is examined

In Section II of the HSC Mathematics Advanced paper:

  • Short questions (3-4 marks). Compute mean and SD from a dataset. Apply the 68-95-99.7 rule.
  • Medium questions (5-7 marks). Find probabilities using the standard normal table. Compare two normal distributions. Compute z-scores in a real-world context.
  • Long extended-response questions (8-10 marks). Multi-part problems involving the sampling distribution of the mean, often with interpretation of results in context.

Practice strategy

For HSC Mathematics Advanced statistical analysis:

  • Term 3. Memorise the 68-95-99.7 rule, the z-score formula, and the standard error formula.
  • Term 4. Practice with the standard normal table. Do at least one statistics question per past paper.
  • Final two weeks. Drill the conversion: z-score β†’\rightarrow probability β†’\rightarrow interpretation. This sequence appears in every HSC paper.

In one sentence

HSC Mathematics Advanced statistical analysis rewards confident use of the normal distribution: convert any value to a z-score, read off the probability from the standard normal table, and interpret the result in context. Memorise the 68-95-99.7 rule, the z-score formula, and the standard error formula until they are automatic.

  • statistics
  • probability
  • normal-distribution
  • hsc-maths-advanced
  • year-12
  • 2026