What does statistical analysis cover in HSC Mathematics Advanced?

Statistical analysis in HSC Mathematics Advanced covers descriptive statistics (mean, median, variance, standard deviation), discrete probability distributions (binomial), continuous probability distributions (the normal distribution), z-scores, sampling distributions, and basic statistical inference. It is about 15-20% of the exam paper.

What is the normal distribution?

The normal distribution is a continuous bell-shaped distribution that arises naturally in many real-world contexts (heights, exam scores, measurement errors). It is fully specified by two parameters - its mean ($\\mu$) and its standard deviation ($\\sigma$). The famous 68-95-99.7 rule states that roughly 68% of values fall within 1 SD of the mean, 95% within 2 SD, and 99.7% within 3 SD.

A z-score (standard score) measures how many standard deviations a value is from the mean. For a value $x$ from a distribution with mean $\\mu$ and standard deviation $\\sigma$, the z-score is $z = (x - \\mu) / \\sigma$. Z-scores let you compare values from different normal distributions on a common scale.

How do I find probabilities using the normal distribution?

Convert the value to a z-score, then use a standard normal table or calculator function to find the probability. For example, $P(Z < 1.5)$ from a standard normal table is approximately 0.9332. HSC exam papers provide a table; learn to use it accurately under time pressure.

What is a sampling distribution?

A sampling distribution describes how a sample statistic (e.g. the sample mean) varies across different random samples drawn from a population. For large enough samples, the Central Limit Theorem says the sampling distribution of the sample mean is approximately normal, with mean equal to the population mean and standard deviation equal to $\\sigma/\\sqrt{n}$ where $n$ is the sample size.

How is statistics examined in HSC Mathematics Advanced?

Statistics questions usually appear in Section II. Common patterns - compute descriptive statistics from a dataset, find probabilities using the normal distribution, apply the 68-95-99.7 rule, compute z-scores, compare distributions, and apply the sampling distribution for the mean. Most questions are 4-8 marks each.

← Maths Advanced guides

NSWMaths Advanced

HSC Mathematics Advanced statistical analysis (2026 guide)

Q: What is a sampling distribution?

A sampling distribution describes how a sample statistic (e.g. the sample mean) varies across different random samples drawn from a population. For large enough samples, the Central Limit Theorem says the sampling distribution of the sample mean is approximately normal, with mean equal to the population mean and standard deviation equal to $\\sigma/\\sqrt{n}$ where $n$ is the sample size.

Q: How is statistics examined in HSC Mathematics Advanced?

Statistics questions usually appear in Section II. Common patterns - compute descriptive statistics from a dataset, find probabilities using the normal distribution, apply the 68-95-99.7 rule, compute z-scores, compare distributions, and apply the sampling distribution for the mean. Most questions are 4-8 marks each.

A complete guide to statistical analysis in HSC Mathematics Advanced. Descriptive statistics, probability distributions (discrete and continuous), the normal distribution, sampling, and applications. With worked examples and the patterns that repeat in HSC papers year after year.

Generated by Claude OpusReviewed by Better Tuition Academy10 min readNESA-MATH-ADV-STATUpdated 2026-05-18

Why statistics matters in HSC Mathematics Advanced

Statistical analysis is the second-largest topic in HSC Mathematics Advanced after calculus. It is also the topic where students most often lose marks, partly because statistics demands a different style of thinking than the rest of the course (less "solve the equation", more "interpret the situation").

The topic has clean structure: descriptive statistics for summarising data, probability distributions for modelling random outcomes, and the normal distribution as the universal tool for continuous data.

Descriptive statistics

Measures of centre

Mean ( $\bar{x}$ ): the average. $\bar{x} = \frac{1}{n}\sum_i x_i$ .
Median: the middle value when data are sorted. Half the data are below, half above.
Mode: the most frequent value.

The mean is sensitive to outliers; the median is robust. For skewed data (e.g. income), the median is usually more representative.

Measures of spread

Range: $\max - \min$ .
Variance: $\text{var}(x) = \frac{1}{n}\sum_i (x_i - \bar{x})^2$ . (HSC uses population variance, dividing by $n$ .)
Standard deviation: $\sigma = \sqrt{\text{var}}$ . The square root of variance, in the same units as the data.
Interquartile range (IQR): $Q_3 - Q_1$ , where $Q_1$ and $Q_3$ are the first and third quartiles.

Standard deviation is the most-used spread measure in HSC because it pairs with the normal distribution.

Discrete probability distributions

A discrete probability distribution assigns probabilities to a finite or countable set of outcomes.

Expected value (mean)

For a discrete random variable $X$ :

E(X) = \sum_i x_i P(X = x_i)

Variance and standard deviation

\text{var}(X) = E[(X - \mu)^2] = \sum_i (x_i - \mu)^2 P(X = x_i)

Worked example

A spinner has three outcomes: 1 (probability 0.5), 2 (probability 0.3), 3 (probability 0.2). Find the expected value and standard deviation.

$E(X) = 1 \cdot 0.5 + 2 \cdot 0.3 + 3 \cdot 0.2 = 0.5 + 0.6 + 0.6 = 1.7$ .

IMATH_14
IMATH_15
$= 0.245 + 0.027 + 0.338 = 0.61$ .

$\sigma = \sqrt{0.61} \approx 0.781$ .

The normal distribution

The normal (or Gaussian) distribution is a continuous bell-shaped distribution fully specified by its mean $\mu$ and standard deviation $\sigma$ . Notation: $X \sim N(\mu, \sigma^2)$ .

Properties

Symmetric about the mean.
The mean, median, and mode are all equal to $\mu$ .
The total area under the curve is 1 (it's a probability density).
68-95-99.7 rule: approximately 68% of values within 1 SD, 95% within 2 SD, 99.7% within 3 SD of the mean.

Standard normal distribution

The standard normal $Z$ has $\mu = 0$ and $\sigma = 1$ . Any normal distribution can be converted to standard normal via the z-score transformation:

Z = \frac{X - \mu}{\sigma}

A z-score tells you how many standard deviations above (positive) or below (negative) the mean a value is.

Using normal tables

HSC exam papers provide a table of $\Phi(z) = P(Z \leq z)$ for the standard normal. From this you can compute any normal probability.

Worked example. A class's exam scores are normally distributed with mean 70 and standard deviation 10. Find the probability that a randomly chosen student scored above 85.

Step 1: $z = \frac{85 - 70}{10} = 1.5$ .

Step 2: $P(X > 85) = P(Z > 1.5) = 1 - P(Z \leq 1.5) = 1 - \Phi(1.5)$ .

Step 3: From a standard normal table, $\Phi(1.5) \approx 0.9332$ .

Step 4: $P(X > 85) = 1 - 0.9332 = 0.0668$ , or about 6.68%.

Worked example using 68-95-99.7

The heights of Year 12 boys are normally distributed with mean 175 cm and standard deviation 7 cm. What percentage are taller than 189 cm?

189 cm is $(189 - 175) / 7 = 2$ standard deviations above the mean.

By the 68-95-99.7 rule, 95% of values are within 2 SD. So 2.5% are above 189 cm and 2.5% are below 161 cm.

Answer: about 2.5%.

Sampling and the Central Limit Theorem

When you take a random sample of size $n$ from a population with mean $\mu$ and standard deviation $\sigma$ , the sample mean $\bar{X}$ has:

Mean: $E(\bar{X}) = \mu$ .
Standard deviation: $\sigma_{\bar{X}} = \frac{\sigma}{\sqrt{n}}$ (the standard error of the mean).

The Central Limit Theorem (CLT) says that for large enough $n$ (usually $n \geq 30$ ), the sampling distribution of $\bar{X}$ is approximately normal, regardless of the shape of the population distribution.

Worked example

Heights of Year 12 girls have mean 167 cm and standard deviation 8 cm. A random sample of 16 girls is taken. What is the probability the sample mean is above 170 cm?

The sample mean has mean 167 and standard deviation $8/\sqrt{16} = 2$ .

$z = (170 - 167) / 2 = 1.5$ .

$P(\bar{X} > 170) = P(Z > 1.5) = 1 - \Phi(1.5) \approx 1 - 0.9332 = 0.0668$ .

About 6.68%.

Common HSC statistics traps

Confusing $\sigma$ with $\sigma^2$ . Standard deviation vs variance. Read the question carefully.

Forgetting to convert to z-score. You cannot look up $P(X \leq 85)$ directly; you must standardise first.

Wrong tail. If the question asks $P(X > 85)$ , you need $1 - \Phi(z)$ . If it asks $P(X < 85)$ , you need $\Phi(z)$ . Visualise the shaded area.

Misapplying the 68-95-99.7 rule. The rule gives the probability inside a band around the mean. If asked for the probability outside, subtract from 1.

Confusing the population SD with the sample mean's SD. The sample mean has SD $\sigma/\sqrt{n}$ , not $\sigma$ .

Forgetting CLT's sample-size requirement. The CLT is approximate; it gets better as $n$ grows. For small $n$ from non-normal populations, the sampling distribution may not be normal.

How statistics is examined

In Section II of the HSC Mathematics Advanced paper:

Short questions (3-4 marks). Compute mean and SD from a dataset. Apply the 68-95-99.7 rule.
Medium questions (5-7 marks). Find probabilities using the standard normal table. Compare two normal distributions. Compute z-scores in a real-world context.
Long extended-response questions (8-10 marks). Multi-part problems involving the sampling distribution of the mean, often with interpretation of results in context.

Practice strategy

For HSC Mathematics Advanced statistical analysis:

Term 3. Memorise the 68-95-99.7 rule, the z-score formula, and the standard error formula.
Term 4. Practice with the standard normal table. Do at least one statistics question per past paper.
Final two weeks. Drill the conversion: z-score $\rightarrow$ probability $\rightarrow$ interpretation. This sequence appears in every HSC paper.

In one sentence

HSC Mathematics Advanced statistical analysis rewards confident use of the normal distribution: convert any value to a z-score, read off the probability from the standard normal table, and interpret the result in context. Memorise the 68-95-99.7 rule, the z-score formula, and the standard error formula until they are automatic.