What is the sample mean is random?

If you take a random sample of size n from a population and compute its mean X, a different sample gives a different mean. So X is a random variable. Its distribution is called the sampling distribution of the mean, and it has its own mean and standard deviation.

What are standardising to find probabilities?

To compute a probability for X, standardise using the standard error:

§-Syllabus dot point

QLDSpecialist MathematicsSyllabus dot point

Topic 3: Statistical inference

Understand the distribution of the sample mean, apply the central limit theorem to describe its shape, mean and standard deviation, and use these to compute probabilities for sample means drawn from a population

A focused answer to the QCE Specialist Mathematics Unit 4 dot point on the sampling distribution of the mean. Covers the mean and standard error of the sample mean, the central limit theorem, standardising to compute probabilities, and how sample size affects spread, with a verified worked example and the standard-error mistake QCAA markers watch for.

Generated by Claude Opus 4.88 min answerUpdated 2026-06-02

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

What this dot point is asking

QCAA wants you to understand that the sample mean $\bar{X}$ is itself a random variable with its own distribution, to state and apply the central limit theorem, and to use the resulting normal model to compute probabilities about sample means. This is the foundation of statistical inference, assessed in IA3 and the external assessment, and it precedes confidence-interval work.

The answer

The sample mean is random

If you take a random sample of size $n$ from a population and compute its mean $\bar{X}$ , a different sample gives a different mean. So $\bar{X}$ is a random variable. Its distribution is called the sampling distribution of the mean, and it has its own mean and standard deviation.

Mean and standard error

If the population has mean $\mu$ and standard deviation $\sigma$ , then for a sample of size $n$ :

E(\bar{X}) = \mu, \qquad \text{SD}(\bar{X}) = \frac{\sigma}{\sqrt{n}}.

The standard deviation of the sample mean, $\dfrac{\sigma}{\sqrt{n}}$ , is called the standard error. It is smaller than the population standard deviation, and it shrinks as $n$ grows: larger samples give more reliable estimates of $\mu$ . Crucially the divisor is $\sqrt{n}$ , not $n$ .

The central limit theorem

The central limit theorem states that for a sufficiently large sample size $n$ , the distribution of the sample mean $\bar{X}$ is approximately normal,

\bar{X} \sim N\!\left(\mu, \frac{\sigma^2}{n}\right),

regardless of the shape of the original population. If the population is already normal, $\bar{X}$ is exactly normal for any $n$ . A common rule of thumb takes $n \geq 30$ as large enough for the approximation when the population is not too skewed.

Standardising to find probabilities

To compute a probability for $\bar{X}$ , standardise using the standard error:

Z = \frac{\bar{X} - \mu}{\sigma / \sqrt{n}}.

This $Z$ has the standard normal distribution $N(0,1)$ , so probabilities follow from the normal model. The only change from a single-observation calculation is dividing by $\dfrac{\sigma}{\sqrt{n}}$ rather than $\sigma$ .

Effect of sample size

Because the standard error is $\dfrac{\sigma}{\sqrt{n}}$ , quadrupling the sample size halves the spread of $\bar{X}$ . This is why larger samples produce tighter estimates and narrower confidence intervals: the sampling distribution concentrates around $\mu$ .

Working backwards to find a sample size

A favourite extended-response task gives a probability statement about $\bar{X}$ and asks for the sample size. The method reverses standardising: convert the probability to a critical $z$ -value, then solve $\dfrac{\text{(value} - \mu)}{\sigma/\sqrt{n}} = z$ for $n$ . Because $n$ must be a whole number of observations, round the result, and state explicitly that a sample size is an integer. Reading the correct $z$ from the stated tail probability (lower tail, upper tail, or central region) is where care is needed, and a sketch of the normal curve with the area shaded prevents sign errors.

Normal population versus the central limit theorem

It is worth distinguishing two reasons $\bar{X}$ might be normal. If the population itself is normal, then $\bar{X}$ is exactly normal for every sample size, however small, because a sum of normal variables is normal. If the population is not normal, $\bar{X}$ is only approximately normal, and only for a large enough $n$ , by the central limit theorem. An exam question that asks you to justify normality is testing exactly this distinction: cite the normal population if one is given, and cite the central limit theorem only when the population shape is unknown or non-normal and $n$ is large.

Single observation versus the mean of a sample

The most consequential modelling decision is whether a question concerns one randomly chosen value or the average of a sample. A single observation $X$ has standard deviation $\sigma$ ; the sample mean $\bar{X}$ has the much smaller standard error $\dfrac{\sigma}{\sqrt{n}}$ . Using the wrong one is the difference between a correct and an incorrect probability. Read the wording carefully: phrases like "the mean of the sample" or "the average" signal $\bar{X}$ , while "a randomly selected" individual signals $X$ .

Worked example

A population has mean $\mu = 50$ and standard deviation $\sigma = 12$ . A random sample of $36$ is taken. Find the probability that the sample mean exceeds $54$ .

A question about the sample mean $\bar{X}$ always requires the standard error, not the population standard deviation, in the $z$ -score calculation.

Step 1: State the sampling distribution

With $n = 36$ , the standard error is

\frac{\sigma}{\sqrt{n}} = \frac{12}{\sqrt{36}} = \frac{12}{6} = 2.

By the central limit theorem (and here $n = 36 \geq 30$ ), $\bar{X} \sim N(50,\, 2^2)$ approximately.

Step 2: Standardise to a $z$ -score

Subtract the mean and divide by the standard error to convert $\bar{X} = 54$ to the standard normal scale:

Z = \frac{54 - 50}{2} = \frac{4}{2} = 2.

Step 3: Find the probability from the standard normal table

P(\bar{X} > 54) = P(Z > 2) \approx 0.0228.

Step 4: Interpret the result

There is about a $2.3\%$ chance that a sample of $36$ has a mean above $54$ . For comparison, a single observation would give $Z = \dfrac{54 - 50}{12} \approx 0.33$ , which corresponds to a much higher probability. Averaging over $36$ values reduces the spread dramatically, making a sample mean that far above $\mu$ quite rare.

Final answer: $P(\bar{X} > 54) \approx 0.0228$ .

Exam-style practice questions

Practice questions written in the style of QCAA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

QCAA 20245 marksPaper 2 (complex familiar). Heights of Year 12 students are normal with mean

168.6

cm and standard deviation

12.7

cm. A random sample of

20

is taken. (a) Explain why the sample mean is normally distributed. (b) Determine

P(\bar{X} > 170)

. (c) If there is a

75\%

probability that

\bar{X}

lies within

\pm h

of the mean, determine

P(\bar{X} \geq 168.6 + h)

. (d) Hence determine

h

Show worked answer →

Standard error $= \dfrac{\sigma}{\sqrt{n}} = \dfrac{12.7}{\sqrt{20}} \approx 2.840$ cm; mean $\mu = 168.6$ .

(a) Because the population is itself normal, $\bar{X}$ is normal for any sample size, not relying on a large $n$ .

(b) $z = \dfrac{170 - 168.6}{2.840} \approx 0.493$ , so $P(\bar{X} > 170) = P(Z > 0.493) \approx 0.31.$

(d) The upper critical value for a $0.125$ tail is $z \approx 1.150$ , so $h = 1.150 \times 2.840 \approx 3.27$ cm.

Markers reward the normality reason, the standardised probability, the tail logic, and the value of $h$ .

QCAA 20237 marksPaper 2 (complex unfamiliar). University travel times are normal with mean

25.2

min and standard deviation

4.7

min. A random sample of

120

gives

\bar{X}

. (a) Determine

P(24.5 \leq \bar{X} \leq 25.9)

. (b) Given

P(\bar{X} \leq k) = 0.8

, determine

k

. (c) A second sample has

P(\bar{X} \leq 24.6) = 0.05

; determine its size.

Show worked answer →

First sample: standard error $= \dfrac{4.7}{\sqrt{120}} \approx 0.4291$ min; mean $25.2$ .

(a) $z$ -values $\dfrac{24.5 - 25.2}{0.4291} \approx -1.631$ and $\dfrac{25.9 - 25.2}{0.4291} \approx 1.631$ , so $P(-1.631 \leq Z \leq 1.631) \approx 0.897.$

(b) For a lower area $0.8$ , $z \approx 0.8416$ , so $k = 25.2 + 0.8416 \times 0.4291 \approx 25.56$ min.

(c) $P(\bar{X} \leq 24.6) = 0.05$ puts $24.6$ at $z = -1.645$ : $24.6 = 25.2 - 1.645\cdot\dfrac{4.7}{\sqrt{n}}$ , so $\sqrt{n} = \dfrac{1.645 \times 4.7}{0.6} \approx 12.886$ , giving $n \approx 166.$

Markers reward the standard error, both standardisations, and solving for $n$ with rounding to a whole sample size.