Skip to main content
VICSpecialist MathematicsSyllabus dot point

How do we build an approximate confidence interval for a population mean from a sample, and what does the confidence level actually mean?

Construction and interpretation of approximate confidence intervals for a population mean using the sample mean and standard error, the choice of confidence level and its zz value, the effect of sample size on the interval width, and the correct interpretation of a confidence interval

A focused answer to the VCE Specialist Mathematics Unit 4 key-knowledge point on confidence intervals for a mean. The interval formula, the z value for a confidence level, the effect of sample size, and correct interpretation, with a verified worked example.

Generated by Claude Opus 4.76 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this dot point is asking
  2. The confidence interval formula
  3. Choosing the $z$ value
  4. The effect of sample size
  5. Interpreting a confidence interval correctly
  6. Examples in context
  7. Try this

What this dot point is asking

VCAA wants you to construct an approximate confidence interval for a population mean from a sample, choosing the right zz value for the confidence level, computing the interval, understanding how sample size affects its width, and interpreting it correctly. The interpretation, in particular, is a frequent exam focus.

The confidence interval formula

An approximate C%C\% confidence interval for the population mean μ\mu, based on a sample of size nn with sample mean xˉ\bar{x}, is

xˉ±zσn,\bar{x} \pm z\,\frac{\sigma}{\sqrt{n}},

where σn\frac{\sigma}{\sqrt{n}} is the standard error and zz is the value cutting off the central C%C\% of the standard normal distribution. The quantity zσnz\,\frac{\sigma}{\sqrt{n}} is the margin of error. When the population standard deviation σ\sigma is unknown, the sample standard deviation is used in its place for large samples.

Choosing the zz value

The zz value depends only on the confidence level:

  • 90%90\% confidence: z1.645z \approx 1.645;
  • 95%95\% confidence: z1.96z \approx 1.96;
  • 99%99\% confidence: z2.576z \approx 2.576.

A higher confidence level needs a larger zz, hence a wider interval, since more confidence requires casting a wider net.

The effect of sample size

The interval half-width is zσnz\,\frac{\sigma}{\sqrt{n}}, so it shrinks as 1n\frac{1}{\sqrt{n}}. A larger sample gives a narrower, more precise interval. To halve the width you must quadruple the sample size, the same diminishing return seen with the standard error.

Interpreting a confidence interval correctly

The confidence level describes the procedure, not a single interval. The correct statement is: if many samples were taken and a C%C\% interval computed from each, about C%C\% of those intervals would contain the true mean μ\mu. It is incorrect to say there is a C%C\% probability that μ\mu lies in a particular computed interval, because μ\mu is a fixed (unknown) constant, and the computed interval either contains it or does not.

Examples in context

Example 1. A 90%90\% interval is narrower than a 99%99\% interval from the same data, because 1.645<2.5761.645 < 2.576.

Example 2. Increasing nn from 2525 to 100100 halves the margin of error, since 100=225\sqrt{100} = 2\sqrt{25}.

Try this

Q1. State the zz value for a 99%99\% confidence interval. [1 mark]

  • Cue. 2.5762.576.

Q2. Find the margin of error for σ=15\sigma = 15, n=25n = 25, at 95%95\% confidence. [2 marks]

  • Cue. 1.96×155=1.96×3=5.881.96 \times \frac{15}{5} = 1.96\times 3 = 5.88.

Q3. Give the correct interpretation of a 95%95\% confidence interval. [2 marks]

  • Cue. In repeated sampling, about 95%95\% of such intervals would contain the true mean.

Exam-style practice questions

Practice questions written in the style of VCAA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

2023 VCAA1 marksA random sample of 20 adult male koalas has a sample mean of 11.39 kg. It is known that the mass of adult male koalas in the forest is normally distributed with a standard deviation of 1 kg. Find a 95% confidence interval for the population mean (the mean mass of all adult male koalas in the forest). Give your values correct to two decimal places.
Show worked answer →

The approximate confidence interval for a mean is x-bar +/- z . sigma / sqrt(n).

For 95% confidence the z value is 1.96. Here x-bar = 11.39, sigma = 1 and n = 20, so the standard error is 1 / sqrt(20) = 0.2236.

Margin of error = 1.96 . 0.2236 = 0.4383, which rounds to 0.44.

So the interval is 11.39 +/- 0.44, that is, (10.95, 11.83) kilograms (to two decimal places).

2025 VCAA1 marksThe volume of water dispensed into Wasser bottles is normally distributed with a standard deviation of 5 mL. Engineers take a random sample of 30 bottles; the sample mean is 750 mL. Find a 95% confidence interval for the mean volume of water dispensed into each Wasser bottle. Give your values in millilitres, correct to one decimal place.
Show worked answer →

Use x-bar +/- z . sigma / sqrt(n) with the 95% z value of 1.96.

Here x-bar = 750, sigma = 5 and n = 30, so the standard error is 5 / sqrt(30) = 0.9129.

Margin of error = 1.96 . 0.9129 = 1.789, which rounds to 1.8.

So the 95% confidence interval is 750 +/- 1.8, that is, (748.2, 751.8) mL (to one decimal place).

2023 VCAA1 marksSixty random samples of adult male koalas are taken and their 95% confidence intervals are calculated. In how many of these confidence intervals would the actual mean mass of all adult male koalas in the forest be expected to lie?
Show worked answer →

A 95% confidence level means that, over many repeated samples, about 95% of the constructed intervals will contain the true population mean.

So the expected number is 95% of the 60 intervals: 0.95 . 60 = 57.

Therefore the true mean would be expected to lie in 57 of the 60 confidence intervals. (This is the correct frequency interpretation: it is the intervals that vary from sample to sample, not the fixed population mean.)