β Unit 4: Further calculus and statistical inference
Topic 3: Continuous random variables, the normal distribution, and statistical inference
Apply the sampling distribution of the sample proportion $\hat{p}$ (mean $p$, standard deviation $\sqrt{p(1-p)/n}$) and construct approximate confidence intervals $\hat{p} \pm z^* \sqrt{\hat{p}(1-\hat{p})/n}$ for a population proportion
A focused answer to the QCE Maths Methods Unit 4 dot point on sample proportions and confidence intervals. The sampling distribution of $\hat{p}$, the normal approximation, the CI formula with standard $z^*$ values, and worked Paper 2 / PSMT examples.
Have a quick question? Jump to the Q&A page
What this dot point is asking
QCAA wants you to treat the sample proportion as a random variable, apply the normal approximation to its sampling distribution, construct confidence intervals for a population proportion, and interpret the interval correctly. The dot point bridges Unit 3 binomial probability with Unit 4 statistical inference, and is heavily examined in PSMT and EA.
Sample proportion
If a population has true proportion of "successes" and a random sample of is drawn with successes, the sample proportion is:
Because is random, is a random variable: it varies from sample to sample.
Sampling distribution of IMATH_10
Mean. . The sample proportion is an unbiased estimator of .
Standard deviation. .
Two takeaways:
- SD falls as : quadruple to halve SD.
- SD is maximised at ; minimised at or .
Normal approximation
For large :
Conditions for the approximation (QCAA convention):
- IMATH_20
- IMATH_21
When these conditions hold, is approximately normal with mean and SD . Standardise via to compute probabilities.
Confidence intervals
A confidence interval for a population proportion combines:
- The point estimate (centre).
- The standard error: (using in place of unknown ).
- The critical value for the confidence level.
The formula:
The margin of error is .
Standard values
| Level | IMATH_33 |
|---|---|
| 90% | 1.6449 (round to 1.645) |
| 95% | 1.9600 (round to 1.96) |
| 99% | 2.5758 (round to 2.58) |
Interpretation
A confidence interval has the long-run interpretation:
Approximately of intervals constructed by this procedure across repeated samples would contain the true population proportion.
This is NOT:
- "There is a probability that is in this interval." (Once the interval is constructed, either is or is not in it.)
- " of the population have proportions in this interval." (The interval is about the parameter, not about individuals.)
The correct language refers to the long-run success rate of the procedure.
Sample size design
To achieve a margin of error at most at confidence:
If is unknown in advance, use (worst case) for a conservative design.
Always round up to the next integer (cannot sample a fractional person).
Worked example
A study estimates the proportion of adults who exercise daily. A random sample of adults gives 120 who do.
.
SE: .
For 95 percent: . Margin: .
CI: .
Interpretation: approximately 95 percent of intervals constructed by this procedure would contain the true population proportion of adults who exercise daily.
Trade-offs
Confidence vs precision. Higher confidence (99 percent) requires a wider interval. Lower confidence (90 percent) gives a narrower interval. To improve both, increase .
Sample size economics. Doubling reduces SE by a factor of . Quadrupling halves SE. Diminishing returns above for opinion polling.
When the normal approximation fails
For very small samples or proportions near 0 or 1, the approximation can be poor. QCAA conventions ( and ) ensure validity. Outside these conditions, an exact (binomial-based) interval would be needed, beyond Methods scope.
Common errors
Wrong for the level. 1.96 for 95 percent; 1.645 for 90 percent. Mixing them up gives wrong widths.
Probability misinterpretation. "The probability is in this interval" is wrong. Use long-run-procedure language.
Sample size not rounded up. becomes , not 384.
** vs in formulas.** In the sampling distribution SD: use (when known). In the CI standard error: use (when unknown).
Forgetting worst-case . When no prior estimate of exists for sample-size design, use to maximise for the most conservative .
In one sentence
The sample proportion has mean and SD , and for large (with and ) is approximately normal; an approximate confidence interval for the population proportion is with for the 90, 95, 99 percent levels, interpreted as the long-run procedure containing the true in approximately of repeated samples.
Past exam questions, worked
Real questions from past QCAA papers on this dot point, with our answer explainer.
2024 QCAA-style P25 marksA poll of 600 voters found 354 supported candidate X. (a) Compute a 95 percent confidence interval for the true proportion. (b) Interpret the interval. (c) Find the smallest sample size needed to achieve a 95 percent confidence interval of half-width 0.02, assuming the true proportion is around 0.5.Show worked answer β
(a) Confidence interval.
.
SE: .
for 95 percent.
Margin: .
CI: , approximately .
(b) Interpretation. If many similar samples of 600 voters were drawn and 95 percent confidence intervals were constructed from each, approximately 95 percent of these intervals would contain the true population proportion of voters supporting candidate X. (Avoid: "there is a 95 percent probability the true proportion is in this interval", which is incorrect.)
(c) Sample size. Margin: . With worst-case :
.
Smallest .
Markers reward correct SE formula, correct for 95 percent, the correct interpretation language, and rounding up to integer for sample size.
Related dot points
- Apply the normal distribution $N(\mu, \sigma^2)$ and the standardisation $Z = (X - \mu)/\sigma$ to compute normal probabilities and inverse probabilities, including the empirical 68-95-99.7 rule
A focused answer to the QCE Maths Methods Unit 4 dot point on the normal distribution. Standardisation, the empirical rule, normal probability and inverse-normal calculations, and worked PSMT and EA examples.
- Define a continuous random variable, its probability density function (pdf), cumulative distribution function (cdf), and compute probabilities, expected value (mean), variance and standard deviation as definite integrals
A focused answer to the QCE Maths Methods Unit 4 dot point on continuous random variables. Defines the pdf, cdf, mean, variance and standard deviation as integrals, including the normalisation condition and a worked PSMT-style example.
- Recognise the binomial distribution $X \sim \mathrm{Bin}(n, p)$ as the count of successes in $n$ independent Bernoulli trials, apply the binomial probability formula and use CAS, and use the formulas $E(X) = np$ and $\mathrm{Var}(X) = np(1 - p)$
A focused answer to the QCE Mathematical Methods Unit 3 dot point on the binomial distribution. Defines the binomial conditions (BINS), states the probability formula, gives the mean $np$ and variance $np(1 - p)$, and walks through both by-hand Paper 1 calculations and CAS-supported Paper 2 calculations including $P(X \leq k)$, $P(X \geq k)$ and modelling applications.