← Unit 4

VICMath MethodsSyllabus dot point

How is a confidence interval for a population proportion constructed and interpreted?

Approximate confidence intervals for a population proportion $p$ based on the sample proportion $\hat{p}$, including the standard 90, 95 and 99 percent intervals and their interpretation

A focused answer to the VCE Math Methods Unit 4 key-knowledge point on confidence intervals. The formula, the standard $z^*$ values for 90, 95 and 99 percent intervals, the correct interpretation language, and the relationship between sample size, margin of error and confidence level.

Generated by Claude OpusReviewed by Better Tuition Academy9 min answer

Have a quick question? Jump to the Q&A page

What this dot point is asking

VCAA wants you to construct confidence intervals for a population proportion pp from sample data, choose the correct critical value zβˆ—z^* for the requested confidence level, interpret the interval correctly (avoiding the common probability-misstatement), and solve sample-size design problems based on margin-of-error targets.

The confidence interval formula

Given a sample of size nn with sample proportion p^\hat{p}, an approximate C%C \% confidence interval for the population proportion pp is:

p^Β±zβˆ—p^(1βˆ’p^)n\hat{p} \pm z^* \sqrt{\frac{\hat{p}(1 - \hat{p})}{n}}

The components are:

  • Point estimate. p^\hat{p}, the centre of the interval.
  • Standard error (SE). p^(1βˆ’p^)/n\sqrt{\hat{p}(1 - \hat{p}) / n}, the estimated standard deviation of p^\hat{p}.
  • Critical value zβˆ—z^*. The number of standard errors corresponding to the confidence level.
  • Margin of error (MoE). zβˆ—Γ—SEz^* \times \text{SE}, the half-width of the interval.

The interval is (p^βˆ’MoE,p^+MoE)(\hat{p} - \text{MoE}, \hat{p} + \text{MoE}).

Note. p^\hat{p} vs pp in the standard error

In the sampling-distribution formula, SD(p^)=p(1βˆ’p)/n\text{SD}(\hat{p}) = \sqrt{p(1-p)/n} uses the true (and usually unknown) pp.

In the confidence-interval formula, the standard error uses p^\hat{p} in place of pp because pp is unknown. This is the plug-in standard error. For Methods purposes the substitution is the standard convention.

Standard zβˆ—z^* values

Confidence level IMATH_23
90 percent 1.6449 (often rounded to 1.645)
95 percent 1.9600 (often rounded to 1.96)
99 percent 2.5758 (often rounded to 2.58)

These come from the inverse-normal: zβˆ—=invNorm((1+C/100)/2)z^* = \text{invNorm}((1 + C/100) / 2) for confidence level CC percent.

The 95 percent value of 1.96 is the most commonly used and worth memorising.

Interpretation: what the confidence level means

A C%C \% confidence interval has the following correct interpretation:

If the sampling procedure were repeated many times (each time computing a new p^\hat{p} and a new confidence interval from a fresh sample of size nn), approximately C%C \% of the constructed intervals would contain the true population proportion pp.

The interval describes a procedure that works C%C \% of the time, not a probability statement about a single specific interval.

Common misstatements VCAA marks against

These are widespread but incorrect:

  • "There is a 95 percent probability that the true proportion lies in this interval." (Wrong: once the interval is constructed, pp either is or is not in it; there is no probability.)
  • "95 percent of the population have a proportion in this interval." (Wrong: the interval is about the parameter pp, not about individuals.)
  • "We are 95 percent sure that p^\hat{p} is in this interval." (Wrong: p^\hat{p} is the centre of the interval by construction.)

A correct interpretation refers to the long-run success rate of the procedure.

Worked construction

A sample of n=500n = 500 patients includes 175 with a particular condition. Construct a 95 percent confidence interval for the true population prevalence pp.

Sample proportion. p^=175/500=0.35\hat{p} = 175 / 500 = 0.35.

Standard error. 0.35Γ—0.65/500=0.000455β‰ˆ0.02133\sqrt{0.35 \times 0.65 / 500} = \sqrt{0.000455} \approx 0.02133.

zβˆ—z^* for 95 percent. 1.961.96.

Margin of error. 1.96Γ—0.02133β‰ˆ0.04181.96 \times 0.02133 \approx 0.0418.

Confidence interval. (0.35βˆ’0.0418,0.35+0.0418)=(0.308,0.392)(0.35 - 0.0418, 0.35 + 0.0418) = (0.308, 0.392).

Interpretation: 95 percent of intervals constructed by this procedure would contain the true population prevalence.

Margin of error and sample size

The margin of error decreases as the sample size grows. Specifically:

MoE=zβˆ—p^(1βˆ’p^)n\text{MoE} = z^* \sqrt{\frac{\hat{p}(1 - \hat{p})}{n}}

To halve the margin of error, the sample size must quadruple (because of the n\sqrt{n} in the denominator).

To design a study with a specific margin of error, solve for nn:

nβ‰₯(zβˆ—)2p^(1βˆ’p^)MoE2n \geq \frac{(z^*)^2 \hat{p} (1 - \hat{p})}{\text{MoE}^2}

Always round up to the next integer (you cannot have a fractional person).

Worst-case sample size

If no prior estimate of p^\hat{p} is available, use p^=0.5\hat{p} = 0.5 in the design formula, which maximises p^(1βˆ’p^)=0.25\hat{p}(1 - \hat{p}) = 0.25. This gives the largest required nn and so a conservative design.

For 95 percent confidence with MoE=0.03\text{MoE} = 0.03:

nβ‰₯(1.96)2Γ—0.25/(0.03)2=0.9604/0.0009β‰ˆ1067n \geq (1.96)^2 \times 0.25 / (0.03)^2 = 0.9604 / 0.0009 \approx 1067.

The "1000 sample" convention in public opinion polling comes from approximately this calculation.

Confidence level and margin trade-off

Higher confidence level requires wider interval. For fixed nn and p^\hat{p}:

  • 90 percent CI is narrower than 95 percent CI is narrower than 99 percent CI.
  • The trade-off is between certainty (higher confidence) and precision (narrower interval).

To increase confidence without widening, you must increase nn.

When the normal approximation is invalid

The formula assumes:

  • IMATH_55 and n(1βˆ’p^)β‰₯10n (1 - \hat{p}) \geq 10 (or 5, depending on convention).
  • The sample is random and independent.

For very small samples or proportions very close to 0 or 1, the normal approximation breaks down and an exact binomial-based interval is needed. VCE Methods stays inside the regime where the normal approximation is valid; an out-of-regime question would typically flag the issue.

Calculator workflow (Paper 2)

Most CAS systems have a built-in confidence-interval function for proportions, often as zInterval_1Prop or similar. The inputs are usually xx (number of successes), nn (sample size), and confidence level. The output is the interval.

For Paper 2 questions, the calculator output is acceptable, but always show the formula or set-up: p^\hat{p}, SE, zβˆ—z^* at minimum.

Common errors

Using zβˆ—z^* for the wrong confidence level. 1.961.96 for 95 percent; 1.6451.645 for 90 percent; 2.582.58 for 99 percent. Mixing them up gives an interval that is too wide or too narrow.

Forgetting to multiply by zβˆ—z^*. The margin is zβˆ—Γ—SEz^* \times \text{SE}, not just SE.

Probability misinterpretation. As noted, "the probability pp is in this interval" is wrong. Use the long-run-procedure language.

Sample size not rounded up. n=383.1n = 383.1 becomes n=384n = 384, not 383383.

Worst-case p^\hat{p} ignored. If a study design problem gives no prior estimate of pp, the worst-case p^=0.5\hat{p} = 0.5 should be used to ensure the margin is met regardless.

Standard error with pp instead of p^\hat{p}. Use p^\hat{p} in the CI standard error; pp is unknown.

In one sentence

A C%C \% confidence interval for a population proportion is p^Β±zβˆ—p^(1βˆ’p^)/n\hat{p} \pm z^* \sqrt{\hat{p}(1-\hat{p})/n} where zβˆ—z^* is the standard normal critical value for the level (1.6451.645 for 90 percent, 1.961.96 for 95 percent, 2.582.58 for 99 percent); the correct interpretation is that approximately C%C \% of intervals constructed by this procedure across repeated samples would contain the true population proportion, not that this specific interval has a C%C \% probability of containing it.

Past exam questions, worked

Real questions from past VCAA papers on this dot point, with our answer explainer.

2024 VCAA Paper 24 marksA poll of 800 voters in an electorate found that 432 supported candidate A. (a) Find a 95 percent confidence interval for the true proportion of voters supporting candidate A. (b) State what the 95 percent confidence interval means.
Show worked answer β†’

(a) Confidence interval.

Sample proportion: p^=432/800=0.54\hat{p} = 432 / 800 = 0.54.

Standard error: SE=p^(1βˆ’p^)n=0.54Γ—0.46800=0.2484800β‰ˆ0.01762\text{SE} = \sqrt{\frac{\hat{p}(1 - \hat{p})}{n}} = \sqrt{\frac{0.54 \times 0.46}{800}} = \sqrt{\frac{0.2484}{800}} \approx 0.01762.

For 95 percent confidence, zβˆ—=1.96z^* = 1.96.

Margin of error: 1.96Γ—0.01762β‰ˆ0.03451.96 \times 0.01762 \approx 0.0345.

Interval: (0.54βˆ’0.0345,0.54+0.0345)=(0.5055,0.5745)(0.54 - 0.0345, 0.54 + 0.0345) = (0.5055, 0.5745), or approximately (0.506,0.575)(0.506, 0.575) to 3 decimal places.

(b) Interpretation. If many such samples of 800 voters were drawn and a 95 percent confidence interval constructed from each, approximately 95 percent of those intervals would contain the true population proportion of voters supporting candidate A.

Markers reward correct computation of p^\hat{p}, the SE formula using p^\hat{p}, the right zβˆ—z^* for 95 percent, and an interpretation that does not say "there is a 95 percent probability that the true pp is in this specific interval" (a common but incorrect interpretation).

2023 VCAA Paper 23 marksA consumer organisation tested 200 light bulbs and found that 30 were defective. (a) Construct an approximate 90 percent confidence interval for the true proportion of defective bulbs. (b) State the smallest sample size needed to achieve a margin of error no greater than 0.03 at 90 percent confidence, assuming the true proportion is approximately 0.15.
Show worked answer β†’

(a) Confidence interval.

p^=30/200=0.15\hat{p} = 30 / 200 = 0.15.

SE: 0.15Γ—0.85/200=0.000638β‰ˆ0.0252\sqrt{0.15 \times 0.85 / 200} = \sqrt{0.000638} \approx 0.0252.

For 90 percent confidence, zβˆ—=1.645z^* = 1.645.

Margin: 1.645Γ—0.0252β‰ˆ0.04151.645 \times 0.0252 \approx 0.0415.

Interval: (0.15βˆ’0.0415,0.15+0.0415)=(0.1085,0.1915)(0.15 - 0.0415, 0.15 + 0.0415) = (0.1085, 0.1915), approximately (0.109,0.192)(0.109, 0.192).

(b) Sample size for margin 0.03.

Margin: zβˆ—p(1βˆ’p)/n≀0.03z^* \sqrt{p(1-p)/n} \leq 0.03.

Substitute: 1.6450.15Γ—0.85/n≀0.031.645 \sqrt{0.15 \times 0.85 / n} \leq 0.03.

Solve: 0.1275/n≀0.03/1.645β‰ˆ0.01824\sqrt{0.1275 / n} \leq 0.03 / 1.645 \approx 0.01824.

Square: 0.1275/n≀0.0003330.1275 / n \leq 0.000333.

So nβ‰₯0.1275/0.000333β‰ˆ383.1n \geq 0.1275 / 0.000333 \approx 383.1.

The smallest sample size is n=384n = 384 (round up).

Markers reward correct margin formula, correct zβˆ—z^* for 90 percent, and rounding up to the next integer.

Related dot points