Skip to main content
VICMath MethodsSyllabus dot point

How is a confidence interval for a population proportion constructed and interpreted?

Approximate confidence intervals for a population proportion pp based on the sample proportion p^\hat{p}, including the standard 90, 95 and 99 percent intervals and their interpretation

A focused answer to the VCE Math Methods Unit 4 key-knowledge point on confidence intervals. The formula, the standard zz^* values for 90, 95 and 99 percent intervals, the correct interpretation language, and the relationship between sample size, margin of error and confidence level.

Generated by Claude Opus 4.89 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this dot point is asking
  2. The confidence interval formula
  3. Standard zz^* values
  4. Interpretation: what the confidence level means
  5. Worked construction
  6. Margin of error and sample size
  7. Confidence level and margin trade-off
  8. When the normal approximation is invalid
  9. Calculator workflow (Paper 2)
  10. Examples in context
  11. Try this

What this dot point is asking

VCAA wants you to construct confidence intervals for a population proportion pp from sample data, choose the correct critical value zz^* for the requested confidence level, interpret the interval correctly (avoiding the common probability-misstatement), and solve sample-size design problems based on margin-of-error targets.

The confidence interval formula

Given a sample of size nn with sample proportion p^\hat{p}, an approximate C%C \% confidence interval for the population proportion pp is:

p^±zp^(1p^)n\hat{p} \pm z^* \sqrt{\frac{\hat{p}(1 - \hat{p})}{n}}

The components are:

  • Point estimate. p^\hat{p}, the centre of the interval.
  • Standard error (SE). p^(1p^)/n\sqrt{\hat{p}(1 - \hat{p}) / n}, the estimated standard deviation of p^\hat{p}.
  • Critical value zz^*. The number of standard errors corresponding to the confidence level.
  • Margin of error (MoE). z×SEz^* \times \text{SE}, the half-width of the interval.

The interval is (p^MoE,p^+MoE)(\hat{p} - \text{MoE}, \hat{p} + \text{MoE}).

Note. p^\hat{p} vs pp in the standard error

In the sampling-distribution formula, SD(p^)=p(1p)/n\text{SD}(\hat{p}) = \sqrt{p(1-p)/n} uses the true (and usually unknown) pp.

In the confidence-interval formula, the standard error uses p^\hat{p} in place of pp because pp is unknown. This is the plug-in standard error. For Methods purposes the substitution is the standard convention.

Standard zz^* values

Confidence level zz^*
90 percent 1.6449 (often rounded to 1.645)
95 percent 1.9600 (often rounded to 1.96)
99 percent 2.5758 (often rounded to 2.58)

These come from the inverse-normal: z=invNorm((1+C/100)/2)z^* = \text{invNorm}((1 + C/100) / 2) for confidence level CC percent.

The 95 percent value of 1.96 is the most commonly used and worth memorising.

Interpretation: what the confidence level means

A C%C \% confidence interval has the following correct interpretation:

If the sampling procedure were repeated many times (each time computing a new p^\hat{p} and a new confidence interval from a fresh sample of size nn), approximately C%C \% of the constructed intervals would contain the true population proportion pp.

The interval describes a procedure that works C%C \% of the time, not a probability statement about a single specific interval.

Common misstatements VCAA marks against

These are widespread but incorrect:

  • "There is a 95 percent probability that the true proportion lies in this interval." (Wrong: once the interval is constructed, pp either is or is not in it; there is no probability.)
  • "95 percent of the population have a proportion in this interval." (Wrong: the interval is about the parameter pp, not about individuals.)
  • "We are 95 percent sure that p^\hat{p} is in this interval." (Wrong: p^\hat{p} is the centre of the interval by construction.)

A correct interpretation refers to the long-run success rate of the procedure.

Worked construction

A sample of n=500n = 500 patients includes 175 with a particular condition. Construct a 95 percent confidence interval for the true population prevalence pp.

Sample proportion. p^=175/500=0.35\hat{p} = 175 / 500 = 0.35.

Standard error. 0.35×0.65/500=0.0004550.02133\sqrt{0.35 \times 0.65 / 500} = \sqrt{0.000455} \approx 0.02133.

zz^* for 95 percent. 1.961.96.

Margin of error. 1.96×0.021330.04181.96 \times 0.02133 \approx 0.0418.

Confidence interval. (0.350.0418,0.35+0.0418)=(0.308,0.392)(0.35 - 0.0418, 0.35 + 0.0418) = (0.308, 0.392).

Interpretation: 95 percent of intervals constructed by this procedure would contain the true population prevalence.

Margin of error and sample size

The margin of error decreases as the sample size grows. Specifically:

MoE=zp^(1p^)n\text{MoE} = z^* \sqrt{\frac{\hat{p}(1 - \hat{p})}{n}}

To halve the margin of error, the sample size must quadruple (because of the n\sqrt{n} in the denominator).

To design a study with a specific margin of error, solve for nn:

n(z)2p^(1p^)MoE2n \geq \frac{(z^*)^2 \hat{p} (1 - \hat{p})}{\text{MoE}^2}

Always round up to the next integer (you cannot have a fractional person).

Worst-case sample size

If no prior estimate of p^\hat{p} is available, use p^=0.5\hat{p} = 0.5 in the design formula, which maximises p^(1p^)=0.25\hat{p}(1 - \hat{p}) = 0.25. This gives the largest required nn and so a conservative design.

For 95 percent confidence with MoE=0.03\text{MoE} = 0.03:

n(1.96)2×0.25/(0.03)2=0.9604/0.00091067n \geq (1.96)^2 \times 0.25 / (0.03)^2 = 0.9604 / 0.0009 \approx 1067.

The "1000 sample" convention in public opinion polling comes from approximately this calculation.

Confidence level and margin trade-off

Higher confidence level requires wider interval. For fixed nn and p^\hat{p}:

  • 90 percent CI is narrower than 95 percent CI is narrower than 99 percent CI.
  • The trade-off is between certainty (higher confidence) and precision (narrower interval).

To increase confidence without widening, you must increase nn.

When the normal approximation is invalid

The formula assumes:

  • np^10n \hat{p} \geq 10 and n(1p^)10n (1 - \hat{p}) \geq 10 (or 5, depending on convention).
  • The sample is random and independent.

For very small samples or proportions very close to 0 or 1, the normal approximation breaks down and an exact binomial-based interval is needed. VCE Methods stays inside the regime where the normal approximation is valid; an out-of-regime question would typically flag the issue.

Calculator workflow (Paper 2)

Most CAS systems have a built-in confidence-interval function for proportions, often as zInterval_1Prop or similar. The inputs are usually xx (number of successes), nn (sample size), and confidence level. The output is the interval.

For Paper 2 questions, the calculator output is acceptable, but always show the formula or set-up: p^\hat{p}, SE, zz^* at minimum.

Examples in context

Example 1. Customer satisfaction survey. A business surveys 400400 customers and finds 260260 are satisfied, so p^=260400=0.65\hat{p} = \frac{260}{400} = 0.65. The standard error is 0.65×0.35400=0.000569=0.02385\sqrt{\frac{0.65 \times 0.35}{400}} = \sqrt{0.000569} = 0.02385. For a 95%95\% interval, z=1.96z^* = 1.96, giving margin 1.96×0.02385=0.04671.96 \times 0.02385 = 0.0467. The interval is (0.603,0.697)(0.603, 0.697): across repeated samples, about 95%95\% of such intervals capture the true satisfaction rate.

Example 2. Designing a poll. A pollster wants a 95%95\% margin of error no more than 0.0250.025 and has no prior estimate of pp, so uses the worst case p^=0.5\hat{p} = 0.5. Then n(1.96)2(0.25)(0.025)2=0.96040.000625=1536.6n \ge \frac{(1.96)^2(0.25)}{(0.025)^2} = \frac{0.9604}{0.000625} = 1536.6, so at least 15371537 people must be surveyed.

Try this

Q1. A sample of 250250 has p^=0.4\hat{p} = 0.4. Find the standard error. [2 marks]

  • Cue. 0.4×0.6250=0.00096=0.0310\sqrt{\frac{0.4 \times 0.6}{250}} = \sqrt{0.00096} = 0.0310.

Q2. From 600600 items, 9090 are faulty. Construct a 95%95\% confidence interval for the true fault rate. [3 marks]

  • Cue. p^=0.15\hat{p} = 0.15; SE =0.15×0.85600=0.01458= \sqrt{\frac{0.15 \times 0.85}{600}} = 0.01458; margin 1.96×0.01458=0.02861.96 \times 0.01458 = 0.0286; interval (0.121,0.179)(0.121, 0.179).

Q3. State why "there is a 95% probability the true proportion lies in (0.121, 0.179)" is an incorrect interpretation. [2 marks]

  • Cue. Once computed, the interval either contains pp or not; the 95%95\% refers to the long-run success rate of the procedure across many samples, not this single interval.

Exam-style practice questions

Practice questions written in the style of VCAA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

2024 VCAA Paper 24 marksA poll of 800 voters in an electorate found that 432 supported candidate A. (a) Find a 95 percent confidence interval for the true proportion of voters supporting candidate A. (b) State what the 95 percent confidence interval means.
Show worked answer →

(a) Confidence interval.

Sample proportion: p^=432/800=0.54\hat{p} = 432 / 800 = 0.54.

Standard error: SE=p^(1p^)n=0.54×0.46800=0.24848000.01762\text{SE} = \sqrt{\frac{\hat{p}(1 - \hat{p})}{n}} = \sqrt{\frac{0.54 \times 0.46}{800}} = \sqrt{\frac{0.2484}{800}} \approx 0.01762.

For 95 percent confidence, z=1.96z^* = 1.96.

Margin of error: 1.96×0.017620.03451.96 \times 0.01762 \approx 0.0345.

Interval: (0.540.0345,0.54+0.0345)=(0.5055,0.5745)(0.54 - 0.0345, 0.54 + 0.0345) = (0.5055, 0.5745), or approximately (0.506,0.575)(0.506, 0.575) to 3 decimal places.

(b) Interpretation. If many such samples of 800 voters were drawn and a 95 percent confidence interval constructed from each, approximately 95 percent of those intervals would contain the true population proportion of voters supporting candidate A.

Markers reward correct computation of p^\hat{p}, the SE formula using p^\hat{p}, the right zz^* for 95 percent, and an interpretation that does not say "there is a 95 percent probability that the true pp is in this specific interval" (a common but incorrect interpretation).

2023 VCAA Paper 23 marksA consumer organisation tested 200 light bulbs and found that 30 were defective. (a) Construct an approximate 90 percent confidence interval for the true proportion of defective bulbs. (b) State the smallest sample size needed to achieve a margin of error no greater than 0.03 at 90 percent confidence, assuming the true proportion is approximately 0.15.
Show worked answer →

(a) Confidence interval.

p^=30/200=0.15\hat{p} = 30 / 200 = 0.15.

SE: 0.15×0.85/200=0.0006380.0252\sqrt{0.15 \times 0.85 / 200} = \sqrt{0.000638} \approx 0.0252.

For 90 percent confidence, z=1.645z^* = 1.645.

Margin: 1.645×0.02520.04151.645 \times 0.0252 \approx 0.0415.

Interval: (0.150.0415,0.15+0.0415)=(0.1085,0.1915)(0.15 - 0.0415, 0.15 + 0.0415) = (0.1085, 0.1915), approximately (0.109,0.192)(0.109, 0.192).

(b) Sample size for margin 0.03.

Margin: zp(1p)/n0.03z^* \sqrt{p(1-p)/n} \leq 0.03.

Substitute: 1.6450.15×0.85/n0.031.645 \sqrt{0.15 \times 0.85 / n} \leq 0.03.

Solve: 0.1275/n0.03/1.6450.01824\sqrt{0.1275 / n} \leq 0.03 / 1.645 \approx 0.01824.

Square: 0.1275/n0.0003330.1275 / n \leq 0.000333.

So n0.1275/0.000333383.1n \geq 0.1275 / 0.000333 \approx 383.1.

The smallest sample size is n=384n = 384 (round up).

Markers reward correct margin formula, correct zz^* for 90 percent, and rounding up to the next integer.

Related dot points