How is a confidence interval for a population proportion constructed and interpreted?
Approximate confidence intervals for a population proportion $p$ based on the sample proportion $\hat{p}$, including the standard 90, 95 and 99 percent intervals and their interpretation
A focused answer to the VCE Math Methods Unit 4 key-knowledge point on confidence intervals. The formula, the standard $z^*$ values for 90, 95 and 99 percent intervals, the correct interpretation language, and the relationship between sample size, margin of error and confidence level.
Have a quick question? Jump to the Q&A page
What this dot point is asking
VCAA wants you to construct confidence intervals for a population proportion from sample data, choose the correct critical value for the requested confidence level, interpret the interval correctly (avoiding the common probability-misstatement), and solve sample-size design problems based on margin-of-error targets.
The confidence interval formula
Given a sample of size with sample proportion , an approximate confidence interval for the population proportion is:
The components are:
- Point estimate. , the centre of the interval.
- Standard error (SE). , the estimated standard deviation of .
- Critical value . The number of standard errors corresponding to the confidence level.
- Margin of error (MoE). , the half-width of the interval.
The interval is .
Note. vs in the standard error
In the sampling-distribution formula, uses the true (and usually unknown) .
In the confidence-interval formula, the standard error uses in place of because is unknown. This is the plug-in standard error. For Methods purposes the substitution is the standard convention.
Standard values
| Confidence level | IMATH_23 |
|---|---|
| 90 percent | 1.6449 (often rounded to 1.645) |
| 95 percent | 1.9600 (often rounded to 1.96) |
| 99 percent | 2.5758 (often rounded to 2.58) |
These come from the inverse-normal: for confidence level percent.
The 95 percent value of 1.96 is the most commonly used and worth memorising.
Interpretation: what the confidence level means
A confidence interval has the following correct interpretation:
If the sampling procedure were repeated many times (each time computing a new and a new confidence interval from a fresh sample of size ), approximately of the constructed intervals would contain the true population proportion .
The interval describes a procedure that works of the time, not a probability statement about a single specific interval.
Common misstatements VCAA marks against
These are widespread but incorrect:
- "There is a 95 percent probability that the true proportion lies in this interval." (Wrong: once the interval is constructed, either is or is not in it; there is no probability.)
- "95 percent of the population have a proportion in this interval." (Wrong: the interval is about the parameter , not about individuals.)
- "We are 95 percent sure that is in this interval." (Wrong: is the centre of the interval by construction.)
A correct interpretation refers to the long-run success rate of the procedure.
Worked construction
A sample of patients includes 175 with a particular condition. Construct a 95 percent confidence interval for the true population prevalence .
Sample proportion. .
Standard error. .
for 95 percent. .
Margin of error. .
Confidence interval. .
Interpretation: 95 percent of intervals constructed by this procedure would contain the true population prevalence.
Margin of error and sample size
The margin of error decreases as the sample size grows. Specifically:
To halve the margin of error, the sample size must quadruple (because of the in the denominator).
To design a study with a specific margin of error, solve for :
Always round up to the next integer (you cannot have a fractional person).
Worst-case sample size
If no prior estimate of is available, use in the design formula, which maximises . This gives the largest required and so a conservative design.
For 95 percent confidence with :
.
The "1000 sample" convention in public opinion polling comes from approximately this calculation.
Confidence level and margin trade-off
Higher confidence level requires wider interval. For fixed and :
- 90 percent CI is narrower than 95 percent CI is narrower than 99 percent CI.
- The trade-off is between certainty (higher confidence) and precision (narrower interval).
To increase confidence without widening, you must increase .
When the normal approximation is invalid
The formula assumes:
- IMATH_55 and (or 5, depending on convention).
- The sample is random and independent.
For very small samples or proportions very close to 0 or 1, the normal approximation breaks down and an exact binomial-based interval is needed. VCE Methods stays inside the regime where the normal approximation is valid; an out-of-regime question would typically flag the issue.
Calculator workflow (Paper 2)
Most CAS systems have a built-in confidence-interval function for proportions, often as zInterval_1Prop or similar. The inputs are usually (number of successes), (sample size), and confidence level. The output is the interval.
For Paper 2 questions, the calculator output is acceptable, but always show the formula or set-up: , SE, at minimum.
Common errors
Using for the wrong confidence level. for 95 percent; for 90 percent; for 99 percent. Mixing them up gives an interval that is too wide or too narrow.
Forgetting to multiply by . The margin is , not just SE.
Probability misinterpretation. As noted, "the probability is in this interval" is wrong. Use the long-run-procedure language.
Sample size not rounded up. becomes , not .
Worst-case ignored. If a study design problem gives no prior estimate of , the worst-case should be used to ensure the margin is met regardless.
Standard error with instead of . Use in the CI standard error; is unknown.
In one sentence
A confidence interval for a population proportion is where is the standard normal critical value for the level ( for 90 percent, for 95 percent, for 99 percent); the correct interpretation is that approximately of intervals constructed by this procedure across repeated samples would contain the true population proportion, not that this specific interval has a probability of containing it.
Past exam questions, worked
Real questions from past VCAA papers on this dot point, with our answer explainer.
2024 VCAA Paper 24 marksA poll of 800 voters in an electorate found that 432 supported candidate A. (a) Find a 95 percent confidence interval for the true proportion of voters supporting candidate A. (b) State what the 95 percent confidence interval means.Show worked answer β
(a) Confidence interval.
Sample proportion: .
Standard error: .
For 95 percent confidence, .
Margin of error: .
Interval: , or approximately to 3 decimal places.
(b) Interpretation. If many such samples of 800 voters were drawn and a 95 percent confidence interval constructed from each, approximately 95 percent of those intervals would contain the true population proportion of voters supporting candidate A.
Markers reward correct computation of , the SE formula using , the right for 95 percent, and an interpretation that does not say "there is a 95 percent probability that the true is in this specific interval" (a common but incorrect interpretation).
2023 VCAA Paper 23 marksA consumer organisation tested 200 light bulbs and found that 30 were defective. (a) Construct an approximate 90 percent confidence interval for the true proportion of defective bulbs. (b) State the smallest sample size needed to achieve a margin of error no greater than 0.03 at 90 percent confidence, assuming the true proportion is approximately 0.15.Show worked answer β
(a) Confidence interval.
.
SE: .
For 90 percent confidence, .
Margin: .
Interval: , approximately .
(b) Sample size for margin 0.03.
Margin: .
Substitute: .
Solve: .
Square: .
So .
The smallest sample size is (round up).
Markers reward correct margin formula, correct for 90 percent, and rounding up to the next integer.
Related dot points
- The sample proportion $\hat{p}$ as a random variable, the sampling distribution of $\hat{p}$ for repeated samples of size $n$ from a population with true proportion $p$, and the normal approximation for large $n$
A focused answer to the VCE Math Methods Unit 4 key-knowledge point on the sample proportion. Defines $\hat{p}$ as a random variable, gives its mean and standard deviation, sets out the normal-approximation conditions, and works through a Paper 2 estimation question.
- The normal distribution with mean $\mu$ and standard deviation $\sigma$, the standard normal $Z$, the use of the empirical 68/95/99.7 rule, and computation of normal probabilities and inverse probabilities using technology or standard tables
A focused answer to the VCE Math Methods Unit 4 key-knowledge point on the normal distribution. The pdf, the standardisation transformation $Z = (X - \mu)/\sigma$, the empirical rule, and the inverse-probability technique. Includes worked Paper 2 examples and standard CAS workflows.
- Continuous random variables, their probability density functions, cumulative distribution functions, expected value (mean), variance and standard deviation, and computation of probabilities as definite integrals
A focused answer to the VCE Math Methods Unit 4 key-knowledge point on continuous random variables. Defines the probability density function and cumulative distribution function, computes mean and variance as definite integrals, and works through the conditions a pdf must satisfy and the standard Paper 2 set-up questions.