What is a sample proportion, and what is the sampling distribution of for repeated samples from a population?
The sample proportion as a random variable, the sampling distribution of for repeated samples of size from a population with true proportion , and the normal approximation for large
A focused answer to the VCE Math Methods Unit 4 key-knowledge point on the sample proportion. Defines as a random variable, gives its mean and standard deviation, sets out the normal-approximation conditions, and works through a Paper 2 estimation question.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this dot point is asking
VCAA wants you to treat the sample proportion as a random variable, identify the mean and standard deviation of its sampling distribution, and apply the normal approximation to compute sample-proportion probabilities. The dot point is the statistical-inference precursor to confidence intervals.
What is a sample proportion
Suppose a population has a true proportion of "successes" (members with some characteristic: voters for party A, defective items, smokers, opinion-poll affirmatives). A random sample of items is drawn, and the number of successes in the sample is recorded as .
The sample proportion is:
Because is random (depends on which items happen to be sampled), is a random variable. It varies from sample to sample.
The sampling distribution of
Repeatedly drawing samples of size from the same population and computing each time produces a distribution of -values: the sampling distribution of .
Two facts about this distribution:
Mean of
The expected value of the sample proportion equals the population proportion. The sample proportion is an unbiased estimator of .
Standard deviation of
Two interpretations:
- The standard deviation falls as . Quadrupling the sample size halves the standard deviation.
- The standard deviation is largest when . At or , SD() = 0 (no variability because every sample has the same proportion).
Conditions for the formula
The formula assumes:
- Independence. Each sample item is drawn independently. In practice, this requires either sampling with replacement, or sampling from a population large enough that each draw does not materially change the remaining proportion (typically, the population should be at least 10 times the sample size).
- Identical distribution. Each sampled item has the same probability of being a success.
These are the conditions of the binomial distribution: .
The normal approximation
For large , the sampling distribution of is approximately normal:
(Equivalently, has mean and standard deviation .)
When is "large " large enough?
Standard conditions (VCAA cites both):
Some texts use or as the threshold; VCAA accepts any reasonable convention. The conditions ensure the binomial is well-approximated by the normal.
Why the normal approximation works
for large is approximately by the central limit theorem. Dividing by gives approximately .
Computing sample-proportion probabilities
To find , or :
- Verify the normal approximation conditions ( and ).
- State the approximate distribution: .
- Standardise: .
- Compute the probability using calculator (normCdf) or table.
Worked example
A factory produces 60 percent of items meeting specification. A sample of items is taken. Find the probability that the sample proportion meeting spec is at least 0.55.
Mean: .
SD: .
Conditions: , . Normal approximation valid.
Standardise: .
.
Sampling distribution shape and the central limit theorem
For small , the sampling distribution of is discrete (taking only values ) and may be skewed if is far from 0.5. As grows, the distribution becomes both more concentrated (smaller SD) and more bell-shaped (better normal approximation). This is the central limit theorem in action.
VCE Methods does not require formal statement of the CLT, but the underlying intuition is the reason the normal approximation works.
Examples in context
Example 1. Opinion poll variability. In a region favour a proposal (). A poll of people gives a sample proportion with mean and standard deviation . To find the chance the poll shows under support: standardise , so .
Example 2. Choosing a sample size for precision. A manufacturer with a return rate () wants . Solving gives , so . A sample of at least products achieves the target precision.
Try this
Q1. A population has . For a sample of , find the mean and standard deviation of . [2 marks]
- Cue. ; .
Q2. For and , find using the normal approximation. [3 marks]
- Cue. ; ; .
Q3. Find the smallest so that when . [3 marks]
- Cue. , so .
Exam-style practice questions
Practice questions written in the style of VCAA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
2024 VCAA Paper 24 marksA large city's voter list has 40 percent supporters of party A. A random sample of 200 voters is selected. (a) State the mean and standard deviation of the sample proportion of supporters of party A. (b) Use the normal approximation to estimate the probability that is greater than 0.45.Show worked answer β
(a) Mean and standard deviation.
.
.
(b) Probability via normal approximation. With and both larger than 10, the normal approximation is valid.
approximately.
Standardise: .
(from normCdf).
So approximately or 7.45 percent.
Markers reward the formula for SD(), checking the normal-approximation conditions, the standardisation, and a final probability with sensible decimal places.
2023 VCAA Paper 23 marksIn a population, 30 percent of items are defective. A sample of items is taken. (a) Find the smallest value of for which the standard deviation of is less than 0.02. (b) State an assumption needed for the formula for SD() to apply.Show worked answer β
(a) Find .
.
Set . Square both sides: . So .
The smallest integer is .
(b) Assumption. The sample must be a simple random sample (each item chosen independently) from a population large enough that drawing one item does not materially change the proportion remaining. Equivalent assumption: the sample is drawn with replacement, or the sample size is much smaller than the population.
Markers reward the SD formula, the algebraic manipulation to isolate , and an independence / random-sample condition.
Related dot points
- The normal distribution with mean and standard deviation , the standard normal , the use of the empirical 68/95/99.7 rule, and computation of normal probabilities and inverse probabilities using technology or standard tables
A focused answer to the VCE Math Methods Unit 4 key-knowledge point on the normal distribution. The pdf, the standardisation transformation , the empirical rule, and the inverse-probability technique. Includes worked Paper 2 examples and standard CAS workflows.
- Approximate confidence intervals for a population proportion based on the sample proportion , including the standard 90, 95 and 99 percent intervals and their interpretation
A focused answer to the VCE Math Methods Unit 4 key-knowledge point on confidence intervals. The formula, the standard values for 90, 95 and 99 percent intervals, the correct interpretation language, and the relationship between sample size, margin of error and confidence level.
- Continuous random variables, their probability density functions, cumulative distribution functions, expected value (mean), variance and standard deviation, and computation of probabilities as definite integrals
A focused answer to the VCE Math Methods Unit 4 key-knowledge point on continuous random variables. Defines the probability density function and cumulative distribution function, computes mean and variance as definite integrals, and works through the conditions a pdf must satisfy and the standard Paper 2 set-up questions.