How does the sample proportion behave across repeated random samples, and why is it approximately normal?
Describe the sampling distribution of the sample proportion, including its mean, standard error and approximate normality for large samples
WACE Year 12 Mathematics Methods Unit 4 the sample proportion: random sampling, the sampling distribution of p-hat, its mean p, the standard error, and approximate normality for large samples, with worked SCSA-style examples.
✦ Generated by Claude Opus 4.7·6 min answer·
Reviewed by: AI editorial process; not yet individually human-reviewed
SCSA Unit 4 introduces statistical inference by studying how the sample proportion behaves across repeated samples. This dot point asks you to describe its sampling distribution, the mean, the standard error and the approximate normality, which underpins the confidence interval that follows. It is examined in both sections.
Random sampling and the sample proportion
When a random sample of size n is drawn, the number of successes X follows a binomial distribution X∼B(n,p), where p is the unknown population proportion. The sample proportion is
p^​=nX​.
Different samples give different values of p^​, so p^​ is itself a random variable with its own distribution, called the sampling distribution.
Mean and standard error
Because p^​ is n1​ times a binomial variable, its mean and standard deviation follow from the binomial mean np and variance np(1−p).
Since E(p^​)=p, the sample proportion is an unbiased estimator: on average it equals the true proportion. The standard error has n​ in the denominator, so larger samples give less variable, more precise estimates.
Why approximate normality
For large n the binomial distribution of X is well approximated by a normal distribution, so p^​=nX​ is approximately normal too. A common rule of thumb is that both np and n(1−p) should be at least about 10. This normal approximation is what allows the confidence interval to use a z-multiplier.