What is wrong tail for the p value?

A one-sided test uses one tail; a two-sided test doubles the tail probability. Match the p value to the alternative hypothesis.

X and Y are independent with Var(X) = 5, Var(Y) = 3. Find Var(X - Y). [2 marks]

A sample of n = 36 from a population with σ = 12 has mean x = 70. State the standard error of the mean. [1 mark]

For a test of H_0: μ = 100 against H_1: μ ≠ 100, the test statistic is z = 2. State the p value. [2 marks]

§-Syllabus dot point

VICSpecialist MathematicsSyllabus dot point

How do the mean and variance of linear combinations of random variables behave, and how do we use a sample mean to test a hypothesis about a population mean?

Linear combinations of independent random variables and their mean and variance, the distribution of the sample mean $\bar{X}$ , the construction of confidence intervals for a population mean, and hypothesis testing for the mean using a $p$ value

A focused answer to the VCE Specialist Mathematics Unit 4 key-knowledge point on linear combinations and statistical inference. Mean and variance of linear combinations, the distribution of the sample mean, confidence intervals, and hypothesis testing with p values, with a verified worked example.

Generated by Claude Opus 4.87 min answerUpdated 2026-05-29

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Quick answer

For independent random variables, means always add: $E(aX + bY) = aE(X) + bE(Y)$ . Variances add for independent variables after squaring the coefficients: $\mathrm{Var}(aX + bY) = a^2\mathrm{Var}(X) + b^2\mathrm{Var}(Y)$ . The sample mean of $n$ independent observations from a population with mean $\mu$ and standard deviation $\sigma$ has $E(\bar{X}) = \mu$ and $\mathrm{Var}(\bar{X}) = \frac{\sigma^2}{n}$ , so $\bar{X}$ is approximately normal for large $n$ . A confidence interval for $\mu$ is $\bar{x} \pm z\frac{\sigma}{\sqrt{n}}$ . A hypothesis test compares the observed $\bar{x}$ against a null mean $\mu_0$ via $z = \frac{\bar{x} - \mu_0}{\sigma/\sqrt{n}}$ and rejects $H_0$ when the $p$ value is below the significance level.

Jump to a section

What this dot point is asking
Mean and variance of linear combinations
The distribution of the sample mean
Confidence intervals for a population mean
Hypothesis testing for a mean
Why independence and squaring matter
Examples in context
Try this

What this dot point is asking

VCAA wants you to find the mean and variance of linear combinations of independent random variables, to use the resulting distribution of the sample mean $\bar{X}$ , to construct a confidence interval for a population mean, and to carry out a hypothesis test for a mean by computing a test statistic and a $p$ value. This is the statistical inference strand of Specialist, distinct from the proportion-based inference in Mathematical Methods.

Mean and variance of linear combinations

For any random variables, expectation is linear:

E(aX + bY) = aE(X) + bE(Y),

whether or not $X$ and $Y$ are independent. Variance, however, behaves differently. For independent $X$ and $Y$ ,

\mathrm{Var}(aX + bY) = a^2\mathrm{Var}(X) + b^2\mathrm{Var}(Y).

The coefficients are squared, and the cross term vanishes because independence makes the covariance zero. Note that even for a difference, variances add: $\mathrm{Var}(X - Y) = \mathrm{Var}(X) + \mathrm{Var}(Y)$ , since $(-1)^2 = 1$ .

The distribution of the sample mean

Take a random sample $X_1, X_2, \dots, X_n$ of independent observations from a population with mean $\mu$ and standard deviation $\sigma$ . The sample mean is $\bar{X} = \frac{1}{n}\sum X_i$ . Applying the linear-combination rules:

E(\bar{X}) = \mu, \qquad \mathrm{Var}(\bar{X}) = \frac{\sigma^2}{n}, \qquad \mathrm{sd}(\bar{X}) = \frac{\sigma}{\sqrt{n}}.

So the sample mean is unbiased for $\mu$ , and its spread shrinks as $n$ grows. By the central limit theorem, for large $n$ the distribution of $\bar{X}$ is approximately normal, $\bar{X} \sim N\!\left(\mu, \frac{\sigma^2}{n}\right)$ , regardless of the population's shape. The quantity $\frac{\sigma}{\sqrt{n}}$ is the standard error of the mean.

Confidence intervals for a population mean

An approximate $C\%$ confidence interval for $\mu$ , when $\sigma$ is known and $n$ is large, is

\bar{x} \pm z\,\frac{\sigma}{\sqrt{n}},

where $z$ is the standard normal value capturing the central $C\%$ . For a 95% interval, $z \approx 1.96$ . The interval is a range of plausible values for $\mu$ ; the confidence level refers to the long-run proportion of such intervals that would contain the true mean, not the probability that $\mu$ lies in this one fixed interval.

Hypothesis testing for a mean

A test of the mean compares a null hypothesis $H_0: \mu = \mu_0$ against an alternative. The steps:

State $H_0: \mu = \mu_0$ and the alternative $H_1$ (one-sided, $\mu > \mu_0$ or $\mu < \mu_0$ , or two-sided $\mu \neq \mu_0$ ).
Compute the test statistic $z = \dfrac{\bar{x} - \mu_0}{\sigma/\sqrt{n}}$ , which measures how many standard errors the observed mean sits from $\mu_0$ .
Find the $p$ value: the probability, assuming $H_0$ is true, of observing a sample mean at least as extreme as $\bar{x}$ .
Compare the $p$ value with the significance level $\alpha$ (commonly $0.05$ ). If $p < \alpha$ , reject $H_0$ ; otherwise do not reject it.

Worked example

A one-sided test of the mean

A machine is meant to fill bottles to a mean of $\mu_0 = 500$ mL, with known standard deviation $\sigma = 8$ mL. A sample of $n = 16$ bottles has mean $\bar{x} = 504$ mL. Test at the 5% level whether the machine overfills ( $H_1: \mu > 500$ ).

Hypotheses: $H_0: \mu = 500$ versus $H_1: \mu > 500$ (one-sided).
Standard error: $\dfrac{\sigma}{\sqrt{n}} = \dfrac{8}{\sqrt{16}} = \dfrac{8}{4} = 2$ mL.
Test statistic: $z = \dfrac{\bar{x} - \mu_0}{\sigma/\sqrt{n}} = \dfrac{504 - 500}{2} = \dfrac{4}{2} = 2$ .
$p$ value: For a one-sided upper test, $p = P(Z > 2)$ . From the standard normal, $P(Z > 2) \approx 0.0228$ .
Decision: Since $p \approx 0.0228 < 0.05$ , we reject $H_0$ at the 5% level. There is evidence that the machine overfills.
Sense check: The observed mean is two standard errors above the target, which is reasonably extreme, so rejecting $H_0$ is consistent with $z = 2$ sitting beyond the one-sided critical value $z = 1.645$ .

Why independence and squaring matter

The single most error-prone step is the variance rule. Means combine linearly with their coefficients, but variances combine with the squares of the coefficients, and only when the variables are independent. This is why doubling a measurement ( $2X$ ) quadruples its variance, and why averaging $n$ independent values divides the variance by $n$ rather than by $\sqrt{n}$ . Keeping variance and standard deviation distinct (one is the square of the other) prevents most slips in this strand.

Common exam traps

Adding standard deviations: Standard deviations do not add. Work with variances ( $a^2\mathrm{Var}(X) + b^2\mathrm{Var}(Y)$ ), then take the square root at the end if you need a standard deviation.
Subtracting variances for a difference: $\mathrm{Var}(X - Y) = \mathrm{Var}(X) + \mathrm{Var}(Y)$ for independent variables. Variances always add here because the coefficient $-1$ is squared.
Using $\sigma$ instead of the standard error: In a confidence interval or test statistic for the mean, the spread is $\frac{\sigma}{\sqrt{n}}$ , not $\sigma$ .
Wrong tail for the $p$ value: A one-sided test uses one tail; a two-sided test doubles the tail probability. Match the $p$ value to the alternative hypothesis.

Examples in context

Example 1. Variance of a sum. If $X$ and $Y$ are independent with $\mathrm{Var}(X) = 4$ and $\mathrm{Var}(Y) = 9$ , then $\mathrm{Var}(2X + Y) = 4(4) + 1(9) = 25$ , so $\mathrm{sd} = 5$ .

Example 2. Confidence interval. A sample of $n = 25$ with $\bar{x} = 50$ and known $\sigma = 10$ gives standard error $\frac{10}{5} = 2$ , so a 95% interval is $50 \pm 1.96(2) = 50 \pm 3.92$ , that is $(46.08, 53.92)$ .

Try this

Q1. $X$ and $Y$ are independent with $\mathrm{Var}(X) = 5$ , $\mathrm{Var}(Y) = 3$ . Find $\mathrm{Var}(X - Y)$ . [2 marks]

Cue. Variances add: $5 + 3 = 8$ .

Q2. A sample of $n = 36$ from a population with $\sigma = 12$ has mean $\bar{x} = 70$ . State the standard error of the mean. [1 mark]

Cue. $\frac{12}{\sqrt{36}} = \frac{12}{6} = 2$ .

Q3. For a test of $H_0: \mu = 100$ against $H_1: \mu \neq 100$ , the test statistic is $z = 2$ . State the $p$ value. [2 marks]

Cue. Two-sided: $p = 2\,P(Z > 2) \approx 2(0.0228) = 0.0456$ .

Exam-style practice questions

Practice questions written in the style of VCAA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

2023 VCAA2 marksThe time, Xc minutes, taken to drive to a station is normal with mean 20 and standard deviation 6. The waiting time, Xw minutes, for a train is normal with mean 8 and standard deviation 3. The time, Xt minutes, on the train is normal with mean 12 and standard deviation 5. The three times are independent. Find the mean and standard deviation of the total time, in minutes, it takes to travel from home to the city.

Show worked answer →

The total time is the linear combination T = Xc + Xw + Xt.

Mean: for a sum, means add (regardless of independence). E(T) = 20 + 8 + 12 = 40 minutes.

Variance: because the three times are independent, variances add. Var(T) = 6^2 + 3^2 + 5^2 = 36 + 9 + 25 = 70.

Standard deviation: sd(T) = sqrt(Var(T)) = sqrt(70), which is about 8.37 minutes.

So the total travel time has mean 40 minutes and standard deviation sqrt(70) minutes (approximately 8.37 minutes). A common error is to add the standard deviations (6 + 3 + 5 = 14); only the variances add.