How do we describe a discrete random variable and summarise its distribution with mean and variance?
Define a discrete random variable by its probability distribution, and calculate the expected value, variance and standard deviation
A focused answer to the HSC Maths Advanced dot point on discrete random variables. Probability distributions, expected value, variance, standard deviation, and linear transformations of a discrete random variable, with worked examples.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
What this dot point is asking
NESA wants you to recognise a discrete random variable, check that its probability distribution is valid, compute the expected value and variance from the distribution, and apply the linear-transformation rules to . Everything starts from the probability distribution, the list of values with their probabilities, so reading and validating that table is the first marked move in almost every question.
The answer
Discrete random variables and their distributions
A discrete random variable takes a countable list of values with probabilities . The list of values with their probabilities is the probability distribution of . For it to be valid, two conditions must hold:
- for every (each is a genuine probability),
- (something must happen).
The spike graph above is the natural picture: each value sits on the horizontal axis and the height of its spike is its probability, so the heights are the and they must add to . The probability that falls in some set is the sum of for the values in that set. For example, if takes integer values from .
Expected value
The expected value (or mean) of is the long-run average value if we repeated the experiment many times. It is the weighted sum
The expected value need not be one of the values can actually take; it is a balance point, not an outcome.
Expected value of a function of
For any function ,
The most common case is , which gives
This is the quantity you build to find the variance, so it is worth setting up as its own column of working.
Variance and standard deviation
The variance of measures spread around the mean. By definition it is the expected squared deviation,
which is algebraically equivalent (and almost always easier to compute) to
The standard deviation is , in the same units as , which is why it is the spread measure you can compare directly against the mean.
Linear transformations
If for constants and ,
Shifting by slides the mean but leaves the spread untouched; scaling by multiplies the mean by and the standard deviation by (and the variance by ). These rules let you find the mean and variance of without rebuilding any sums.
Reading a distribution and finding its mean, stage by stage
The two diagrams here use the distribution , , , .
Stage 1, read the distribution and check it is valid. Whether it arrives as a two-row table or as the spike graph above, the first move is the same: confirm the probabilities are between and and sum to . Here , so the distribution is valid and you can build calculation columns from it. (If a constant were involved, you would solve for it first.)
Stage 2, find the expected value as the balance point. The mean is the weighted sum . Picture the probabilities as weights placed along the axis: is the point where the bar would balance, marked by the fulcrum below. Note that is not one of the values can take, which is exactly what "balance point, not an outcome" means.
Presenting a distribution as a table
In the exam a discrete distribution is usually laid out as a two-row table: the values on top and the probabilities underneath. Reading it correctly is the first marked step. Check the probabilities sum to (solve for any unknown if a constant is involved), then build the calculation columns you need: for the mean and for . Laying the work out in columns keeps the arithmetic tidy and is exactly what markers look for.
Interpreting expected value and variance
The expected value is the balance point of the distribution: if you placed the probabilities as weights along a number line, is where it would balance, as the fulcrum in the diagram shows. The variance measures how widely the values spread around that balance point, in squared units, and the standard deviation brings it back to the original units so it can be compared with the mean. A small standard deviation means the outcomes cluster tightly around the mean; a large one means they are spread out. This interpretation is what justifies the linear-transformation rules: shifting every value left or right slides the balance point but leaves the spread untouched, while stretching the scale stretches both.
Why is the practical formula
The definition is conceptually clear but arithmetically painful because it subtracts inside every term. The equivalent is almost always faster: build one extra column of , sum it, and subtract the square of the mean. The two formulas are algebraically identical, so use the second to compute and quote the first to explain.
How exam questions ask about discrete random variables
- "Show that the table is a valid probability distribution" or "find the value of ." Check and solve for any unknown.
- "Find " or "." Add the relevant ; for "at least" it is often quicker to use .
- "Find the expected value / mean." Compute the weighted sum , showing the products.
- "Find the variance / standard deviation." Build , then , then square-root for .
- "Let . Find and / ." Apply , , .
Edge cases worth knowing
- An unknown probability via the sum. If one entry is missing or given as , find it from before any mean or variance work.
- The mean is not an attainable value. is a balance point, so a fair die has mean even though you can never roll . Do not "round it to a face".
- A symmetric distribution. If the probabilities are symmetric about a central value, that value is the mean immediately, with no weighted sum required.
- Negative-looking variance. Variance is a sum of squared terms times probabilities, so it can never be negative; a negative result signals an arithmetic slip, usually confused with .
Exam-style practice questions
Practice questions written in the style of NESA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
2022 HSC Q244 marksThe discrete random variable has probability distribution , , , . Find and .Show worked answer →
.
For the variance, first compute .
.
Markers reward the explicit weighted sum for , the use of for the variance, and clean arithmetic.
2021 HSC Q253 marksA discrete random variable has and . Let . Find and the standard deviation of .Show worked answer →
Linearity of expectation: .
Variance scales by the square of the coefficient and is unchanged by adding a constant: .
Standard deviation: .
Markers expect explicit use of and , with the standard deviation as the positive square root.
Related dot points
- Use probability density functions and cumulative distribution functions to find probabilities, medians, modes, means and variances of continuous random variables
A focused answer to the HSC Maths Advanced dot point on continuous random variables. Probability density functions, cumulative distribution functions, computing probabilities by integration, and finding mean, median, mode and variance, with worked examples.
- Use the normal distribution, z-scores, the empirical rule and the standard normal table to find probabilities and percentiles
A focused answer to the HSC Maths Advanced dot point on the normal distribution. Standardising with z-scores, the 68-95-99.7 empirical rule, computing probabilities as areas under the curve and inverse-normal percentiles, with worked examples and exam traps.
- Construct scatter plots, calculate and interpret Pearson's correlation coefficient, and fit and use the least-squares regression line
A focused answer to the HSC Maths Advanced dot point on bivariate data. Scatter plots, the Pearson correlation coefficient, the least-squares regression line, prediction, and the limits of extrapolation, with worked examples and exam traps.
- Use simple and compound interest formulas to find future values, present values, interest rates and time periods
A focused answer to the HSC Maths Advanced dot point on simple and compound interest. The two formulas, conversion between annual and per-period rates, present and future value calculations, and the effect of compounding frequency, with worked examples.