Skip to main content
NSWMaths Standard 2Syllabus dot point

How are measures of spread used to describe how widely data is spread out, and how do they let you compare two data sets?

Calculate measures of spread, including the range, quartiles and interquartile range, and the population standard deviation using technology

A focused answer to the HSC Maths Standard 2 dot point on measures of spread. The range, the quartiles and interquartile range, the five-number summary, the population standard deviation from a calculator, and how to compare the spread of two data sets, with worked Australian examples.

Generated by Claude Opus 4.814 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

What this dot point is asking

NESA wants you to measure how spread out a set of data is, not just where its centre sits. Two classes can share the same average mark yet look completely different - one bunched tightly around the mean, the other scattered from fails to top marks. A measure of spread puts a number on that scatter. You need three of them: the range (the crudest), the interquartile range or IQR (the spread of the middle half), and the standard deviation (the average distance of the values from the mean, read straight off a calculator). You also need to assemble the five-number summary and to use these measures to compare two data sets. The arithmetic for range and IQR is light; the marks are won by quoting the right figure for the right purpose and by reading the population standard deviation, σn\sigma_n, correctly from the calculator.

The answer

A measure of spread answers one question: how far apart are the values? There are three you must know, from crudest to most refined.

  • Range == maximum - minimum. One subtraction, but it uses only the two extreme values, so a single odd value distorts it.
  • Interquartile range (IQR) =Q3Q1= Q_3 - Q_1. This is the spread of the middle 50%50\% of the data, so it ignores the lowest and highest quarters and is barely affected by extreme values.
  • Standard deviation (σn)(\sigma_n) == roughly the average distance of the values from the mean. It uses every value, and in this course you read it from your calculator rather than computing it by hand.

The first two are built from the quartiles, so start there.

Quartiles and the five-number summary

The median splits ordered data into a lower half and an upper half. The quartiles split it into quarters:

  • Q1Q_1 (the lower quartile) is the median of the lower half - a quarter of the way through the data.
  • Q2Q_2 is the median itself - halfway through.
  • Q3Q_3 (the upper quartile) is the median of the upper half - three quarters of the way through.

The method this page uses throughout (the standard HSC approach): first find the median. If there is an odd number of values, leave the middle value out of both halves. If there is an even number, split straight down the middle. Then Q1Q_1 is the median of the lower half and Q3Q_3 is the median of the upper half. Used consistently, this gives the quartiles your calculator's statistics mode reports.

Putting the pieces together, the five-number summary lists, in order:

minimum,Q1,median,Q3,maximum.\text{minimum}, \quad Q_1, \quad \text{median}, \quad Q_3, \quad \text{maximum}.

These five numbers carve the data into four quarters and are the exact inputs a box plot is drawn from. The number line below shows them for the ordered set 3,5,6,8,9,11,12,14,15,17,193, 5, 6, 8, 9, 11, 12, 14, 15, 17, 19.

Five-number summary and interquartile range on a number lineA number line from zero to twenty. Five points are marked on the line: the minimum at three, the lower quartile Q1 at six, the median at eleven, the upper quartile Q3 at fifteen, and the maximum at nineteen. A bracket above the line spans from Q1 to Q3 and is labelled interquartile range equals nine. A bracket below the line spans the whole data from minimum to maximum and is labelled range equals sixteen.The five-number summary on a number line02468101214161820min 3Q₁ 6median 11Q₃ 15max 19IQR = 9range = 16

The range

The range is the simplest measure of spread:

range=maximumminimum.\text{range} = \text{maximum} - \text{minimum}.

For the set above, range =193=16= 19 - 3 = 16. It is quick and tells you the full width of the data, but because it relies only on the two extreme values it is very sensitive: one freak value sends it shooting up, even if every other value is tightly packed. Use it for a rough sense of spread, and reach for the IQR when extremes might mislead.

The interquartile range

The interquartile range measures the spread of the central half of the data:

IQR=Q3Q1.\text{IQR} = Q_3 - Q_1.

For the set above, IQR =156=9= 15 - 6 = 9. Because it chops off the bottom quarter and the top quarter, the IQR ignores extreme values almost entirely - which is its great strength. When a data set has an unusually high or low value, the IQR describes the typical spread far better than the range does. It is also the basis of the 1.5×IQR1.5 \times \text{IQR} outlier test you meet on the next dot point.

The standard deviation (population, σn\sigma_n)

The standard deviation measures spread by averaging how far each value sits from the mean. Unlike the range and IQR, it uses every value in the data. A small standard deviation means the values cluster close to the mean; a large one means they are widely scattered. The smallest it can ever be is 00, which happens only when every value is identical.

In Mathematics Standard you are not asked to compute it by hand from the formula - you read it from the calculator's statistics mode. There are two versions, and you must pick the right one:

  • Population standard deviation σn\sigma_n (sometimes shown as σx\sigma x) - this is the HSC convention and the one you quote unless a question says otherwise.
  • Sample standard deviation sn1s_{n-1} (sometimes sxsx) - slightly larger; not the value used in this course.

Finding the standard deviation on a calculator

The exam expects you to drive your calculator's statistics mode confidently. The steps are the same on every approved model, with only the key names differing:

  1. Enter statistics mode (often MODE then STAT or SD), choosing single-variable (1-VAR) data.
  2. Clear any previous data before you start - an uncleared list is the most common cause of a wrong answer.
  3. Type each value, pressing the data-entry key (commonly DT, M+, or = inside a list) after each one. For repeated values, many calculators let you enter the value then a frequency.
  4. Read off the statistics. Find the key for the population standard deviation, labelled σn\sigma_n or σx\sigma x. The mean is labelled xˉ\bar{x}.
  5. Round as asked, usually to two decimal places, and quote σn\sigma_n (not sn1s_{n-1}).

A quick sanity check: the standard deviation is never negative and is usually smaller than the range. If your calculator shows a value larger than the range, you have read the wrong key or mis-entered the data.

Comparing the spread of two data sets

A favourite exam task gives you two data sets, often with a similar mean, and asks which is more spread out or more consistent. The routine:

  1. Compare the centres first (usually the mean or median) so you know whether you are genuinely comparing spread.
  2. Compare a measure of spread - the standard deviation for the overall scatter, or the IQR if extreme values are in play. The data set with the smaller spread is the more consistent one.
  3. Write the comparison in words, quoting the figures: "Class A's standard deviation (5.155.15) is far smaller than Class B's (13.1713.17), so Class A's marks are more consistent and tightly clustered."

The diagram below compares two data sets with the same median but very different spread.

Comparing the spread of two data sets with the same medianTwo number lines drawn one above the other on the same horizontal scale from zero to sixteen. The upper line, set A, has its five marked points crowded together from six to twelve, a small spread. The lower line, set B, has its five marked points stretched from three to fifteen, a large spread. Both sets share a median of nine, marked on each line, showing that two data sets can have the same centre but very different spread.Same centre, different spreadAB0481216median 9small spreadmedian 9large spread

How exam questions ask about spread

The wording tells you which measure to reach for:

  • "Find the range" - subtract the smallest value from the largest.
  • "Find the five-number summary" - list minimum, Q1Q_1, median, Q3Q_3, maximum, in that order.
  • "Find the interquartile range / IQR" - find Q1Q_1 and Q3Q_3, then subtract: Q3Q1Q_3 - Q_1.
  • "Find the standard deviation" (with no other instruction) - read σn\sigma_n from the calculator and round as asked.
  • "Which data set is more consistent / less variable / more spread out?" - compare a measure of spread (smaller spread == more consistent), and quote the figures.
  • "Compare the two data sets" - comment on both centre (mean or median) and spread (IQR or standard deviation), in words, using the numbers.

Exam-style practice questions

Practice questions written in the style of NESA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

2021 HSC-style3 marksThe ages of seven members of a sports club are 1414, 1616, 1919, 2323, 2828, 3535, 4141. (a) Find the median age. (b) Find the interquartile range. (c) Explain why the interquartile range may describe the spread of these ages better than the range.
Show worked answer →

Median: with 77 ordered values the median is the 44th value, 2323.

IQR: leaving out the median, the lower half is 14,16,1914, 16, 19 (median 1616) and the upper half is 28,35,4128, 35, 41 (median 3535), so Q1=16Q_1 = 16, Q3=35Q_3 = 35 and IQR =3516=19= 35 - 16 = 19.

Explanation: the range (4114=2741 - 14 = 27) depends only on the two extreme ages and is dragged up by the oldest member, whereas the IQR describes the spread of the middle half of the ages, so it is more resistant to a single unusually old or young member.

Markers award one mark for the median, one for a correctly worked IQR, and one for a clear statement that the IQR is less affected by extreme values than the range.

2022 HSC-style4 marksTwo baristas record the time (in seconds) to make a coffee over six orders. Barista X: 3838, 4040, 4141, 4343, 4444, 4646. Barista Y: 3030, 3535, 4141, 4343, 5050, 5353. Both have a mean of about 4242 seconds. (a) Find the range for each barista. (b) State, with a reason, which barista is more consistent. (c) Describe what a larger standard deviation would tell you about a barista's times.
Show worked answer →

Ranges: Barista X has 4638=846 - 38 = 8 seconds; Barista Y has 5330=2353 - 30 = 23 seconds.

More consistent: Barista X, because with the same mean (42\approx 42 s) X's times are far less spread out (range 88 s versus 2323 s), so X's service times vary less from order to order.

Larger standard deviation: it would mean that barista's times are spread more widely around the mean, so the service is less predictable (some coffees much faster and some much slower than average).

Markers award marks for both correct ranges, for naming Barista X with a spread-based reason (not a mean-based one), and for linking a larger standard deviation to greater variability about the mean.

2023 HSC-style3 marksA data set of 1010 values, in order, is 55, 88, 99, 1111, 1212, 1414, 1515, 1717, 1919, 2626. (a) Find Q1Q_1, the median and Q3Q_3. (b) An outlier is any value above Q3+1.5×IQRQ_3 + 1.5 \times \text{IQR} or below Q11.5×IQRQ_1 - 1.5 \times \text{IQR}. Determine whether 2626 is an outlier.
Show worked answer →

Quartiles: with n=10n = 10 the median is the average of the 55th and 66th values, (12+14)÷2=13(12 + 14) \div 2 = 13. The lower half 5,8,9,11,125, 8, 9, 11, 12 gives Q1=9Q_1 = 9 and the upper half 14,15,17,19,2614, 15, 17, 19, 26 gives Q3=17Q_3 = 17.

Outlier test: IQR =179=8= 17 - 9 = 8, so the upper fence is Q3+1.5×IQR=17+1.5×8=17+12=29Q_3 + 1.5 \times \text{IQR} = 17 + 1.5 \times 8 = 17 + 12 = 29. Since 26<2926 < 29, the value 2626 is not an outlier.

Markers award one mark for the median, one for both quartiles, and one for a correctly computed upper fence (2929) with the conclusion that 2626 is not an outlier.

Practice questions

Original practice questions graded from foundation to exam level, each with a full worked solution. Try them before revealing the solution.

foundation2 marksThe daily maximum temperatures (in degrees Celsius) for a week are 1212, 1818, 77, 1515, 2222, 99, 1414. (a) Find the range. (b) State what the range tells you about the data.
Show worked solution →

Part (a) - range is largest minus smallest. Reading the list, the largest value is 2222 and the smallest is 77, so

range=227=15 degrees\text{range} = 22 - 7 = 15 \text{ degrees}

Part (b) - what it tells you. The range measures the total spread: it says the hottest and coldest days that week were 1515 degrees apart. (The range uses only the two extreme values, so a single unusually hot or cold day would change it a lot.)

foundation3 marksA set of nine data values, written in order, is 44, 77, 88, 1111, 1313, 1515, 1818, 2121, 2424. Find the five-number summary (minimum, Q1Q_1, median, Q3Q_3, maximum).
Show worked solution →
Find the minimum, maximum and median
The data is already in order. The minimum is 44 and the maximum is 2424. With 99 values the median is the 55th value (the middle one), so the median is 1313.
Split into a lower and an upper half, excluding the median
Removing the middle value 1313 leaves a lower half 4,7,8,114, 7, 8, 11 and an upper half 15,18,21,2415, 18, 21, 24.
Q1Q_1 is the median of the lower half
With four values, Q1Q_1 is the average of the middle two:

Q1=7+82=7.5Q_1 = \frac{7 + 8}{2} = 7.5

Q3Q_3 is the median of the upper half.

Q3=18+212=19.5Q_3 = \frac{18 + 21}{2} = 19.5

State the five-number summary. Minimum 44, Q1=7.5Q_1 = 7.5, median 1313, Q3=19.5Q_3 = 19.5, maximum 2424.

core3 marksThe number of goals scored by a netball team in eight games was 33, 55, 88, 99, 1212, 1414, 1717, 2020. (a) Find Q1Q_1 and Q3Q_3. (b) Hence find the interquartile range.
Show worked solution →

Part (a) - split the ordered data in half. There are 88 values (an even number), so the data splits cleanly into a lower half 3,5,8,93, 5, 8, 9 and an upper half 12,14,17,2012, 14, 17, 20.

Q1Q_1 is the median of the lower half:

Q1=5+82=6.5Q_1 = \frac{5 + 8}{2} = 6.5

Q3Q_3 is the median of the upper half:

Q3=14+172=15.5Q_3 = \frac{14 + 17}{2} = 15.5

Part (b) - the IQR is Q3Q1Q_3 - Q_1.

IQR=15.56.5=9\text{IQR} = 15.5 - 6.5 = 9

so the interquartile range is 99 goals. (The IQR is the spread of the middle 50%50\% of the games, ignoring the lowest and highest quarters.)

core3 marksThe reaction times (in hundredths of a second) of eleven people, in order, are 22, 44, 55, 77, 88, 1010, 1212, 1313, 1515, 1818, 2020. Find the interquartile range.
Show worked solution →

Find the median first. With 1111 values the median is the 66th value (the middle one):

median=10\text{median} = 10

Split into halves, leaving out the median. Below the median sit 2,4,5,7,82, 4, 5, 7, 8 and above it sit 12,13,15,18,2012, 13, 15, 18, 20.

Find Q1Q_1 and Q3Q_3 as the medians of those halves. Each half has 55 values, so each quartile is the middle one of its half:

Q1=5,Q3=15Q_1 = 5, \qquad Q_3 = 15

Compute the IQR.

IQR=Q3Q1=155=10\text{IQR} = Q_3 - Q_1 = 15 - 5 = 10

so the interquartile range is 1010. (Because the IQR throws away the bottom and top quarters, it is unaffected by the extreme value 2020, unlike the range.)

core3 marksA small data set is 44, 66, 88, 1010, 1212. (a) Find the mean. (b) Using your calculator's statistics mode, find the population standard deviation σn\sigma_n, correct to two decimal places.
Show worked solution →

Part (a) - the mean. Add the values and divide by how many there are:

xˉ=4+6+8+10+125=405=8\bar{x} = \frac{4 + 6 + 8 + 10 + 12}{5} = \frac{40}{5} = 8

Part (b) - the standard deviation from the calculator. Put the calculator into statistics (STAT / SD) mode, clear any old data, type each value pressing the data-entry key after each (44, 66, 88, 1010, 1212), then read off the population standard deviation, the key labelled σn\sigma_n (or σx\sigma x):

σn2.83\sigma_n \approx 2.83

so the population standard deviation is 2.832.83 (correct to two decimal places). (Be sure to read σn\sigma_n, not the sample value sn1s_{n-1}, which is slightly larger; σn\sigma_n is the HSC convention.)

exam5 marksTwo Year 11 classes sat the same test (marks out of 8080). Class A scored 5252, 5555, 5858, 6060, 6262, 6565, 6868. Class B scored 4040, 4848, 5555, 6060, 6666, 7373, 8181. (a) Find the range of each class. (b) Using a calculator, find the population standard deviation σn\sigma_n of each class, correct to two decimal places. (c) Both classes have a mean of about 6060. Compare the two classes' spread, referring to your figures.
Show worked solution →

Part (a) - ranges. Range is largest minus smallest:

Class A: 6852=16,Class B: 8140=41\text{Class A: } 68 - 52 = 16, \qquad \text{Class B: } 81 - 40 = 41

Part (b) - standard deviations from the calculator. Enter each class's marks in statistics mode and read σn\sigma_n:

σA5.15,σB13.17\sigma_A \approx 5.15, \qquad \sigma_B \approx 13.17

Part (c) - compare the spread. Both classes centre on a mean of about 6060, so the comparison is about spread, not centre. Every measure of spread is much larger for Class B: its range (4141) is more than double Class A's (1616), and its standard deviation (13.1713.17) is more than double Class A's (5.155.15). So Class B's marks are far more spread out, while Class A's marks are tightly bunched near the mean. In plain terms, Class A performed consistently and Class B's results were much more variable. (Because the means are equal, the smaller standard deviation - Class A - identifies the more consistent class.)

exam6 marksThe masses (in kg) of ten parcels are 1212, 1515, 1515, 1818, 2020, 2222, 2525, 2727, 3030, 3636. (a) Find the five-number summary. (b) Find the range and the interquartile range. (c) Using a calculator, find the mean and the population standard deviation σn\sigma_n, correct to two decimal places. (d) The courier flags any parcel heavier than Q3+1.5×IQRQ_3 + 1.5 \times \text{IQR} as oversized. Find this cut-off mass and state whether the 3636 kg parcel is flagged.
Show worked solution →

Part (a) - five-number summary. The data is already in order; n=10n = 10. Minimum 1212, maximum 3636. The median is the average of the 55th and 66th values:

median=20+222=21\text{median} = \frac{20 + 22}{2} = 21

The lower half is 12,15,15,18,2012, 15, 15, 18, 20 and the upper half is 22,25,27,30,3622, 25, 27, 30, 36, so

Q1=15,Q3=27Q_1 = 15, \qquad Q_3 = 27

The five-number summary is: minimum 1212, Q1=15Q_1 = 15, median 2121, Q3=27Q_3 = 27, maximum 3636.

Part (b) - range and IQR.

range=3612=24,IQR=2715=12\text{range} = 36 - 12 = 24, \qquad \text{IQR} = 27 - 15 = 12

Part (c) - mean and standard deviation. Adding the ten masses gives 220220, so the mean is

xˉ=22010=22 kg\bar{x} = \frac{220}{10} = 22 \text{ kg}

Entering the data in statistics mode and reading the population standard deviation,

σn7.16 kg\sigma_n \approx 7.16 \text{ kg}

Part (d) - the oversized cut-off. Apply the upper-fence formula with Q3=27Q_3 = 27 and IQR =12= 12:

Q3+1.5×IQR=27+1.5×12=27+18=45 kgQ_3 + 1.5 \times \text{IQR} = 27 + 1.5 \times 12 = 27 + 18 = 45 \text{ kg}

The 3636 kg parcel is below 4545 kg, so it is not flagged as oversized. (Even the heaviest parcel sits inside the fence, so this data set has no high outlier by the 1.5×1.5 \times IQR rule.)

Related dot points