Skip to main content
NSWMaths Standard 2Syllabus dot point

How is a five-number summary turned into a box plot, and how do parallel box plots let you compare two groups by centre, spread, skew and outliers?

Construct and interpret box-and-whisker plots and use them, including parallel (side-by-side) box plots, to compare data sets in terms of centre, spread, skewness and outliers

A focused answer to the HSC Maths Standard 2 dot point on box-and-whisker plots. Building a box plot from the five-number summary, flagging an outlier with the 1.5 times IQR rule, drawing parallel box plots, and comparing two groups by centre, spread, skew and outliers, with worked Australian examples.

Generated by Claude Opus 4.815 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

What this dot point is asking

NESA wants you to turn a five-number summary into a box-and-whisker plot, and to read one the other way, off the page. You need to draw the box from the quartiles, mark the median, run the whiskers out to the extreme values, and flag any outlier as a separate dot using the 1.5×IQR1.5 \times \text{IQR} rule. Then you need to put two box plots on the same scale (parallel box plots) and compare the groups by centre, spread, skew and outliers. The skill being tested is reading a distribution at a glance: a box plot is a five-number summary made visual, and comparing two of them is the quickest fair way to say which group is higher, which is more spread out, and which is lopsided.

The answer

A box-and-whisker plot is a picture of the five-number summary drawn against a number line. The box spans the middle half of the data, from the lower quartile Q1Q_1 to the upper quartile Q3Q_3, with a line inside it at the median. From each end of the box a whisker reaches out to the smallest and largest values. Because the box covers Q1Q_1 to Q3Q_3, its width is the interquartile range, and the whole picture lets you read centre, spread and shape without seeing a single raw number twice.

A box plot built from a five-number summaryA number line from zero to thirty. A box runs from the lower quartile eight to the upper quartile twenty, with a vertical median line at fourteen. A left whisker runs from the box edge at eight out to the minimum three, and a right whisker runs from twenty out to the maximum twenty-seven. The five values are labelled below the axis.Box = middle half (Q₁ to Q₃); whiskers reach min and max051015202530min 3Q₁ 8median 14Q₃ 20max 27IQR = 20 − 8 = 12

What the five pieces mean

Read the plot from left to right and each piece has a fixed meaning:

  • the left whisker end is the minimum, the smallest value,
  • the left edge of the box is the lower quartile Q1Q_1, one quarter of the way through the sorted data,
  • the line inside the box is the median, the middle value,
  • the right edge of the box is the upper quartile Q3Q_3, three quarters of the way through,
  • the right whisker end is the maximum, the largest value.

Each section of the plot holds about a quarter of the data: from the minimum to Q1Q_1, from Q1Q_1 to the median, from the median to Q3Q_3, and from Q3Q_3 to the maximum. A short section means the data is bunched there; a long section means it is spread out there. That is why an uneven plot tells you the shape at a glance.

Constructing a box plot from a five-number summary

The five-number summary comes first; the picture is just a careful plot of it. The routine is always the same:

  1. Draw and scale a number line that comfortably covers the minimum and maximum, with even gaps.
  2. Draw the box from Q1Q_1 to Q3Q_3 above the line.
  3. Draw the median line inside the box, at the median value.
  4. Draw the whiskers from each end of the box out to the minimum and the maximum (unless there is an outlier; see below).

The single thing that earns or loses marks is alignment: every mark must sit at its correct value on the scale. If the scale runs 00 to 3030 and the box should reach 2020, its right edge must line up exactly with 2020 on the axis, not "about there".

Flagging an outlier

When a data set has a value that sits far from the rest, you do not let the whisker stretch all the way out to it. Instead you test it with the 1.5×IQR1.5 \times \text{IQR} rule and, if it fails, draw it as a separate dot.

The two fences are

lower fence=Q11.5×IQR,upper fence=Q3+1.5×IQR.\text{lower fence} = Q_1 - 1.5 \times \text{IQR}, \qquad \text{upper fence} = Q_3 + 1.5 \times \text{IQR}.

Any value below the lower fence or above the upper fence is an outlier. On the plot you mark each outlier as a dot at its value, and the whisker on that side stops at the most extreme value that is inside the fence. For example, with Q1=23Q_1 = 23, Q3=29Q_3 = 29 and IQR=6\text{IQR} = 6, the upper fence is 29+1.5×6=3829 + 1.5 \times 6 = 38, so a value of 4848 is an outlier: it becomes a dot at 4848 and the right whisker stops at 3030 (the largest value not over the fence).

Parallel box plots: comparing two groups

A parallel box plot (also called a side-by-side box plot) draws two or more box plots one above the other against the same number line. Sharing one scale is what makes the comparison fair and instant: you can see at a glance which group is centred higher and which is more spread out, because the boxes are measured against identical axis marks.

Parallel box plots comparing two classes on a shared scaleTwo box plots drawn one above the other against the same axis from thirty to one hundred. The upper plot, Class A, has minimum forty, lower quartile fifty, median sixty, upper quartile seventy and maximum eighty-four. The lower plot, Class B, has minimum fifty-five, lower quartile sixty-five, median seventy-five, upper quartile eighty-five and maximum ninety-five. Class B sits to the right of Class A, showing a higher centre.Same axis = a fair comparison of centre and spread30405060708090100trial markClass AClass BA median 60B median 75

When you compare parallel box plots, always cover the same three things, in this order:

  • Centre. Compare the medians. The group with the higher median line is centred higher. (Here Class B's median of 7575 is well to the right of Class A's 6060.)
  • Spread. Compare the IQRs (box widths) and the ranges (whisker to whisker). A wider box or longer whiskers means a more spread out, less consistent group.
  • Skew (shape). Look at where the median sits in the box and how the whiskers compare. A median near the left of the box with a long right whisker is positive skew; near the right with a long left whisker is negative skew; central with even whiskers is symmetric.

Then mention any outliers shown as separate dots. The exam wants a genuine comparison ("Class B is centred about 1515 marks higher than Class A"), not two separate descriptions.

Reading shape from a box plot

The position of the median inside the box is the quickest read on shape:

  • Symmetric: the median sits near the middle of the box and the two whiskers are about equal in length.
  • Positively skewed: the median sits toward the left (Q1Q_1) side, and the right whisker is longer; the long tail is at the high values.
  • Negatively skewed: the median sits toward the right (Q3Q_3) side, and the left whisker is longer; the long tail is at the low values.

The whisker that is longer points the way the data is skewed.

How exam questions ask about box plots

The wording varies, but each version maps to one of the methods above:

  • "Construct / draw a box plot for this data." Find the five-number summary, scale a number line, draw the box from Q1Q_1 to Q3Q_3, mark the median, and run the whiskers to the min and max (stopping at the fence if there is an outlier).
  • "Show that ... is an outlier" or "Are there any outliers?" Compute the IQR and both fences, compare the suspect value, and state the conclusion. The marks are in the fence calculation, not the yes/no.
  • "Describe the shape of the distribution." Read the median's position in the box and the whisker lengths: symmetric, positively skewed or negatively skewed.
  • "Compare the two data sets" or "... using the parallel box plots." Compare centre (medians), spread (IQR and range) and skew, with figures, then note outliers. Compare, do not just describe each.
  • "What does the box plot tell you about the middle 50%50\%?" That is the box, from Q1Q_1 to Q3Q_3, a span equal to the IQR.

Exam-style practice questions

Practice questions written in the style of NESA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

2022 HSC-style3 marksThe heights of 1111 plants (in cm) are 12,15,16,18,19,20,21,22,23,24,4012, 15, 16, 18, 19, 20, 21, 22, 23, 24, 40. (a) Find the five-number summary. (b) Show that 4040 is an outlier using the 1.5×IQR1.5 \times \text{IQR} rule.
Show worked answer →

Five-number summary: median is the 66th value =20= 20; lower half 12,15,16,18,1912, 15, 16, 18, 19 gives Q1=16Q_1 = 16; upper half 21,22,23,24,4021, 22, 23, 24, 40 gives Q3=23Q_3 = 23. So min =12= 12, Q1=16Q_1 = 16, median =20= 20, Q3=23Q_3 = 23, max =40= 40.

Outlier test: IQR =2316=7= 23 - 16 = 7; upper fence =23+1.5×7=23+10.5=33.5= 23 + 1.5 \times 7 = 23 + 10.5 = 33.5. Since 40>33.540 > 33.5, the value 4040 is an outlier.

Markers award one mark for a correct median and quartiles, one for the full five-number summary, and one for the fence calculation with the comparison and conclusion. A bare "yes it is an outlier" with no fence shown does not earn the outlier mark.

2021 HSC-style4 marksParallel box plots show the daily maximum temperature (in degrees Celsius) at two towns over summer. Town A: min 1414, Q1=22Q_1 = 22, median =26= 26, Q3=30Q_3 = 30, max =38= 38. Town B: min 2020, Q1=24Q_1 = 24, median =25= 25, Q3=27Q_3 = 27, max =32= 32. Compare the temperatures at the two towns, referring to centre, spread and skew.
Show worked answer →

Centre: the medians are close (2626 for Town A, 2525 for Town B), so a typical day is about the same temperature in both towns.

Spread: Town A has range 3814=2438 - 14 = 24 and IQR 3022=830 - 22 = 8; Town B has range 3220=1232 - 20 = 12 and IQR 2724=327 - 24 = 3. Town A's temperatures are much more spread out, so Town B is far more consistent.

Skew: Town A is roughly symmetric (median near the middle of the box). Town B is positively skewed (the median 2525 sits close to Q1Q_1, with a longer upper whisker to 3232).

Markers reward a comparison of medians, a comparison of spread using range or IQR with numbers, and a comment on shape. The strongest answers compare (not just describe each town separately) and quote figures from the plots.

2023 HSC-style3 marksA box plot of finishing times for a charity fun run has its median line very close to the left edge of the box, with a short left whisker and a long right whisker. Describe the shape of the distribution and explain, in context, what this tells you about the finishing times.
Show worked answer →

Shape: the distribution is positively skewed, because the long tail (whisker) is on the higher (right) side and the median sits near the lower quartile.

Interpretation: most runners finished in a fairly tight band of faster times (the compact left side and box), but a smaller number took much longer to finish, stretching the upper tail. So the typical (median) time is on the faster side, while a few slow finishers pull the maximum well to the right.

Markers award one mark for naming positive skew, one for linking it to the long upper whisker / median near Q1Q_1, and one for a sensible context interpretation (a cluster of faster times with a few slow finishers).

Practice questions

Original practice questions graded from foundation to exam level, each with a full worked solution. Try them before revealing the solution.

foundation2 marksA data set has the five-number summary minimum =4= 4, Q1=9Q_1 = 9, median =15= 15, Q3=22Q_3 = 22, maximum =28= 28. Describe, in order, the five marks you would draw on a box plot for these values.
Show worked solution →

Recall what a box plot shows. A box-and-whisker plot marks the five-number summary against a number-line scale: the two whisker ends, the two ends of the box, and the median line inside the box.

List the five marks in order from left to right.

  • left whisker end at the minimum, 44,
  • left edge of the box at the lower quartile Q1Q_1, 99,
  • the median line inside the box, 1515,
  • right edge of the box at the upper quartile Q3Q_3, 2222,
  • right whisker end at the maximum, 2828.

The box runs from 99 to 2222 and holds the middle half of the data, with the median line drawn at 1515. The left whisker runs from 44 to 99 and the right whisker from 2222 to 2828. (Check: the five numbers increase from left to right, 4<9<15<22<284 < 9 < 15 < 22 < 28, as they must on a box plot.)

foundation3 marksSeven players scored these goals over a season: 5,8,9,12,15,18,215, 8, 9, 12, 15, 18, 21. (a) Find the five-number summary. (b) Find the interquartile range.
Show worked solution →

The list is already in order, so read off the median first. With 77 values the median is the 44th value:

median=12\text{median} = 12

Find the quartiles from the two halves. Splitting either side of the median, the lower half is 5,8,95, 8, 9 and the upper half is 15,18,2115, 18, 21. The quartile is the middle of each half:

Q1=8,Q3=18Q_1 = 8, \qquad Q_3 = 18

Part (a) five-number summary. The smallest value is 55 and the largest is 2121, so the summary is

min=5,Q1=8,median=12,Q3=18,max=21.\text{min} = 5, \quad Q_1 = 8, \quad \text{median} = 12, \quad Q_3 = 18, \quad \text{max} = 21.

Part (b) interquartile range. The IQR is the spread of the middle box:

IQR=Q3Q1=188=10.\text{IQR} = Q_3 - Q_1 = 18 - 8 = 10.

(Check: the five numbers increase in order and Q1Q_1 and Q3Q_3 sit either side of the median, as required.)

foundation2 marksA box plot is drawn on a scale from 00 to 5050. Its left whisker starts at 1010, the box runs from 1818 to 3434 with the median line at 2424, and the right whisker ends at 4646. (a) Write down the five-number summary. (b) State the range and the interquartile range.
Show worked solution →

Read each feature off the plot. The whisker ends give the minimum and maximum, the box edges give the quartiles, and the line inside the box is the median.

Part (a) five-number summary.

min=10,Q1=18,median=24,Q3=34,max=46.\text{min} = 10, \quad Q_1 = 18, \quad \text{median} = 24, \quad Q_3 = 34, \quad \text{max} = 46.

Part (b) range and IQR. The range is the full whisker-to-whisker spread and the IQR is the width of the box:

range=4610=36,IQR=3418=16.\text{range} = 46 - 10 = 36, \qquad \text{IQR} = 34 - 18 = 16.

(Check: the box width 1616 is smaller than the full range 3636, which is always true because the box holds only the middle half.)

core4 marksThe masses (in kg) of 1111 parcels are 10,12,13,15,16,18,20,21,23,24,2610, 12, 13, 15, 16, 18, 20, 21, 23, 24, 26. (a) Find the five-number summary. (b) Find the interquartile range. (c) Describe the steps you would take to draw the box plot on a scale from 00 to 3030.
Show worked solution →

The data is already sorted, so find the median. With 1111 values the median is the 66th value:

median=18 kg\text{median} = 18 \text{ kg}

Find the quartiles. Excluding the median, the lower half is 10,12,13,15,1610, 12, 13, 15, 16 and the upper half is 20,21,23,24,2620, 21, 23, 24, 26. The middle of each half is

Q1=13 kg,Q3=23 kg.Q_1 = 13 \text{ kg}, \qquad Q_3 = 23 \text{ kg}.

Part (a) five-number summary.

min=10,Q1=13,median=18,Q3=23,max=26.\text{min} = 10, \quad Q_1 = 13, \quad \text{median} = 18, \quad Q_3 = 23, \quad \text{max} = 26.

Part (b) interquartile range.

IQR=2313=10 kg.\text{IQR} = 23 - 13 = 10 \text{ kg}.

Part (c) drawing it. Draw a number line from 00 to 3030 with even gaps. Above the line, draw a box from 1313 to 2323 and a vertical line inside it at the median 1818. From the left edge of the box draw a whisker out to 1010, and from the right edge draw a whisker out to 2626. Label the scale. (Check: 13<18<2313 < 18 < 23 confirms the median sits inside the box, and the whiskers reach the smallest and largest masses.)

core5 marksThe reaction times (in hundredths of a second) of 1111 people are 20,22,23,24,25,26,27,28,29,30,4820, 22, 23, 24, 25, 26, 27, 28, 29, 30, 48. (a) Find the five-number summary. (b) Use the 1.5×IQR1.5 \times \text{IQR} rule to test the largest value for being an outlier. (c) Explain how the outlier is shown on a box plot.
Show worked solution →

Find the median of the 1111 sorted values. The median is the 66th value:

median=26\text{median} = 26

Find the quartiles. The lower half (excluding the median) is 20,22,23,24,2520, 22, 23, 24, 25 and the upper half is 27,28,29,30,4827, 28, 29, 30, 48, so

Q1=23,Q3=29,IQR=2923=6.Q_1 = 23, \qquad Q_3 = 29, \qquad \text{IQR} = 29 - 23 = 6.

Part (a) five-number summary.

min=20,Q1=23,median=26,Q3=29,max=48.\text{min} = 20, \quad Q_1 = 23, \quad \text{median} = 26, \quad Q_3 = 29, \quad \text{max} = 48.

Part (b) outlier test. An upper outlier lies above the upper fence Q3+1.5×IQRQ_3 + 1.5 \times \text{IQR}:

29+1.5×6=29+9=38.29 + 1.5 \times 6 = 29 + 9 = 38.

Since 48>3848 > 38, the value 4848 is an outlier.

Part (c) showing it. On the box plot the outlier 4848 is drawn as a separate dot beyond the right whisker. The right whisker then stops at the largest value that is not an outlier, which is 3030, rather than reaching 4848. (Check: the lower fence is 239=1423 - 9 = 14 and the smallest value 2020 is above it, so there is no lower outlier.)

exam5 marksParallel box plots compare the trial marks of two classes, North and South. North has five-number summary 42,52,60,70,8042, 52, 60, 70, 80. South has five-number summary 50,60,70,81,9250, 60, 70, 81, 92. (a) Compare the two classes by median. (b) Compare them by spread (range and IQR). (c) State, with a reason, which class performed better overall.
Show worked solution →

Compare the centres first. The median is the fair measure of a typical mark.

North median=60,South median=70.\text{North median} = 60, \qquad \text{South median} = 70.

South's median is 1010 marks higher, so a typical South student scored higher than a typical North student.

Part (a) median comparison. South has the higher median (7070 versus 6060), so South is centred about 1010 marks above North.

Part (b) spread comparison. Compare both the full range and the middle-half IQR.

North range=8042=38,South range=9250=42.\text{North range} = 80 - 42 = 38, \qquad \text{South range} = 92 - 50 = 42.

North IQR=7052=18,South IQR=8160=21.\text{North IQR} = 70 - 52 = 18, \qquad \text{South IQR} = 81 - 60 = 21.

South has a slightly larger range and IQR, so South's marks are a little more spread out (less consistent) than North's.

Part (c) overall judgement. South performed better overall: its median is 1010 marks higher and its whole box sits to the right of North's, so South scored higher across the board. The trade-off is that South's marks were slightly less consistent (larger IQR), but the clear lift in the centre outweighs the small extra spread. (Check: both medians lie inside their own boxes, 52<60<7052 < 60 < 70 and 60<70<8160 < 70 < 81, as required.)

exam6 marksA cafe records the number of customers in the hour after opening on 1111 weekdays and 1111 weekend days. Weekdays: 4,6,7,8,9,10,11,12,13,14,164, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16. Weekends: 8,10,12,14,16,18,20,22,24,26,458, 10, 12, 14, 16, 18, 20, 22, 24, 26, 45. (a) Find the five-number summary for each group. (b) Test the largest weekend value for being an outlier. (c) Compare the two groups by centre, spread and skew, ready to draw as parallel box plots.
Show worked solution →
Weekdays: find the median and quartiles
With 1111 sorted values the median is the 66th value, 1010. The lower half is 4,6,7,8,94, 6, 7, 8, 9 and the upper half is 11,12,13,14,1611, 12, 13, 14, 16, so Q1=7Q_1 = 7 and Q3=13Q_3 = 13.
Weekends: find the median and quartiles
The median is the 66th value, 1818. The lower half is 8,10,12,14,168, 10, 12, 14, 16 and the upper half is 20,22,24,26,4520, 22, 24, 26, 45, so Q1=12Q_1 = 12 and Q3=24Q_3 = 24.
Part (a) five-number summaries

Weekday: 4,  7,  10,  13,  16.\text{Weekday: } 4, \; 7, \; 10, \; 13, \; 16.

Weekend: 8,  12,  18,  24,  45.\text{Weekend: } 8, \; 12, \; 18, \; 24, \; 45.

Part (b) outlier test on the weekend maximum. The weekend IQR is 2412=1224 - 12 = 12, so the upper fence is

24+1.5×12=24+18=42.24 + 1.5 \times 12 = 24 + 18 = 42.

Since 45>4245 > 42, the value 4545 is an outlier and is drawn as a separate dot; the weekend right whisker then stops at 2626.

Part (c) compare by centre, spread and skew.

  • Centre: the weekend median (1818) is well above the weekday median (1010), so weekends are busier on a typical day.
  • Spread: the weekday IQR is 137=613 - 7 = 6 and the weekend IQR is 1212, so weekend counts are roughly twice as spread out; weekends are less predictable.
  • Skew: the weekday plot is fairly symmetric (the median 1010 sits near the middle of the box, with similar whiskers). The weekend plot is positively skewed: the upper whisker and the outlier 4545 stretch far to the right while the lower part is compact.

(Check: each median lies inside its own box, 7<10<137 < 10 < 13 and 12<18<2412 < 18 < 24, and the weekend lower fence 1218=612 - 18 = -6 means no value can be a low outlier, consistent with the compact left tail.)

Related dot points