How is a five-number summary turned into a box plot, and how do parallel box plots let you compare two groups by centre, spread, skew and outliers?
Construct and interpret box-and-whisker plots and use them, including parallel (side-by-side) box plots, to compare data sets in terms of centre, spread, skewness and outliers
A focused answer to the HSC Maths Standard 2 dot point on box-and-whisker plots. Building a box plot from the five-number summary, flagging an outlier with the 1.5 times IQR rule, drawing parallel box plots, and comparing two groups by centre, spread, skew and outliers, with worked Australian examples.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
What this dot point is asking
NESA wants you to turn a five-number summary into a box-and-whisker plot, and to read one the other way, off the page. You need to draw the box from the quartiles, mark the median, run the whiskers out to the extreme values, and flag any outlier as a separate dot using the rule. Then you need to put two box plots on the same scale (parallel box plots) and compare the groups by centre, spread, skew and outliers. The skill being tested is reading a distribution at a glance: a box plot is a five-number summary made visual, and comparing two of them is the quickest fair way to say which group is higher, which is more spread out, and which is lopsided.
The answer
A box-and-whisker plot is a picture of the five-number summary drawn against a number line. The box spans the middle half of the data, from the lower quartile to the upper quartile , with a line inside it at the median. From each end of the box a whisker reaches out to the smallest and largest values. Because the box covers to , its width is the interquartile range, and the whole picture lets you read centre, spread and shape without seeing a single raw number twice.
What the five pieces mean
Read the plot from left to right and each piece has a fixed meaning:
- the left whisker end is the minimum, the smallest value,
- the left edge of the box is the lower quartile , one quarter of the way through the sorted data,
- the line inside the box is the median, the middle value,
- the right edge of the box is the upper quartile , three quarters of the way through,
- the right whisker end is the maximum, the largest value.
Each section of the plot holds about a quarter of the data: from the minimum to , from to the median, from the median to , and from to the maximum. A short section means the data is bunched there; a long section means it is spread out there. That is why an uneven plot tells you the shape at a glance.
Constructing a box plot from a five-number summary
The five-number summary comes first; the picture is just a careful plot of it. The routine is always the same:
- Draw and scale a number line that comfortably covers the minimum and maximum, with even gaps.
- Draw the box from to above the line.
- Draw the median line inside the box, at the median value.
- Draw the whiskers from each end of the box out to the minimum and the maximum (unless there is an outlier; see below).
The single thing that earns or loses marks is alignment: every mark must sit at its correct value on the scale. If the scale runs to and the box should reach , its right edge must line up exactly with on the axis, not "about there".
Flagging an outlier
When a data set has a value that sits far from the rest, you do not let the whisker stretch all the way out to it. Instead you test it with the rule and, if it fails, draw it as a separate dot.
The two fences are
Any value below the lower fence or above the upper fence is an outlier. On the plot you mark each outlier as a dot at its value, and the whisker on that side stops at the most extreme value that is inside the fence. For example, with , and , the upper fence is , so a value of is an outlier: it becomes a dot at and the right whisker stops at (the largest value not over the fence).
Parallel box plots: comparing two groups
A parallel box plot (also called a side-by-side box plot) draws two or more box plots one above the other against the same number line. Sharing one scale is what makes the comparison fair and instant: you can see at a glance which group is centred higher and which is more spread out, because the boxes are measured against identical axis marks.
When you compare parallel box plots, always cover the same three things, in this order:
- Centre. Compare the medians. The group with the higher median line is centred higher. (Here Class B's median of is well to the right of Class A's .)
- Spread. Compare the IQRs (box widths) and the ranges (whisker to whisker). A wider box or longer whiskers means a more spread out, less consistent group.
- Skew (shape). Look at where the median sits in the box and how the whiskers compare. A median near the left of the box with a long right whisker is positive skew; near the right with a long left whisker is negative skew; central with even whiskers is symmetric.
Then mention any outliers shown as separate dots. The exam wants a genuine comparison ("Class B is centred about marks higher than Class A"), not two separate descriptions.
Reading shape from a box plot
The position of the median inside the box is the quickest read on shape:
- Symmetric: the median sits near the middle of the box and the two whiskers are about equal in length.
- Positively skewed: the median sits toward the left () side, and the right whisker is longer; the long tail is at the high values.
- Negatively skewed: the median sits toward the right () side, and the left whisker is longer; the long tail is at the low values.
The whisker that is longer points the way the data is skewed.
How exam questions ask about box plots
The wording varies, but each version maps to one of the methods above:
- "Construct / draw a box plot for this data." Find the five-number summary, scale a number line, draw the box from to , mark the median, and run the whiskers to the min and max (stopping at the fence if there is an outlier).
- "Show that ... is an outlier" or "Are there any outliers?" Compute the IQR and both fences, compare the suspect value, and state the conclusion. The marks are in the fence calculation, not the yes/no.
- "Describe the shape of the distribution." Read the median's position in the box and the whisker lengths: symmetric, positively skewed or negatively skewed.
- "Compare the two data sets" or "... using the parallel box plots." Compare centre (medians), spread (IQR and range) and skew, with figures, then note outliers. Compare, do not just describe each.
- "What does the box plot tell you about the middle ?" That is the box, from to , a span equal to the IQR.
Exam-style practice questions
Practice questions written in the style of NESA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
2022 HSC-style3 marksThe heights of plants (in cm) are . (a) Find the five-number summary. (b) Show that is an outlier using the rule.Show worked answer →
Five-number summary: median is the th value ; lower half gives ; upper half gives . So min , , median , , max .
Outlier test: IQR ; upper fence . Since , the value is an outlier.
Markers award one mark for a correct median and quartiles, one for the full five-number summary, and one for the fence calculation with the comparison and conclusion. A bare "yes it is an outlier" with no fence shown does not earn the outlier mark.
2021 HSC-style4 marksParallel box plots show the daily maximum temperature (in degrees Celsius) at two towns over summer. Town A: min , , median , , max . Town B: min , , median , , max . Compare the temperatures at the two towns, referring to centre, spread and skew.Show worked answer →
Centre: the medians are close ( for Town A, for Town B), so a typical day is about the same temperature in both towns.
Spread: Town A has range and IQR ; Town B has range and IQR . Town A's temperatures are much more spread out, so Town B is far more consistent.
Skew: Town A is roughly symmetric (median near the middle of the box). Town B is positively skewed (the median sits close to , with a longer upper whisker to ).
Markers reward a comparison of medians, a comparison of spread using range or IQR with numbers, and a comment on shape. The strongest answers compare (not just describe each town separately) and quote figures from the plots.
2023 HSC-style3 marksA box plot of finishing times for a charity fun run has its median line very close to the left edge of the box, with a short left whisker and a long right whisker. Describe the shape of the distribution and explain, in context, what this tells you about the finishing times.Show worked answer →
Shape: the distribution is positively skewed, because the long tail (whisker) is on the higher (right) side and the median sits near the lower quartile.
Interpretation: most runners finished in a fairly tight band of faster times (the compact left side and box), but a smaller number took much longer to finish, stretching the upper tail. So the typical (median) time is on the faster side, while a few slow finishers pull the maximum well to the right.
Markers award one mark for naming positive skew, one for linking it to the long upper whisker / median near , and one for a sensible context interpretation (a cluster of faster times with a few slow finishers).
Practice questions
Original practice questions graded from foundation to exam level, each with a full worked solution. Try them before revealing the solution.
foundation2 marksA data set has the five-number summary minimum , , median , , maximum . Describe, in order, the five marks you would draw on a box plot for these values.Show worked solution →
Recall what a box plot shows. A box-and-whisker plot marks the five-number summary against a number-line scale: the two whisker ends, the two ends of the box, and the median line inside the box.
List the five marks in order from left to right.
- left whisker end at the minimum, ,
- left edge of the box at the lower quartile , ,
- the median line inside the box, ,
- right edge of the box at the upper quartile , ,
- right whisker end at the maximum, .
The box runs from to and holds the middle half of the data, with the median line drawn at . The left whisker runs from to and the right whisker from to . (Check: the five numbers increase from left to right, , as they must on a box plot.)
foundation3 marksSeven players scored these goals over a season: . (a) Find the five-number summary. (b) Find the interquartile range.Show worked solution →
The list is already in order, so read off the median first. With values the median is the th value:
Find the quartiles from the two halves. Splitting either side of the median, the lower half is and the upper half is . The quartile is the middle of each half:
Part (a) five-number summary. The smallest value is and the largest is , so the summary is
Part (b) interquartile range. The IQR is the spread of the middle box:
(Check: the five numbers increase in order and and sit either side of the median, as required.)
foundation2 marksA box plot is drawn on a scale from to . Its left whisker starts at , the box runs from to with the median line at , and the right whisker ends at . (a) Write down the five-number summary. (b) State the range and the interquartile range.Show worked solution →
Read each feature off the plot. The whisker ends give the minimum and maximum, the box edges give the quartiles, and the line inside the box is the median.
Part (a) five-number summary.
Part (b) range and IQR. The range is the full whisker-to-whisker spread and the IQR is the width of the box:
(Check: the box width is smaller than the full range , which is always true because the box holds only the middle half.)
core4 marksThe masses (in kg) of parcels are . (a) Find the five-number summary. (b) Find the interquartile range. (c) Describe the steps you would take to draw the box plot on a scale from to .Show worked solution →
The data is already sorted, so find the median. With values the median is the th value:
Find the quartiles. Excluding the median, the lower half is and the upper half is . The middle of each half is
Part (a) five-number summary.
Part (b) interquartile range.
Part (c) drawing it. Draw a number line from to with even gaps. Above the line, draw a box from to and a vertical line inside it at the median . From the left edge of the box draw a whisker out to , and from the right edge draw a whisker out to . Label the scale. (Check: confirms the median sits inside the box, and the whiskers reach the smallest and largest masses.)
core5 marksThe reaction times (in hundredths of a second) of people are . (a) Find the five-number summary. (b) Use the rule to test the largest value for being an outlier. (c) Explain how the outlier is shown on a box plot.Show worked solution →
Find the median of the sorted values. The median is the th value:
Find the quartiles. The lower half (excluding the median) is and the upper half is , so
Part (a) five-number summary.
Part (b) outlier test. An upper outlier lies above the upper fence :
Since , the value is an outlier.
Part (c) showing it. On the box plot the outlier is drawn as a separate dot beyond the right whisker. The right whisker then stops at the largest value that is not an outlier, which is , rather than reaching . (Check: the lower fence is and the smallest value is above it, so there is no lower outlier.)
exam5 marksParallel box plots compare the trial marks of two classes, North and South. North has five-number summary . South has five-number summary . (a) Compare the two classes by median. (b) Compare them by spread (range and IQR). (c) State, with a reason, which class performed better overall.Show worked solution →
Compare the centres first. The median is the fair measure of a typical mark.
South's median is marks higher, so a typical South student scored higher than a typical North student.
Part (a) median comparison. South has the higher median ( versus ), so South is centred about marks above North.
Part (b) spread comparison. Compare both the full range and the middle-half IQR.
South has a slightly larger range and IQR, so South's marks are a little more spread out (less consistent) than North's.
Part (c) overall judgement. South performed better overall: its median is marks higher and its whole box sits to the right of North's, so South scored higher across the board. The trade-off is that South's marks were slightly less consistent (larger IQR), but the clear lift in the centre outweighs the small extra spread. (Check: both medians lie inside their own boxes, and , as required.)
exam6 marksA cafe records the number of customers in the hour after opening on weekdays and weekend days. Weekdays: . Weekends: . (a) Find the five-number summary for each group. (b) Test the largest weekend value for being an outlier. (c) Compare the two groups by centre, spread and skew, ready to draw as parallel box plots.Show worked solution →
- Weekdays: find the median and quartiles
- With sorted values the median is the th value, . The lower half is and the upper half is , so and .
- Weekends: find the median and quartiles
- The median is the th value, . The lower half is and the upper half is , so and .
- Part (a) five-number summaries
Part (b) outlier test on the weekend maximum. The weekend IQR is , so the upper fence is
Since , the value is an outlier and is drawn as a separate dot; the weekend right whisker then stops at .
Part (c) compare by centre, spread and skew.
- Centre: the weekend median () is well above the weekday median (), so weekends are busier on a typical day.
- Spread: the weekday IQR is and the weekend IQR is , so weekend counts are roughly twice as spread out; weekends are less predictable.
- Skew: the weekday plot is fairly symmetric (the median sits near the middle of the box, with similar whiskers). The weekend plot is positively skewed: the upper whisker and the outlier stretch far to the right while the lower part is compact.
(Check: each median lies inside its own box, and , and the weekend lower fence means no value can be a low outlier, consistent with the compact left tail.)
Related dot points
- Calculate measures of spread, including the range, quartiles and interquartile range, and the population standard deviation using technology
A focused answer to the HSC Maths Standard 2 dot point on measures of spread. The range, the quartiles and interquartile range, the five-number summary, the population standard deviation from a calculator, and how to compare the spread of two data sets, with worked Australian examples.
- Determine outliers using the interquartile range, describe and interpret the shape and features of a distribution (symmetry, skewness, modality, centre, spread and outliers) and compare data displays using these features
A focused answer to the HSC Maths Standard 2 dot point on outliers and describing distributions. The 1.5 times IQR outlier test with lower and upper fences, telling symmetric from positively and negatively skewed data, unimodal versus bimodal shape, and writing a full describe-the-distribution answer covering shape, centre, spread and outliers, with worked Australian examples.
- Calculate measures of central tendency, including the mean, median and mode, for both raw data and data presented in a frequency table
A focused answer to the HSC Maths Standard 2 dot point on the mean, median and mode. Finding all three from a raw list, the mean and mode from a frequency table, the mean from grouped data using class centres, and choosing the most appropriate measure when the data is skewed or has an outlier, with worked Australian examples.
- Construct and interpret frequency histograms and polygons, and cumulative frequency graphs (ogives), including using an ogive to estimate the median and quartiles of a data set
A focused answer to the HSC Maths Standard 2 dot point on frequency graphs. Drawing a frequency histogram with a polygon overlaid through the bar-top midpoints, building a cumulative frequency graph (the ogive) from the running totals, and reading the median, the quartiles and the interquartile range off the ogive, with a fully worked grouped-marks example.