What is only testing one fence?

Always check both the lower and the upper fence. A question often hides a low outlier while you focus on an obvious high one.

NSWMaths Standard 2Syllabus dot point

How do you test whether a value is an outlier, name the shape of a data set, and write a full description of a distribution?

Determine outliers using the interquartile range, describe and interpret the shape and features of a distribution (symmetry, skewness, modality, centre, spread and outliers) and compare data displays using these features

A focused answer to the HSC Maths Standard 2 dot point on outliers and describing distributions. The 1.5 times IQR outlier test with lower and upper fences, telling symmetric from positively and negatively skewed data, unimodal versus bimodal shape, and writing a full describe-the-distribution answer covering shape, centre, spread and outliers, with worked Australian examples.

Generated by Claude Opus 4.814 min answerUpdated 2026-06-21

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

What this dot point is asking

NESA wants you to do two linked jobs. First, apply a definite rule to decide whether an extreme value is an outlier: the $1.5 \times \text{IQR}$ test, which builds a lower and an upper "fence" from the quartiles and flags anything beyond them. Second, describe a distribution in words, naming its shape (symmetric, or skewed left or right), its modality (one peak or two), its centre, its spread, and any outliers. Almost every Data Analysis question that shows a graph or a data set ends with "describe the distribution" or "is this an outlier", so these are among the most reliable marks in the module. The arithmetic is light; the marks are won by stating the fence you test against, showing the comparison, and using the right vocabulary for the shape.

The answer

There are two skills here, and they share one idea: the middle of the data is steady, and you measure everything relative to it. Outliers are found by stepping a fixed distance ( $1.5 \times \text{IQR}$ ) out from the quartiles. Shape is read from how the data sits around its centre - balanced (symmetric) or lopsided (skewed).

The interquartile range, quickly

The interquartile range is the spread of the middle half of the data:

\text{IQR} = Q_3 - Q_1,

where $Q_1$ is the lower quartile (a quarter of the way through the ordered data) and $Q_3$ is the upper quartile (three quarters of the way through). The IQR ignores the extreme top and bottom quarters, so it is not distorted by a single wild value - which is exactly why the outlier test is built on it.

The $1.5 \times \text{IQR}$ outlier test

An outlier is a value that lies unusually far from the rest of the data. The standard test draws two fences:

lower fence $= Q_1 - 1.5 \times \text{IQR}$ ,
upper fence $= Q_3 + 1.5 \times \text{IQR}$ .

Any value below the lower fence or above the upper fence is an outlier. Everything between the fences is treated as ordinary. The number line below shows the fences built out from the quartiles, with two values flagged because they fall beyond them.

The two outliers in the diagram, at $3$ and $46$ , are labelled with their values, not just coloured, so they are identifiable even in black and white. Notice the test treats high and low extremes the same way: always check both fences, because a question may hide a low outlier while you stare at an obvious high one.

Shape: symmetry and skew

The shape of a distribution is how the data sits around its centre. There are three shapes you must name on sight:

Symmetric: the data is balanced about the centre, so the left and right halves are near mirror images. The mean and median are roughly equal.
Positively skewed (skewed to the right): most data is bunched at the low end with a long tail stretching to the right. The few large values pull the mean above the median.
Negatively skewed (skewed to the left): most data is bunched at the high end with a long tail stretching to the left. The few small values pull the mean below the median.

The skew is named for the direction the tail points, which trips up many students: a right-pointing tail is positive skew even though the bulk of the data is on the left. The three smooth curves below show the shapes side by side, with the mean and median marked so you can see how skew pulls them apart.

Modality: how many peaks

Modality counts the clear peaks in the data:

Unimodal: one clear peak (one mode). Most single-group data is unimodal.
Bimodal: two clear, separate peaks. Two peaks almost always means two groups have been combined - for example heights of male and female students, or sales on weekdays versus weekends. When you see bimodal data, the useful comment is that the data may be better split and described as two groups.

A set with no clear peak (all bars about level) is sometimes called uniform, but unimodal and bimodal are the two you will name most.

Writing a full "describe the distribution" answer

When a question says "describe the distribution", markers expect a checklist, not a vibe. Cover four features, in this order:

Shape - symmetric, positively skewed, or negatively skewed (and mention bimodal if there are two peaks).
Centre - quote the median (preferred when the data is skewed or has an outlier) or the mean, with its value.
Spread - quote the IQR (preferred when skewed) or the range, with its value.
Outliers - state any outliers (ideally justified by the $1.5 \times \text{IQR}$ test) and whether they are kept.

A reliable sentence frame is: "The distribution is [shape], centred at [median] with a spread (IQR) of [value], and [has one outlier at .../ has no outliers]." Pairing median with IQR is the safe choice, because both resist outliers; pair mean with standard deviation only when the data is roughly symmetric.

How exam questions ask about outliers and shape

The wording maps straight onto a method:

"Is [value] an outlier?" or "Determine whether ... is an outlier" - run the $1.5 \times \text{IQR}$ test: find the IQR, find the relevant fence, then state the comparison and conclusion.
"Show that [value] is an outlier" - the answer is already known, so the marks are entirely in the working: fence calculation plus the comparison.
"Describe the shape" or "What is the shape of the distribution?" - name symmetric, positive skew or negative skew (the tail names the skew), and add modality if there are two peaks.
"Describe the distribution" - the full four-part answer: shape, centre, spread, outliers.
"Which measure of centre is more appropriate?" - the median if the data is skewed or has an outlier, because it resists extreme values; otherwise the mean.
"Compare the two distributions" - compare like with like: centre against centre and spread against spread, using the median and IQR, then note shape and outliers.

Testing for outliers and describing shape

Three worked examples: a clean upper-fence outlier test, the same data with the extreme value reduced so there is no outlier, and a full describe-the-distribution answer.

Test a high value with the $1.5 \times \text{IQR}$ rule

A netball team's goals across $11$ games, in order, are $8, 9, 11, 12, 13, 14, 15, 16, 17, 18, 40$ . The quartiles are $Q_1 = 11$ and $Q_3 = 17$ . Is the score of $40$ an outlier?

Find the IQR. Subtract the quartiles:

\text{IQR} = Q_3 - Q_1 = 17 - 11 = 6.

Build the upper fence. A high value is tested against $Q_3 + 1.5 \times \text{IQR}$ , and $1.5 \times 6 = 9$ , so

\text{upper fence} = 17 + 9 = 26.

Compare and conclude. Since $40 > 26$ , the score of $40$ is an outlier. (The lower fence is $11 - 9 = 2$ ; the smallest value $8$ is above it, so there is no low outlier.)

Same data, no outlier

Now suppose the largest score had instead been $20$ , giving $8, 9, 11, 12, 13, 14, 15, 16, 17, 18, 20$ . The quartiles are unchanged at $Q_1 = 11$ and $Q_3 = 17$ , so the IQR is still $6$ and the upper fence is still $26$ .

Compare the largest value. The largest value is now $20$ , and

20 < 26,

so it lies inside the upper fence. There is no outlier. This shows the point of the test: it is the fence, not a gut feeling, that decides. A value of $20$ "looks" high but is ordinary; a value of $40$ is genuinely beyond the fence.

A full describe-the-distribution answer

The waiting times (in minutes) for $11$ patients are $3, 20, 22, 23, 24, 25, 26, 27, 28, 29, 46$ , with $Q_1 = 22$ , median $25$ and $Q_3 = 28$ . Describe the distribution.

Test for outliers first. The IQR is $28 - 22 = 6$ , so $1.5 \times \text{IQR} = 9$ and the fences are

\text{lower fence} = 22 - 9 = 13, \qquad \text{upper fence} = 28 + 9 = 37.

Since $3 < 13$ and $46 > 37$ , both $3$ and $46$ are outliers.

Now write the four features.

Shape: apart from the two extremes the middle values are tightly and evenly packed around $25$ , so the central data is roughly symmetric, with one outlier at each end.
Centre: the median is $25$ minutes, preferred here because the two outliers would distort the mean.
Spread: the IQR is $6$ minutes for the middle half; the full range of $46 - 3 = 43$ minutes is badly inflated by the outliers.
Outliers: there are two, at $3$ and $46$ minutes, both beyond the fences; they should be checked (a value of $3$ minutes may be a recording error, while $46$ may be a genuinely long wait).

So the waiting times are roughly symmetric in the middle, centred at a median of $25$ minutes with an IQR of $6$ minutes, but with one low outlier at $3$ and one high outlier at $46$ .

Common traps

Naming the skew backwards: The skew is named for the tail, not the bulk. A long tail to the right is positive (right) skew, even though most of the data sits on the left. A long tail to the left is negative skew.
Forgetting the $1.5$: The fences use $Q_1 - 1.5 \times \text{IQR}$ and $Q_3 + 1.5 \times \text{IQR}$ , not $Q_1 - \text{IQR}$ . Leaving out the $1.5$ makes the fences far too tight and flags ordinary values.
Only testing one fence: Always check both the lower and the upper fence. A question often hides a low outlier while you focus on an obvious high one.
Quoting the mean for a skewed set: When data is skewed or has an outlier, the median (with the IQR) describes the centre and spread better, because the mean is dragged towards the tail.
Giving an incomplete description: "Describe the distribution" needs all four of shape, centre, spread and outliers. Missing one loses a mark.

In plain English

Picture the marks of a class lined up shortest to tallest. Most people sit in a sensible middle band, and you can draw two "fences" a fixed step out on each side; anyone standing past a fence is an outlier, a genuine odd-one-out rather than just a bit different. The shape is about whether the crowd is balanced or lopsided: if there is a long thin straggle of people off to one side, the data is "skewed" towards that side, and the skew is named for where the straggle points, not where the crowd is. A single bump in the crowd means one main group; two separate bumps usually means two different groups got mixed together. When someone asks you to describe the data, you just answer four quick questions: what shape is the crowd, where is its middle, how spread out is it, and is anyone standing out past the fences.

Exam-style practice questions

Practice questions written in the style of NESA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

2022 HSC-style3 marksA data set has a lower quartile of

Q_1 = 18

and an upper quartile of

Q_3 = 26

. The largest value is

44

. Determine whether

44

is an outlier, showing your working.

Show worked answer →

A full-mark response computes the IQR, $26 - 18 = 8$ , then the upper fence, $Q_3 + 1.5 \times \text{IQR} = 26 + 1.5 \times 8 = 26 + 12 = 38$ .

It then states the comparison explicitly: $44 > 38$ , therefore $44$ is an outlier.

Markers award one mark for the IQR, one for correctly applying the $1.5 \times \text{IQR}$ rule to get the fence $38$ , and one for the comparison and conclusion. A bare "yes" with no fence shown scores poorly, even if the answer is correct.

2021 HSC-style4 marksThe histogram of weekly earnings for a group of workers is bunched at the lower end with a long tail stretching to the right, where a few workers earn much more. (a) Name the shape of the distribution. (b) State whether the mean or the median is the larger measure of centre, and explain why. (c) State which measure better represents a typical worker, with a reason.

Show worked answer →

Part (a): the distribution is positively skewed (skewed to the right) - the long tail points to the right.

Part (b): the mean is larger than the median, because the small number of very high earners in the right tail pull the mean up, while the median (the middle position) is barely affected.

Part (c): the median better represents a typical worker, because it is resistant to the few extreme high incomes that distort the mean.

Markers reward the correct shape name, the correct mean-versus-median direction WITH the tail reasoning, and a justified choice of the median for a typical value. Naming the skew the wrong way (a common slip) loses the part (a) and part (b) marks together.

2023 HSC-style4 marksA set of

12

daily maximum temperatures has a five-number summary with minimum

19

Q_1 = 24

, median

27

Q_3 = 30

and maximum

41

. (a) Show that

41

is an outlier. (b) Describe the distribution, referring to shape, centre and spread.

Show worked answer →

Part (a): IQR $= 30 - 24 = 6$ ; upper fence $= 30 + 1.5 \times 6 = 30 + 9 = 39$ ; since $41 > 39$ , the value $41$ is an outlier.

Part (b): shape is positively skewed (the upper tail is stretched by the high value); centre is a median of $27$ degrees (preferred over the mean because of the outlier); spread is an IQR of $6$ degrees for the middle half, with a full range of $41 - 19 = 22$ degrees inflated by the outlier.

Markers award the outlier test (fence plus comparison), then one mark each for a correctly justified shape, centre and spread. Quoting the median rather than the mean for the centre of a skewed set is part of what is rewarded here.

Practice questions

Original practice questions graded from foundation to exam level, each with a full worked solution. Try them before revealing the solution.

foundation2 marksFor a data set the lower quartile is

Q_1 = 13

and the upper quartile is

Q_3 = 17

. (a) Find the interquartile range. (b) Find the lower and upper outlier fences using the

1.5 \times \text{IQR}

rule.

Show worked solution →

Part (a) - interquartile range. The IQR is the upper quartile minus the lower quartile:

\text{IQR} = Q_3 - Q_1 = 17 - 13 = 4

Part (b) - the two fences. The lower fence sits $1.5 \times \text{IQR}$ below $Q_1$ and the upper fence sits $1.5 \times \text{IQR}$ above $Q_3$ . Here $1.5 \times 4 = 6$ , so

\text{lower fence} = Q_1 - 1.5 \times \text{IQR} = 13 - 6 = 7

\text{upper fence} = Q_3 + 1.5 \times \text{IQR} = 17 + 6 = 23

Any value below $7$ or above $23$ would be flagged as an outlier. (Check: the fences sit one full $1.5 \times \text{IQR}$ step outside each quartile, so they should straddle the quartiles symmetrically, and $7$ and $23$ are each $6$ away from $13$ and $17$ .)

foundation2 marksA data set is positively skewed (skewed to the right). (a) State which is larger, the mean or the median. (b) State on which side the long tail of the data lies.

Show worked solution →

Part (a) - mean versus median. In a positively skewed set the few large values in the tail pull the mean upward, while the median (a position, not a total) barely moves. So the mean is greater than the median.

Part (b) - the tail. Positive skew means the data is stretched out towards the high (positive) end, so the long tail points to the right. The bulk of the data is bunched on the left with a few large values trailing off to the right. (Memory hook: the skew is named for the direction the tail points, so "positive / right skew" has its tail on the right.)

foundation1 marksA histogram of student heights shows two clear, separate peaks. State the modality of the distribution and suggest what the two peaks might represent.

Show worked solution →

Count the peaks. Two clear, separate peaks means the distribution is bimodal.

Interpret the peaks. Two peaks usually signals two groups combined into one data set. For heights, a plausible explanation is that the data mixes two subgroups, for example male and female students, each clustering around its own typical height. (A single-peak set is unimodal; no clear peak is sometimes called uniform.)

core3 marksThe number of goals scored by a netball team across

11

games, in order, is

8, 9, 11, 12, 13, 14, 15, 16, 17, 18, 40

. The five-number summary gives

Q_1 = 11

and

Q_3 = 17

. Use the

1.5 \times \text{IQR}

rule to test whether

40

is an outlier.

Show worked solution →

Find the IQR. Subtract the quartiles:

\text{IQR} = Q_3 - Q_1 = 17 - 11 = 6

Find the upper fence. A high value is tested against the upper fence, $Q_3 + 1.5 \times \text{IQR}$ . Here $1.5 \times 6 = 9$ , so

\text{upper fence} = 17 + 9 = 26

Compare and conclude. The value $40$ is greater than the upper fence $26$ , so

40 > 26

means $40$ is an outlier by the $1.5 \times \text{IQR}$ rule. (For completeness the lower fence is $11 - 9 = 2$ , and the smallest value $8$ is above $2$ , so there is no low outlier. State the fence you cross, then the comparison: that is the line markers reward.)

core3 marksThe waiting times (in minutes) at a clinic for

11

patients, in order, are

3, 20, 22, 23, 24, 25, 26, 27, 28, 29, 46

, with

Q_1 = 22

and

Q_3 = 28

. Test both ends for outliers using the

1.5 \times \text{IQR}

rule and list any outliers.

Show worked solution →

Find the IQR. Subtract the quartiles:

\text{IQR} = Q_3 - Q_1 = 28 - 22 = 6

Find both fences. With $1.5 \times 6 = 9$ :

\text{lower fence} = Q_1 - 1.5 \times \text{IQR} = 22 - 9 = 13

\text{upper fence} = Q_3 + 1.5 \times \text{IQR} = 28 + 9 = 37

Compare each extreme value. The smallest value is $3$ and the largest is $46$ :

3 < 13 \quad \text{and} \quad 46 > 37

so both fall outside their fences. The outliers are $3$ minutes and $46$ minutes. (Check: every other value lies between $13$ and $37$ , so exactly two points are flagged, one at each end. Always test both fences, not just the obvious big value.)

exam5 marksThe times (in minutes) for

15

commuters to travel to work, in order, are

22, 24, 25, 26, 28, 29, 30, 31, 32, 33, 35, 36, 38, 40, 72

. For this data

Q_1 = 26

, the median is

31

and

Q_3 = 36

. (a) Find the IQR. (b) Use the

1.5 \times \text{IQR}

rule to test whether

72

is an outlier. (c) Describe the distribution, commenting on shape, centre, spread and outliers.

Show worked solution →

Part (a) - the IQR. Subtract the quartiles:

\text{IQR} = Q_3 - Q_1 = 36 - 26 = 10

Part (b) - test the value $72$ . A high value is tested against the upper fence. With $1.5 \times 10 = 15$ :

\text{upper fence} = Q_3 + 1.5 \times \text{IQR} = 36 + 15 = 51

Since

72 > 51

the value $72$ is an outlier.

Part (c) - describe the distribution. Work through the four features in order:

Shape: ignoring the outlier the values rise fairly evenly, but the long upper tail (one value far above the rest) makes the data positively skewed (skewed to the right).
Centre: the median is $31$ minutes; the median is the better measure of centre here because the outlier would inflate the mean.
Spread: the IQR is $10$ minutes (the middle half of commuters are within a $10$ minute band); the full range is $72 - 22 = 50$ minutes, stretched by the outlier.
Outliers: there is one outlier at $72$ minutes, well above the upper fence of $51$ ; this is a genuine value (a very long commute, perhaps a transport delay) rather than an error, so it should be kept but noted.

So the travel times are positively skewed with a median of $31$ minutes, an IQR of $10$ minutes, and one high outlier at $72$ minutes. (Check: the median $31$ sits inside the quartiles $26$ and $36$ as it must, and only the single value $72$ crosses a fence.)

exam6 marksA teacher records two class quiz results out of

20

. Class A, in order, is

4, 6, 7, 8, 9, 10, 11, 12, 13, 14, 20

with

Q_1 = 7

, median

10

Q_3 = 13

. Class B, in order, is

12, 14, 15, 16, 16, 17, 17, 18, 18, 19, 20

with

Q_1 = 15

, median

17

Q_3 = 18

. (a) Test Class A for outliers using the

1.5 \times \text{IQR}

rule. (b) Describe the shape of each class. (c) Write one or two sentences comparing the two classes' centre and spread.

Show worked solution →

Part (a) - test Class A for outliers. First the IQR:

\text{IQR}_A = Q_3 - Q_1 = 13 - 7 = 6

With $1.5 \times 6 = 9$ , the fences are

\text{lower fence} = 7 - 9 = -2, \qquad \text{upper fence} = 13 + 9 = 22

The smallest value is $4$ and the largest is $20$ , and

4 > -2 \quad \text{and} \quad 20 < 22

so both extremes lie inside the fences: Class A has no outliers.

Part (b) - shape of each class. For Class A the values are spread fairly evenly and the mean and median are close, so the shape is roughly symmetric. For Class B the marks bunch up near the top (the maximum is $20$ ) with a tail of lower marks trailing down to $12$ , so Class B is negatively skewed (skewed to the left).

Part (c) - compare centre and spread. Class B has the higher centre (median $17$ versus $10$ ), so Class B performed better overall. Class B is also more consistent: its IQR is $Q_3 - Q_1 = 18 - 15 = 3$ , smaller than Class A's IQR of $6$ , so Class B's middle marks are more tightly clustered. (Check: comparing like with like, both statements use the median for centre and the IQR for spread, which is the safe pairing when a set may be skewed.)

What this dot point is asking

The answer

The interquartile range, quickly

The 1.5×IQR1.5 \times \text{IQR}1.5×IQR outlier test

Shape: symmetry and skew

Modality: how many peaks

Writing a full "describe the distribution" answer

How exam questions ask about outliers and shape

Exam-style practice questions

Practice questions

Related dot points

The $1.5 \times \text{IQR}$ outlier test