NSWMaths Standard 2Syllabus dot point

How are the mean, median and mode used to summarise the centre of a data set, and which is the most appropriate measure?

Calculate measures of central tendency, including the mean, median and mode, for both raw data and data presented in a frequency table

A focused answer to the HSC Maths Standard 2 dot point on the mean, median and mode. Finding all three from a raw list, the mean and mode from a frequency table, the mean from grouped data using class centres, and choosing the most appropriate measure when the data is skewed or has an outlier, with worked Australian examples.

Generated by Claude Opus 4.814 min answerUpdated 2026-06-21

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

What this dot point is asking

NESA wants you to summarise a whole data set with a single "centre" value, and to know there are three of them: the mean, the median and the mode. You need to find all three from a raw list, find the mean and the mode from a frequency table, and estimate the mean from grouped data using class centres. Just as important is the judgement call NESA tests right beside the arithmetic: deciding which measure best describes a particular data set, because an outlier or a skewed shape can make the mean misleading. The calculations are short. The marks are won by setting out a clear $fx$ table and by choosing the right measure with a one-line reason.

The answer

A measure of central tendency is a single number that stands for the middle, or typical value, of a data set. The three you must know are:

Mean ( $\bar{x}$ ): the arithmetic average, the sum of all values divided by how many there are.
Median: the middle value once the data is put in order.
Mode: the value that occurs most often.

Each answers "what is typical?" in a slightly different way, and they can give different numbers for the same data. The diagram below shows why that happens: when a data set has a long tail on one side (it is skewed), the mean is dragged toward the tail while the median stays near the bulk of the data.

The mean

The mean of a list is the total of all the values divided by the number of values:

\bar{x} = \frac{\text{sum of all values}}{\text{number of values}} = \frac{\sum x}{n}.

For the marks $4, 7, 7, 9, 13$ there are $n = 5$ values totalling $40$ , so $\bar{x} = 40 \div 5 = 8$ . The symbol $\bar{x}$ (read "x-bar") is the standard name for the mean. The mean uses every value, which makes it the most informative measure when the data has no extreme values - but it is exactly that sensitivity that makes a single outlier able to distort it.

The median

The median is the middle value of the ordered data. Always sort the data first. Then:

if there is an odd number of values, the median is the single middle one (for $n$ values it is in position $\tfrac{n+1}{2}$ ),
if there is an even number of values, the median is the average of the two middle values.

For $4, 7, 7, 9, 13$ (already ordered, $n = 5$ ) the middle is the $3$ rd value, so the median is $7$ . For $3, 5, 5, 6, 8, 9$ ( $n = 6$ ) the two middle values are the $3$ rd and $4$ th, $5$ and $6$ , so the median is $\tfrac{5+6}{2} = 5.5$ . Because the median only cares about position, an outlier hardly moves it - which is its great strength.

The mode

The mode is the value (or category) that occurs most often. For $36, 37, 38, 38, 38, 39, 40, 40, 42$ the value $38$ occurs three times, more than any other, so the mode is $38$ . A data set can have:

no mode, if every value occurs once,
two modes (bimodal) or more, if several values tie for the highest frequency.

The mode is the only measure of centre you can use for categorical data, such as the most popular ice-cream flavour, because you cannot add up or order categories.

The mean and mode from a frequency table

When data is given in a frequency table you do not write out every value. Instead you add an $fx$ column - each value $x$ multiplied by its frequency $f$ - and use:

\bar{x} = \frac{\sum fx}{\sum f}

where $\sum f$ is the total frequency (how many data values there are altogether) and $\sum fx$ is the total of the $fx$ column. The mode is simply the value with the highest frequency, and the median is the value at the middle position, found by counting down the frequencies. The worked set below shows the full $fx$ layout.

Estimating the mean from grouped data

Sometimes data is grouped into class intervals (for example $0\text{ to }10$ , $10\text{ to }20$ ) and the individual values are lost. To estimate the mean you replace each class by its class centre, the midpoint of the interval:

\text{class centre} = \frac{\text{lower end} + \text{upper end}}{2}.

Then you treat each class centre as the value $x$ and use the same frequency-table formula, $\bar{x} = \dfrac{\sum fx}{\sum f}$ . The answer is an estimate, because every value in a class is assumed to sit at the centre.

Choosing the most appropriate measure

Different data sets call for different measures. The deciding factors are outliers and skew:

Use the mean when the data is roughly symmetric with no outliers, because it uses all the information.
Use the median when there is an outlier or the data is skewed, because the median is not dragged toward extreme values. House prices and incomes are the classic examples: a few very high values inflate the mean, so the median is reported instead.
Use the mode for categorical data, or when the most common value is what matters (a shoe shop cares about the most-sold size, not the "average" size).

The dot plot below makes the effect of an outlier visible. Most values cluster between $2$ and $7$ , but a single value of $20$ pulls the mean ( $5.8$ ) to the right, well above the median and mode (both $4$ ), which stay with the cluster.

How exam questions ask about central tendency

The wording maps straight to a method:

"Find the mean / average" means $\dfrac{\sum x}{n}$ for a list, or $\dfrac{\sum fx}{\sum f}$ for a frequency table.
"Find the median" means order the data first, then take the middle (average the middle two if $n$ is even). Forgetting to order is the classic lost mark.
"Find the mode / the most common ..." means the value with the highest frequency; be ready to answer "no mode" or "bimodal".
"Estimate the mean" from a grouped table signals class centres, and the word "estimate" is your cue to say the answer is approximate.
"Which measure is most appropriate?" or "why is the mean misleading?" is asking about outliers and skew: name the outlier, and choose the median with a reason.
"The mean is larger than the median - explain" wants you to identify a high outlier or a right (positive) skew pulling the mean up.

Finding the mean, median and mode four ways

Four standard situations: all three measures from a raw list, the mode of a list, the mean from a frequency table, and the mean from grouped data via class centres. Each uses the matching formula and shows the totals in full.

Mean and median of a list

A netball player scores the following goals across $9$ games: $14, 15, 15, 16, 17, 18, 19, 19, 20$ . Find the mean and the median.

Mean - sum over count. There are $n = 9$ values. Add them:

\sum x = 14 + 15 + 15 + 16 + 17 + 18 + 19 + 19 + 20 = 153

\bar{x} = \frac{153}{9} = 17.

Median - the middle of the ordered list. The data is already in order and $n = 9$ is odd, so the median is the value in position $\tfrac{9+1}{2} = 5$ , the $5$ th value:

14, 15, 15, 16, \mathbf{17}, 18, 19, 19, 20

The median is $17$ . Here the mean and median agree, which is what you expect when the data is fairly symmetric with no outlier.

The mode of a list

The shoe sizes sold in one hour at a store are $36, 37, 38, 38, 38, 39, 40, 40, 42$ . Find the mode and explain why it is the useful measure here.

Count the frequencies: Size $38$ occurs three times, size $40$ occurs twice, and the rest occur once.
Identify the most frequent: The highest frequency is $3$ , for size $38$ , so the mode is size $38$ .
Why the mode: A shoe shop wants the most commonly bought size for stock, not an "average" size, so the mode is the meaningful measure for this categorical-style data.

Mean from a frequency table

A survey of $30$ households records the number of cars owned. Build the $fx$ column and find the mean number of cars.

Cars ( $x$ )	Frequency ( $f$ )	$fx$
$0$	$2$	$0$
$1$	$6$	$6$
$2$	$7$	$14$
$3$	$4$	$12$
$4$	$1$	$4$
Total	$\sum f = 20$	$\sum fx = 36$

Read the totals from the table. The total frequency is $\sum f = 2+6+7+4+1 = 20$ households, and the $fx$ column totals $\sum fx = 0+6+14+12+4 = 36$ cars.

Apply the formula.

\bar{x} = \frac{\sum fx}{\sum f} = \frac{36}{20} = 1.8

so the mean is $1.8$ cars per household. (The mode here is $2$ cars, the value with the highest frequency, $7$ .)

Mean from grouped data using class centres

The grouped table shows the marks of $40$ students in a test. Estimate the mean using class centres.

Mark	Class centre ( $x$ )	Frequency ( $f$ )	$fx$
$0\text{ to }20$	$10$	$3$	$30$
$20\text{ to }40$	$30$	$8$	$240$
$40\text{ to }60$	$50$	$14$	$700$
$60\text{ to }80$	$70$	$10$	$700$
$80\text{ to }100$	$90$	$5$	$450$
Total		$\sum f = 40$	$\sum fx = 2120$

Find each class centre. Each centre is the midpoint of the interval, for example $\tfrac{0+20}{2} = 10$ and $\tfrac{40+60}{2} = 50$ .

Build and total the $fx$ column using the class centre as $x$ : $\sum fx = 30 + 240 + 700 + 700 + 450 = 2120$ , with $\sum f = 40$ .

Divide for the estimate.

\bar{x} = \frac{\sum fx}{\sum f} = \frac{2120}{40} = 53

so the estimated mean mark is $53$ . It is an estimate because every student in a class is treated as if they scored the class centre.

Final answers: the netball mean and median are both $17$ ; the shoe mode is size $38$ ; the mean number of cars is $1.8$ ; and the estimated mean mark is $53$ .

Common traps

Not ordering the data before finding the median: The median is the middle of the sorted list. Reading the middle of the unsorted list is the single most common error.
Mishandling an even number of values: With an even $n$ there is no single middle value; average the two middle ones. The median can then be a value not in the data.
Forgetting the $fx$ column: For a frequency table the mean is $\dfrac{\sum fx}{\sum f}$ , not the average of the $x$ values. Dividing $\sum fx$ by the number of rows instead of by $\sum f$ is a frequent slip.
Using the upper end of a class instead of the class centre: Grouped-data means use the midpoint of each interval. Using the upper (or lower) end skews every term.
Letting an outlier stand unquestioned: If one value is far from the rest, the mean is distorted; say so and prefer the median, rather than quoting the mean as "typical".
Calling a value the mode just because it looks central: The mode is the most frequent value, which need not be near the middle (and there may be none, or several).

Exam-style practice questions

Practice questions written in the style of NESA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

2021 HSC-style3 marksThe dot plot below records the number of siblings of

20

students. The values are

0, 0, 0, 1, 1, 1, 1, 1, 2, 2, 2, 2, 3, 3, 3, 4, 4, 5, 6, 9

. (a) Find the mode. (b) Find the median. (c) Find the mean, correct to one decimal place.

Show worked answer →

Mode: $1$ (it occurs five times, more than any other value).

Median: with $20$ values, average the $10$ th and $11$ th ordered values, which are $2$ and $2$ , so the median is $2$ .

Mean: the total is $0(3)+1(5)+2(4)+3(3)+4(2)+5+6+9 = 0+5+8+9+8+5+6+9 = 50$ , and $50 \div 20 = 2.5$ .

Markers award one mark for each correct measure. A common error is reading the median off the dot plot as the tallest column ( $1$ , the mode) instead of locating the middle position. The mean is higher than the median here because of the values $6$ and $9$ in the right tail, a point worth a one-line comment if the question asks about skew.

2022 HSC-style4 marksThe grouped frequency table shows the masses, in kilograms, of

50

parcels. | Mass (kg) |

0\text{ to }4

4\text{ to }8

8\text{ to }12

12\text{ to }16

| | Frequency |

14

20

11

5

| (a) Write down the class centres. (b) Estimate the mean mass, correct to one decimal place. (c) Explain why your answer is only an estimate.

Show worked answer →

Class centres: $2, 6, 10, 14$ (each is the midpoint of its interval).

Mean: $\sum fx = 2(14)+6(20)+10(11)+14(5) = 28+120+110+70 = 328$ , and $\sum f = 50$ , so $\bar{x} = \frac{328}{50} = 6.56 \approx 6.6$ kg.

Why an estimate: grouping discards the exact values and replaces every parcel in a class with the class centre, so the calculation assumes the data is evenly spread within each interval, which is generally not exactly true.

Markers award a mark for the class centres, a mark for a correct $\sum fx$ , a mark for the division, and a mark for a clear reason that the class centre stands in for the real values. Using the upper end of each interval instead of the centre is the most common error and loses the accuracy marks.

2020 HSC-style3 marksA real-estate agent says the mean sale price of the seven houses she sold last month was $

1.2

million, but the median sale price was only $

680\,000

. (a) Explain how the mean can be so much larger than the median. (b) State which measure better represents a typical house price, with a reason.

Show worked answer →

Part (a): the data must contain one or more very expensive sales (high outliers) that lift the mean. The mean adds in the actual dollar value of every sale, so a single multi-million-dollar property pulls it well above the bulk of the data, whereas the median only depends on the middle sale's position and is barely affected.

Part (b): the median of $ $680\,000$ better represents a typical house price, because it is not distorted by the extreme high sales; the mean of $ $1.2$ million overstates what most buyers actually paid.

Markers reward the identification of outlier/skew as the cause and a justified choice of the median, using the idea that the mean is sensitive to extreme values while the median is resistant. A bare answer with no reference to outliers earns little.

Practice questions

Original practice questions graded from foundation to exam level, each with a full worked solution. Try them before revealing the solution.

foundation2 marksFor the data set

4, 7, 7, 9, 13

, find (a) the mean and (b) the median.

Show worked solution →

Part (a) - mean is the sum divided by how many. There are $5$ values, so add them and divide by $5$ :

\bar{x} = \frac{4 + 7 + 7 + 9 + 13}{5} = \frac{40}{5} = 8

so the mean is $8$ .

Part (b) - median is the middle of the ordered list. The data is already in order. With $5$ values the middle one is the $3$ rd: $4, 7, \mathbf{7}, 9, 13$ , so the median is $7$ . (Check: the median $7$ sits inside the data range $4$ to $13$ , as it must.)

foundation2 marksFor the data set

3, 5, 5, 6, 8, 9

, find (a) the mode and (b) the median.

Show worked solution →

Part (a) - mode is the most common value. The value $5$ appears twice and every other value appears once, so the mode is $5$ .

Part (b) - median with an even number of values is the average of the middle two. The list is already ordered and has $6$ values, so the two middle ones are the $3$ rd and $4$ th, namely $5$ and $6$ :

\text{median} = \frac{5 + 6}{2} = \frac{11}{2} = 5.5

so the median is $5.5$ . (Note the median need not be one of the data values when $n$ is even.)

foundation2 marksThe table shows the number of pets owned by

20

students. Find the mean number of pets, correct to two decimal places. | Pets (

x

) |

0

1

2

3

| | Frequency (

f

) |

5

8

4

3

Show worked solution →

Build the $fx$ column. Multiply each value by its frequency:

0 \times 5 = 0, \quad 1 \times 8 = 8, \quad 2 \times 4 = 8, \quad 3 \times 3 = 9

Add the columns. The total frequency is $\sum f = 5 + 8 + 4 + 3 = 20$ and the total of the $fx$ column is

\sum fx = 0 + 8 + 8 + 9 = 25

Divide. The mean of a frequency table is

\bar{x} = \frac{\sum fx}{\sum f} = \frac{25}{20} = 1.25

so the mean is $1.25$ pets. (Check: $\sum f = 20$ matches the $20$ students given, so no row was missed.)

core4 marksSeven small businesses report their annual profit, in thousands of dollars:

48, 52, 55, 55, 58, 61, 210

. (a) Find the mean. (b) Find the median. (c) Find the mode. (d) State, with a reason, which of the three is the most appropriate measure of the typical profit.

Show worked solution →

Part (a) - mean. Add the seven values and divide by $7$ :

\bar{x} = \frac{48 + 52 + 55 + 55 + 58 + 61 + 210}{7} = \frac{539}{7} = 77

so the mean profit is $ $77\,000$ .

Part (b) - median: The list is ordered and has $7$ values, so the median is the $4$ th value: $48, 52, 55, \mathbf{55}, 58, 61, 210$ , giving a median of $ $55\,000$ .
Part (c) - mode: The only repeated value is $55$ , so the mode is $ $55\,000$ .
Part (d) - most appropriate measure: The value $210$ is an outlier - it sits far above the others and drags the mean up to $77$ , which is larger than six of the seven businesses actually earned. The median of $ $55\,000$ is the most appropriate measure of the typical profit, because it is not distorted by the one unusually high value. (Check: six of the seven profits are between $48$ and $61$ , so a "typical" figure near $55$ is far more representative than $77$ .)

core3 marksOver a season a netball player records the number of goals she scores each game in a frequency table. Find the mean number of goals per game. | Goals (

x

) |

0

1

2

3

4

5

| | Frequency (

f

) |

3

6

8

5

2

1

Show worked solution →

Form the $fx$ column. Multiply each goal value by its frequency:

0\times3=0,\quad 1\times6=6,\quad 2\times8=16,\quad 3\times5=15,\quad 4\times2=8,\quad 5\times1=5

Total the two columns. The number of games is

\sum f = 3 + 6 + 8 + 5 + 2 + 1 = 25

and the total number of goals is

\sum fx = 0 + 6 + 16 + 15 + 8 + 5 = 50

Divide. The mean is

\bar{x} = \frac{\sum fx}{\sum f} = \frac{50}{25} = 2

so she averages $2$ goals per game. (Check: $\sum fx = 50$ is the total goals over $25$ games, and $50 \div 25 = 2$ is a sensible game average.)

core4 marksThe grouped frequency table shows the time, in minutes, that

40

customers waited to be served. Using class centres, estimate the mean waiting time. | Time (min) |

0\text{ to }10

10\text{ to }20

20\text{ to }30

30\text{ to }40

40\text{ to }50

| | Frequency (

f

) |

6

10

12

8

4

Show worked solution →

Find each class centre. The class centre is the midpoint of the interval, the average of its lower and upper ends:

\tfrac{0+10}{2}=5,\quad \tfrac{10+20}{2}=15,\quad \tfrac{20+30}{2}=25,\quad \tfrac{30+40}{2}=35,\quad \tfrac{40+50}{2}=45

Build the $fx$ column using the class centre as $x$ :

5\times6=30,\ \ 15\times10=150,\ \ 25\times12=300,\ \ 35\times8=280,\ \ 45\times4=180

Total the columns.

\sum f = 6+10+12+8+4 = 40, \qquad \sum fx = 30+150+300+280+180 = 940

Divide. The estimated mean is

\bar{x} = \frac{\sum fx}{\sum f} = \frac{940}{40} = 23.5

so the mean waiting time is about $23.5$ minutes. (This is an estimate, because grouping replaces every value in a class with its centre. Check: $23.5$ falls in the $20\text{ to }30$ class, which holds the most customers, so it is sensible.)

exam6 marksA cafe records the number of cups of coffee sold each hour during a

7

-hour morning shift:

12, 13, 15, 15, 16, 17, 17

. (a) Find the mean, median and mode. (b) The owner later realises one hour was mis-recorded: the busiest hour actually sold

50

cups, not the

17

that was the largest value. Replace that value with

50

and recalculate the mean and the median. (c) Explain which measure changed more, and which measure better describes a typical hour's sales after the correction.

Show worked solution →

Part (a) - the three measures for the original data. Add the seven values:

\bar{x} = \frac{12 + 13 + 15 + 15 + 16 + 17 + 17}{7} = \frac{105}{7} = 15

The list is ordered with $7$ values, so the median is the $4$ th value: $12, 13, 15, \mathbf{15}, 16, 17, 17$ , giving a median of $15$ . The most common value is... $15$ and $17$ each appear twice, so the data is bimodal, with modes $15$ and $17$ .

Part (b) - replace the largest $17$ with $50$ . The new data set is $12, 13, 15, 15, 16, 17, 50$ . The new mean is

\bar{x} = \frac{12 + 13 + 15 + 15 + 16 + 17 + 50}{7} = \frac{138}{7} \approx 19.71

The new ordered list is $12, 13, 15, \mathbf{15}, 16, 17, 50$ , so the median is still the $4$ th value, $15$ .

Part (c) - which changed, and which is better. The mean jumped from $15$ to about $19.71$ , a rise of nearly $5$ , while the median did not change at all (it stayed $15$ ). The mean changed far more because it adds in the actual size of every value, so one large outlier ( $50$ ) pulls it upward; the median only depends on the position of the middle value, so a single extreme value barely moves it. After the correction the median of $15$ better describes a typical hour, since six of the seven hours sold between $12$ and $17$ cups. (Check: the new mean $19.71$ is larger than $6$ of the $7$ data values, the tell-tale sign that an outlier has distorted it.)

What this dot point is asking

The answer

The mean

The median

The mode

The mean and mode from a frequency table

Estimating the mean from grouped data

Choosing the most appropriate measure

How exam questions ask about central tendency

Exam-style practice questions

Practice questions

Related dot points