How do you organise raw data into a frequency table, group it into class intervals with class centres, and build a cumulative frequency column?
Organise, interpret and display data into appropriate tabular and graphical representations including frequency distribution tables, both ungrouped and grouped using class intervals and class centres, and cumulative frequency
A focused answer to the HSC Maths Standard 2 dot point on frequency tables. Tallying raw data into a frequency table, grouping data into class intervals, finding the class centre, and building the cumulative frequency column, with worked Australian examples and the totals checked so the cumulative frequency ends at the sample size.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
What this dot point is asking
NESA wants you to take a list of raw data and organise it into a frequency table. For a small spread of values you tally each value and record how often it occurs. For data that is spread over a wide range you first group it into class intervals and work with the class centre (the middle value of each interval) as the representative figure for the class. On top of either kind of table you add a cumulative frequency column, a running total that answers "how many values are this size or smaller". None of the arithmetic is hard. The marks are won and lost on three habits: tallying without missing or double-counting a value, choosing sensible class intervals, and getting the cumulative column to end exactly at the sample size . Get those right and every table in the Data Analysis module becomes routine, because the histogram, the polygon and the ogive that follow are all just pictures of these columns.
The answer
A frequency table has one row per value (or per class) and a frequency column that counts how often each appears. The first habit is reliable tallying: go through the raw data once, put one stroke in the right row for each value, group the strokes in fives (four strokes with a fifth struck through them, a "gate"), then count the strokes to get each frequency. Always finish by adding the frequency column and checking it equals the number of data values , because a total that is off by one is the signature of a missed or double-counted value.
Building a frequency table with tallies (ungrouped data)
When the data takes only a few distinct values - shoe sizes, goals per game, number of pets - you do not need to group anything. List each distinct value in its own row from smallest to largest, tally the raw data into those rows, and write the frequency beside each. The tally column is not decoration: tallying in one careful pass is what stops you losing or repeating a value, and the gate-of-five grouping makes the strokes quick to count.
For example, suppose players report shoe sizes and the tallies come out as below. The frequency is just the number of strokes, and the column total confirms all players are accounted for.
| Shoe size | Tally | Frequency |
|---|---|---|
The frequencies add to , which matches the number of players, so nothing was missed.
Grouping raw data into class intervals
When the data is spread over a wide range - test marks out of , masses in grams, commute times - a row for every distinct value would be unreadable. Instead you group the data into class intervals (also called classes), each covering an equal-width band of values, and count how many data values fall in each band.
Two choices set up a good grouped table:
- Number of classes. Aim for roughly to classes. Too few hides the shape; too many leaves the table almost as long as the raw list.
- Class width. Pick a "nice" equal width (often , or ) so every value lands in exactly one class. Equal widths matter: they keep the class centres evenly spaced and make the later histogram fair.
Set the classes so they do not overlap and leave no gaps, for example , , and so on. Then tally each raw value into the one class that contains it.
The class centre
Once data is grouped you no longer know each exact value, only the class it sits in. The class centre is the value used to stand in for everything in that class: it is the midpoint of the interval, found by averaging the lower and upper limit.
So the interval has class centre , and the interval has class centre . Because the classes have equal width, the class centres are evenly spaced by exactly the class width, which is a fast way to check them. The diagram below shows what the class centre means on a number line.
The cumulative frequency column
The cumulative frequency of a row is the total of that row's frequency and all the frequencies above it - a running total down the table. It answers "how many data values are at most this size". You build it by carrying a running sum: start with the first frequency, then add each next frequency to the total so far.
The single most important check in this whole topic: the last cumulative frequency must equal , the number of data values, because by the bottom row you have counted everything. If it does not, a frequency is wrong or has been added twice.
Cumulative frequency is the bridge to the rest of the module. Reading "how many scored or less" off the cumulative column is exactly what the cumulative frequency graph (the ogive) does with a curve, and it is how the median and quartiles get located later.
Putting it together: raw to grouped to cumulative
The whole method is one table grown in three steps. Take the raw data, tally it into class intervals to get the frequency column, then run a total down the side to get the cumulative column, adding a class-centre column so the table is ready for the mean and the graphs that follow. The worked set below does exactly this, and every frequency, class centre and cumulative total has been checked so the cumulative column ends at .
How exam questions ask about frequency tables
The wording varies, but each version points straight at one of the columns:
- "Construct / complete a frequency table" with a tally column means tally the raw data into rows and count, then check the total equals .
- "Group the data into the class intervals ..." or "using a class width of ..." means build a grouped table: tally each value into its class and (usually) add a class-centre column.
- "Find the class centre of ..." is just the average of the two class limits.
- "State the modal class" asks for the class with the highest frequency (the grouped version of the mode).
- "Complete the cumulative frequency column" means run a total down the frequency column; expect a check that it ends at .
- "How many scored at most / less than / no more than ..." is a cumulative-frequency read. "At most" and "no more than" include the boundary value; "less than" stops below it - read the wording carefully.
- "Find the value of the missing frequency" uses the total: subtract the known frequencies from .
- "Estimate the mean" from a grouped table means use the class centres in ; the word "estimate" is the clue that grouping has hidden the exact values.
Exam-style practice questions
Practice questions written in the style of NESA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
2021 HSC-style4 marksThe number of children per household in a survey of households gave the frequency table below, where one frequency is missing.
| Children | Frequency |
| --- | --- |
| | |
| | |
| | |
| | |
| | |
(a) Find the value of . (b) Construct a cumulative frequency column. (c) Find the number of households with fewer than children.Show worked answer →
Part (a): the frequencies sum to the households, so . One mark for .
Part (b): cumulative frequencies are the running totals . One mark, and markers check the final cumulative frequency equals .
Part (c): "fewer than children" means , or , the cumulative frequency at the row, which is households. One mark for the answer and one for correctly reading the cumulative column (not including the -children row).
Markers award the value of , a correct cumulative column ending at , and the correct cumulative read for "fewer than ".
2022 HSC-style5 marksThe examination marks of students are grouped into the class intervals , , , , , with frequencies and . (a) State the class centre of the interval. (b) State the modal class. (c) Construct a cumulative frequency column. (d) A student passes by scoring more than . Find how many students passed.Show worked answer →
Part (a): the class centre is . One mark.
Part (b): the modal class is because it has the highest frequency, . One mark.
Part (c): the cumulative frequencies are , ending at the total . One mark, with the final value checked against .
Part (d): "more than " means the and classes, so students passed. Equivalently, the cumulative frequency up to is , and . One mark for the method, one for .
Markers reward the class centre, the modal class identified by frequency, a cumulative column ending at , and the correct "more than " count.
2023 HSC-style3 marksA grouped frequency table records the masses of parcels in classes of width kg starting at . The frequencies of the first four classes are and . (a) State the upper class limit and the class centre of the second class (). (b) Show that these four classes account for all parcels and write the cumulative frequency of the fourth class.Show worked answer →
Part (a): the second class is , so its upper class limit is and its class centre is . One mark for both.
Part (b): the four frequencies sum to , the total number of parcels, so the four classes account for every parcel. The cumulative frequency of the fourth class is this running total, . One mark for the sum equalling and one for stating the cumulative frequency is (the total).
Markers reward the correct upper limit and class centre, the demonstration that the frequencies total , and the cumulative frequency stated as .
Practice questions
Original practice questions graded from foundation to exam level, each with a full worked solution. Try them before revealing the solution.
foundation2 marksA frequency table records the number of pets owned by each of students. The frequencies for and pets are and . (a) Confirm the frequencies add to the number of students. (b) State how many students own at least pets.Show worked solution →
Part (a) - add the frequency column. The frequency column should total the number of students surveyed:
This matches the students, so the table is complete (every student is counted exactly once).
Part (b) - "at least " means or more. Add the frequencies for , and pets:
so students own at least pets. (Sanity check: is less than the total , as it must be, and the students with or pet make up the rest, since and .)
foundation3 marksTwenty netball players are asked their shoe size. The results are . Construct a frequency table with a tally column for the shoe sizes.Show worked solution →
Set up the rows. List each distinct shoe size from smallest to largest in its own row: .
Tally each value, then count. Work through the list once, adding one stroke per value (grouping in fives with a gate stroke), then write the frequency as the number of strokes:
| Shoe size | Tally | Frequency |
|---|---|---|
Check the total. The frequency column must add to the players:
which matches, so no value has been missed or double-counted.
foundation2 marksA grouped frequency table uses the class intervals , , and . (a) Find the class centre of each interval. (b) State which interval the value falls into.Show worked solution →
Part (a) - the class centre is the average of the two endpoints. Add the lower and upper limit of each interval and halve:
so the class centres are and .
Part (b) - locate the value. Since , the value falls in the interval . (Note the class centres are evenly spaced apart, which is the class width - a quick check that they are right.)
core4 marksThe heights of tomato seedlings, in centimetres, are recorded: . Group the data into the class intervals , , and , and build a grouped frequency table with a class-centre column and a frequency column.Show worked solution →
Decide the rows from the given intervals. The four classes are , , and , each wide.
Tally each value into its class, then count the frequency. Going through the list once and dropping each height into the interval that contains it gives the frequencies below. The class centre of each interval is the average of its endpoints, for example .
| Height (cm) | Class centre | Frequency |
|---|---|---|
Check the total. The frequencies must add to the seedlings:
which matches, so every seedling has been placed in exactly one class. (The class centres rise by , the class width, confirming they are evenly spaced.)
core4 marksThe number of goals scored by a hockey team in matches gave the frequencies below. Add a cumulative frequency column, and use it to find how many matches the team scored at most goals in.
| Goals | Frequency |
| --- | --- |
| | |
| | |
| | |
| | |
| | |
| | |Show worked solution →
Build the cumulative frequency by running totals. Each cumulative frequency is the sum of all frequencies up to and including that row, so add each new frequency to the running total:
| Goals | Frequency | Cumulative frequency |
|---|---|---|
Check the final cumulative total. The last cumulative frequency must equal the number of matches, , and it does (), so the column is correct.
Read off "at most goals". "At most " means , or goals, which is exactly the cumulative frequency in the row:
so the team scored at most goals in matches.
exam6 marksThe daily commuting times, in minutes, of workers are recorded: . (a) Group the data into class intervals , , , , and build a grouped frequency table with class-centre, frequency and cumulative frequency columns. (b) State the modal class. (c) Find how many workers commute less than minutes. (d) Estimate the mean commuting time using the class centres.Show worked solution →
Part (a) - tally into classes, then add the class-centre and cumulative columns. Each value is placed in the one class that contains it; the class centre is the average of the endpoints (for example ); and each cumulative frequency is the running total of the frequencies.
| Time (min) | Class centre | Frequency | Cumulative frequency |
|---|---|---|---|
The frequencies add to , and the final cumulative frequency is , so the table is consistent.
Part (b) - modal class. The highest frequency is , so the modal class is minutes.
Part (c) - "less than minutes". Times under fall in the classes , and , which is the cumulative frequency at the row:
so workers commute less than minutes.
Part (d) - estimate the mean from class centres. With grouped data the mean is estimated by treating every value in a class as its class centre, so multiply each class centre by its frequency, add, and divide by :
so the estimated mean commuting time is about minutes. (It is an estimate, not the exact mean, because the class centres stand in for the real values once the data is grouped.)
exam5 marksThe masses, in grams, of apples are grouped into the class intervals , , , and , giving frequencies and . (a) Find the class centre of each interval. (b) Build a cumulative frequency column. (c) An apple is graded 'large' if its mass is g or more. Estimate how many of the apples are large, and express this as a percentage of the sample.Show worked solution →
Part (a) - class centres. The class centre is the average of the lower and upper limit of each interval:
so the class centres are and grams.
Part (b) - cumulative frequency column. Add the frequencies as a running total:
| Mass (g) | Class centre | Frequency | Cumulative frequency |
|---|---|---|---|
The final cumulative frequency is , confirming the column.
Part (c) - "large" apples are g or more. These are the two top classes, and :
As a percentage of the apples:
so about apples (roughly ) are large. (Cross-check with the cumulative column: apples are under g, and are g or more, which agrees.)
Related dot points
- Display and interpret numerical data using dot plots and stem-and-leaf plots, including back-to-back stem-and-leaf plots, and describe the clusters, gaps, outliers and shape of the data
A focused answer to the HSC Maths Standard 2 dot point on dot plots and stem-and-leaf plots. How to construct and read each display, how to build a back-to-back stem-and-leaf plot to compare two groups, and how to describe clusters, gaps, outliers and the shape of a distribution, with worked Australian examples.
- Construct and interpret frequency histograms and polygons, and cumulative frequency graphs (ogives), including using an ogive to estimate the median and quartiles of a data set
A focused answer to the HSC Maths Standard 2 dot point on frequency graphs. Drawing a frequency histogram with a polygon overlaid through the bar-top midpoints, building a cumulative frequency graph (the ogive) from the running totals, and reading the median, the quartiles and the interquartile range off the ogive, with a fully worked grouped-marks example.
- Calculate measures of central tendency, including the mean, median and mode, for both raw data and data presented in a frequency table
A focused answer to the HSC Maths Standard 2 dot point on the mean, median and mode. Finding all three from a raw list, the mean and mode from a frequency table, the mean from grouped data using class centres, and choosing the most appropriate measure when the data is skewed or has an outlier, with worked Australian examples.
- Calculate measures of spread, including the range, quartiles and interquartile range, and the population standard deviation using technology
A focused answer to the HSC Maths Standard 2 dot point on measures of spread. The range, the quartiles and interquartile range, the five-number summary, the population standard deviation from a calculator, and how to compare the spread of two data sets, with worked Australian examples.