How do you display grouped data as a frequency histogram and polygon, and read the median and quartiles off a cumulative frequency graph (ogive)?
Construct and interpret frequency histograms and polygons, and cumulative frequency graphs (ogives), including using an ogive to estimate the median and quartiles of a data set
A focused answer to the HSC Maths Standard 2 dot point on frequency graphs. Drawing a frequency histogram with a polygon overlaid through the bar-top midpoints, building a cumulative frequency graph (the ogive) from the running totals, and reading the median, the quartiles and the interquartile range off the ogive, with a fully worked grouped-marks example.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
What this dot point is asking
NESA wants you to turn a grouped frequency table into a picture and then read summary numbers back off that picture. There are three displays to master. The frequency histogram is a column graph of grouped numerical data, with the bars touching. The frequency polygon is a line drawn over the histogram through the top-of-bar midpoints. The cumulative frequency graph, called the ogive, plots the running totals and lets you estimate the median and the quartiles by reading off the graph. Almost every Data Analysis paper expects you to either construct one of these or read values from one, so the two skills that earn marks are drawing each graph in the right place and reading the median and quartiles off an ogive at the correct heights.
The answer
Start from a grouped frequency table: classes, their frequencies, and a cumulative (running-total) column. The histogram and polygon are drawn from the frequencies; the ogive is drawn from the cumulative frequencies. Get the table right first and all three graphs follow.
The frequency histogram
A frequency histogram is a column graph for grouped numerical data. Two features separate it from an ordinary bar chart:
- the bars touch with no gaps, because the horizontal axis is a continuous number line (the classes join end to end), and
- the height of each bar is the frequency of that class.
By convention each bar is centred on its class centre (the middle value of the interval), so a class from to has its bar centred at . The vertical axis is frequency and starts at .
The frequency polygon
A frequency polygon is a line graph drawn over the same data. You join the midpoint of the top of each bar with straight line segments. That midpoint sits at the class centre across and the frequency up, so each vertex is the point . To finish the polygon you bring each end down to the horizontal axis at the next class centre beyond the data, where the frequency is , so the line closes onto the axis. A polygon is handy for comparing the shapes of two distributions on the one set of axes without the bars getting in the way.
The diagram below shows the histogram for the worked data set (the test marks of students) with the polygon drawn over it. Notice every polygon vertex lands exactly on the middle of a bar top.
The cumulative frequency graph (the ogive)
The cumulative frequency of a class is the running total: how many values fall at or below the upper end of that class. The ogive is the graph of cumulative frequency (vertical axis) against the upper class boundary (horizontal axis), with the points joined by a smooth rising curve. Three things are always true of an ogive:
- it only ever rises (a running total cannot go down),
- it starts at the lower boundary of the first class at a cumulative frequency of , and
- it ends at the top boundary of the last class at a cumulative frequency equal to , the total number of values.
Its typical stretched-S shape - shallow at the bottom, steep through the busy middle classes, shallow again at the top - is exactly what the running totals produce.
Reading the median and quartiles off the ogive
This is the highest-value skill on the page. Because the ogive shows "how many values are at or below each mark", you can find any quantile by reading across from the right height, then down to the axis. For values:
- median: read across from a cumulative frequency of ,
- lower quartile : read across from ,
- upper quartile : read across from .
Hit the curve, drop straight down, and read the value on the horizontal axis. The interquartile range is then . The diagram below does this for the marks: the dashed lines read across from , and to give , a median and .
How exam questions ask about frequency graphs
The wording tells you which graph and which skill is being tested:
- "Construct / draw a frequency histogram" wants touching bars centred on the class centres, heights equal to the frequencies, axes labelled (frequency up, the variable across).
- "Draw a frequency polygon" wants the line through the bar-top midpoints, , closed onto the axis at the empty class centres on each side.
- "Construct a cumulative frequency graph / ogive" wants cumulative frequency plotted against the upper class boundary, joined by a smooth rising curve from up to .
- "Use the ogive to estimate the median / quartiles" wants you to read across from , or and drop to the axis; an answer within a small reading tolerance scores.
- "How many values are less than / at most ?" is a direct read up from on the axis to the curve and across to the cumulative frequency.
- "Find the interquartile range from the graph" means read off and first, then subtract.
Exam-style practice questions
Practice questions written in the style of NESA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
2021 HSC-style4 marksThe marks of students in a test are grouped into classes, giving cumulative frequencies of , , , , , at the upper class boundaries , , , , , . By drawing a cumulative frequency graph (ogive), estimate the median and the interquartile range of the marks.Show worked answer →
Plot cumulative frequency (vertical) against the upper class boundary (horizontal): the points , , , , , , joined by a smooth rising curve, starting from .
Median: read across from a cumulative frequency of , then down to the axis, giving a median of about marks.
Quartiles: read across from for and from for , so the interquartile range is about marks.
Markers reward correct plotting (cumulative frequency against the upper boundary), reading across from , and , and an answer within a small reading tolerance of the true values. Reading from a frequency (not cumulative frequency) loses marks.
2023 HSC-style3 marksA frequency table groups the heights of plants into classes with centres , , , cm and frequencies , , , . (a) State the heights and frequencies of the points used to draw the frequency polygon. (b) Explain how the polygon should be closed onto the horizontal axis.Show worked answer →
Part (a): the polygon joins the top-of-bar midpoints, at (class centre, frequency): , , , .
Part (b): the polygon is closed by joining each end down to the horizontal axis at the next class centre beyond the data with frequency , that is to on the left and on the right, so the polygon meets the axis.
Markers reward the four (centre, frequency) coordinates and the idea that the ends are taken down to zero at the adjacent empty class centres. Plotting at the class boundaries instead of the class centres is the common error.
Practice questions
Original practice questions graded from foundation to exam level, each with a full worked solution. Try them before revealing the solution.
foundation2 marksA frequency histogram is drawn for the grouped data with class centres , , , and frequencies , , , . (a) How wide is the gap between neighbouring bars? (b) State the height of the tallest bar.Show worked solution →
Part (a) - the gaps in a frequency histogram. A frequency histogram displays grouped numerical data, so the bars are drawn touching with no gaps between them. The gap is therefore .
Part (b) - the tallest bar. Bar height equals the class frequency, so the tallest bar is the one with the largest frequency:
so the tallest bar has height . (Check: the bars sit at the class centres , , , and are each wide, so they meet edge to edge, confirming the zero gap.)
foundation3 marksA set of grouped data has the frequencies below. Class to : ; class to : ; class to : ; class to : . Build the cumulative frequency column (the running total).Show worked solution →
- Add a running total down the table
- The cumulative frequency of a class is the number of values at or below the upper end of that class, so you keep a running sum:
- First class
- .
- Up to
- .
- Up to
- .
- Up to
- .
So the cumulative frequencies are , , , . (Check: the final cumulative frequency must equal the total number of values, and , which matches the last running total.)
foundation2 marksAn ogive is plotted for a data set of values. (a) What cumulative frequency do you read across from to estimate the median? (b) What cumulative frequency gives the lower quartile ?Show worked solution →
Part (a) - the median position. The median sits at the halfway point of the data, so you read across from a cumulative frequency of
then drop down to the horizontal axis to read the median value.
Part (b) - the lower quartile position. The lower quartile is one quarter of the way up, so read across from
then drop to the axis for . (Check: would use , which sits above the median position , as it should.)
core4 marksThe grouped data below records the time (in minutes) commuters spent waiting for a train. Class to : frequency ; class to : frequency ; class to : frequency ; class to : frequency . (a) Find the cumulative frequencies. (b) State the cumulative frequencies you would read across from to find the median, and on an ogive.Show worked solution →
Part (a) - cumulative frequencies. Keep a running total down the classes:
so the cumulative frequencies are , , , .
Part (b) - the read-off heights. With values:
- median at ,
- lower quartile at ,
- upper quartile at .
So you read across the ogive from cumulative frequencies , and respectively, then drop to the time axis. (Check: the final cumulative frequency equals the number of commuters, confirming the running total is complete.)
core4 marksA frequency histogram is to be drawn for marks grouped into classes with centres , , , , and frequencies , , , , . (a) State where the leftmost and rightmost bars are centred. (b) A frequency polygon is overlaid. Give the coordinates of the polygon vertices that sit on top of the bars.Show worked solution →
Part (a) - the end bars. Each bar is centred on its class centre, so the leftmost bar is centred at a mark of and the rightmost bar at a mark of .
Part (b) - the polygon vertices on the bars. A frequency polygon joins the midpoint of the top of each bar, and the midpoint of a bar's top sits at the class centre (horizontal) and the frequency (vertical). So the vertices are
reading (class centre, frequency) off each bar in turn. (To close the polygon onto the axis you would also add a point at the next empty class centre on each side, and , both at frequency .)
exam6 marksThe grouped frequency table below shows the test marks (out of ) of students. Class to : frequency ; class to : frequency ; class to : frequency ; class to : frequency ; class to : frequency ; class to : frequency . (a) Find the cumulative frequency for each class. (b) Using the cumulative frequencies, estimate the median mark from an ogive. (c) Estimate the lower and upper quartiles. (d) Hence find the interquartile range.Show worked solution →
Part (a) - cumulative frequencies. Run a total down the upper class boundaries ():
so the cumulative frequencies are , , , , , . (The last value equals , the number of students, as a check.)
Part (b) - the median. With , read across from a cumulative frequency of . On the ogive this lands inside the to class, where the cumulative frequency climbs from (at ) to (at ). Reading down to the mark axis gives a median of about
so the median is approximately marks.
Part (c) - the quartiles. The lower quartile is at , which sits in the to class (cumulative frequency rising to ):
The upper quartile is at , in the to class (cumulative frequency rising to ):
so and marks.
Part (d) - the interquartile range. The IQR is the spread of the middle half:
so the interquartile range is about marks. (Check: , that is , exactly as the order of the three quartiles requires.)
exam5 marksAn ogive is drawn for the daily rainfall (in mm) recorded on days. Reading off the graph gives a median of mm, a lower quartile of mm and an upper quartile of mm. (a) Find the interquartile range. (b) Estimate how many days had a rainfall of mm or less. (c) Estimate how many days had a rainfall greater than mm. (d) Estimate the number of days with rainfall between mm and mm.Show worked solution →
Part (a) - interquartile range. The IQR is the upper quartile minus the lower quartile:
Part (b) - days at or below mm. The lower quartile mm is the value with one quarter of the data at or below it, so
had mm or less (read off the ogive, an estimate, so rounding to a whole day is expected).
Part (c) - days above mm. The upper quartile mm has three quarters of the data at or below it, so one quarter lies above it:
had more than mm.
Part (d) - days between the quartiles. Between and lies the middle half of the data:
had rainfall between mm and mm. (Check: the three regions , which rounds back to within the rounding of half-days, confirming the split is consistent.)
Related dot points
- Organise, interpret and display data into appropriate tabular and graphical representations including frequency distribution tables, both ungrouped and grouped using class intervals and class centres, and cumulative frequency
A focused answer to the HSC Maths Standard 2 dot point on frequency tables. Tallying raw data into a frequency table, grouping data into class intervals, finding the class centre, and building the cumulative frequency column, with worked Australian examples and the totals checked so the cumulative frequency ends at the sample size.
- Display categorical and numerical data using a range of statistical graphs, including column graphs, sector graphs, line graphs, divided bar graphs and Pareto charts, and interpret the displays
A focused answer to the HSC Maths Standard 2 dot point on statistical graphs. Column and bar graphs, sector (pie) graphs with each angle computed as a fraction of 360 degrees, line graphs, divided bar graphs, and the Pareto chart with bars sorted descending and a cumulative percentage line for the 80/20 read, with worked Australian examples.
- Calculate measures of central tendency, including the mean, median and mode, for both raw data and data presented in a frequency table
A focused answer to the HSC Maths Standard 2 dot point on the mean, median and mode. Finding all three from a raw list, the mean and mode from a frequency table, the mean from grouped data using class centres, and choosing the most appropriate measure when the data is skewed or has an outlier, with worked Australian examples.
- Calculate measures of spread, including the range, quartiles and interquartile range, and the population standard deviation using technology
A focused answer to the HSC Maths Standard 2 dot point on measures of spread. The range, the quartiles and interquartile range, the five-number summary, the population standard deviation from a calculator, and how to compare the spread of two data sets, with worked Australian examples.
- Construct and interpret box-and-whisker plots and use them, including parallel (side-by-side) box plots, to compare data sets in terms of centre, spread, skewness and outliers
A focused answer to the HSC Maths Standard 2 dot point on box-and-whisker plots. Building a box plot from the five-number summary, flagging an outlier with the 1.5 times IQR rule, drawing parallel box plots, and comparing two groups by centre, spread, skew and outliers, with worked Australian examples.