Skip to main content
NSWMaths Standard 2Syllabus dot point

How do you display grouped data as a frequency histogram and polygon, and read the median and quartiles off a cumulative frequency graph (ogive)?

Construct and interpret frequency histograms and polygons, and cumulative frequency graphs (ogives), including using an ogive to estimate the median and quartiles of a data set

A focused answer to the HSC Maths Standard 2 dot point on frequency graphs. Drawing a frequency histogram with a polygon overlaid through the bar-top midpoints, building a cumulative frequency graph (the ogive) from the running totals, and reading the median, the quartiles and the interquartile range off the ogive, with a fully worked grouped-marks example.

Generated by Claude Opus 4.815 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

What this dot point is asking

NESA wants you to turn a grouped frequency table into a picture and then read summary numbers back off that picture. There are three displays to master. The frequency histogram is a column graph of grouped numerical data, with the bars touching. The frequency polygon is a line drawn over the histogram through the top-of-bar midpoints. The cumulative frequency graph, called the ogive, plots the running totals and lets you estimate the median and the quartiles by reading off the graph. Almost every Data Analysis paper expects you to either construct one of these or read values from one, so the two skills that earn marks are drawing each graph in the right place and reading the median and quartiles off an ogive at the correct heights.

The answer

Start from a grouped frequency table: classes, their frequencies, and a cumulative (running-total) column. The histogram and polygon are drawn from the frequencies; the ogive is drawn from the cumulative frequencies. Get the table right first and all three graphs follow.

The frequency histogram

A frequency histogram is a column graph for grouped numerical data. Two features separate it from an ordinary bar chart:

  • the bars touch with no gaps, because the horizontal axis is a continuous number line (the classes join end to end), and
  • the height of each bar is the frequency of that class.

By convention each bar is centred on its class centre (the middle value of the interval), so a class from 6060 to 7070 has its bar centred at 6565. The vertical axis is frequency and starts at 00.

The frequency polygon

A frequency polygon is a line graph drawn over the same data. You join the midpoint of the top of each bar with straight line segments. That midpoint sits at the class centre across and the frequency up, so each vertex is the point (class centre,frequency)(\text{class centre}, \text{frequency}). To finish the polygon you bring each end down to the horizontal axis at the next class centre beyond the data, where the frequency is 00, so the line closes onto the axis. A polygon is handy for comparing the shapes of two distributions on the one set of axes without the bars getting in the way.

The diagram below shows the histogram for the worked data set (the test marks of 4040 students) with the polygon drawn over it. Notice every polygon vertex lands exactly on the middle of a bar top.

Frequency histogram of test marks with a frequency polygon overlaidSix touching bars for the mark classes thirty to ninety, with heights two, four, eight, twelve, eight and six. A frequency polygon is drawn over the bars, joining the midpoint of the top of each bar, and is brought down to the horizontal axis at the empty class centres twenty-five and ninety-five on each side.Polygon joins the midpoint of each bar topmarkfrequency02468101230405060708090

The cumulative frequency graph (the ogive)

The cumulative frequency of a class is the running total: how many values fall at or below the upper end of that class. The ogive is the graph of cumulative frequency (vertical axis) against the upper class boundary (horizontal axis), with the points joined by a smooth rising curve. Three things are always true of an ogive:

  • it only ever rises (a running total cannot go down),
  • it starts at the lower boundary of the first class at a cumulative frequency of 00, and
  • it ends at the top boundary of the last class at a cumulative frequency equal to nn, the total number of values.

Its typical stretched-S shape - shallow at the bottom, steep through the busy middle classes, shallow again at the top - is exactly what the running totals produce.

Reading the median and quartiles off the ogive

This is the highest-value skill on the page. Because the ogive shows "how many values are at or below each mark", you can find any quantile by reading across from the right height, then down to the axis. For nn values:

  • median: read across from a cumulative frequency of n2\dfrac{n}{2},
  • lower quartile Q1Q_1: read across from n4\dfrac{n}{4},
  • upper quartile Q3Q_3: read across from 3n4\dfrac{3n}{4}.

Hit the curve, drop straight down, and read the value on the horizontal axis. The interquartile range is then IQR=Q3Q1\text{IQR} = Q_3 - Q_1. The diagram below does this for the 4040 marks: the dashed lines read across from 1010, 2020 and 3030 to give Q155Q_1 \approx 55, a median 65\approx 65 and Q375Q_3 \approx 75.

Ogive of the test marks with the median and quartiles read offA cumulative frequency curve rising from mark thirty at zero to mark ninety at forty. Dashed lines read across from cumulative frequencies ten, twenty and thirty to the curve and down to the mark axis, giving a lower quartile of about fifty-five, a median of about sixty-five and an upper quartile of about seventy-five.Read across, then down to the mark axismarkcum. freq.051015202530354030405060708090Q₁medQ₃

How exam questions ask about frequency graphs

The wording tells you which graph and which skill is being tested:

  • "Construct / draw a frequency histogram" wants touching bars centred on the class centres, heights equal to the frequencies, axes labelled (frequency up, the variable across).
  • "Draw a frequency polygon" wants the line through the bar-top midpoints, (class centre,frequency)(\text{class centre}, \text{frequency}), closed onto the axis at the empty class centres on each side.
  • "Construct a cumulative frequency graph / ogive" wants cumulative frequency plotted against the upper class boundary, joined by a smooth rising curve from 00 up to nn.
  • "Use the ogive to estimate the median / quartiles" wants you to read across from n2\tfrac{n}{2}, n4\tfrac{n}{4} or 3n4\tfrac{3n}{4} and drop to the axis; an answer within a small reading tolerance scores.
  • "How many values are less than / at most kk?" is a direct read up from kk on the axis to the curve and across to the cumulative frequency.
  • "Find the interquartile range from the graph" means read off Q1Q_1 and Q3Q_3 first, then subtract.

Exam-style practice questions

Practice questions written in the style of NESA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

2021 HSC-style4 marksThe marks of 4040 students in a test are grouped into classes, giving cumulative frequencies of 22, 66, 1414, 2626, 3434, 4040 at the upper class boundaries 4040, 5050, 6060, 7070, 8080, 9090. By drawing a cumulative frequency graph (ogive), estimate the median and the interquartile range of the marks.
Show worked answer →

Plot cumulative frequency (vertical) against the upper class boundary (horizontal): the points (40,2)(40, 2), (50,6)(50, 6), (60,14)(60, 14), (70,26)(70, 26), (80,34)(80, 34), (90,40)(90, 40), joined by a smooth rising curve, starting from (30,0)(30, 0).

Median: read across from a cumulative frequency of n2=20\tfrac{n}{2} = 20, then down to the axis, giving a median of about 6565 marks.

Quartiles: read across from n4=10\tfrac{n}{4} = 10 for Q155Q_1 \approx 55 and from 3n4=30\tfrac{3n}{4} = 30 for Q375Q_3 \approx 75, so the interquartile range is about 7555=2075 - 55 = 20 marks.

Markers reward correct plotting (cumulative frequency against the upper boundary), reading across from 2020, 1010 and 3030, and an answer within a small reading tolerance of the true values. Reading from a frequency (not cumulative frequency) loses marks.

2023 HSC-style3 marksA frequency table groups the heights of 3030 plants into classes with centres 1010, 2020, 3030, 4040 cm and frequencies 44, 1111, 99, 66. (a) State the heights and frequencies of the points used to draw the frequency polygon. (b) Explain how the polygon should be closed onto the horizontal axis.
Show worked answer →

Part (a): the polygon joins the top-of-bar midpoints, at (class centre, frequency): (10,4)(10, 4), (20,11)(20, 11), (30,9)(30, 9), (40,6)(40, 6).

Part (b): the polygon is closed by joining each end down to the horizontal axis at the next class centre beyond the data with frequency 00, that is to (0,0)(0, 0) on the left and (50,0)(50, 0) on the right, so the polygon meets the axis.

Markers reward the four (centre, frequency) coordinates and the idea that the ends are taken down to zero at the adjacent empty class centres. Plotting at the class boundaries instead of the class centres is the common error.

Practice questions

Original practice questions graded from foundation to exam level, each with a full worked solution. Try them before revealing the solution.

foundation2 marksA frequency histogram is drawn for the grouped data with class centres 55, 1010, 1515, 2020 and frequencies 33, 77, 66, 44. (a) How wide is the gap between neighbouring bars? (b) State the height of the tallest bar.
Show worked solution →

Part (a) - the gaps in a frequency histogram. A frequency histogram displays grouped numerical data, so the bars are drawn touching with no gaps between them. The gap is therefore 00.

Part (b) - the tallest bar. Bar height equals the class frequency, so the tallest bar is the one with the largest frequency:

max(3,7,6,4)=7\max(3, 7, 6, 4) = 7

so the tallest bar has height 77. (Check: the bars sit at the class centres 55, 1010, 1515, 2020 and are each 55 wide, so they meet edge to edge, confirming the zero gap.)

foundation3 marksA set of grouped data has the frequencies below. Class 00 to 1010: 44; class 1010 to 2020: 99; class 2020 to 3030: 77; class 3030 to 4040: 55. Build the cumulative frequency column (the running total).
Show worked solution →
Add a running total down the table
The cumulative frequency of a class is the number of values at or below the upper end of that class, so you keep a running sum:
First class
44.
Up to 2020
4+9=134 + 9 = 13.
Up to 3030
13+7=2013 + 7 = 20.
Up to 4040
20+5=2520 + 5 = 25.

So the cumulative frequencies are 44, 1313, 2020, 2525. (Check: the final cumulative frequency must equal the total number of values, and 4+9+7+5=254 + 9 + 7 + 5 = 25, which matches the last running total.)

foundation2 marksAn ogive is plotted for a data set of 6060 values. (a) What cumulative frequency do you read across from to estimate the median? (b) What cumulative frequency gives the lower quartile Q1Q_1?
Show worked solution →

Part (a) - the median position. The median sits at the halfway point of the data, so you read across from a cumulative frequency of

n2=602=30\frac{n}{2} = \frac{60}{2} = 30

then drop down to the horizontal axis to read the median value.

Part (b) - the lower quartile position. The lower quartile is one quarter of the way up, so read across from

n4=604=15\frac{n}{4} = \frac{60}{4} = 15

then drop to the axis for Q1Q_1. (Check: Q3Q_3 would use 3n4=45\tfrac{3n}{4} = 45, which sits above the median position 3030, as it should.)

core4 marksThe grouped data below records the time (in minutes) 4040 commuters spent waiting for a train. Class 00 to 55: frequency 66; class 55 to 1010: frequency 1414; class 1010 to 1515: frequency 1212; class 1515 to 2020: frequency 88. (a) Find the cumulative frequencies. (b) State the cumulative frequencies you would read across from to find the median, Q1Q_1 and Q3Q_3 on an ogive.
Show worked solution →

Part (a) - cumulative frequencies. Keep a running total down the classes:

6,6+14=20,20+12=32,32+8=406, \quad 6 + 14 = 20, \quad 20 + 12 = 32, \quad 32 + 8 = 40

so the cumulative frequencies are 66, 2020, 3232, 4040.

Part (b) - the read-off heights. With n=40n = 40 values:

  • median at n2=402=20\dfrac{n}{2} = \dfrac{40}{2} = 20,
  • lower quartile at n4=404=10\dfrac{n}{4} = \dfrac{40}{4} = 10,
  • upper quartile at 3n4=3×404=30\dfrac{3n}{4} = \dfrac{3 \times 40}{4} = 30.

So you read across the ogive from cumulative frequencies 2020, 1010 and 3030 respectively, then drop to the time axis. (Check: the final cumulative frequency 4040 equals the number of commuters, confirming the running total is complete.)

core4 marksA frequency histogram is to be drawn for marks grouped into classes with centres 4545, 5555, 6565, 7575, 8585 and frequencies 33, 99, 1414, 88, 66. (a) State where the leftmost and rightmost bars are centred. (b) A frequency polygon is overlaid. Give the coordinates of the polygon vertices that sit on top of the bars.
Show worked solution →

Part (a) - the end bars. Each bar is centred on its class centre, so the leftmost bar is centred at a mark of 4545 and the rightmost bar at a mark of 8585.

Part (b) - the polygon vertices on the bars. A frequency polygon joins the midpoint of the top of each bar, and the midpoint of a bar's top sits at the class centre (horizontal) and the frequency (vertical). So the vertices are

(45,3), (55,9), (65,14), (75,8), (85,6)(45, 3), \ (55, 9), \ (65, 14), \ (75, 8), \ (85, 6)

reading (class centre, frequency) off each bar in turn. (To close the polygon onto the axis you would also add a point at the next empty class centre on each side, (35,0)(35, 0) and (95,0)(95, 0), both at frequency 00.)

exam6 marksThe grouped frequency table below shows the test marks (out of 100100) of 4040 students. Class 3030 to 4040: frequency 22; class 4040 to 5050: frequency 44; class 5050 to 6060: frequency 88; class 6060 to 7070: frequency 1212; class 7070 to 8080: frequency 88; class 8080 to 9090: frequency 66. (a) Find the cumulative frequency for each class. (b) Using the cumulative frequencies, estimate the median mark from an ogive. (c) Estimate the lower and upper quartiles. (d) Hence find the interquartile range.
Show worked solution →

Part (a) - cumulative frequencies. Run a total down the upper class boundaries (40,50,60,70,80,9040, 50, 60, 70, 80, 90):

2,2+4=6,6+8=14,14+12=26,26+8=34,34+6=402, \quad 2 + 4 = 6, \quad 6 + 8 = 14, \quad 14 + 12 = 26, \quad 26 + 8 = 34, \quad 34 + 6 = 40

so the cumulative frequencies are 22, 66, 1414, 2626, 3434, 4040. (The last value equals n=40n = 40, the number of students, as a check.)

Part (b) - the median. With n=40n = 40, read across from a cumulative frequency of n2=20\tfrac{n}{2} = 20. On the ogive this lands inside the 6060 to 7070 class, where the cumulative frequency climbs from 1414 (at 6060) to 2626 (at 7070). Reading down to the mark axis gives a median of about

60+201412×10=60+612×10=6560 + \frac{20 - 14}{12} \times 10 = 60 + \frac{6}{12} \times 10 = 65

so the median is approximately 6565 marks.

Part (c) - the quartiles. The lower quartile is at n4=10\tfrac{n}{4} = 10, which sits in the 5050 to 6060 class (cumulative frequency 66 rising to 1414):

Q150+1068×10=50+5=55Q_1 \approx 50 + \frac{10 - 6}{8} \times 10 = 50 + 5 = 55

The upper quartile is at 3n4=30\tfrac{3n}{4} = 30, in the 7070 to 8080 class (cumulative frequency 2626 rising to 3434):

Q370+30268×10=70+5=75Q_3 \approx 70 + \frac{30 - 26}{8} \times 10 = 70 + 5 = 75

so Q155Q_1 \approx 55 and Q375Q_3 \approx 75 marks.

Part (d) - the interquartile range. The IQR is the spread of the middle half:

IQR=Q3Q1=7555=20\text{IQR} = Q_3 - Q_1 = 75 - 55 = 20

so the interquartile range is about 2020 marks. (Check: Q1<median<Q3Q_1 < \text{median} < Q_3, that is 55<65<7555 < 65 < 75, exactly as the order of the three quartiles requires.)

exam5 marksAn ogive is drawn for the daily rainfall (in mm) recorded on 5050 days. Reading off the graph gives a median of 1212 mm, a lower quartile of 77 mm and an upper quartile of 1919 mm. (a) Find the interquartile range. (b) Estimate how many days had a rainfall of 77 mm or less. (c) Estimate how many days had a rainfall greater than 1919 mm. (d) Estimate the number of days with rainfall between 77 mm and 1919 mm.
Show worked solution →

Part (a) - interquartile range. The IQR is the upper quartile minus the lower quartile:

IQR=Q3Q1=197=12 mm\text{IQR} = Q_3 - Q_1 = 19 - 7 = 12 \text{ mm}

Part (b) - days at or below 77 mm. The lower quartile Q1=7Q_1 = 7 mm is the value with one quarter of the data at or below it, so

14×50=12.513 days\frac{1}{4} \times 50 = 12.5 \approx 13 \text{ days}

had 77 mm or less (read off the ogive, an estimate, so rounding to a whole day is expected).

Part (c) - days above 1919 mm. The upper quartile Q3=19Q_3 = 19 mm has three quarters of the data at or below it, so one quarter lies above it:

14×50=12.513 days\frac{1}{4} \times 50 = 12.5 \approx 13 \text{ days}

had more than 1919 mm.

Part (d) - days between the quartiles. Between Q1Q_1 and Q3Q_3 lies the middle half of the data:

12×50=25 days\frac{1}{2} \times 50 = 25 \text{ days}

had rainfall between 77 mm and 1919 mm. (Check: the three regions 13+25+13=5113 + 25 + 13 = 51, which rounds back to 5050 within the rounding of half-days, confirming the split is consistent.)

Related dot points