Skip to main content
TASMathematics ApplicationsSyllabus dot point

How do we detect and describe association between two categorical variables?

Construct and interpret two-way frequency tables to investigate association between categorical variables.

Constructing two-way frequency tables, calculating row and column percentages, and judging association between categorical variables in TCE Mathematics Applications.

Generated by Claude Opus 4.77 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

What this dot point is asking

Bivariate data is not always numerical. When both variables are categorical, such as gender and preferred transport, you cannot draw a scatterplot or compute a correlation coefficient. Instead you organise the data in a two-way frequency table and look for patterns in the percentages.

Reading the table

The cells inside the table are the joint frequencies. The totals down the right edge and along the bottom are the marginal frequencies, and the single number in the bottom-right corner is the grand total. Always check that the row totals and the column totals each add up to the grand total before going further.

Judging association

To decide whether two categorical variables are associated, you compare the percentage breakdown of the response variable across each category of the explanatory variable. If those percentage breakdowns are noticeably different, the variables are associated. If they are about the same, there is no association.

Choosing the right percentages

The percentage direction must match the question. If sex is the explanatory variable and you want to know how drink preference depends on sex, calculate the percentages within each sex (so each sex column adds to 100 percent). Calculating the percentages the other way around answers a different question and usually hides the pattern you were asked about.

Describing the strength

You are not expected to run a formal significance test in this course. Instead, describe the association in words and quantify the gap. Phrases like a higher proportion of, compared with, and a difference of so many percentage points show the marker you have read the table correctly. A larger gap between the comparison groups suggests a stronger association.

When you report, state which variable you treated as explanatory, quote the comparison percentages, and finish with a plain-language conclusion about whether an association exists.

A complete answer converts the relevant cells to percentages on a common base, compares them across the explanatory categories, and concludes clearly about association rather than restating the original counts.

Exam-style practice questions

Practice questions written in the style of TASC exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

2024 TASC General Mathematics5 marksTable 2 shows the results from two Australian territories (ACT and NT) in a national vote, with Yes/No counts and totals (ACT: Yes 176000, No 111000; NT: Yes 43000, No 65000). a) Complete a percentage table. b) Complete a bar chart to represent your percentages. c) What is meant by the statement: there appears to be some association between these two variables?
Show worked answer →

a) (1 mark) Convert each count to a percentage of its own column (territory) total. ACT: Yes = 176000/287000 = 61.3%, No = 111000/287000 = 38.7%. NT: Yes = 43000/108000 = 39.8%, No = 65000/108000 = 60.2%. Each column totals 100%.

b) (2 marks) Draw a segmented or side-by-side bar chart with one bar per territory, heights showing the Yes and No percentages, the y-axis clearly labelled as a percentage and scaled correctly.

c) (2 marks) Association means the distribution of one variable (the vote) changes depending on the value of the other variable (the territory). Here the Yes percentage is much higher in the ACT (61.3%) than in the NT (39.8%), so knowing the territory changes the likely vote. Because the column percentages differ noticeably, the two categorical variables appear to be associated rather than independent.