What is 1. Confounding?

A third variable causes both. Ice cream sales and drowning deaths are correlated because both are caused by hot weather. Eating ice cream does not cause drowning.

What is 2. Reverse causation?

Y causes X, not X causes Y. People with depression sometimes use cannabis to self-medicate. A correlation between cannabis use and depression might reflect depression leading to cannabis use, rather than cannabis causing depression.

Random fluctuations produce statistical associations in large datasets. With 100 random tests, on average 5 will appear "significant" at p < 0.05 by chance alone.

§-Syllabus dot point

NSWInvestigating ScienceSyllabus dot point

Inquiry Question 2: How do scientific claims become misinterpreted and how can scientific evidence be evaluated?

Distinguish correlation from causation, identifying confounding variables and the criteria for establishing causation

A focused answer to the HSC Investigating Science Module 7 dot point on correlation and causation. Covers the difference, the Bradford Hill criteria, named examples like smoking and lung cancer, and worked HSC past exam questions.

Generated by Claude Opus 4.87 min answerUpdated 2026-06-22

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section

What this dot point is asking
The answer
Examples in context
Try this

What this dot point is asking

NESA wants you to distinguish correlation from causation, identify confounding variables, and explain how science establishes causation when randomised trials are not possible. This dot point appears in 4-7 mark questions in every recent paper.

The answer

A correlation is a statistical association. Causation is a directed relationship where one variable produces changes in another. Mistaking correlation for causation is one of the most common errors in scientific reasoning and media reporting.

Correlation

A statistical association between two variables. As variable X changes, variable Y tends to change in a related way.

Positive correlation. Both variables increase together. Ice cream sales and shark attacks (both higher in summer).
Negative correlation. One variable increases as the other decreases. Coffee consumption and sleep duration.
Zero correlation. No statistical association.

Correlation is measured by the correlation coefficient (r), ranging from -1 (perfect negative) through 0 (no correlation) to +1 (perfect positive).

Causation

A directed relationship where changes in X produce changes in Y. Three minimum conditions:

X and Y are correlated.
X precedes Y in time.
No alternative explanation (confounder) accounts for the correlation.

Why correlation does not imply causation

Three common reasons a correlation can exist without causation.

1. Confounding: A third variable causes both. Ice cream sales and drowning deaths are correlated because both are caused by hot weather. Eating ice cream does not cause drowning.
2. Reverse causation: Y causes X, not X causes Y. People with depression sometimes use cannabis to self-medicate. A correlation between cannabis use and depression might reflect depression leading to cannabis use, rather than cannabis causing depression.
3. Chance: Random fluctuations produce statistical associations in large datasets. With 100 random tests, on average 5 will appear "significant" at p < 0.05 by chance alone.

The Bradford Hill criteria

In 1965 the epidemiologist Sir Austin Bradford Hill proposed nine criteria for establishing causation from observational evidence. The criteria are not a checklist but a set of considerations to weigh.

Strength of association. Strong associations (relative risk over 5) are less easily explained by confounders.
Consistency. The association is reproduced across different populations, places and times.
Specificity. The cause is associated with a specific outcome, not a wide range.
Temporal sequence. The exposure precedes the outcome.
Biological gradient (dose-response). More exposure produces more outcome.
Plausibility. A biological mechanism is consistent with current scientific knowledge.
Coherence. The relationship fits known facts about the natural history of the disease.
Experimental evidence. Where possible, intervention reduces the outcome.
Analogy. Similar cause-effect relationships exist elsewhere.

Worked example: smoking and lung cancer

In the 1950s, observational studies (Doll and Hill in the UK, Wynder and Graham in the US) found that smokers had much higher lung cancer rates than non-smokers. Tobacco companies and some scientists argued correlation was not causation.

Application of Bradford Hill criteria:

Strength. Smokers had over 10 times the lung cancer risk of non-smokers.
Consistency. Replicated in dozens of studies across many countries.
Specificity. Smoking strongly linked to lung cancer specifically.
Temporal sequence. Smoking decades before cancer onset.
Dose-response. Pack-years strongly predicted risk.
Plausibility. Tobacco smoke contains over 60 carcinogens that damage DNA.
Coherence. Lung pathology consistent with chemical insult.
Experimental. Animals exposed to tar developed tumours.
Analogy. Other chemical carcinogens behaved similarly.

All nine criteria supported causation. The medical consensus shifted by the late 1960s. Australia's National Tobacco Strategy and the world-first plain-packaging legislation followed.

When randomised trials are not possible

For many important questions (smoking, climate, diet over decades), randomised trials are unethical or impractical. Causal inference relies on:

Multiple independent observational studies.
Biological mechanism.
Dose-response evidence.
Animal models and in vitro studies.
Natural experiments. When policy changes (e.g. smoking bans) act like randomisation, the before-and-after comparison strengthens inference.
Mendelian randomisation. Using genetic variants as natural randomisation for a risk factor (e.g. genetic variants for high cholesterol show that cholesterol causes heart disease).

Worked example: vaping and cardiovascular risk

In 2025, multiple Australian cohorts (45 and Up, Australian Longitudinal Study) reported associations between vaping and elevated heart rate and blood pressure in young adults. The TGA's review applied Bradford Hill criteria:

Consistency moderate (multiple studies agree).
Strength moderate (relative risk 1.5 to 2).
Biological plausibility high (nicotine raises heart rate).
Dose-response observed.
Temporal sequence established in longitudinal studies.
No RCT, but Mendelian randomisation supportive.

The conclusion is that vaping likely causes short-term cardiovascular changes, but long-term cancer or chronic disease causation requires longer observation. The case study illustrates how causal inference is built incrementally.

Spurious correlations

The website Spurious Correlations lists hundreds of statistically significant but absurd correlations:

US cheese consumption per capita and number of people who died from being entangled in their bedsheets (r = 0.95).
Maine divorce rate and margarine consumption (r = 0.99).

These illustrate that without mechanism, plausibility and the other Bradford Hill criteria, correlations alone are meaningless. They are useful pedagogical reminders.

Worked example

Apply correlation versus causation reasoning to: "Vaccinated children have more allergies than unvaccinated children, so vaccines cause allergies."

Is there a correlation? Some observational studies report this; others do not.
Is there reverse causation? Possible: families with autoimmune or allergy history may vaccinate more proactively.
Are there confounders? Many. Vaccinated families differ in socioeconomic status, healthcare access, diet, hygiene practices and breastfeeding rates. The hygiene hypothesis suggests these confounders explain allergy patterns.
Biological plausibility? Low. Vaccines do not contain allergens commonly involved in childhood allergies.
Dose-response? Not observed.
Independent replication? Mixed; well-controlled studies show no causal link.

The claim does not survive scrutiny: the correlation is real in some datasets but is explained by confounders, lacks mechanism, and lacks dose-response. This is the standard pattern of mistaking correlation for causation.

Examples in context

Example 1. Sun exposure, vitamin D and melanoma risk. Australian epidemiological data shows a population-level correlation: regions with more sun exposure (Queensland, northern NSW) have higher melanoma rates. The simple inference "more sun causes more melanoma" is partly true but oversimplifies. Bradford Hill analysis shows strength of association is real, temporal sequence holds (exposure precedes diagnosis), dose-response is observed (cumulative UV exposure correlates with risk), plausible mechanism exists (UV DNA damage). However, individual-level confounders include skin type (fair-skinned migrants from northern Europe contribute disproportionately to Australian melanoma rates), childhood sunburn frequency and recreational vs occupational sun exposure. The Cancer Council Australia uses the full Bradford Hill framework to communicate that sun causes melanoma but only in a probabilistic, dose-response way.

Example 2. Fluoride in NSW water and dental caries reduction. Town water fluoridation in NSW began in the 1960s. A clear correlation emerged between fluoridation and reduced childhood caries rates, with non-fluoridated regional towns showing 30 to 50 per cent higher caries rates than fluoridated ones. To establish causation rather than correlation, NHMRC and Cancer Council reviews applied Bradford Hill criteria: strength (large effect), consistency (replicated worldwide), temporal sequence (caries fell after fluoridation began), dose-response (optimal at about 1 ppm), plausible mechanism (fluorapatite formation in enamel), coherence with biochemistry. Confounders such as diet and dental-service availability were controlled by comparing matched towns. Australia's evidence-based policy was that fluoridation causes caries reduction, supporting expansion to over 90 per cent of Australians by the 2010s.

Try this

Q1. Distinguish correlation from causation with one example of a correlation that is not causal. [3 marks]

Cue. Ice-cream sales correlate with drownings; both driven by hot weather (confounder), not causal.

Q2. A study reports that NSW students who eat breakfast have higher HSC marks. Outline two confounders and one method to control for them. [3 marks]

Cue. Confounders: family socioeconomic status, sleep quality, school resources. Control via multiple regression or stratified analysis; ideally a randomised intervention trial of free breakfasts.

Q3. A vaccine-injury claim notes that some children develop autism diagnoses within months of MMR vaccination. (a) Identify the logical fallacy. (b) Explain the correct interpretation given that autism is usually diagnosed at the same age as MMR is given. (c) Identify one large study that has refuted the claim. [2+2+2 marks]

Cue. (a) Post hoc ergo propter hoc. (b) Temporal coincidence due to typical age of diagnosis, not causation. (c) Madsen et al 2002 (537,303 Danish children); subsequent meta-analyses of over 1 million children.

Exam-style practice questions

Practice questions written in the style of NESA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

2024 HSC5 marksDistinguish between correlation and causation, using examples to illustrate the difference.

Show worked answer →

A 5-mark answer needs both definitions, the relationship, two examples and the criteria for establishing causation.

Correlation: A statistical association between two variables. As one changes, the other tends to change in a related way. Correlations can be positive (both increase together), negative (one increases as the other decreases) or zero.
Causation: A relationship where changes in one variable produce changes in the other. The cause precedes the effect, and changing the cause changes the effect.
All causation produces correlation; not all correlation indicates causation
Example of correlation without causation: Ice cream sales and drowning deaths are both higher in summer. Both are correlated, but eating ice cream does not cause drowning. The confounder is temperature: hot weather increases both ice cream consumption and swimming.
Example of correlation indicating causation: Smoking and lung cancer. Initial correlation observed by Doll and Hill in the 1950s, mechanism (carcinogens damaging DNA), dose-response (more smoking, more cancer), animal experiments, and the temporal sequence (smoking precedes cancer onset) together establish causation.
Criteria to establish causation: Strength of association, dose-response, temporal sequence, biological plausibility, consistency across studies, experimental confirmation when possible. These are known as the Bradford Hill criteria.

Markers reward both definitions, the asymmetric relationship, an example for each and at least two Bradford Hill criteria.

2022 HSC4 marksExplain why establishing causation in observational studies is more difficult than in randomised controlled trials.

Show worked answer →

A 4-mark answer needs the difference in study designs, the role of confounders and the consequence.

Randomised controlled trials: Participants are randomly assigned to treatment or control groups. Randomisation distributes confounders (known and unknown) evenly across groups, so any difference in outcome can be attributed to the treatment.
Observational studies: Participants choose their own exposures. Smokers differ from non-smokers in many ways: socioeconomic status, occupation, diet, exercise, stress. Any observed difference in outcome may be due to the exposure, the confounders or a mixture.
The confounder problem: A confounder is associated with both the exposure and the outcome. In observational data, statistical adjustment can reduce confounding but cannot eliminate it because some confounders are unknown or imperfectly measured.
Consequence: Observational studies can suggest causal hypotheses but cannot reliably establish them on their own. Multiple independent observational studies, biological mechanism, dose-response and (where possible) randomised trials are needed.
When RCTs are not possible: Ethics often prevent RCTs: we cannot randomly assign people to smoke. In such cases, the Bradford Hill criteria applied to observational data are the best route to causal inference.

Markers reward the design difference, the role of confounders, an explicit consequence and the recognition that RCTs are not always possible.