Inquiry Question 2: How do scientific claims become misinterpreted and how can scientific evidence be evaluated?
Distinguish correlation from causation, identifying confounding variables and the criteria for establishing causation
A focused answer to the HSC Investigating Science Module 7 dot point on correlation and causation. Covers the difference, the Bradford Hill criteria, named examples like smoking and lung cancer, and worked HSC past exam questions.
Have a quick question? Jump to the Q&A page
What this dot point is asking
NESA wants you to distinguish correlation from causation, identify confounding variables, and explain how science establishes causation when randomised trials are not possible. This dot point appears in 4-7 mark questions in every recent paper.
The answer
A correlation is a statistical association. Causation is a directed relationship where one variable produces changes in another. Mistaking correlation for causation is one of the most common errors in scientific reasoning and media reporting.
Correlation
A statistical association between two variables. As variable X changes, variable Y tends to change in a related way.
- Positive correlation. Both variables increase together. Ice cream sales and shark attacks (both higher in summer).
- Negative correlation. One variable increases as the other decreases. Coffee consumption and sleep duration.
- Zero correlation. No statistical association.
Correlation is measured by the correlation coefficient (r), ranging from -1 (perfect negative) through 0 (no correlation) to +1 (perfect positive).
Causation
A directed relationship where changes in X produce changes in Y. Three minimum conditions:
- X and Y are correlated.
- X precedes Y in time.
- No alternative explanation (confounder) accounts for the correlation.
Why correlation does not imply causation
Three common reasons a correlation can exist without causation.
- 1. Confounding
- A third variable causes both. Ice cream sales and drowning deaths are correlated because both are caused by hot weather. Eating ice cream does not cause drowning.
- 2. Reverse causation
- Y causes X, not X causes Y. People with depression sometimes use cannabis to self-medicate. A correlation between cannabis use and depression might reflect depression leading to cannabis use, rather than cannabis causing depression.
- 3. Chance
- Random fluctuations produce statistical associations in large datasets. With 100 random tests, on average 5 will appear "significant" at p < 0.05 by chance alone.
The Bradford Hill criteria
In 1965 the epidemiologist Sir Austin Bradford Hill proposed nine criteria for establishing causation from observational evidence. The criteria are not a checklist but a set of considerations to weigh.
- Strength of association. Strong associations (relative risk over 5) are less easily explained by confounders.
- Consistency. The association is reproduced across different populations, places and times.
- Specificity. The cause is associated with a specific outcome, not a wide range.
- Temporal sequence. The exposure precedes the outcome.
- Biological gradient (dose-response). More exposure produces more outcome.
- Plausibility. A biological mechanism is consistent with current scientific knowledge.
- Coherence. The relationship fits known facts about the natural history of the disease.
- Experimental evidence. Where possible, intervention reduces the outcome.
- Analogy. Similar cause-effect relationships exist elsewhere.
Worked example: smoking and lung cancer
In the 1950s, observational studies (Doll and Hill in the UK, Wynder and Graham in the US) found that smokers had much higher lung cancer rates than non-smokers. Tobacco companies and some scientists argued correlation was not causation.
Application of Bradford Hill criteria:
- Strength. Smokers had over 10 times the lung cancer risk of non-smokers.
- Consistency. Replicated in dozens of studies across many countries.
- Specificity. Smoking strongly linked to lung cancer specifically.
- Temporal sequence. Smoking decades before cancer onset.
- Dose-response. Pack-years strongly predicted risk.
- Plausibility. Tobacco smoke contains over 60 carcinogens that damage DNA.
- Coherence. Lung pathology consistent with chemical insult.
- Experimental. Animals exposed to tar developed tumours.
- Analogy. Other chemical carcinogens behaved similarly.
All nine criteria supported causation. The medical consensus shifted by the late 1960s. Australia's National Tobacco Strategy and the world-first plain-packaging legislation followed.
When randomised trials are not possible
For many important questions (smoking, climate, diet over decades), randomised trials are unethical or impractical. Causal inference relies on:
- Multiple independent observational studies.
- Biological mechanism.
- Dose-response evidence.
- Animal models and in vitro studies.
- Natural experiments. When policy changes (e.g. smoking bans) act like randomisation, the before-and-after comparison strengthens inference.
- Mendelian randomisation. Using genetic variants as natural randomisation for a risk factor (e.g. genetic variants for high cholesterol show that cholesterol causes heart disease).
Worked example: vaping and cardiovascular risk
In 2025, multiple Australian cohorts (45 and Up, Australian Longitudinal Study) reported associations between vaping and elevated heart rate and blood pressure in young adults. The TGA's review applied Bradford Hill criteria:
- Consistency moderate (multiple studies agree).
- Strength moderate (relative risk 1.5 to 2).
- Biological plausibility high (nicotine raises heart rate).
- Dose-response observed.
- Temporal sequence established in longitudinal studies.
- No RCT, but Mendelian randomisation supportive.
The conclusion is that vaping likely causes short-term cardiovascular changes, but long-term cancer or chronic disease causation requires longer observation. The case study illustrates how causal inference is built incrementally.
Spurious correlations
The website Spurious Correlations lists hundreds of statistically significant but absurd correlations:
- US cheese consumption per capita and number of people who died from being entangled in their bedsheets (r = 0.95).
- Maine divorce rate and margarine consumption (r = 0.99).
These illustrate that without mechanism, plausibility and the other Bradford Hill criteria, correlations alone are meaningless. They are useful pedagogical reminders.
Past exam questions, worked
Real questions from past NESA papers on this dot point, with our answer explainer.
2024 HSC5 marksDistinguish between correlation and causation, using examples to illustrate the difference.Show worked answer →
A 5-mark answer needs both definitions, the relationship, two examples and the criteria for establishing causation.
- Correlation
- A statistical association between two variables. As one changes, the other tends to change in a related way. Correlations can be positive (both increase together), negative (one increases as the other decreases) or zero.
- Causation
- A relationship where changes in one variable produce changes in the other. The cause precedes the effect, and changing the cause changes the effect.
- All causation produces correlation; not all correlation indicates causation
- Example of correlation without causation
- Ice cream sales and drowning deaths are both higher in summer. Both are correlated, but eating ice cream does not cause drowning. The confounder is temperature: hot weather increases both ice cream consumption and swimming.
- Example of correlation indicating causation
- Smoking and lung cancer. Initial correlation observed by Doll and Hill in the 1950s, mechanism (carcinogens damaging DNA), dose-response (more smoking, more cancer), animal experiments, and the temporal sequence (smoking precedes cancer onset) together establish causation.
- Criteria to establish causation
- Strength of association, dose-response, temporal sequence, biological plausibility, consistency across studies, experimental confirmation when possible. These are known as the Bradford Hill criteria.
Markers reward both definitions, the asymmetric relationship, an example for each and at least two Bradford Hill criteria.
2022 HSC4 marksExplain why establishing causation in observational studies is more difficult than in randomised controlled trials.Show worked answer →
A 4-mark answer needs the difference in study designs, the role of confounders and the consequence.
- Randomised controlled trials
- Participants are randomly assigned to treatment or control groups. Randomisation distributes confounders (known and unknown) evenly across groups, so any difference in outcome can be attributed to the treatment.
- Observational studies
- Participants choose their own exposures. Smokers differ from non-smokers in many ways: socioeconomic status, occupation, diet, exercise, stress. Any observed difference in outcome may be due to the exposure, the confounders or a mixture.
- The confounder problem
- A confounder is associated with both the exposure and the outcome. In observational data, statistical adjustment can reduce confounding but cannot eliminate it because some confounders are unknown or imperfectly measured.
- Consequence
- Observational studies can suggest causal hypotheses but cannot reliably establish them on their own. Multiple independent observational studies, biological mechanism, dose-response and (where possible) randomised trials are needed.
- When RCTs are not possible
- Ethics often prevent RCTs: we cannot randomly assign people to smoke. In such cases, the Bradford Hill criteria applied to observational data are the best route to causal inference.
Markers reward the design difference, the role of confounders, an explicit consequence and the recognition that RCTs are not always possible.
Related dot points
- Evaluate the validity, reliability and accuracy of scientific evidence presented in claims, considering the hierarchy of evidence in medical research
A focused answer to the HSC Investigating Science Module 7 dot point on evaluating evidence. Covers the hierarchy of evidence, what each level contributes, how to identify weak claims, and worked HSC past exam questions on medical and scientific reporting.
- Identify common logical fallacies and cognitive biases that distort scientific claims, including ad hominem, appeals to authority and confirmation bias
A focused answer to the HSC Investigating Science Module 7 dot point on logical fallacies and cognitive bias. Covers ad hominem, appeal to authority, false dichotomy, confirmation bias, the Dunning-Kruger effect, and worked HSC past exam questions.
- Investigate how scientific consensus is established and how it has been challenged, using climate change as a case study
A focused answer to the HSC Investigating Science Module 7 dot point on climate consensus. Covers how the IPCC consensus is built, the strength of evidence, common denial tactics, the role of funded misinformation, and worked HSC past exam questions.