What are study designs?

Cross-sectional study. Measures prevalence and risk factors in a population at a single point in time. Useful for snapshots but cannot establish temporal sequence.

New cases per population per time. Formula: incidence rate = new casespopulation at risk × time. Reported per 100 000 per year.

Existing cases at a point in time. Formula: prevalence = existing casestotal population. Reported as a percentage.

Deaths per population per time. Crude mortality counts all deaths; cause-specific mortality counts deaths from a specific disease. Reported per 100 000 per year.

What is case fatality rate?

Deaths divided by diagnosed cases. Measures how deadly a disease is once contracted.

Total illness in a population, including non-fatal disease burden (often measured as DALYs, disability-adjusted life years).

What is cross-sectional study?

Measures prevalence and risk factors in a population at a single point in time. Useful for snapshots but cannot establish temporal sequence.

What is cohort study?

Follows a group of healthy people forward in time, recording exposures and waiting for disease to develop. Strong for establishing temporal sequence and calculating incidence and relative risk. Example: the Framingham Heart Study (1948 onwards) identified cholesterol, smoking and hypertension as cardiovascular risk factors.

What is case-control study?

Compares people with the disease (cases) to matched people without (controls), looking backward at exposures. Efficient for rare diseases. Vulnerable to recall and selection bias.

What is randomised controlled trial?

Participants are randomly assigned to intervention or control groups. The gold standard for testing whether an intervention causes an outcome. Used for treatment trials, less often for risk factor studies (cannot ethically assign people to smoke).

What is ecological study?

Compares disease rates across populations (e.g. fluoride in water versus dental caries). Cannot make individual-level claims (ecological fallacy).

NESA 2020 HSC-style practice: [An 11-year study of 58 406 young adults related drinking-water arsenic exposure to mortality; survival graphs are shown for three exposure bands ( 223 µg/L) in males and females.] The hypothesis was that exposure to arsenic in drinking water increases mortality in young adults. Discuss the data presented in the graphs in relation to this hypothesis.

Marks come from making **points for and/or against the hypothesis** and **relating each to the data**. **Support for the hypothesis:** - In **both sexes, increasing arsenic dose led to decreased survival**, suggesting arsenic causes the decline; the dose–response was clearest in **males**. - Survival **declined progressively over the 11 years**, consistent with cumulative exposure reducing survival. **Reference the control/qualifications:** - The ** 90 µg/L gave a **similar** decrease, hinting at **other interacting factors** (e.g. nutrition, genes). - Note the magnitude caveat: despite the large sample, **survival only dropped by ~0.1% or less**, so the effect, while present, is small.

NSWBiologySyllabus dot point

Inquiry Question 2: Do non-infectious diseases cause more deaths than infectious diseases?

Collect and represent data from secondary sources to evaluate the method used in an example of an epidemiological study, including incidence, prevalence, mortality, and the methods and benefits of epidemiology

A focused answer to the HSC Biology Module 8 dot point on epidemiology. Defines incidence, prevalence and mortality, compares cohort, case-control and cross-sectional study designs, and applies them to the Doll and Hill lung cancer studies.

Generated by Claude Opus 4.810 min answerUpdated 2026-06-21

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section

What this dot point is asking
The answer
Examples in context
Try this

What this dot point is asking

NESA wants you to define the core epidemiological measures, describe the main study designs, evaluate a real epidemiological study, and explain how epidemiology informs public health.

The answer

Epidemiology is the study of the distribution, causes and control of disease in populations. It uses observational and experimental study designs to identify risk factors, estimate disease burden, and evaluate interventions.

Core measures

Incidence: New cases per population per time. Formula: $\text{incidence rate} = \frac{\text{new cases}}{\text{population at risk} \times \text{time}}$ . Reported per 100 000 per year. Tracks how fast a disease is emerging.
Prevalence: Existing cases at a point in time. Formula: $\text{prevalence} = \frac{\text{existing cases}}{\text{total population}}$ . Reported as a percentage. Tracks total disease burden.
Mortality: Deaths per population per time. Crude mortality counts all deaths; cause-specific mortality counts deaths from a specific disease. Reported per 100 000 per year. Tracks lethality.
Case fatality rate: Deaths divided by diagnosed cases. Measures how deadly a disease is once contracted.
Morbidity: Total illness in a population, including non-fatal disease burden (often measured as DALYs, disability-adjusted life years).

Study designs

Cross-sectional study: Measures prevalence and risk factors in a population at a single point in time. Useful for snapshots but cannot establish temporal sequence.
Cohort study (prospective): Follows a group of healthy people forward in time, recording exposures and waiting for disease to develop. Strong for establishing temporal sequence and calculating incidence and relative risk. Example: the Framingham Heart Study (1948 onwards) identified cholesterol, smoking and hypertension as cardiovascular risk factors.
Case-control study (retrospective): Compares people with the disease (cases) to matched people without (controls), looking backward at exposures. Efficient for rare diseases. Vulnerable to recall and selection bias.
Randomised controlled trial (RCT): Participants are randomly assigned to intervention or control groups. The gold standard for testing whether an intervention causes an outcome. Used for treatment trials, less often for risk factor studies (cannot ethically assign people to smoke).
Ecological study: Compares disease rates across populations (e.g. fluoride in water versus dental caries). Cannot make individual-level claims (ecological fallacy).

Worked example: Doll and Hill and lung cancer

In 1950, Richard Doll and Austin Bradford Hill published a case-control study of 1298 patients in London hospitals. Cases were lung cancer patients; controls were matched patients without lung cancer. Smoking history was recorded by interview.

Result. Smokers had a much higher rate of lung cancer than non-smokers, with a dose-response gradient: more cigarettes per day, higher cancer risk.

Follow-up. The British Doctors Study (1951 onwards) followed 40 000 male doctors prospectively for over 50 years. It confirmed:

Lung cancer mortality 25 times higher in heavy smokers than non-smokers.
Half of long-term smokers die from a smoking-related disease.
Quitting at any age reduces risk.

Bradford Hill criteria. Hill later proposed nine criteria for inferring causation from observation: strength of association, consistency, specificity, temporality, biological gradient, plausibility, coherence, experiment and analogy. Smoking and lung cancer satisfied all nine.

Impact. The studies led to public health warnings, advertising restrictions, taxation, plain-packaging laws (in Australia from 2012), and a roughly two-thirds reduction in adult smoking rates in developed countries.

Benefits of epidemiology

Identifies causes. Smoking and lung cancer, asbestos and mesothelioma, HPV and cervical cancer.
Targets prevention. Identifies high-risk groups for screening (e.g. women over 50 for breast cancer).
Evaluates interventions. Did the cervical cancer vaccine reduce incidence? (Yes, by over 50 percent in vaccinated cohorts.)
Tracks emerging disease. Surveillance systems detect new outbreaks early (COVID-19, HIV).
Allocates resources. Prevalence data informs hospital capacity, drug stockpiles and staffing.

Limitations of epidemiology

Cannot prove causation in observational studies. Only RCTs can do that directly; observational studies use Bradford Hill criteria.
Confounding. Hidden variables may explain associations.
Bias. Selection bias, recall bias, reporting bias.
Generalisability. A study in one population may not apply elsewhere.

Examples in context

Example 1. NSW BreastScreen and population-level mortality reduction. BreastScreen NSW invites women aged 50 to 74 for free biennial mammographic screening, with participation reaching about 55 percent of eligible women. Epidemiological evaluation by the Cancer Institute NSW compares age-adjusted breast cancer mortality between regular screeners and non-screeners, controlling for confounders such as socioeconomic status. Modelled data show a roughly 25 percent reduction in breast cancer mortality among regularly screened women. This is a population-level intervention evaluation using cohort study design, and it illustrates how prevalence (women living with breast cancer), incidence (new diagnoses) and mortality (deaths) move differently when screening is introduced: incidence rises initially due to detection, then mortality falls.

Example 2. Mater Mothers' Hospital and the COVID-19 maternal cohort study. During 2020-2022, the Mater Mothers' Hospital in Brisbane led an Australian prospective cohort study following 8500 pregnant women, tracking SARS-CoV-2 infection, vaccination and obstetric outcomes. Results published in The Lancet Regional Health showed vaccinated women had no increase in adverse pregnancy outcomes, while unvaccinated infected women had a 4.5-fold higher risk of preterm birth. The cohort design (following exposed and unexposed groups forward in time) was essential to establish temporal causation, which a cross-sectional snapshot could not have shown. The findings directly informed Royal Australian and New Zealand College of Obstetricians and Gynaecologists guidelines for COVID-19 vaccination in pregnancy.

Try this

Q1. Define incidence, prevalence and mortality, and explain how each is calculated for a population. [3 marks]

Cue. Incidence = new cases per population per time. Prevalence = total cases at one point in time per population. Mortality = deaths from the disease per population per time.

Q2. In a NSW town of 50 000 people, 200 new cases of type 2 diabetes were diagnosed in 2025 and 3200 people were living with diabetes at year end. Calculate (a) the annual incidence rate per 100 000, (b) the point prevalence per 1000. [3 marks]

Cue. (a) 200 / 50000 = 400 per 100 000 per year. (b) 3200 / 50000 = 64 per 1000 (6.4 percent).

Q3. Compare cohort and case-control study designs in epidemiology. (a) Describe each design. (b) Identify one advantage and one disadvantage of each. (c) Justify which is more suitable for studying a rare disease such as mesothelioma. [2+3+2 marks]

Cue. (a) Cohort: follow exposed/unexposed forward; case-control: compare diseased and non-diseased retrospectively. (b) Cohort allows incidence calculation but is slow and expensive; case-control is fast but susceptible to recall bias. (c) Case-control better for rare disease due to efficiency.

Exam-style practice questions

Practice questions written in the style of NESA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

2025 HSC7 marks[A population lives across regions A, B and C; A and B are linked by a road bridge while C is isolated. A graph shows the risk (%) of developing an environmental disease according to age at exposure (10, 20, 30 years) over up to 55 years after exposure.] Design an epidemiological study that could be used to produce the results shown in the graph. Justify the features of your design.

Show worked answer →

Band-marked: top marks design a study producing most features of the graph AND justify features with reference to the stimulus.

Sample/cohort: survey individuals from age groups 10, 20 and 30 (matching the graph's exposure ages) across areas A, B and C, followed over ~55 years (the graph's time span).
Use of a control: A and B mix freely via the bridge, but C is isolated so it can serve as a control group — justify this with the map.
Validity features: include equal numbers of males and females; large sample size; and collect data on confounding factors (diet, exercise, exposure to disease-causing agents, lifestyle, general health) so risk can be attributed to age at exposure rather than other variables.
Analysis: correlate the data to find common features/activities, linking exposure age to disease risk. Examiners reward designs that explicitly use the stimulus and show sound grasp of reliability and validity.

2023 HSC7 marks[Air pollution has been linked to non-infectious neurological disorders. 500 people from each of three major cities, males and females aged 20–50, were monitored for 12 months; results gave the % of each sample with symptoms.] Evaluate the method used in this epidemiological study in determining a link between air pollution and the symptoms.

Show worked answer →

An evaluate question — top marks need a thorough grasp of what makes an epidemiological study valid, a comprehensive analysis of THIS study, and an informed judgement.

Judgement: The study is not valid and does not establish a statistically significant cause–effect link.

Weaknesses to analyse (each earns credit):

Confounding risk factors not controlled — age, sex, ethnic group and especially occupation (which could cause similar symptoms) are not accounted for; different ethnic groups are not indicated.
Exposure not measured — no indication of locality within each city or proximity to industry; subjects should have varying, measured levels of exposure so greater exposure can be matched to greater symptom incidence.
Time too short — 12 months may be insufficient for symptoms to develop.
No measure of symptom severity, and the sample/geographic and socioeconomic range may be too narrow for reliable trends.

Conclude with a clear judgement that the design flaws mean any apparent link is unreliable.

2022 HSC4 marks[A historical study followed non-smoking married women aged 40+ across 29 Japanese health districts for 14 years; lung-cancer mortality was assessed by husbands' smoking. Non-smoker women with non-smoker husbands: 8.7 per 100 000; with smoker husbands: 15.5; women who smoke: 32.8.] Evaluate the method used in this epidemiological study.

Show worked answer →

Top marks (4) need a thorough understanding of the methodology plus a suitable judgement.

Strengths: The study used large numbers of women matched into three exposure categories and followed them for a long period (14 years). These features make the sample size and duration adequate for a valid study.
Limitations: The categories assume each woman spends similar time with a smoker and that each smoker smokes a similar amount; in reality exposure time and smoke volume vary greatly, which could compromise the data. (Longer follow-up would give even more definitive data.)
Judgement: Because the large cohort should average out the variation in individual exposure, the method is overall valid despite these limitations.

2020 HSC4 marks[An 11-year study of 58 406 young adults related drinking-water arsenic exposure to mortality; survival graphs are shown for three exposure bands (<90, 90–223, >223 µg/L) in males and females.] The hypothesis was that exposure to arsenic in drinking water increases mortality in young adults. Discuss the data presented in the graphs in relation to this hypothesis.

Show worked answer →

Marks come from making points for and/or against the hypothesis and relating each to the data.

Support for the hypothesis:

In both sexes, increasing arsenic dose led to decreased survival, suggesting arsenic causes the decline; the dose–response was clearest in males.
Survival declined progressively over the 11 years, consistent with cumulative exposure reducing survival.

Reference the control/qualifications:

The <90 µg/L group had the highest survival and acts as a near-control; survival there was high even though this is above the WHO limit.
In females, all doses >90 µg/L gave a similar decrease, hinting at other interacting factors (e.g. nutrition, genes).
Note the magnitude caveat: despite the large sample, survival only dropped by ~0.1% or less, so the effect, while present, is small.

What this dot point is asking

The answer

Core measures

Study designs

Worked example: Doll and Hill and lung cancer

Benefits of epidemiology

Limitations of epidemiology

Examples in context

Try this

Exam-style practice questions

Weaknesses to analyse (each earns credit):

Support for the hypothesis:

Reference the control/qualifications:

Related dot points