When does evidence from a sample justify a conclusion about a whole population, and what makes a generalisation hasty or biased?
evaluate inductive generalisations by assessing sample size, representativeness and the dangers of hasty generalisation and biased sampling
A focused QCE Unit 3 answer on inductive generalisation. Covers the structure of generalising from a sample to a population, the criteria of sufficient size and representativeness, the fallacies of hasty generalisation and biased sampling, and why anecdotes and self-selected samples mislead.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this dot point is asking
QCAA wants you to evaluate the inductive move from a sample to a population: from "most people I surveyed prefer X" to "most people prefer X." You need the structure of an inductive generalisation, the two criteria that make it strong (adequate size and representativeness), and the fallacies that wreck it (hasty generalisation and biased sampling). This is essential for analysing scientific, statistical and everyday claims.
The answer
The structure of a generalisation
An inductive generalisation infers a claim about a whole population from observations of a sample of it:
- In the observed sample, proportion p of members have feature F.
- Therefore, in the whole population, roughly proportion p have feature F.
The conclusion goes beyond the evidence (we have not checked every member), so the argument is inductive and assessed for strength, not validity. Polling, quality control and most scientific measurement rest on this pattern.
Criterion one: sufficient sample size
A generalisation is stronger when the sample is large enough. Inferring the eating habits of a nation from three friends is weak; the small sample could easily be unrepresentative by chance. Larger samples reduce the role of luck. But size alone is not enough, which is why the second criterion matters more.
Criterion two: representativeness
A sample is representative when it resembles the population in the features relevant to the conclusion. A sample can be huge yet biased. The classic case is the 1936 Literary Digest poll, which surveyed millions of Americans drawn from telephone and car-registration lists and wrongly predicted the presidential result, because in the Depression those lists over-represented wealthier voters. A smaller but properly random sample beat it. Representativeness usually requires random selection so that every member of the population has an equal chance of being included.
Hasty generalisation
The hasty generalisation (or fallacy of insufficient sample) draws a broad conclusion from a sample that is too small or atypical. "My grandfather smoked and lived to ninety, so smoking is harmless" generalises from a single anomalous case. Anecdotes are the most common form: one vivid story is treated as evidence about everyone.
Biased sampling
Biased sampling uses a sample that is systematically unrepresentative. Two frequent forms:
- Self-selection: people who choose to respond differ from those who do not. Online reviews over-represent the very pleased and the very angry.
- Convenience sampling: surveying whoever is easiest to reach (your own friends, one suburb) rather than a random cross-section.
A famous related error is survivorship bias: studying only the cases that "survived" a process. In World War II, examining only returning aircraft for bullet holes ignored the planes that did not return, which is where the fatal damage actually was.
Evaluating a generalisation in a response
To assess a generalisation in QCAA style: (1) identify the sample and the target population; (2) ask whether the sample is large enough; (3) ask, more importantly, whether it is representative, looking for self-selection, convenience and survivorship bias; (4) judge the strength accordingly. Naming "hasty generalisation" earns little; you must explain why the sample fails to support the population claim.
Where generalisation sits in philosophy
Inductive generalisation is the engine of empirical science and the target of Hume's problem of induction: even a perfectly representative past sample cannot logically guarantee the future. It also connects to the informal fallacies strand, since hasty generalisation is a fallacy of presumption. Evaluating it well is the difference between evidence-based reasoning and prejudice dressed up as data.
Try this
Q1. State the two main criteria for a strong inductive generalisation. [2 marks]
- Cue. Sufficient sample size and representativeness of the sample.
Q2. Explain why a large sample can still produce a weak generalisation. [3 marks]
- Cue. If the sampling method is biased (self-selection, convenience), size does not fix the unrepresentativeness.
Q3. Identify the fallacy: "Two players from that school cheated, so the whole school is dishonest." [2 marks]
- Cue. Hasty generalisation from too small and atypical a sample.
Related dot points
- distinguish inductive from deductive reasoning and evaluate inductive arguments for strength and cogency rather than validity
A focused QCE Unit 3 answer on inductive reasoning. Covers the difference between deduction and induction, why inductive arguments are assessed for strength and cogency rather than validity, the role of probability, and how added evidence can defeat an otherwise strong inference.
- analyse and evaluate arguments from analogy, assessing the relevance and number of similarities and the presence of relevant disanalogies
A focused QCE Unit 3 answer on analogical reasoning. Covers the structure of an argument from analogy, the criteria that make one strong (relevance, number and variety of similarities), how relevant disanalogies weaken it, and famous philosophical analogies such as Paley's watch and Thomson's violinist.
- apply basic probabilistic reasoning to evaluate arguments, including conditional probability, base rates and common statistical fallacies
A focused QCE Unit 3 answer on probabilistic reasoning. Covers conditional probability, the role of base rates, Bayesian updating in plain terms, and common statistical fallacies including the base-rate fallacy, the conjunction fallacy and the gambler's fallacy.