β Module 5: Scientific Investigations
Inquiry Question 2: How does the design of a valid experimental investigation allow for the analysis of first-hand data?
Process, analyse and interpret quantitative and qualitative data, including identifying and accounting for sources of error and uncertainty
A focused answer to the HSC Investigating Science Module 5 dot point on data analysis. Covers means and ranges, error bars, significant figures, random vs systematic error, outliers, and worked HSC past exam questions.
Have a quick question? Jump to the Q&A page
What this dot point is asking
NESA wants you to process raw quantitative data into summary statistics, represent data with appropriate graphs and tables, account for measurement error and uncertainty, and identify outliers. Quantitative data analysis is examined in nearly every Investigating Science paper.
The answer
Processing data turns raw measurements into evidence that can be interpreted against a hypothesis. The standard steps:
- Tabulate raw data.
- Calculate summary statistics (mean, median, range, standard deviation).
- Identify outliers and decide whether to exclude.
- Estimate uncertainty.
- Graph the result with error bars.
- Interpret in light of the hypothesis.
Summary statistics
Mean. Sum of values divided by count. The most common measure of central tendency for normally distributed data.
- Median
- Middle value when data is ordered. Less affected by outliers than the mean.
- Range
- Difference between maximum and minimum. A simple measure of spread.
- Standard deviation
- A more rigorous measure of spread that quantifies how tightly values cluster around the mean.
Uncertainty
Every measurement has uncertainty arising from instrument resolution and natural variation. Standard ways to report:
- Absolute uncertainty. cm.
- Percentage uncertainty. .
For multiple measurements, the uncertainty is usually estimated as half the range or as the standard deviation of the mean.
Significant figures
Report data with significant figures appropriate to the precision of the instrument.
- A ruler graduated to 1 mm reads to mm; report measurements to 1 decimal place in cm.
- A digital thermometer reading to 0.1 degrees Celsius reports to that resolution; report no extra digits.
When calculating, the answer cannot be more precise than the least precise input. cm cm rounds to cm (two significant figures, matching the ).
Random and systematic error
Random error. Unpredictable variation between repeated measurements, caused by chance fluctuations. Magnitude varies; direction varies. Random error reduces with averaging.
Systematic error. Consistent bias in one direction caused by miscalibration, methodological flaw or biased observer. Systematic error does not reduce with averaging.
| Property | Random error | Systematic error |
|---|---|---|
| Direction | Random | Consistent |
| Effect | Reduces precision | Reduces accuracy |
| Reduction | Replication, averaging | Calibration, instrument correction |
| Example | Stopwatch reading by hand | Balance not zeroed |
Outliers
A data point well outside the cluster of others. A common rule is values more than 2 to 3 standard deviations from the mean. Options:
- Investigate. Check whether the value is a transcription error, a faulty instrument or a real but rare observation.
- Repeat the measurement if possible.
- Exclude with justification. Document the reason for exclusion. Do not silently drop outliers; that is selective reporting.
Graphing
Choice of graph.
- Line graph. Continuous independent variable (time, temperature).
- Bar graph. Categorical independent variable (treatment groups).
- Scatter plot. Investigating correlation between two continuous variables.
Error bars. Vertical lines showing the range or uncertainty around each data point. Mandatory for any quantitative graph in Investigating Science.
Axes. Labelled with quantity and unit. Independent variable on the x-axis, dependent variable on the y-axis. Origin clearly marked.
Interpreting in light of the hypothesis
A finding is meaningful when:
- The treatment effect is larger than the uncertainty in the measurement.
- The result is reproducible across replicates.
- Alternative explanations (confounders, instrument bias) can be ruled out.
A treatment difference smaller than the error bars is not evidence of effect.
Past exam questions, worked
Real questions from past NESA papers on this dot point, with our answer explainer.
2024 HSC5 marksA student measured the length of a metal rod five times and obtained: 24.6, 24.8, 24.7, 25.4, 24.7 cm. Process this data set and discuss sources of error.Show worked answer β
A 5-mark answer needs identification of an outlier, calculation of mean and uncertainty, and analysis of sources of error.
- Inspection
- The value 25.4 cm is more than 3 standard deviations from the others and is a probable outlier. Confirm by repeating the measurement or excluding from the mean.
- Excluding the outlier
- values 24.6, 24.8, 24.7, 24.7 cm.
- Mean
- IMATH_0 cm.
- Range
- IMATH_1 cm.
- Uncertainty
- IMATH_2 cm (half the range, or the resolution of the ruler).
- Reported value
- IMATH_3 cm.
Sources of error.
- Random error. Slight variations in ruler alignment, parallax, ruler reading. Reduced by replication.
- Systematic error. Ruler calibration drift, room temperature affecting metal length, observer bias. Reduced by calibration.
- Outlier (25.4 cm) likely reflects a misread or transcription error.
Markers reward identification of the outlier, mean, range, uncertainty and a distinction between random and systematic error.
2022 HSC3 marksExplain the difference between random and systematic error, with an example of each.Show worked answer β
A 3-mark answer needs both definitions and clear examples.
- Random error
- Unpredictable variation in measurements caused by chance fluctuations in the measurement process. Magnitude and direction vary between measurements. Random error affects precision and is reduced by averaging multiple repeats.
- Example
- Reading a thermometer with mercury that is settling, giving slightly different values each time you check. Repeats give 22.1, 22.3, 21.9, 22.2 degrees Celsius. The mean is reliable, individual readings vary.
- Systematic error
- Consistent bias in measurements that shifts all readings in the same direction. Caused by miscalibration, faulty equipment or methodological flaws. Systematic error affects accuracy and is not reduced by averaging.
- Example
- A balance reads 0.1 g too high on every measurement because it was not zeroed. Every recorded mass is 0.1 g greater than the true value, regardless of how many replicates are taken.
- Reduction
- Random error: average multiple measurements. Systematic error: calibrate or correct the bias.
Markers reward both definitions, examples that show the direction of error, and the reduction method for each.
Related dot points
- Plan investigations to ensure that they are valid and reliable, including the use of an appropriate experimental design with consideration of independent, dependent and controlled variables
A focused answer to the HSC Investigating Science Module 5 dot point on variables and experimental design. Covers independent, dependent and controlled variables, control groups, sample size, and worked HSC past exam questions.
- Evaluate scientific investigations and findings in terms of reliability, validity, accuracy and precision of data
A focused answer to the HSC Investigating Science Module 5 dot point on reliability, validity, accuracy and precision. The four concepts every Investigating Science student must distinguish, with worked HSC past exam questions.
- Communicate scientific understanding using suitable language and terminology, including the role of peer review and replication in confirming scientific findings
A focused answer to the HSC Investigating Science Module 5 dot point on peer review and replication. Covers what peer review does, why it matters, the reproducibility crisis, and worked HSC past exam questions on confirming scientific findings.