← Module 5: Scientific Investigations

NSWInvestigating ScienceSyllabus dot point

Inquiry Question 2: How does the design of a valid experimental investigation allow for the analysis of first-hand data?

Process, analyse and interpret quantitative and qualitative data, including identifying and accounting for sources of error and uncertainty

A focused answer to the HSC Investigating Science Module 5 dot point on data analysis. Covers means and ranges, error bars, significant figures, random vs systematic error, outliers, and worked HSC past exam questions.

Generated by Claude OpusReviewed by Better Tuition Academy6 min answer

Have a quick question? Jump to the Q&A page

What this dot point is asking

NESA wants you to process raw quantitative data into summary statistics, represent data with appropriate graphs and tables, account for measurement error and uncertainty, and identify outliers. Quantitative data analysis is examined in nearly every Investigating Science paper.

The answer

Processing data turns raw measurements into evidence that can be interpreted against a hypothesis. The standard steps:

  1. Tabulate raw data.
  2. Calculate summary statistics (mean, median, range, standard deviation).
  3. Identify outliers and decide whether to exclude.
  4. Estimate uncertainty.
  5. Graph the result with error bars.
  6. Interpret in light of the hypothesis.

Summary statistics

Mean. Sum of values divided by count. The most common measure of central tendency for normally distributed data.

xΛ‰=βˆ‘xin\bar{x} = \frac{\sum x_i}{n}

Median
Middle value when data is ordered. Less affected by outliers than the mean.
Range
Difference between maximum and minimum. A simple measure of spread.
Standard deviation
A more rigorous measure of spread that quantifies how tightly values cluster around the mean.

s=βˆ‘(xiβˆ’xΛ‰)2nβˆ’1s = \sqrt{\frac{\sum (x_i - \bar{x})^2}{n - 1}}

Uncertainty

Every measurement has uncertainty arising from instrument resolution and natural variation. Standard ways to report:

  • Absolute uncertainty. 24.7Β±0.124.7 \pm 0.1 cm.
  • Percentage uncertainty. 0.124.7Γ—100=0.4%\frac{0.1}{24.7} \times 100 = 0.4 \%.

For multiple measurements, the uncertainty is usually estimated as half the range or as the standard deviation of the mean.

Significant figures

Report data with significant figures appropriate to the precision of the instrument.

  • A ruler graduated to 1 mm reads to Β±0.5\pm 0.5 mm; report measurements to 1 decimal place in cm.
  • A digital thermometer reading to 0.1 degrees Celsius reports to that resolution; report no extra digits.

When calculating, the answer cannot be more precise than the least precise input. 24.724.7 cm Γ—3.1\times 3.1 cm rounds to 7777 cm2^2 (two significant figures, matching the 3.13.1).

Random and systematic error

Random error. Unpredictable variation between repeated measurements, caused by chance fluctuations. Magnitude varies; direction varies. Random error reduces with averaging.

Systematic error. Consistent bias in one direction caused by miscalibration, methodological flaw or biased observer. Systematic error does not reduce with averaging.

Property Random error Systematic error
Direction Random Consistent
Effect Reduces precision Reduces accuracy
Reduction Replication, averaging Calibration, instrument correction
Example Stopwatch reading by hand Balance not zeroed

Outliers

A data point well outside the cluster of others. A common rule is values more than 2 to 3 standard deviations from the mean. Options:

  1. Investigate. Check whether the value is a transcription error, a faulty instrument or a real but rare observation.
  2. Repeat the measurement if possible.
  3. Exclude with justification. Document the reason for exclusion. Do not silently drop outliers; that is selective reporting.

Graphing

Choice of graph.

  • Line graph. Continuous independent variable (time, temperature).
  • Bar graph. Categorical independent variable (treatment groups).
  • Scatter plot. Investigating correlation between two continuous variables.

Error bars. Vertical lines showing the range or uncertainty around each data point. Mandatory for any quantitative graph in Investigating Science.

Axes. Labelled with quantity and unit. Independent variable on the x-axis, dependent variable on the y-axis. Origin clearly marked.

Interpreting in light of the hypothesis

A finding is meaningful when:

  1. The treatment effect is larger than the uncertainty in the measurement.
  2. The result is reproducible across replicates.
  3. Alternative explanations (confounders, instrument bias) can be ruled out.

A treatment difference smaller than the error bars is not evidence of effect.

Past exam questions, worked

Real questions from past NESA papers on this dot point, with our answer explainer.

2024 HSC5 marksA student measured the length of a metal rod five times and obtained: 24.6, 24.8, 24.7, 25.4, 24.7 cm. Process this data set and discuss sources of error.
Show worked answer β†’

A 5-mark answer needs identification of an outlier, calculation of mean and uncertainty, and analysis of sources of error.

Inspection
The value 25.4 cm is more than 3 standard deviations from the others and is a probable outlier. Confirm by repeating the measurement or excluding from the mean.
Excluding the outlier
values 24.6, 24.8, 24.7, 24.7 cm.
Mean
IMATH_0 cm.
Range
IMATH_1 cm.
Uncertainty
IMATH_2 cm (half the range, or the resolution of the ruler).
Reported value
IMATH_3 cm.

Sources of error.

  • Random error. Slight variations in ruler alignment, parallax, ruler reading. Reduced by replication.
  • Systematic error. Ruler calibration drift, room temperature affecting metal length, observer bias. Reduced by calibration.
  • Outlier (25.4 cm) likely reflects a misread or transcription error.

Markers reward identification of the outlier, mean, range, uncertainty and a distinction between random and systematic error.

2022 HSC3 marksExplain the difference between random and systematic error, with an example of each.
Show worked answer β†’

A 3-mark answer needs both definitions and clear examples.

Random error
Unpredictable variation in measurements caused by chance fluctuations in the measurement process. Magnitude and direction vary between measurements. Random error affects precision and is reduced by averaging multiple repeats.
Example
Reading a thermometer with mercury that is settling, giving slightly different values each time you check. Repeats give 22.1, 22.3, 21.9, 22.2 degrees Celsius. The mean is reliable, individual readings vary.
Systematic error
Consistent bias in measurements that shifts all readings in the same direction. Caused by miscalibration, faulty equipment or methodological flaws. Systematic error affects accuracy and is not reduced by averaging.
Example
A balance reads 0.1 g too high on every measurement because it was not zeroed. Every recorded mass is 0.1 g greater than the true value, regardless of how many replicates are taken.
Reduction
Random error: average multiple measurements. Systematic error: calibrate or correct the bias.

Markers reward both definitions, examples that show the direction of error, and the reduction method for each.

Related dot points