Skip to main content
WAMathematics ApplicationsSyllabus dot point

How can we measure and use the relationship between two numerical variables?

Construct and interpret scatterplots, calculate the correlation coefficient and least-squares regression line, and use the line to make predictions.

How to read scatterplots, measure linear association with Pearson's r and the coefficient of determination, fit the least-squares line, and predict while judging interpolation versus extrapolation.

Generated by Claude Opus 4.78 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this dot point is asking
  2. Describing a scatterplot
  3. Correlation coefficient
  4. Least-squares regression line
  5. Interpolation and extrapolation
  6. Correlation is not causation

What this dot point is asking

You must identify the explanatory and response variables, describe the scatterplot, find and interpret rr and r2r^2, write the least-squares equation, and make and judge predictions.

Describing a scatterplot

Plot the explanatory variable xx on the horizontal axis and the response variable yy on the vertical axis. Then describe:

  • Form: linear or non-linear.
  • Direction: positive (rises) or negative (falls).
  • Strength: how closely points cluster about a line.
  • Outliers: any points well away from the pattern.

Correlation coefficient

Pearson's correlation coefficient rr measures the strength and direction of a linear association. It satisfies 1r1-1 \le r \le 1: r=+1r = +1 is a perfect positive line, r=1r = -1 a perfect negative line, and r=0r = 0 no linear association.

Least-squares regression line

The least-squares line minimises the sum of the squared vertical distances from the points to the line. Write it as y=a+bxy = a + bx, where bb is the slope and aa is the yy-intercept.

The slope bb is the predicted change in yy for each one-unit increase in xx. The intercept aa is the predicted yy when x=0x = 0 (meaningful only if x=0x = 0 is sensible in context).

Interpolation and extrapolation

Interpolation predicts within the range of the data and is usually reliable. Extrapolation predicts outside the range and is risky, because the linear pattern may not continue.

Correlation is not causation

A strong rr shows the variables move together; it does not prove that one causes the other. A lurking third variable, or coincidence, can produce correlation without any causal link.