Skip to main content
WAMathematics ApplicationsSyllabus dot point

How do we put a number on the strength and direction of a linear association?

Calculate and interpret Pearson's correlation coefficient r and the coefficient of determination r squared, and state their limitations.

How to calculate Pearson's r with technology, read its sign and size, convert to the coefficient of determination, interpret the proportion of variation explained, and respect the limits of both measures.

Generated by Claude Opus 4.76 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this dot point is asking
  2. Reading Pearson's r
  3. The coefficient of determination
  4. Limitations of both measures

What this dot point is asking

You must compute rr (almost always with technology), describe what its sign and size mean, find and interpret r2r^2, and know when neither number should be trusted.

Reading Pearson's r

The correlation coefficient rr satisfies 1r1-1 \le r \le 1.

  • The sign matches the direction: positive rr for a positive association, negative for a negative one.
  • The size (distance from zero) matches the strength: values near ±1\pm 1 are strong, near 00 are weak.

In practice you read rr from your calculator's regression output after entering the two lists; you are rarely asked to compute it by hand.

The coefficient of determination

The coefficient of determination is r2r^2. It is the fraction (or percentage) of the variation in the response variable that is explained by the linear relationship with the explanatory variable.

Because it is squared, r2r^2 is always between 00 and 11 and loses the sign. You quote the direction from rr and the explained proportion from r2r^2.

Limitations of both measures

Both rr and r2r^2 describe only linear association and only over the range of the data.

  • A curved relationship can give a small rr even though the variables are strongly related; the relationship is just not linear.
  • An outlier can pull rr towards or away from zero, so always check the scatterplot.
  • Neither number proves causation. A strong rr between two variables can be produced by a lurking third variable.