§-Syllabus dot point

VICGeneral MathematicsSyllabus dot point

When a scatterplot is curved, how do the squared, log and reciprocal transformations straighten the data so a least-squares line can be fitted?

Recognise non-linear association from a scatterplot and residual plot, apply the squared, logarithmic or reciprocal transformation to the explanatory or response variable to linearise the data, fit a least-squares line to the transformed data, and use it to predict

A focused answer to the VCE General Mathematics Unit 3 Data analysis key-knowledge point on data transformation. Spotting curvature, the circle-of-transformations idea, applying the squared, log and reciprocal transformations, fitting a line to transformed data, and predicting back.

Generated by Claude Opus 4.87 min answerUpdated 2026-06-02

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section

What this dot point is asking
Spotting that a transformation is needed
Choosing the transformation
Fitting and predicting with a transformed model
Reading the transformed equation back
The circle of transformations idea
Comparing $r^2$ before and after
Predicting and the limits of the model
Why this matters for the exams

What this dot point is asking

VCAA wants you to handle bivariate data whose scatterplot is curved rather than linear. A least-squares line should only be fitted to a linear relationship, so when the residual plot shows a clear pattern you first transform one of the variables, using a squared, logarithmic or reciprocal transformation, to straighten the data. You then fit the least-squares line to the transformed data and use it to predict, remembering to undo the transformation at the end. This is the natural follow-on from correlation and regression.

Spotting that a transformation is needed

A single curved scatterplot, or a residual plot with a clear arch or U-shape rather than random scatter, signals that a straight line is the wrong model. Rather than abandon regression, you re-express one variable so that the relationship becomes linear.

Choosing the transformation

The direction the curve bends tells you which transformation to apply. Stretching the high end of the $x$ -axis (squaring $x$ ) or compressing it (log or reciprocal of $x$ ) shifts points to straighten the bulge. In the exam you are usually told which transformation to apply, or you pick the one that gives the better $r^2$ on the transformed data.

Fitting and predicting with a transformed model

Once a variable is transformed, treat the transformed quantity as a new variable and fit the least-squares line as normal.

Fitting a squared transformation and predicting

A relationship between $x$ and $y$ is curved, and a squared transformation on $x$ straightens it. After computing $X = x^2$ for each point, the least-squares line of $y$ on $X$ is

y = 5 + 0.4 X, \qquad \text{that is} \qquad y = 5 + 0.4\,x^2.

Step 1: Identify the transformed variable.

Because the scatterplot showed curvature and a squared transformation was applied to the explanatory variable $x$ , the new variable is $X = x^2$ . The fitted line was found using $X$ values, so any prediction must use the same transformed quantity rather than the raw $x$ .

Step 2: Transform the input value.

Before substituting into the equation, square the given $x$ value to obtain the corresponding $X$ value. Skipping this step and substituting $x$ directly would treat the slope as though it multiplies an untransformed value, giving the wrong prediction.

X = 6^2 = 36

Step 3: Substitute and evaluate.

With $X = 36$ , substitute into the fitted equation to find the predicted $y$ . Because the transformation was on $x$ only, the predicted $y$ comes straight out of the equation with no further undoing required.

y = 5 + 0.4 \times 36 = 5 + 14.4 = 19.4

Step 4: Repeat the logic for a log-transformed model.

The same two-step approach applies to any transformation: transform the input, then substitute. For a log model $y = 2 + 3\log_{10} x$ , the log of $100$ equals $2$ because $10^2 = 100$ , so we substitute directly.

y = 2 + 3\log_{10}(100) = 2 + 3 \times 2 = 8

Final answer: Using the squared model, the predicted $y$ at $x = 6$ is $19.4$ . Using the log model, the predicted $y$ at $x = 100$ is $8$ .

Reading the transformed equation back

The fitted equation already contains the transformation, so prediction is just careful substitution. If the transformation was on the response variable, for example $\sqrt{y} = a + bx$ written as $y^{1/2}$ or with $y$ replaced by $\log y$ , you must undo it at the end: square both sides, or raise $10$ to the power. Always check whether the transformation sits on $x$ or on $y$ before predicting.

Common trap

Predicting without transforming the input: If the model is $y = 5 + 0.4 x^2$ , you must square the given $x$ before substituting. Plugging the raw $x$ into the slope ignores the transformation.
Forgetting to undo a transformation on $y$: When $y$ itself was transformed (for example $\log_{10} y = a + bx$ ), solve for $\log_{10} y$ first, then take $10$ to that power to recover $y$ .
Fitting a line to curved raw data: The whole point is that the raw relationship is non-linear; fitting a line to it gives a poor model and a patterned residual plot.
Choosing the transformation at random: Pick the transformation that straightens the plot and raises $r^2$ , or use the one the question specifies. Different transformations give different equations.

The circle of transformations idea

A helpful way to choose a transformation is to picture the curve's bulge. If the scatter bulges so that the curve rises steeply then flattens (concave down), a log or square-root style re-expression of $x$ , or squaring $y$ , tends to straighten it. If the curve rises gently then steepens (concave up), squaring $x$ , or taking a log or reciprocal of $y$ , often works. The six VCE moves are $x^2$ , $\log_{10}x$ , $\tfrac{1}{x}$ and the same three on $y$ . Each shifts points along one axis: squaring stretches the high end, while log and reciprocal compress it. You rarely need to derive the choice from scratch in the exam, because the question usually names the transformation, but understanding why it works helps you avoid applying it to the wrong variable.

Comparing $r^2$ before and after

The justification for a transformation is that it improves the linearity of the relationship, which shows up as a higher coefficient of determination $r^2$ and a residual plot that becomes random scatter. A standard exam task is to fit the line to the raw data, note a patterned residual plot and a modest $r^2$ , then apply the named transformation, refit, and report the new $r^2$ as evidence that the transformed model is better. Always state the improvement with figures, for example that $r^2$ rises from $0.78$ to $0.96$ , and confirm that the residual plot of the transformed model shows no remaining pattern. That combination of a higher $r^2$ and a structureless residual plot is the full justification markers want.

Predicting and the limits of the model

Once you have the transformed equation, prediction is careful substitution: transform the input if $x$ was transformed, substitute, then undo any transformation on $y$ at the very end. The same warnings as ordinary regression still apply, only more sharply. Extrapolating a transformed model far beyond the data range is even more unreliable than for a linear model, because the chosen transformation was only justified over the observed range. State that a prediction is an interpolation (reasonably reliable) or an extrapolation (treat with caution), and never read a transformation as proof that one variable causes the other.

Why this matters for the exams

Transformation questions appear most years and reward students who keep track of which variable was transformed and who undo it correctly when predicting. They build directly on correlation and least-squares regression: the residual plot is the trigger, the transformation is the fix, and the prediction is the payoff. Show the transformed value explicitly in your working so a marker can follow each step.

Exam-style practice questions

Practice questions written in the style of VCAA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

2023 VCAA1 marksA scatterplot of tree height (m) against age (years) is linearised using a logarithm (base 10) transformation applied to the variable age. The equation of the least squares line is height = -3.8 + 12.6 x log10(age). Using this equation, the age, in years, of a tree with a height of 8.52 m is closest to A. 7.9 B. 8.9 C. 9.1 D. 9.5 E. 9.9

Show worked answer →

Substitute height = 8.52 into the transformed equation and solve for age.

8.52 = -3.8 + 12.6 x log10(age).

12.6 x log10(age) = 8.52 + 3.8 = 12.32, so log10(age) = 12.32 / 12.6 = 0.97778.

age = 10^0.97778 = 9.50 years.

This is closest to 9.5, so the answer is D. Remember to undo the log by raising 10 to the power of both sides.

2025 VCAA1 marksA squared transformation is applied to the variable doctors (number per 1000 people) when modelling life expectancy in years, life. The equation of the least squares line fitted to this transformed data is of the form life = a + b x (doctors)^2. Using this equation, the predicted life, in years, for a country with two doctors per 1000 people is closest to A. 73.6 B. 74.0 C. 74.5 D. 74.9

Show worked answer →

Using the data table, the squared transformation creates a new explanatory variable (doctors)^2. Fitting a least squares line of life on (doctors)^2 with a calculator gives, to four significant figures, life = 63.12 + 2.842 x (doctors)^2.

To predict for two doctors per 1000 people, substitute doctors = 2, so (doctors)^2 = 4.

life = 63.12 + 2.842 x 4 = 63.12 + 11.37 = 74.5 years.

This is closest to 74.5, so the answer is C. The key step is squaring the value before multiplying by the slope.