§-Syllabus dot point

WAPsychologySyllabus dot point

How does operant conditioning explain voluntary behaviour through its consequences?

Explain operant conditioning, including reinforcement, punishment, shaping and schedules of reinforcement, with reference to Skinner

WACE Year 12 Psychology Unit 3: operant conditioning, positive and negative reinforcement, positive and negative punishment, shaping, and continuous versus partial schedules of reinforcement, with reference to Skinner and Thorndike.

Generated by Claude Opus 4.86 min answerUpdated 2026-06-02

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section

What this dot point is asking
The law of effect and the Skinner box
Reinforcement and punishment
Shaping
Schedules of reinforcement
Why operant conditioning matters

What this dot point is asking

SCSA asks you to distinguish the four consequences, define reinforcement versus punishment, explain shaping, and compare schedules of reinforcement. The marked skill is correctly classifying a real consequence and predicting its effect.

The law of effect and the Skinner box

Edward Thorndike's law of effect states that behaviours followed by satisfying consequences are more likely to be repeated, while those followed by unpleasant consequences are less likely. B. F. Skinner developed this into operant conditioning, studying it with the operant chamber (the Skinner box), in which an animal learns that pressing a lever delivers food or stops a shock.

The key idea is that the consequence of a behaviour changes the probability that the behaviour will recur.

Reinforcement and punishment

There are four consequences, defined by whether a stimulus is added or removed and whether behaviour increases or decreases.

Positive reinforcement: adding a pleasant stimulus to increase a behaviour (giving praise or a reward).
Negative reinforcement: removing an unpleasant stimulus to increase a behaviour (taking a painkiller to remove a headache, so taking the painkiller increases).
Positive punishment: adding an unpleasant stimulus to decrease a behaviour (a fine, extra chores).
Negative punishment: removing a pleasant stimulus to decrease a behaviour (losing phone privileges, response cost).

Shaping

Complex behaviours rarely appear all at once. Shaping is the reinforcement of successive approximations: rewarding behaviours that get progressively closer to the desired target. A rat is first rewarded for facing the lever, then for moving toward it, then for touching it, then for pressing it. Animal trainers and teachers use shaping to build skills step by step.

Schedules of reinforcement

How often a behaviour is reinforced changes how strongly it is learned and how resistant it is to extinction.

Continuous reinforcement rewards every correct response. Learning is fast, but extinction is also fast once rewards stop.
Partial (intermittent) reinforcement rewards only some responses. Learning is slower but far more resistant to extinction.

Partial schedules vary by ratio (number of responses) or interval (time), and by whether the requirement is fixed or variable. Variable-ratio schedules, which reward an unpredictable number of responses, produce the highest and most persistent response rates, which is why poker machines and gambling are so addictive.

Why operant conditioning matters

Operant conditioning underpins behaviour management, token economies, education, animal training and habit formation. It also explains the persistence of gambling and the difficulty of breaking habits maintained on variable schedules. Unlike classical conditioning, it governs voluntary, goal-directed behaviour.

Exam-style practice questions

Practice questions written in the style of SCSA exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

WACE 20216 marksFor each of the following, identify whether it is positive reinforcement, negative reinforcement, positive punishment or negative punishment, and justify your choice: (a) a student is given a detention for being late; (b) a driver buckles a seatbelt to stop the warning beep; (c) a child is praised for tidying their room; (d) a teenager loses screen time for rudeness.

Show worked answer →

A 6 mark classification answer needs each item labelled and justified by add/remove and increase/decrease.

(a) Positive punishment: An unpleasant stimulus (detention) is added to decrease lateness.
(b) Negative reinforcement: An unpleasant stimulus (the beep) is removed, increasing the seatbelt behaviour.
(c) Positive reinforcement: A pleasant stimulus (praise) is added to increase tidying.
(d) Negative punishment: A pleasant stimulus (screen time) is removed to decrease rudeness.

Markers reward correct labels and justification using two rules: reinforcement increases and punishment decreases behaviour, while positive and negative refer only to adding or removing a stimulus.

WACE 20237 marksExplain shaping and the schedules of reinforcement, and use them to explain why gambling on poker machines is so persistent.

Show worked answer →

A 7 mark extended response needs shaping, the schedules, and the gambling application.

Shaping: The reinforcement of successive approximations: rewarding behaviours that get progressively closer to a target, used to build complex behaviour step by step.
Schedules: Continuous reinforcement rewards every response, giving fast learning but fast extinction. Partial schedules reward only some responses and are far more resistant to extinction. Variable-ratio schedules reward an unpredictable number of responses and produce the highest, most persistent response rates.
Gambling: Poker machines pay out on a variable-ratio schedule: wins are unpredictable, so the player keeps responding in case the next press wins. This produces a high, extinction-resistant response rate, which is why the behaviour persists even through long losing runs.
Conclusion: Markers reward defining shaping, distinguishing the schedules, and explaining gambling through the variable-ratio schedule.