Skip to main content
WAPsychologySyllabus dot point

How does operant conditioning explain voluntary behaviour through its consequences?

Explain operant conditioning, including reinforcement, punishment, shaping and schedules of reinforcement, with reference to Skinner

WACE Year 12 Psychology Unit 3: operant conditioning, positive and negative reinforcement, positive and negative punishment, shaping, and continuous versus partial schedules of reinforcement, with reference to Skinner and Thorndike.

Generated by Claude Opus 4.76 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this dot point is asking
  2. The law of effect and the Skinner box
  3. Reinforcement and punishment
  4. Shaping
  5. Schedules of reinforcement
  6. Why operant conditioning matters

What this dot point is asking

SCSA asks you to distinguish the four consequences, define reinforcement versus punishment, explain shaping, and compare schedules of reinforcement. The marked skill is correctly classifying a real consequence and predicting its effect.

The law of effect and the Skinner box

Edward Thorndike's law of effect states that behaviours followed by satisfying consequences are more likely to be repeated, while those followed by unpleasant consequences are less likely. B. F. Skinner developed this into operant conditioning, studying it with the operant chamber (the Skinner box), in which an animal learns that pressing a lever delivers food or stops a shock.

The key idea is that the consequence of a behaviour changes the probability that the behaviour will recur.

Reinforcement and punishment

There are four consequences, defined by whether a stimulus is added or removed and whether behaviour increases or decreases.

  • Positive reinforcement: adding a pleasant stimulus to increase a behaviour (giving praise or a reward).
  • Negative reinforcement: removing an unpleasant stimulus to increase a behaviour (taking a painkiller to remove a headache, so taking the painkiller increases).
  • Positive punishment: adding an unpleasant stimulus to decrease a behaviour (a fine, extra chores).
  • Negative punishment: removing a pleasant stimulus to decrease a behaviour (losing phone privileges, response cost).

Shaping

Complex behaviours rarely appear all at once. Shaping is the reinforcement of successive approximations: rewarding behaviours that get progressively closer to the desired target. A rat is first rewarded for facing the lever, then for moving toward it, then for touching it, then for pressing it. Animal trainers and teachers use shaping to build skills step by step.

Schedules of reinforcement

How often a behaviour is reinforced changes how strongly it is learned and how resistant it is to extinction.

  • Continuous reinforcement rewards every correct response. Learning is fast, but extinction is also fast once rewards stop.
  • Partial (intermittent) reinforcement rewards only some responses. Learning is slower but far more resistant to extinction.

Partial schedules vary by ratio (number of responses) or interval (time), and by whether the requirement is fixed or variable. Variable-ratio schedules, which reward an unpredictable number of responses, produce the highest and most persistent response rates, which is why poker machines and gambling are so addictive.

Why operant conditioning matters

Operant conditioning underpins behaviour management, token economies, education, animal training and habit formation. It also explains the persistence of gambling and the difficulty of breaking habits maintained on variable schedules. Unlike classical conditioning, it governs voluntary, goal-directed behaviour.