You are viewing a free preview of this lesson.
Subscribe to unlock all 10 lessons in this course and every other course on LearningBro.
Where the biological model explains addiction through brain mechanisms and genes, learning and cognitive models explain it through experience — how addictive behaviour is acquired and maintained by conditioning and social learning, and how expectancies, beliefs and cognitive biases sustain it. The learning approach treats addiction as a set of learned associations and reinforced habits; the cognitive approach treats it as the product of the addict's thinking about substances and about themselves. These psychological models are essential complements to biology: they explain how the same neural reward described biologically becomes attached to particular cues and situations, why relapse is so cue-driven, why purely behavioural and cognitive-behavioural treatments work, and how behavioural addictions such as gambling develop without any ingested substance. This lesson applies the models to the AQA examples of nicotine and gambling, and is written in the standard clinical, objective register of A-Level teaching.
Key Definition: Cue reactivity is the phenomenon whereby environmental cues repeatedly associated with substance use (through classical conditioning) come to trigger craving and physiological responses even when the substance itself is absent.
This lesson addresses the following points from the AQA A-Level Psychology (7182) specification, Paper 3, Section D — Addiction:
It develops classical conditioning (cue reactivity), operant conditioning (positive and negative reinforcement; partial reinforcement), social learning theory, and the cognitive approach (expectancy theory; cognitive distortions and biases applied especially to gambling), preparing you to describe (AO1) and evaluate (AO3) the learning and cognitive models. Questions are usually split AO1/AO3 only, with no AO2 unless a scenario stem is provided — this lesson flags that distinction. The content complements the biological model and underpins the behavioural and cognitive-behavioural treatments examined in the "reducing addiction" lesson.
The learning model's first mechanism is classical (Pavlovian) conditioning. Through repeated pairing with the substance, previously neutral stimuli in the environment become conditioned stimuli that, by themselves, trigger craving and physiological arousal:
| Classical conditioning term | Application to addiction |
|---|---|
| Unconditioned stimulus (UCS) | The drug itself (e.g., nicotine) |
| Unconditioned response (UCR) | The substance's effects (dopamine release, relaxation/arousal) |
| Neutral stimulus (NS) → conditioned stimulus (CS) | Cues regularly present at use: a pub, the smell of smoke, a lighter, a betting shop, specific people, times of day, emotional states |
| Conditioned response (CR) | Craving and physiological arousal (raised heart rate, changes in skin conductance) triggered by the cue alone |
For example, a smoker who always smokes with a morning coffee comes to find that the smell and taste of coffee alone triggers a strong craving, with no nicotine present. The same applies to gambling: the sights and sounds of a betting environment, or even a payday, can become conditioned cues that elicit the urge to gamble. This is cue reactivity, and it is the learning model's most powerful explanation of relapse — long after withdrawal has passed, everyday cues can re-trigger craving.
Carter and Tiffany (1999) conducted a meta-analysis of cue-reactivity studies. Their findings were that substance-related cues reliably elicited self-reported craving and produced measurable physiological responses (e.g., heart-rate and skin-conductance changes), and that this pattern held across different substances (alcohol, nicotine, opioids, cocaine). The conclusion is that conditioned cues genuinely acquire the power to trigger craving and arousal, supporting the classical-conditioning account and explaining why relapse is so often cue-driven. This evidence also directly justifies cue-exposure treatment, which aims to extinguish the conditioned response by repeatedly presenting cues without the substance.
The learning model's second mechanism is operant conditioning — addiction as behaviour strengthened by its consequences.
graph LR
A[Early use] --> B[Substance produces pleasure / arousal]
B --> C[Positive reinforcement: behaviour strengthened]
A2[Dependence develops] --> D[Substance relieves withdrawal / distress]
D --> E[Negative reinforcement: behaviour strengthened]
F[Gambling] --> G[Unpredictable wins]
G --> H[Partial reinforcement: highly persistent behaviour]
Positive reinforcement. In the early stages, the substance's pleasurable effects (the nicotine "lift", the excitement of a win) act as positive reinforcers, making use more likely to recur.
Negative reinforcement. As dependence develops, the individual increasingly uses the substance to escape or avoid an aversive state — withdrawal symptoms, stress, or low mood. Because the behaviour removes something unpleasant, it is strengthened through negative reinforcement. This is the behavioural counterpart of the biological A-/B-process shift: use is now driven by relief rather than pleasure.
Partial (intermittent) reinforcement and gambling. Gambling is the textbook case of partial reinforcement: wins arrive unpredictably, on a variable schedule, which produces behaviour that is exceptionally persistent and resistant to extinction — the gambler keeps playing through long losing runs because, historically, reward has come eventually and unpredictably. This is why gambling, in operant terms, is so hard to stop. The contrast with continuous reinforcement is instructive: a behaviour rewarded every single time extinguishes quickly once the reward stops (the absence of reward is immediately obvious), whereas a behaviour rewarded only intermittently is highly resistant to extinction, because a run without reward is indistinguishable from the normal pattern of play. Gambling machines and games are, in effect, built around the most powerful reinforcement schedule known to learning theory, which helps explain why problem gambling can persist despite very heavy and obvious losses.
A further operant point is that gambling delivers not only the financial win but immediate secondary reinforcers — flashing lights, celebratory sounds, the announcement of a win — that arrive instantly and reliably, even when the monetary outcome is poor. These immediate reinforcers can strengthen continued play in their own right, and the contrast between immediate reward signals and delayed financial loss further entrenches the behaviour, since immediate consequences shape behaviour more powerfully than delayed ones.
| Type of reinforcement | Application | Stage |
|---|---|---|
| Positive reinforcement | Smoking produces a pleasurable lift; a win produces excitement | Early use / initiation |
| Negative reinforcement | Smoking to relieve craving/irritability; gambling to escape low mood | Maintenance / dependence |
| Partial (intermittent) reinforcement | Unpredictable gambling wins create extinction-resistant behaviour | Gambling addiction |
Key Definition: Negative reinforcement strengthens a behaviour by removing or reducing an unpleasant stimulus. In addiction it typically involves using a substance, or gambling, to relieve withdrawal, stress or distress.
Social learning theory (Bandura) explains how addiction can be initiated through observation, imitation and vicarious reinforcement:
The AQA specification asks specifically for learning theory applied to nicotine, and the three mechanisms combine neatly to give a full account of a smoking career. Social learning typically explains initiation: an adolescent observes admired peers, family members or media figures smoking, sees them apparently rewarded (looking relaxed, sociable or grown-up), and imitates through vicarious reinforcement. Operant conditioning then explains maintenance: early cigarettes are positively reinforced by nicotine's pleasant effects, but as dependence develops, smoking is increasingly negatively reinforced by the relief of craving and irritability between cigarettes. Classical conditioning explains the cue-bound, relapse-prone character of the habit: the situations in which smoking regularly occurs (coffee, breaks, alcohol, stress) become conditioned cues that trigger craving, so that even an ex-smoker can be tipped into relapse by an everyday cue years later. The strength of the learning model is precisely this ability to assemble initiation, maintenance and relapse from ordinary conditioning principles.
The cognitive approach shifts the focus from external contingencies to the addict's thinking. For the AQA specification, cognitive theory is applied especially to gambling.
Expectancy theory holds that expectations about the effects of a substance or behaviour strongly influence use. Expectancies are learned through personal experience and social learning, and they can be positive or negative:
Individuals with stronger positive expectancies are more likely to use and to develop dependence, and expectancies predict use even in young people who have not yet tried the substance — evidence that they are absorbed from the social environment before any direct experience. Expectancies and behaviour are also reciprocal: positive expectancies encourage use, and the experience of use (especially when it relieves stress or boredom) strengthens the expectancies, helping to entrench the addiction. For gambling, the powerful positive expectancy — "I could win big" — is sustained partly because the rare, vivid experience of winning outweighs, in memory, the far more frequent losses, linking expectancy theory to the recall bias described below.
Cognitive theory is especially powerful for gambling, where characteristic cognitive distortions sustain the behaviour despite consistent losses:
These distortions explain how gamblers maintain a belief that they can win despite mounting evidence to the contrary, and they are a direct target of cognitive-behavioural treatment. They also dovetail with the biological account: the near miss, treated cognitively as "almost winning", is the same event that engages the dopamine reward system almost as strongly as a real win — so a single feature of gambling products is simultaneously a cognitive distortion and a biological reward trigger, illustrating how the levels of explanation interlock rather than compete. A further distortion, the illusion of control, is reinforced by the design of many games (choosing numbers, pressing "stop" on a reel), which give the player an impression of skill where none exists.
A complementary cognitive account describes layered maladaptive beliefs that maintain addiction: negative core beliefs about the self ("I can't cope"), facilitating (permissive) beliefs that grant permission to use ("just one won't hurt", "I deserve a treat"), and automatic thoughts triggered by cues ("I need one now"). Together these maintain a self-perpetuating cycle, in which distress prompts use, use brings short-term relief, and the subsequent guilt reinforces the negative core beliefs.
graph TD
A[Trigger / Cue or Stress] --> B[Automatic Thoughts]
B --> C[Facilitating Beliefs: 'just one won't hurt']
C --> D[Craving Intensifies]
D --> E[Substance Use / Gambling]
E --> F[Short-term Relief]
F --> G[Guilt and Reinforcement of Negative Core Beliefs]
G --> A
Beyond gambling-specific distortions, addiction more generally is associated with measurable cognitive biases:
These biases are not merely symptoms; they help maintain addiction in a self-reinforcing loop. Attentional bias means the addicted individual repeatedly notices substance- or gambling-related cues, which (through cue reactivity) trigger craving, which in turn sharpens attention to those cues still further — so the bias and the craving feed one another. Recall bias keeps the perceived "balance sheet" of use favourable by foregrounding the rewards and discounting the harms, sustaining the positive expectancies that drive continued use. Because these are ordinary cognitive processes operating in a distorted way, they can in principle be retrained, which is the rationale behind attentional-bias-modification techniques and the cognitive-restructuring components of CBT.
Subscribe to continue reading
Get full access to this lesson and all 10 lessons in this course.