Learning & Cognitive Models of Addiction

Where the biological model explains addiction through brain mechanisms and genes, learning and cognitive models explain it through experience — how addictive behaviour is acquired and maintained by conditioning and social learning, and how expectancies, beliefs and cognitive biases sustain it. The learning approach treats addiction as a set of learned associations and reinforced habits; the cognitive approach treats it as the product of the addict's thinking about substances and about themselves. These psychological models are essential complements to biology: they explain how the same neural reward described biologically becomes attached to particular cues and situations, why relapse is so cue-driven, why purely behavioural and cognitive-behavioural treatments work, and how behavioural addictions such as gambling develop without any ingested substance. This lesson applies the models to the AQA examples of nicotine and gambling, and is written in the standard clinical, objective register of A-Level teaching.

Key Definition: Cue reactivity is the phenomenon whereby environmental cues repeatedly associated with substance use (through classical conditioning) come to trigger craving and physiological responses even when the substance itself is absent.

Spec Mapping

This lesson addresses the following points from the AQA A-Level Psychology (7182) specification, Paper 3, Section D — Addiction:

Learning theory as applied to nicotine addiction and gambling addiction.
Cognitive theory as applied to gambling addiction.

It develops classical conditioning (cue reactivity), operant conditioning (positive and negative reinforcement; partial reinforcement), social learning theory, and the cognitive approach (expectancy theory; cognitive distortions and biases applied especially to gambling), preparing you to describe (AO1) and evaluate (AO3) the learning and cognitive models. Questions are usually split AO1/AO3 only, with no AO2 unless a scenario stem is provided — this lesson flags that distinction. The content complements the biological model and underpins the behavioural and cognitive-behavioural treatments examined in the "reducing addiction" lesson.

Classical Conditioning and Cue Reactivity

The learning model's first mechanism is classical (Pavlovian) conditioning. Through repeated pairing with the substance, previously neutral stimuli in the environment become conditioned stimuli that, by themselves, trigger craving and physiological arousal:

Classical conditioning term	Application to addiction
Unconditioned stimulus (UCS)	The drug itself (e.g., nicotine)
Unconditioned response (UCR)	The substance's effects (dopamine release, relaxation/arousal)
Neutral stimulus (NS) → conditioned stimulus (CS)	Cues regularly present at use: a pub, the smell of smoke, a lighter, a betting shop, specific people, times of day, emotional states
Conditioned response (CR)	Craving and physiological arousal (raised heart rate, changes in skin conductance) triggered by the cue alone

For example, a smoker who always smokes with a morning coffee comes to find that the smell and taste of coffee alone triggers a strong craving, with no nicotine present. The same applies to gambling: the sights and sounds of a betting environment, or even a payday, can become conditioned cues that elicit the urge to gamble. This is cue reactivity, and it is the learning model's most powerful explanation of relapse — long after withdrawal has passed, everyday cues can re-trigger craving.

Evidence — Carter and Tiffany (1999)

Carter and Tiffany (1999) conducted a meta-analysis of cue-reactivity studies. Their findings were that substance-related cues reliably elicited self-reported craving and produced measurable physiological responses (e.g., heart-rate and skin-conductance changes), and that this pattern held across different substances (alcohol, nicotine, opioids, cocaine). The conclusion is that conditioned cues genuinely acquire the power to trigger craving and arousal, supporting the classical-conditioning account and explaining why relapse is so often cue-driven. This evidence also directly justifies cue-exposure treatment, which aims to extinguish the conditioned response by repeatedly presenting cues without the substance.

Operant Conditioning

The learning model's second mechanism is operant conditioning — addiction as behaviour strengthened by its consequences.

graph LR
    A[Early use] --> B[Substance produces pleasure / arousal]
    B --> C[Positive reinforcement: behaviour strengthened]
    A2[Dependence develops] --> D[Substance relieves withdrawal / distress]
    D --> E[Negative reinforcement: behaviour strengthened]
    F[Gambling] --> G[Unpredictable wins]
    G --> H[Partial reinforcement: highly persistent behaviour]

Positive reinforcement. In the early stages, the substance's pleasurable effects (the nicotine "lift", the excitement of a win) act as positive reinforcers, making use more likely to recur.

Negative reinforcement. As dependence develops, the individual increasingly uses the substance to escape or avoid an aversive state — withdrawal symptoms, stress, or low mood. Because the behaviour removes something unpleasant, it is strengthened through negative reinforcement. This is the behavioural counterpart of the biological A-/B-process shift: use is now driven by relief rather than pleasure.

Partial (intermittent) reinforcement and gambling. Gambling is the textbook case of partial reinforcement: wins arrive unpredictably, on a variable schedule, which produces behaviour that is exceptionally persistent and resistant to extinction — the gambler keeps playing through long losing runs because, historically, reward has come eventually and unpredictably. This is why gambling, in operant terms, is so hard to stop. The contrast with continuous reinforcement is instructive: a behaviour rewarded every single time extinguishes quickly once the reward stops (the absence of reward is immediately obvious), whereas a behaviour rewarded only intermittently is highly resistant to extinction, because a run without reward is indistinguishable from the normal pattern of play. Gambling machines and games are, in effect, built around the most powerful reinforcement schedule known to learning theory, which helps explain why problem gambling can persist despite very heavy and obvious losses.

A further operant point is that gambling delivers not only the financial win but immediate secondary reinforcers — flashing lights, celebratory sounds, the announcement of a win — that arrive instantly and reliably, even when the monetary outcome is poor. These immediate reinforcers can strengthen continued play in their own right, and the contrast between immediate reward signals and delayed financial loss further entrenches the behaviour, since immediate consequences shape behaviour more powerfully than delayed ones.

Type of reinforcement	Application	Stage
Positive reinforcement	Smoking produces a pleasurable lift; a win produces excitement	Early use / initiation
Negative reinforcement	Smoking to relieve craving/irritability; gambling to escape low mood	Maintenance / dependence
Partial (intermittent) reinforcement	Unpredictable gambling wins create extinction-resistant behaviour	Gambling addiction

Key Definition: Negative reinforcement strengthens a behaviour by removing or reducing an unpleasant stimulus. In addiction it typically involves using a substance, or gambling, to relieve withdrawal, stress or distress.

Social Learning Theory (SLT)

Social learning theory (Bandura) explains how addiction can be initiated through observation, imitation and vicarious reinforcement:

Children and adolescents observe parents, peers and media figures using substances or gambling.
If the model appears to gain positive consequences (enjoyment, social acceptance, stress relief, winning), the observer is vicariously reinforced and more likely to imitate.
Models who are attractive, similar or high-status (admired peers, celebrities) are especially influential, because observers identify with them and expect similar rewards.
The behaviour is more likely to be imitated when the observer expects to be reinforced for it and believes they can perform it (the role of outcome expectancies and self-efficacy in Bandura's account), which links social learning to the cognitive model below.
Peer influence peaks in adolescence, when the need for social acceptance is strong and susceptibility to modelling is heightened.

Application to the AQA Examples

Nicotine. Adolescents whose parents or close friends smoke are substantially more likely to begin smoking; media portrayals of smoking provide further modelling, and peer norms shape both initiation and continuation.
Gambling. Gambling advertising, the normalisation of betting in sport and media, and observing others' (publicised) wins while their losses remain hidden all model gambling as rewarding and low-risk, encouraging uptake — a particularly clear case of how the presentation of a behaviour shapes social learning.

Putting the Learning Model Together: Nicotine

The AQA specification asks specifically for learning theory applied to nicotine, and the three mechanisms combine neatly to give a full account of a smoking career. Social learning typically explains initiation: an adolescent observes admired peers, family members or media figures smoking, sees them apparently rewarded (looking relaxed, sociable or grown-up), and imitates through vicarious reinforcement. Operant conditioning then explains maintenance: early cigarettes are positively reinforced by nicotine's pleasant effects, but as dependence develops, smoking is increasingly negatively reinforced by the relief of craving and irritability between cigarettes. Classical conditioning explains the cue-bound, relapse-prone character of the habit: the situations in which smoking regularly occurs (coffee, breaks, alcohol, stress) become conditioned cues that trigger craving, so that even an ex-smoker can be tipped into relapse by an everyday cue years later. The strength of the learning model is precisely this ability to assemble initiation, maintenance and relapse from ordinary conditioning principles.

Cognitive Models of Addiction

The cognitive approach shifts the focus from external contingencies to the addict's thinking. For the AQA specification, cognitive theory is applied especially to gambling.

Expectancy Theory

Expectancy theory holds that expectations about the effects of a substance or behaviour strongly influence use. Expectancies are learned through personal experience and social learning, and they can be positive or negative:

Positive outcome expectancies: "Alcohol makes me more confident", "Smoking helps me relax", "Gambling is exciting and I could win big".
Negative outcome expectancies: "Drinking makes me ill", "Smoking causes cancer", "Gambling just loses money".

Individuals with stronger positive expectancies are more likely to use and to develop dependence, and expectancies predict use even in young people who have not yet tried the substance — evidence that they are absorbed from the social environment before any direct experience. Expectancies and behaviour are also reciprocal: positive expectancies encourage use, and the experience of use (especially when it relieves stress or boredom) strengthens the expectancies, helping to entrench the addiction. For gambling, the powerful positive expectancy — "I could win big" — is sustained partly because the rare, vivid experience of winning outweighs, in memory, the far more frequent losses, linking expectancy theory to the recall bias described below.

Cognitive Distortions in Gambling

Cognitive theory is especially powerful for gambling, where characteristic cognitive distortions sustain the behaviour despite consistent losses:

The gambler's fallacy — the false belief that a run of losses makes a win "due" (e.g., "red must come up soon"), although successive outcomes are independent.
The illusion of control — overestimating one's influence over chance outcomes (e.g., throwing dice harder for a high number, choosing "lucky" numbers).
Near-miss interpretation — treating an outcome that fell just short as "almost winning", which sustains motivation even though a near miss is simply a loss.
Recall bias — vividly remembering wins while discounting or forgetting losses, which distorts the perceived odds of winning.

These distortions explain how gamblers maintain a belief that they can win despite mounting evidence to the contrary, and they are a direct target of cognitive-behavioural treatment. They also dovetail with the biological account: the near miss, treated cognitively as "almost winning", is the same event that engages the dopamine reward system almost as strongly as a real win — so a single feature of gambling products is simultaneously a cognitive distortion and a biological reward trigger, illustrating how the levels of explanation interlock rather than compete. A further distortion, the illusion of control, is reinforced by the design of many games (choosing numbers, pressing "stop" on a reel), which give the player an impression of skill where none exists.

Self-Schema and Maladaptive Beliefs

A complementary cognitive account describes layered maladaptive beliefs that maintain addiction: negative core beliefs about the self ("I can't cope"), facilitating (permissive) beliefs that grant permission to use ("just one won't hurt", "I deserve a treat"), and automatic thoughts triggered by cues ("I need one now"). Together these maintain a self-perpetuating cycle, in which distress prompts use, use brings short-term relief, and the subsequent guilt reinforces the negative core beliefs.

graph TD
    A[Trigger / Cue or Stress] --> B[Automatic Thoughts]
    B --> C[Facilitating Beliefs: 'just one won't hurt']
    C --> D[Craving Intensifies]
    D --> E[Substance Use / Gambling]
    E --> F[Short-term Relief]
    F --> G[Guilt and Reinforcement of Negative Core Beliefs]
    G --> A

Cognitive Biases in Addiction

Beyond gambling-specific distortions, addiction more generally is associated with measurable cognitive biases:

Attentional bias — addicted individuals automatically prioritise substance-related stimuli. A smoker trying to quit notices cigarettes and people smoking far more than a non-smoker. This is often demonstrated using a modified Stroop task, in which addicted participants are slower to name the colour of substance-related words (because the word's meaning captures attention), indexing the bias.
Memory (recall) bias — selectively recalling the positive effects of use while minimising the negative, which sustains motivation to continue.

These biases are not merely symptoms; they help maintain addiction in a self-reinforcing loop. Attentional bias means the addicted individual repeatedly notices substance- or gambling-related cues, which (through cue reactivity) trigger craving, which in turn sharpens attention to those cues still further — so the bias and the craving feed one another. Recall bias keeps the perceived "balance sheet" of use favourable by foregrounding the rewards and discounting the harms, sustaining the positive expectancies that drive continued use. Because these are ordinary cognitive processes operating in a distorted way, they can in principle be retrained, which is the rationale behind attentional-bias-modification techniques and the cognitive-restructuring components of CBT.

Learning & Cognitive Models of Addiction

Learning & Cognitive Models of Addiction

Spec Mapping

Classical Conditioning and Cue Reactivity

Evidence — Carter and Tiffany (1999)

Operant Conditioning

Social Learning Theory (SLT)

Application to the AQA Examples

Putting the Learning Model Together: Nicotine

Cognitive Models of Addiction

Expectancy Theory

Cognitive Distortions in Gambling

Self-Schema and Maladaptive Beliefs

Cognitive Biases in Addiction

Synoptic Links

More in Psychology