You are viewing a free preview of this lesson.
Subscribe to unlock all 4 lessons in this course and every other course on LearningBro.
Essay questions carry the most marks on AQA A-Level Psychology papers: a single 16-mark essay is worth one-sixth of an entire paper, and there are several across the three exams. More importantly, essays are where the gap between grades is decided, because they reward a skill — sustained, evidenced evaluation — that cannot be bluffed. A student can scrape AO1 marks by reciting facts, but the leap from a Level 2 to a Level 4 essay is almost entirely a leap in the quality of evaluation and the coherence of the argument. This lesson teaches that skill explicitly: how to ration AO1 against AO3, how to build an evaluation paragraph that actually develops rather than merely asserts, how to use issues and debates as ready-made AO3, how to apply theory to a scenario for AO2, and — through banded worked extracts — exactly what the difference between a mid-band and a top-band paragraph looks like on the page.
This is a synthesis lesson serving the extended-response mark schemes that sit behind every essay across the qualification.
| Technique taught | Assessment objective served | Where it appears |
|---|---|---|
| AO1/AO3 balance and the 6-mark cap | AO1 + AO3 | Every 12- and 16-mark essay (Papers 1, 2, 3) |
| PEEL / PETAL evaluation structure | AO3 | Every essay; 6-mark "outline and evaluate" items |
| Using issues and debates as evaluation | AO3 | Especially Paper 3 Section A, but valid everywhere |
| Signposting and conclusions | AO3 (argument coherence) | Level 4 descriptor on all essays |
| Application to a scenario | AO2 | Application items on all papers; scenario essays on Paper 3 |
The numbers to hold in mind: a 12-mark essay is 6 AO1 + 6 AO3; a 16-mark essay is 6 AO1 + 10 AO3 (or 6 AO1 + 4 AO2 + 6 AO3 when a scenario is attached). The asymmetry of the 16-marker — ten evaluation marks against six description marks — is the single most important strategic fact in this lesson.
| Type | Marks | Typical command | AO1 | AO2 | AO3 |
|---|---|---|---|---|---|
| 6-mark mini-essay | 6 | "Outline and evaluate..." | 3 | – | 3 |
| 12-mark essay | 12 | "Outline and evaluate..." / "Discuss..." | 6 | – | 6 |
| 16-mark essay | 16 | "Discuss..." | 6 | – | 10 |
| 16-mark scenario essay (Paper 3) | 16 | "Discuss... Refer to the item..." | 6 | 4 | 6 |
Always read the stem to check whether stimulus material is present, because it changes your target balance.
Both 12- and 16-mark essays award a maximum of 6 marks for AO1. This single fact reshapes how you should write:
Common Mistake: Writing three paragraphs of description and one of evaluation. This produces an essay that maxes AO1 (6 marks) but strands AO3 in Level 2 — and since AO3 is worth up to 10 of the 16 marks, the overall grade is capped low. The AO1 you write beyond the sixth mark earns literally nothing.
Evaluation is where most marks live (10 of 16). The reliable way to structure each evaluation point is PEEL, or its extended cousin PETAL.
| Step | Meaning | Example |
|---|---|---|
| P — Point | A clear evaluative claim | "A strength of the cognitive approach is that it has practical applications." |
| E — Evidence | Support with research/example | "Beck's cognitive triad and Ellis's ABC model led directly to CBT." |
| E — Explain | Why this supports/undermines the theory | "Because CBT is an effective, recommended treatment for depression, the cognitive emphasis on faulty thinking has demonstrable real-world value." |
| L — Link | Tie back to the question / mini-conclusion | "Therefore the approach can be judged useful, since it generates evidence-based interventions." |
PETAL inserts a T = This/However (counter-argument) step before the Link, which is what genuinely lifts an essay into Level 4:
graph LR
P[Point<br/>evaluative claim] --> E1[Evidence<br/>study/example]
E1 --> E2[Explain<br/>why it matters]
E2 --> T[This/However<br/>counter-argument]
T --> L[Link<br/>balanced mini-conclusion]
The "However..." move signals to the examiner that you are weighing evidence rather than listing it — the hallmark of effective evaluation in the level descriptors.
Point: A limitation of the behaviourist approach is that it is environmentally deterministic.
Evidence: The approach holds that all behaviour is shaped by classical and operant conditioning; Skinner argued free will is an illusion and behaviour is entirely a product of reinforcement contingencies.
Explain: This is problematic because it ignores the documented role of cognition. Bandura's work on self-efficacy shows that beliefs about one's own ability influence persistence — something stimulus-response associations alone cannot explain.
Link: This suggests the behaviourist account is incomplete, since it neglects the cognitive and personal factors that other approaches show to be influential.
| Essay type | AO3 marks | Recommended PEEL paragraphs |
|---|---|---|
| 12-mark | 6 | 3 well-developed |
| 16-mark | 10 | 4-5 well-developed |
It is far better to have fewer, deeper points than many shallow ones. The "shopping list" of six one-line criticisms is a Level 2 staple precisely because none of the points is developed or evidenced.
Variety signals breadth. Aim to draw on several of the following in any essay.
Name the researcher and date, summarise the key finding, explain how it supports the theory, and (for extra credit) comment on the methodology of that supporting study.
Name a study that challenges the theory, explain how its findings undermine it, then consider whether the contradicting study itself has weaknesses.
| Issue | Description | Example |
|---|---|---|
| Lab experiment | High control, low ecological validity | Milgram's setting was artificial; behaviour may differ in real life |
| Field experiment | More ecological validity, less control | Hofling et al. (1966) nurse study — realistic but raises ethics |
| Sample bias | Unrepresentative sample limits generalisability | Milgram's sample was all-male, all-American |
| Demand characteristics | Participants guess the aim and change behaviour | Orne (1962) on the "good participant" effect |
| Social desirability | Participants give acceptable answers | Self-report of prejudice may understate it |
| Ethical issues | Harm, deception, lack of consent | Distress in Milgram's and Zimbardo's studies |
Theories that generate effective treatments (CBT, systematic desensitisation, drug therapy) gain credibility from their utility. A theory with no application can be framed as limited.
| Debate | How to deploy it |
|---|---|
| Nature vs Nurture | Does the theory over-weight biology or environment? Is the balance defensible? |
| Reductionism vs Holism | Does it oversimplify complex behaviour into single components? |
| Determinism vs Free Will | Does it deny personal agency (biological/environmental/psychic determinism)? |
| Cultural Bias | Were key studies Western-only? Is generalisation safe (ethnocentrism)? |
| Gender Bias | One-gender samples; assuming male behaviour is the norm (androcentrism); alpha/beta bias |
| Idiographic vs Nomothetic | General laws vs individual experience |
| Ethical implications | Could the theory be used to justify control or discrimination? |
Exam Tip: Issues and debates are valid AO3 on any essay, not just Paper 3. A debate point, fully developed, is one of the most reliable ways to add a Level-4-quality paragraph because it demonstrates the "wider issues" engagement the top band rewards.
Two features distinguish a coherent Level 4 essay from a competent Level 3 one.
Signposting uses explicit discourse markers so the examiner never has to hunt for your point: begin evaluation paragraphs with "One strength of...", "A limitation of...", "However, this can be challenged because...", "In contrast...". These flags do free work — they make the structure of your argument visible.
Conclusions are expected for the top band. A conclusion is not a summary ("In conclusion, there are strengths and weaknesses"); it is a judgement that weighs the evidence: "Overall, although the model offers a useful first framework, the weight of contradictory evidence — particularly case studies showing dissociable memory stores — suggests it is too simplistic to stand as a complete account." The conclusion is where you convert a pile of points into a line of argument.
The single subtlest reason essays stall in the middle band is that students believe they are evaluating when they are in fact still describing. Compare:
| Disguised description (low AO3) | Genuine evaluation (high AO3) |
|---|---|
| "Another study is Loftus and Palmer, who changed the verb in a question and found different speed estimates." | "Loftus and Palmer's finding supports the reconstructive view of memory because it shows post-event information altering the memory itself, which a passive-storage model cannot explain." |
| "Bowlby also talked about the critical period." | "The notion of a fixed critical period is undermined by Rutter's Romanian orphan data, which show recovery beyond it, implying a sensitive period instead." |
The diagnostic test is simple: an evaluative sentence contains a judgement word ("supports", "undermines", "challenges", "is a strength because", "is limited by") and a reason ("because...", "which means...", "implying..."). If a sentence merely reports what a researcher did or claimed, it is AO1 no matter where it sits in the essay. Train yourself to scan a draft and underline every judgement-plus-reason; their density is your real AO3 score.
The level descriptors reward evaluation that is effective and thorough, not evaluation that is long. An effective point does three things the disguised version omits: it states the direction of the judgement, it explains the mechanism by which the evidence bears on the theory, and it (ideally) acknowledges a counter. This is why the banded extracts later in this lesson differ chiefly in reasoning depth, not length — and why padding an essay with extra studies, each merely named, does nothing to raise the band.
Examiners can tell a planned essay from an unplanned one within a paragraph, because the planned essay does not repeat itself, does not run out of evaluation, and reaches a conclusion. Yet a plan need cost only two minutes. The efficient method is a margin skeleton: down the side of the page, jot one line per paragraph —
AO1: components + key claim · S1: dual-task · S2: KF case (+caution) · L1: central executive vague · D: limited scope/reductionism · CONC: advance but partial
Six abbreviations, two minutes, and the essay's balance is locked in before a sentence is written. The skeleton also acts as a time governor: if you have planned five evaluation slots for a 20-minute essay, you know each gets roughly three minutes, and you can pace accordingly. Crucially, planning prevents the most damaging essay profile examiners report — a luxurious description followed by two thin, hurried criticisms once the candidate notices the clock.
Application questions require you to use knowledge to explain a described situation. They appear on all three papers.
Scenario: "Tom tries to read a book while his flatmate talks to him about work. He finds it difficult to follow both."
A weak answer simply lists the WMM components. A strong answer ties them to Tom: both reading (subvocal rehearsal via the articulatory process) and listening to speech draw on the phonological loop; because both tasks compete for the same limited-capacity component, performance suffers — whereas a visual task plus listening would load different components (visuo-spatial sketchpad versus phonological loop) and interfere less.
Subscribe to continue reading
Get full access to this lesson and all 4 lessons in this course.