You are viewing a free preview of this lesson.
Subscribe to unlock all 10 lessons in this course and every other course on LearningBro.
The International Phonetic Alphabet (IPA) is the standard system used by linguists worldwide to represent the sounds of spoken language. Developed by the International Phonetic Association (founded in Paris in 1886), the IPA provides a one-to-one correspondence between symbols and sounds: each symbol represents exactly one distinct speech sound, and each sound is represented by exactly one symbol. For AQA A-Level English Language (7702), a confident command of the IPA is not an optional extra — it is the foundational tool that lets you write about pronunciation with the precision examiners reward. Phonetics and phonology are not a free-standing topic on the specification; they are a method of language analysis integrated into every component — Paper 1 (Language, the individual and society), Paper 2 (Language diversity and change) and the Non-Examined Assessment (NEA). Whenever you analyse spoken data, accent, regional variation, child language or the representation of speech in a text, the IPA is the apparatus that turns a vague impression into an evidenced observation. This lesson builds that apparatus from the ground up.
It is worth being clear from the outset about why you are learning this. On AQA 7702, phonological knowledge primarily serves AO1 — the assessment objective that rewards the systematic application of appropriate methods and terminology to the analysis of language. A candidate who writes "she drops her aitches" is making a lay observation; a candidate who writes "the speaker deletes word-initial /h/ in lexical words, a variable feature associated with non-standard accents" is demonstrating AO1 through accurate metalanguage and the IPA. Phonological analysis also feeds AO3 — the contextual factors that shape language production and reception — because accent is a powerful index of region, social class, age and identity, and listeners' attitudes to accent are themselves a contextual phenomenon. The full set of assessment objectives for the A-Level is AO1 26 · AO2 26 · AO3 23 · AO4 15 · AO5 10, and phonetics threads chiefly through AO1 and AO3.
Because the topic is integrated rather than self-contained, you will rarely be asked to "transcribe this passage" in isolation. Instead you will be given spoken data — a transcript, perhaps with prosodic notation — and asked to analyse it. The IPA is the tool that lets you pinpoint exactly which sounds a speaker uses, compare them with a reference accent, and build an argument about variation and its social meaning.
English spelling (its orthography) is notoriously inconsistent. The same letter or combination of letters can represent different sounds in different words, and the same sound can be spelled in many different ways. This mismatch is a historical accident: spelling was largely fixed in the late-medieval and early-modern period, while pronunciation has gone on changing, so the two have drifted apart.
| Spelling Issue | Examples |
|---|---|
| One letter, multiple sounds | The letter "c" represents /k/ in "cat" but /s/ in "city" |
| One sound, multiple spellings | The sound /iː/ is spelled "ee" in "feet," "ea" in "beat," "ie" in "piece," "ey" in "key," and "e" in "me" |
| Silent letters | The "k" in "knee," the "b" in "lamb," the "w" in "write" |
| Same spelling, different sounds | "ough" is pronounced differently in "though," "through," "thought," "tough," "cough," and "bough" |
| Letters that do not match sounds at all | "one" begins with a /w/ sound; "colonel" contains an /r/ sound spelled "l" |
This inconsistency means ordinary English spelling is an unreliable guide to pronunciation. The IPA solves the problem by providing a phonetically transparent writing system — what you see is exactly what you say. Crucially, the IPA also lets you write about a sound that has no straightforward letter (the final consonant of "sing", for instance, which is a single sound /ŋ/ but spelled with two letters), and to distinguish sounds that English spelling conflates (the two different "th" sounds in "think" and "this").
Key Definition: International Phonetic Alphabet (IPA) — a standardised system of phonetic notation in which each symbol represents one and only one speech sound, enabling precise and unambiguous transcription of any language.
A note on the reference accent: throughout this course, unless stated otherwise, the model accent is Received Pronunciation (RP), also increasingly called Standard Southern British English (SSBE). This is the accent described in the major English pronunciation dictionaries and the conventional baseline for describing other accents. It is chosen as a reference because it is well documented, not because it is "correct" — linguistically, no accent is superior to any other.
Before learning the symbols, it is essential to understand the distinction between phonemes and allophones. This is arguably the single most important concept in the whole topic, and candidates who grasp it can write with real authority.
A phoneme is the smallest unit of sound in a language that can distinguish one word from another. Phonemes are abstract, mental categories — the sounds that speakers of a language store as meaningfully "different". In English, /p/ and /b/ are separate phonemes because swapping one for the other changes a word's meaning: "pat" /pæt/ versus "bat" /bæt/. The standard test for phonemic status is the minimal pair: two words that differ in only one sound in the same position, where that one difference produces a difference in meaning. "Pin" /pɪn/ and "bin" /bɪn/ form a minimal pair, proving /p/ and /b/ are distinct phonemes of English.
An allophone is a specific physical realisation of a phoneme. Allophones are the actual sounds produced in speech, and a single phoneme may have several allophones depending on the phonetic environment. The choice between them is predictable from context and never changes meaning. For example:
Key Definition: Phoneme — an abstract, contrastive unit of sound that can distinguish meaning; written between forward slashes, /p/, /b/, /æ/. Allophone — a concrete, predictable realisation of a phoneme in a particular environment; written in square brackets, [pʰ], [p], [ɫ].
The crucial point is that allophones of the same phoneme never create a difference in meaning. English speakers do not perceive aspirated [pʰ] and unaspirated [p] as different sounds; they are simply context-dependent versions of "the same sound" /p/. Two further technical terms are useful here. Allophones are typically in complementary distribution — each occurs in environments where the other cannot, so they never compete in the same slot. When two sounds can appear in the same slot without changing meaning (a speaker might end "cat" with either [t] or [ʔ]), they are said to be in free variation.
The IPA can be used at two levels of detail, and knowing which to deploy is itself an analytical skill.
| Type | Notation | Detail Level | Example ("pin") | Purpose |
|---|---|---|---|---|
| Broad (phonemic) | Forward slashes / / | Records only the distinctive phonemes | /pɪn/ | Showing which phonemes are used; sufficient for most A-Level analysis |
| Narrow (phonetic) | Square brackets [ ] | Records fine phonetic detail including allophones | [pʰɪn] | Showing exactly how sounds are produced; useful for detailed accent work |
For AQA A-Level, broad transcription is the standard expectation, and it is more than adequate for most questions. The slashes signal that you are recording the phoneme inventory — the meaningful sound categories — of an utterance. However, when you want to capture a specific accent feature such as aspiration, glottalisation, l-vocalisation or a dark /l/, you switch to narrow transcription in square brackets and add diacritics (small modifying marks) where needed: aspiration [ʰ], nasalisation, dentalisation, and so on. Mixing the conventions up — putting allophonic detail inside slashes, or treating square brackets as merely decorative — signals to an examiner that the underlying concept has not landed.
Key Definition: Broad transcription — phonemic transcription using / / that records only the contrastive sound units of a language. Narrow transcription — phonetic transcription using [ ] that captures fine-grained articulatory detail, including allophonic variation, often with diacritics.
English has 24 consonant phonemes (the exact count varies slightly between analyses). They are set out in the chart below, organised by place of articulation (columns — where the obstruction is made) and manner of articulation (rows — how the airstream is modified). Where a cell holds a voicing pair, the voiceless sound is on the left and the voiced sound on the right. This chart is a markdown table, and it is the standard way to present the consonant system — you should be able to read it fluently and locate any sound by its three-term label.
| Manner | Bilabial | Labiodental | Dental | Alveolar | Post-alveolar | Palatal | Velar | Glottal |
|---|---|---|---|---|---|---|---|---|
| Plosive | p · b | t · d | k · g | |||||
| Nasal | m | n | ŋ | |||||
| Fricative | f · v | θ · ð | s · z | ʃ · ʒ | h | |||
| Affricate | tʃ · dʒ | |||||||
| Approximant | w | r | j | (w) | ||||
| Lateral | l |
(/w/ appears twice because it is a labial-velar approximant — produced with simultaneous lip rounding and a raised back of the tongue.)
| Symbol | Example | Three-term description |
|---|---|---|
| /p/ | pat | voiceless bilabial plosive |
| /b/ | bat | voiced bilabial plosive |
| /t/ | tap | voiceless alveolar plosive |
| /d/ | dog | voiced alveolar plosive |
| /k/ | cat | voiceless velar plosive |
| /g/ | get | voiced velar plosive |
| /f/ | fat | voiceless labiodental fricative |
| /v/ | vat | voiced labiodental fricative |
| /θ/ | think | voiceless dental fricative |
| /ð/ | this | voiced dental fricative |
| /s/ | sit | voiceless alveolar fricative |
| /z/ | zoo | voiced alveolar fricative |
| /ʃ/ | ship | voiceless post-alveolar fricative |
| /ʒ/ | pleasure | voiced post-alveolar fricative |
| /h/ | hat | voiceless glottal fricative |
| /tʃ/ | chip | voiceless post-alveolar affricate |
| /dʒ/ | jug | voiced post-alveolar affricate |
| /m/ | mat | voiced bilabial nasal |
| /n/ | net | voiced alveolar nasal |
| /ŋ/ | sing | voiced velar nasal |
| /l/ | lot | voiced alveolar lateral approximant |
| /r/ | red | voiced post-alveolar approximant |
| /j/ | yes | voiced palatal approximant |
| /w/ | wet | voiced labial-velar approximant |
The full mechanics of place, manner and voicing are developed in Lesson 3; for now the aim is to recognise each symbol and read its label.
English vowels are more complex than consonants because they vary far more between accents. In RP there are roughly 20 vowel phonemes — 12 monophthongs (pure vowels) and 8 diphthongs (gliding vowels). Vowels are classified by tongue height (close to open), tongue backness (front to back) and lip rounding (rounded or unrounded) — a system developed from Daniel Jones's cardinal vowel scheme, explored in Lesson 2.
| Symbol | Key Word | Description |
|---|---|---|
| /iː/ | fleece | close front unrounded (long) |
| /ɪ/ | kit | near-close near-front unrounded (short) |
| /e/ | dress | open-mid front unrounded (short) |
| /æ/ | trap | near-open front unrounded (short) |
| /ɑː/ | bath, father | open back unrounded (long) |
| /ɒ/ | lot | open back rounded (short) |
| /ɔː/ | thought | open-mid back rounded (long) |
| /ʊ/ | foot | near-close near-back rounded (short) |
| /uː/ | goose | close back rounded (long) |
| /ʌ/ | strut | open-mid back unrounded (short) |
| /ɜː/ | nurse | open-mid central unrounded (long) |
| /ə/ | about, comma | mid central unrounded (short, unstressed only) |
| Symbol | Key Word | Movement |
|---|---|---|
| /eɪ/ | face | mid front → close front |
| /aɪ/ | price | open central → close front |
| /ɔɪ/ | choice | open-mid back → close front |
| /əʊ/ | goat | mid central → close back |
| /aʊ/ | mouth | open central → close back |
| /ɪə/ | near | close front → mid central |
| /eə/ | square | mid front → mid central |
| /ʊə/ | cure | close back → mid central |
The schwa deserves special attention because it is the most common vowel sound in English. It is a short, unstressed, mid-central vowel — the "default" sound that appears in virtually every unstressed syllable.
Examples of schwa in common words:
The schwa is central to connected speech (Lesson 4) because full vowels in unstressed positions are constantly reduced to schwa in natural talk. Roughly one in three vowel sounds in fluent English is a schwa, which is why being able to spot and transcribe it is such a strong discriminator in analysis.
The terms phonetics and phonology are often used together, but they name two distinct (if overlapping) enterprises, and showing that you understand the difference is itself a mark of AO1 sophistication.
A simple way to hold the distinction: phonetics tells you that English [pʰ] and [p] are physically different sounds; phonology tells you that, in English, that physical difference is not used to distinguish words, so both belong to one phoneme /p/. The very same two phones might be separate phonemes in another language. For A-Level you operate at both levels — using phonetic detail (the substance) to support phonological argument (the system).
Key Definition: Phonetics — the study of the physical production, transmission and perception of speech sounds, dealing in phones [ ]. Phonology — the study of how sounds function within a language's system, dealing in phonemes / /.
The official IPA chart is not an arbitrary list; it is a grid that encodes the very parameters of articulation. Understanding its logic makes the symbols far easier to learn and lets you reason your way to a sound's identity rather than memorising by rote.
The pay-off for analysis is that any sound's three-term label can be read off its position: a symbol sitting in the "plosive" row and "bilabial" column, on the voiced (right) side, must be the voiced bilabial plosive /b/. Lessons 2 and 3 work through the vowel quadrilateral and the consonant grid in full.
Sounds do not occur in a random string; they are organised into syllables, and a grasp of syllable structure underpins much phonological analysis (especially the connected-speech processes of Lesson 4). A syllable has up to three parts:
| Part | Definition | Example in "strand" /strænd/ |
|---|---|---|
| Onset | The consonant(s) before the vowel | /str/ |
| Nucleus | The vowel (or syllabic consonant) at the core — the only obligatory part | /æ/ |
| Coda | The consonant(s) after the vowel | /nd/ |
The nucleus plus coda together form the rhyme (the part that matters for poetic rhyme). Crucially, a nucleus need not be a vowel: English permits syllabic consonants, where /l/, /n/ or /m/ themselves form the core of an unstressed syllable, marked with a small vertical stroke — "bottle" /ˈbɒtl̩/, "button" /ˈbʌtn̩/, "rhythm" /ˈrɪðm̩/. Recognising syllabic consonants is a precise observation that often goes hand in hand with schwa elision (the schwa drops and the following consonant takes over the syllable).
Key Definition: Syllable — a phonological unit organised around an obligatory nucleus (usually a vowel), optionally preceded by an onset and followed by a coda; nucleus + coda form the rhyme. A syllabic consonant is a consonant functioning as a syllable nucleus, e.g. /l̩/ in "bottle".
Having grasped the phoneme/allophone distinction in the abstract, it is worth cataloguing the most important allophonic processes of English, because being able to name them — and to know they are predictable and meaning-preserving — is exactly what separates a confident answer from a vague one. Each of the following is a context-conditioned variant of a single phoneme, not a separate sound.
| Allophonic process | Phoneme | Conditioning environment | Realisation |
|---|---|---|---|
| Aspiration | /p, t, k/ | Start of a stressed syllable | [pʰ, tʰ, kʰ] — "pin" [pʰɪn]; absent after /s/ ("spin" [spɪn]) |
| Clear vs dark /l/ | /l/ | Clear before vowels, dark after vowels/before consonants | [l] in "leaf"; [ɫ] in "feel" |
| Vowel nasalisation | any vowel | Adjacent to a nasal consonant | "man" [mæ̃n] — the vowel picks up nasal resonance |
| Pre-fortis clipping | long vowels/diphthongs | Before a voiceless ("fortis") consonant | shorter vowel in "seat" than "seed", same quality |
| Glottal reinforcement | /p, t, k/ | Often syllable-finally | a glottal closure accompanies the plosive, e.g. "kit" [kɪʔt] |
The unifying principle is complementary distribution: each variant appears precisely where its partner cannot, so the two never contrast and a speaker could not use the difference to signal a change of meaning. This is why, when you spot one of these in narrow transcription, the correct framing is "the /t/ phoneme is realised as the allophone [ʔ] in this environment", not "the speaker uses a different sound". That phrasing keeps the abstract phoneme and its concrete realisation properly distinct — the conceptual move at the heart of the whole topic.
A short worked example shows the analytical pay-off. Suppose a transcript of an Estuary English speaker gives "quite a lot of water" with glottal stops.
Model answer (extract): "The speaker realises intervocalic and pre-consonantal /t/ as the glottal-stop allophone [ʔ] — 'quite a' [kwaɪʔ ə], 'lot of' [lɒʔ ə], 'water' [ˈwɔːʔə]. Phonemically the words still contain /t/; phonetically that /t/ surfaces as [ʔ], a context-governed allophone, which is why the transcription is given narrowly in square brackets. Glottal replacement of /t/ has spread well beyond its traditional Cockney associations through Estuary English and is now common even among younger middle-class speakers, so here it indexes a broadly south-eastern, informal identity rather than a narrowly working-class one (AO3)."
Examiner-style commentary: A Mid-band answer would say the speaker "uses glottal stops instead of t". A Stronger answer transcribes the feature narrowly and names glottal replacement. This Top-band extract is precise about the phoneme/allophone relationship (the /t/ is still there underlyingly; [ʔ] is its realisation), justifies the square brackets, and handles the social meaning with nuance for AO3. Once again, the discriminator is conceptual control, not symbol-count.
Here are some example words transcribed in broad IPA (RP):
| Word | Transcription |
|---|---|
| cat | /kæt/ |
| think | /θɪŋk/ |
| church | /tʃɜːtʃ/ |
| pleasure | /ˈpleʒə/ |
| language | /ˈlæŋgwɪdʒ/ |
| phonetics | /fəˈnetɪks/ |
| international | /ˌɪntəˈnæʃənəl/ |
| alphabet | /ˈælfəbet/ |
Note the stress marks: the primary stress mark ˈ is placed before the stressed syllable, and the secondary stress mark ˌ before a syllable with secondary stress. A reliable working method for transcribing an unfamiliar word is: (1) say it slowly and naturally, ignoring the spelling; (2) segment it into syllables; (3) identify each consonant and vowel sound in turn, checking each vowel against a known key word; (4) mark the stress; (5) read your transcription back aloud and confirm it produces the word you intended. Step (1) is the one most candidates skip — and it is the one that prevents the single commonest error: transcribing the spelling rather than the sound (writing /k/ for the silent "k" of "know", for example).
A common Paper 2 task gives a short transcript and asks you to analyse the phonological features of a speaker's accent. Suppose a Northern English speaker says "I had a bath last night" and the data shows the BATH word with a short front vowel and the final cluster simplified.
Model answer (extract): "The speaker realises the BATH vowel as /æ/ rather than the RP /ɑː/, producing /bæθ/. This is a diagnostic feature of most Northern English accents, where the TRAP–BATH split has not operated, and it functions here as a clear index of regional identity (AO3). At the phrase boundary, the speaker also simplifies the cluster in 'last night', eliding word-final /t/ to give /lɑːs naɪt/ — a connected-speech process rather than a feature of the accent system as such, and one which would be shown narrowly in square brackets. Recording these features with the IPA, rather than describing them impressionistically, allows precise comparison with the reference accent."
Examiner-style commentary: A Mid-band response would correctly name the BATH vowel difference but might rely on loose phrasing ("they say a flat a") and offer no transcription. A Stronger response uses accurate IPA for the diagnostic word and the correct lexical-set label (BATH). This Top-band extract goes further: it transcribes the feature precisely, distinguishes an accent-system feature (the vowel) from a connected-speech process (the elision), signals the broad/narrow distinction, and links the variation to regional identity for AO3. The discriminator at the top is not more symbols but sharper categorisation and an explicit move from description to social meaning.
Phonological analysis is not confined to live spoken data. AQA Paper 1 and the NEA frequently involve texts that represent speech — dialogue in fiction, eye-dialect in a novel, the scripted "spontaneity" of an advertisement, or the phonetic respelling writers use to evoke an accent (e.g. "wiv", "innit", "'ello"). The IPA lets you analyse such representation precisely: you can show exactly which phonological features a writer has chosen to mark in spelling (and which they have left standard), and discuss the effect.
Consider a novelist rendering a London character's speech as "I fink we should 'ave a chat abaht it." A strong analysis would observe that the writer signals TH-fronting (/θ/ → /f/ in "fink"), H-dropping in the stressed lexical word "have" ('ave), and a fronted, lowered MOUTH vowel in "about" (spelled "abaht" to suggest a quality nearer [a] than RP /aʊ/). You would then ask the AO3 question: what does this representation do? It rapidly indexes region and social class, and — depending on the surrounding text — may build sympathy, comedy or, more troublingly, a stereotype. The same IPA toolkit that describes a real accent equips you to analyse its fictional construction.
Because phonetics is a method integrated into every component, the return on mastering the IPA is unusually high. In Paper 1 it sharpens analysis of spoken transcripts and of speech represented in texts, and it feeds discussion of how individuals and social groups index identity through pronunciation. In Paper 2 it is central to accent and dialect variation and to language change over time (the centring-diphthong loss, the spread of glottalling and TH-fronting). In the NEA it underwrites any investigation touching on accent, regional variation, child phonological development, or the representation of speech. In every case the move is the same: from an impression ("sounds posh", "sounds Northern", "drops sounds") to an evidenced, terminologically precise claim that an examiner can credit under AO1 and that you can then connect to context under AO3.