You are viewing a free preview of this lesson.
Subscribe to unlock all 10 lessons in this course and every other course on LearningBro.
How do children acquire language? This deceptively simple question has generated one of the longest-running and most productive debates in linguistics and psychology. In the space of roughly five years a human infant moves from reflexive crying to producing grammatically structured, context-sensitive, creative utterances — an achievement no other species and no machine has matched without explicit instruction. The central puzzle is how this happens so fast, so reliably, and in broadly the same sequence across radically different languages and cultures. Four major theoretical traditions have grown up around this puzzle — nativism, behaviourism, the cognitive approach and social interactionism — and for AQA A-Level English Language (Paper 1, Section B: Children's Language Development, 0–11 years) these theories are not optional extras. They are the lens through which you read the data. A strong essay does not list theories; it deploys them to explain features in a transcript and then weighs how well each one accounts for what the child actually does. Section B rewards AO2 (critical understanding of concepts and theories) and AO1 (accurate, methodical analysis using linguistic terminology) especially heavily, so genuine command of these positions is the spine of a high-band answer.
A useful framing is the nature–nurture continuum. At the "nature" pole sits Chomsky, for whom the environment merely triggers a richly specified innate endowment. At the "nurture" pole sits Skinner, for whom the child is shaped almost entirely by environmental consequences. Piaget and the interactionists occupy the middle, but in different ways: Piaget makes language a by-product of general cognition, while Bruner and Vygotsky make it a product of social collaboration. Keep this map in mind, because examiners reward candidates who can locate a theory on the continuum and explain why a particular data feature pulls towards one pole or the other.
The nativist perspective argues that the capacity to acquire language is innate — humans are born biologically primed for it, much as we are biologically primed to walk. The most influential nativist is Noam Chomsky, whose work from Syntactic Structures (1957) and Aspects of the Theory of Syntax (1965) onwards reframed the whole field. Chomsky's starting observation is that human language is not just a vocabulary but a generative system: a finite set of rules that can produce an infinite number of grammatical sentences, including sentences never previously uttered. No child could store every possible sentence; therefore the child must internalise rules. The question is where those rules come from.
Chomsky proposed that every child is born with a Language Acquisition Device (LAD) — an innate mental faculty dedicated to language. The LAD is not a vocabulary store; it is a processing mechanism that allows the child to:
Key Definition: Language Acquisition Device (LAD) — a hypothesised innate mental faculty proposed by Chomsky that enables a child to derive the grammatical rules of whatever human language they are exposed to from limited and imperfect input.
Chomsky argued that beneath the surface diversity of the world's languages lies a shared structural blueprint he called Universal Grammar (UG) — a set of abstract principles common to all human languages (for example, that sentences are built hierarchically out of phrases, not merely strung together as flat sequences of words). On this view the child is not learning grammar from scratch; the child already "knows" the universal principles and only has to discover, from the input, how their particular language is configured. Chomsky frames this as parameter setting: think of each language-specific choice (for instance, whether the verb precedes or follows its object) as a switch that the input flips one way or the other. This elegantly explains how children acquire whichever of the world's languages surrounds them, while still doing so along a common developmental path.
| Concept | Explanation |
|---|---|
| LAD | An innate biological faculty that processes input and derives grammatical rules |
| Universal Grammar (UG) | A set of abstract structural principles shared by all human languages |
| Parameter setting | Exposure to a specific language "flips the switches" to its particular configuration |
| Poverty of the stimulus | The input is too sparse and degraded to account, by itself, for the rich grammar the child ends up with |
The poverty of the stimulus argument is the logical engine of nativism. Chomsky argued that the language a child hears is degenerate: it is full of hesitations, false starts, slips, incomplete sentences and background noise; it contains essentially no information about which sentences are ungrammatical (children rarely hear "Don't say it that way"); and it is not accompanied by explicit grammar teaching. Despite this impoverished, error-strewn evidence, every normally developing child arrives at a complex, abstract grammar — and crucially they converge on roughly the same grammar as everyone else in their speech community. Chomsky's inference is that the gap between the thinness of the input and the richness of the outcome must be bridged by something the child brings to the task: innate knowledge.
The behaviourist approach, most closely associated with B. F. Skinner (Verbal Behavior, 1957), treats language as learned behaviour acquired through the same general mechanisms as any other behaviour: imitation, reinforcement and conditioning. For a strict behaviourist, the mind is a "black box"; what matters is the observable relationship between a stimulus, a response and its consequence.
In Verbal Behavior Skinner applied operant conditioning — strengthening or weakening behaviour through its consequences — to verbal behaviour:
Key Definition: Operant conditioning — a learning process in which a behaviour becomes more or less frequent according to its consequences (reinforcement or punishment). Skinner argued that children's verbal behaviour is selected and shaped by caregiver responses in exactly this way.
| Mechanism | Example |
|---|---|
| Imitation | Child hears doggy and repeats doggy |
| Positive reinforcement | Child says milk, the caregiver gives milk, strengthening the word |
| Correction | Child says I goed; parent replies No, you went |
| Shaping | Babble ma-ma-ma meets an enthusiastic response, nudging the child toward Mama |
Behaviourism is not worthless: imitation and reinforcement plainly do operate, most obviously in the acquisition of vocabulary, accent and socially-rewarded formulae (please, ta, bye-bye). A child learns the word dog, not chien, precisely because of the input around them — environment clearly contributes. The theory's fatal weakness is grammar. Chomsky's (1959) review of Verbal Behavior is one of the most famous demolitions in the discipline, and its objections remain the standard exam ammunition:
The cognitive approach, associated above all with Jean Piaget (1926, 1952), holds that language is not a self-contained innate module but one expression of the child's general intellectual development. On this account, children can only talk about what they can first understand: linguistic milestones are downstream of cognitive milestones. Language is a tool that develops as cognition develops, not a faculty that runs ahead of it.
| Piaget's Stage | Age (approx.) | Cognitive Achievement | Linguistic Consequence |
|---|---|---|---|
| Sensorimotor | 0–2 years | Object permanence; knowing the world through the senses and actions | First words emerge once the child grasps that objects persist; early speech is tied to the here-and-now |
| Pre-operational | 2–7 years | Symbolic thought; egocentrism; animism | Vocabulary explosion; words used as symbols; speech is often egocentric (not adapted to a listener) |
| Concrete operational | 7–11 years | Conservation; logical thinking about concrete situations | Language becomes more logical, decentred and listener-aware; metalinguistic awareness grows |
| Formal operational | 11+ years | Abstract and hypothetical reasoning | Capacity for abstract, conditional and hypothetical language |
Piaget's most cited claim for child language is that object permanence — the realisation that things continue to exist when out of sight, achieved around 8–12 months — must precede the use of words to name absent objects. A word is a symbol that stands in for a thing; a child who does not yet believe the hidden ball still exists has nothing for the symbol to stand for. The fact that first words tend to appear shortly after object permanence is established is taken as supporting evidence.
Key Definition: Object permanence — the understanding that objects continue to exist even when they cannot be perceived. Piaget treated this cognitive milestone as a precondition for naming, since words are symbols for things that need not be present.
Piaget interpreted young children's running self-commentary during play as egocentric speech: talk that is not genuinely communicative because the pre-operational child cannot yet take another's perspective, and which simply fades as the child decentres. This same phenomenon becomes a battleground between Piaget and Vygotsky (below), who reads it in the opposite direction — as the origin of inner thought. Holding both readings in mind is exactly the kind of theoretical evaluation Section B rewards.
Interactionism argues that language is acquired through social interaction with more competent speakers. It does not necessarily deny innate capacity; rather, it insists that capacity must be activated and shaped in collaboration. The two pillars are Jerome Bruner (1983) and Lev Vygotsky (1978).
In deliberate counterpoint to Chomsky's LAD, Bruner (1983) proposed the Language Acquisition Support System (LASS) — the structured, supportive environment caregivers provide that allows a child's language to develop. Bruner's slogan, in effect, is that the LAD needs a LASS: innate potential is necessary but insufficient without social scaffolding. Key features include:
Key Definition: Language Acquisition Support System (LASS) — Bruner's term for the framework of structured social support — scaffolding, formats, joint attention — through which caregivers help a child acquire language in meaningful contexts.
Vygotsky (1978) placed social interaction at the very centre of cognitive and linguistic growth. His key construct is the Zone of Proximal Development (ZPD) — the gap between what a child can achieve independently and what they can achieve with help from a more knowledgeable other (MKO). Learning happens inside this zone: the MKO models language slightly above the child's solo level, the child performs it with support, and over time the supported performance is internalised as independent ability. Vygotsky also reinterpreted the very self-talk Piaget had dismissed: for Vygotsky, private (egocentric) speech is not a deficiency but a functional tool that children use to regulate their own behaviour and that gradually goes "underground" to become silent inner speech — the medium of verbal thought. Where Piaget saw egocentric speech withering away, Vygotsky saw it transforming into thinking itself.
| Concept | Theorist | Explanation |
|---|---|---|
| LASS | Bruner (1983) | The caregiver-provided support structure that complements innate ability |
| Scaffolding | Bruner (1983) | Graduated support, withdrawn as competence grows |
| Formats / routines | Bruner (1983) | Predictable repeated interactions that frame language learning |
| ZPD | Vygotsky (1978) | The gap between solo ability and supported ability |
| MKO | Vygotsky (1978) | A more competent partner who supports development |
| Private / inner speech | Vygotsky (1978) | Self-directed speech that is internalised into verbal thought |
Because the four theories make competing claims, examiners reward candidates who can marshal evidence for and against, not just describe positions. The following lines of evidence recur and are worth holding ready.
The idea of a critical period — a developmental window during which language is acquired easily and after which it becomes very difficult — is central to nativism, since a biologically timed window implies a biological mechanism. The clearest support comes, unfortunately, from cases of deprivation. Genie (Curtiss, 1977), discovered at 13 after extreme isolation, acquired a sizeable vocabulary but never developed normal syntax — she could not reliably form questions, use auxiliaries or build complex clauses. The much earlier case of Victor, the "Wild Boy of Aveyron", is similar. The difficulty is that such cases are hopelessly confounded: abuse, malnutrition, possible pre-existing impairment and the sheer absence of any interaction are tangled together, so they cannot cleanly isolate the role of a maturational window. They are suggestive of a critical period, not proof, and you should present them with that caution. Less ethically fraught support comes from second-language acquisition, where ultimate attainment of native-like grammar tends to decline with the age of first exposure.
A striking nativist argument comes from situations where children are exposed to impoverished or unstructured input yet produce structured language — apparent demonstrations of an innate drive to grammaticalise. The linguist Derek Bickerton argued that children exposed to a pidgin (a rudimentary, grammar-poor contact language) can develop it into a fully grammatical creole in a single generation, supplying grammatical machinery the input lacked. Even more vivid is the emergence of Nicaraguan Sign Language: when deaf children were first brought together in schools in Nicaragua in the late twentieth century, successive cohorts of children spontaneously regularised and grammaticalised their shared signing into a rich, rule-governed language not modelled for them by adults. Both cases are read as children imposing structure that was not fully present in the input — exactly what the LAD/UG hypothesis predicts. Attribute such claims as "argued by" rather than as settled fact, but the cases are powerful illustrative evidence for an innate grammatical capacity.
The behaviourist–nativist clash partly turns on an empirical question: how good is the input? Nativists assume it is "degenerate"; later research suggests that child-directed speech is actually rather clean — slow, repetitive, grammatically well-formed and pitched to the child's level. This cuts both ways. It weakens the strong poverty-of-the-stimulus claim (the input is better than Chomsky implied) but supports interactionism (caregivers tune their speech helpfully). A sophisticated answer notes that the quality of input is itself contested evidence rather than a settled fact assumed by one theory.
It is worth stating plainly that no serious contemporary account is purely one of these theories. Most researchers accept some innate readiness (the species-specificity and universal sequence are hard to deny), substantial learning from rich input (vocabulary, the native sound system, idioms) and an essential role for social interaction (pragmatics above all). Usage-based theorists such as Michael Tomasello argue that children build grammar gradually from concrete, item-based patterns using general cognitive abilities (pattern-finding and intention-reading) rather than a dedicated innate grammar — a position that blends learning, cognition and interaction and directly challenges strong nativism. You do not need to resolve the debate in an exam; you need to stage it intelligently around the data in front of you, showing which theory each feature supports.
| Feature | Nativist | Behaviourist | Cognitive | Social Interactionist |
|---|---|---|---|---|
| Key figure(s) | Chomsky | Skinner | Piaget | Bruner, Vygotsky |
| Nature or nurture? | Nature | Nurture | Nature first, then language | Interaction of nature and nurture |
| Key mechanism | LAD / Universal Grammar | Imitation and reinforcement | Cognitive development | Scaffolding, ZPD, formats |
| Role of input | Triggers parameter setting | The whole source of learning | Secondary to cognition | Essential — provides the scaffold |
| Best explains | Universals; creativity; virtuous errors | Vocabulary; accent; social formulae | Concrete-before-abstract order | Pragmatics; the role of caregivers |
| Key weakness | LAD/UG unobservable | Cannot explain creativity or virtuous errors | Dissociation cases (e.g. Williams syndrome) | Cross-cultural counter-evidence |
The modern consensus is not that one theory wins. It is that acquisition is best explained by an interaction of an innate, species-specific readiness for language (nativism's contribution), general cognitive development (Piaget's contribution), powerful learning from input (the kernel of truth in behaviourism) and structured social support (the interactionists' contribution). In an exam, the candidates who score highest are usually those who treat the theories as complementary tools for explaining different features of the data, not as rivals one of which must be "right".
A typical Section B prompt asks you to evaluate ideas about how children acquire language with reference to data. Compare the two responses below to the same micro-task: explain what a child's virtuous error of "I drawed a picture" reveals about how language is acquired.
Mid-band response: "The child says 'drawed' which is wrong because the correct word is 'drew'. This is a virtuous error. It shows the child is learning. Skinner said children copy adults, so the child copied it wrong. This proves the behaviourist theory."
Top-band response: "The non-standard past-tense form drawed is a virtuous error: the child has over-applied the regular
-edrule to an irregular verb. Crucially, no caregiver models drawed, so it cannot be an imitation, which directly undermines Skinner's behaviourist account. Instead it supports a nativist reading: the child has abstracted a productive grammatical rule from the input and is generating a novel form from it, exactly the kind of creativity Chomsky argues the LAD makes possible. The error is also developmentally informative — it sits on the middle limb of the U-shaped curve, where children temporarily regularise forms they once produced correctly — so it is a sign of progress, not regression. A full reading would add that caregiver recasting ('Oh, you drew a picture!') gives the child interactional feedback consistent with Bruner's LASS, showing how nativist and interactionist accounts can be combined rather than opposed."
Examiner-style commentary: The Mid-band response correctly names the feature but mislabels its theoretical significance — a virtuous error actually challenges behaviourism, so the candidate has reversed the argument (a common and costly slip). The Top-band response is precise with terminology (over-application of the regular
-edmorpheme), uses the data feature to discriminate between theories rather than just attach a label, situates the error developmentally (the U-shaped curve), and synthesises two frameworks. That combination of accurate analysis (AO1) and evaluative theoretical understanding (AO2) is what lifts an answer into the top band.
It pays to know the shape of the task these theories serve. Section B of Paper 1: Language, the Individual and Society focuses on Children's Language Development (0–11 years) and is worth 30 marks. The paper as a whole is 2 hours 30 minutes, 100 marks, and 40% of the A-level. In Section B you answer one discursive essay chosen from two questions, where the accompanying data focuses on spoken, written or multimodal language. The essay is marked against five assessment objectives, which for this section weight as follows: AO1 26% (apply linguistic methods and terminology systematically and accurately), AO2 26% (demonstrate critical understanding of concepts and theories — this is where the four approaches earn their keep), AO3 23% (analyse contextual factors), AO4 15% (explore connections across texts) and AO5 10% (where relevant, your own writing skill).
The practical implication for theory is direct. AO2 rewards you for using and evaluating the right theory at the right moment, not for reproducing it. The strongest scripts treat the data as the centre of gravity and bring in Chomsky, Skinner, Piaget, Bruner or Vygotsky precisely when a feature calls for explanation — then weigh how convincingly that theory accounts for it. Because the data may be written or multimodal as well as spoken, be ready to apply these acquisition theories to literacy and to mixed-mode texts, not only to speech transcripts.
Several recurring errors cost candidates marks on this topic, and they are easy to pre-empt once you know them.
goed as support for Skinner. It is the opposite: because no one models goed, it cannot be imitation, so it undermines behaviourism and supports an active, rule-forming child. Get the direction of the argument right.Avoiding these six pitfalls, while keeping the data central, is most of what separates a secure answer from a shaky one.