AQA A-Level Biology: Populations and Evolution — Complete Revision Guide (7402)
AQA A-Level Biology: Populations and Evolution (7402)
Populations and evolution is the topic that links every other strand of A-Level Biology into a single explanatory framework. Sections 3.7.2 and 3.7.3 of AQA 7402 develop population genetics from the algebraic foundation of the Hardy-Weinberg principle through to the molecular and palaeontological evidence for descent with modification. The Phase 2 build adds two new lessons — molecular evidence for evolution and phylogeny, and the fossil, biogeographic and comparative evidence — that complete the "evidence for evolution" treatment the specification demands and that the Phase 1 catalogue had previously underserved.
This course sits as course 8 of the 11 in the LearningBro AQA A-Level Biology learning path. It depends on the genetics foundations laid in genetic information, which develops meiosis, chromosomal segregation and mutation, and on the biochemical fluency built in biological molecules, cells and energy transfers. It feeds into ecosystems and energy flow, where population dynamics meet ecological resource constraints, and into gene expression and biotechnology, where molecular phylogeny is operationalised through DNA sequencing.
This guide walks through all eight lessons of the Populations and Evolution course — the chi-squared test, Hardy-Weinberg, natural selection, genetic drift and gene flow, speciation, classification and taxonomy, molecular evidence and the comparative evidence trio — and links each into the wider AQA Biology programme.
Guide Overview
The course breaks Sections 3.7.2 and 3.7.3 into eight lessons. Lesson 1, chi-squared test, develops the statistical machinery that underpins genetic-ratio testing throughout the course. Lesson 2, Hardy-Weinberg principle, establishes the null model against which evolutionary change is detected. Lesson 3, natural selection, develops Darwin's mechanism in its modern population-genetic form. Lesson 4, genetic drift and gene flow, handles the non-selective evolutionary processes — the random sampling of alleles between generations and the movement of alleles between populations. Lesson 5, speciation, develops the allopatric and sympatric mechanisms by which new species arise. Lesson 6, classification and taxonomy, covers the Linnaean hierarchy, the three-domain system and the principles of cladistics.
The two Phase 2 additions complete the evidence base. Lesson 7, molecular evidence for evolution and phylogeny, develops DNA and amino-acid sequence comparison, the molecular clock and phylogenetic-tree construction. Lesson 8, evidence for evolution: fossil, biogeographic and comparative, develops the three classical evidence categories — palaeontology, biogeography and comparative anatomy — that anchor evolutionary theory outside the molecular domain.
AQA 7402 Specification Coverage
AQA Biology 7402 examines Sections 3.7.2 and 3.7.3 most heavily on Paper 2 (Sections 3.5-3.8), with synoptic items appearing on Paper 3. Quantitative population genetics — Hardy-Weinberg and chi-squared — is one of the most reliable sources of straightforward marks in the entire 7402 syllabus. Refer to the official AQA specification document for exact wording of every learning outcome.
| Sub-topic | Spec area | Typical paper weight |
|---|---|---|
| Chi-squared test | 3.7.2 / maths skills | 3-6 marks |
| Hardy-Weinberg principle | 3.7.2 | 4-8 marks |
| Natural selection | 3.7.2 | 5-9 marks |
| Genetic drift and gene flow | 3.7.2 | 3-5 marks |
| Speciation (allopatric and sympatric) | 3.7.3 | 4-8 marks |
| Classification and taxonomy | 3.7.3 | 3-5 marks |
| Molecular evidence and phylogeny | 3.7.3 | 4-6 marks |
| Fossil, biogeographic and comparative evidence | 3.7.3 | 3-5 marks |
These weights are estimates modelled on the structure of recent 7402 papers. What is reliable is that a Hardy-Weinberg calculation, a chi-squared analysis and a natural-selection long-answer item appear on essentially every series.
The Chi-Squared Test
The opening lesson on the chi-squared test develops the statistical machinery that underpins genetic-ratio testing across the course. The chi-squared statistic — introduced by Karl Pearson in 1900 — tests whether observed frequencies deviate significantly from expected frequencies. The formula is χ² = Σ((O − E)² / E), where O is the observed count, E is the expected count and the summation runs over all categories.
The procedure is procedural: state the null hypothesis (typically "the observed ratio matches the expected ratio"); calculate expected frequencies from the null model; compute χ² from the table; determine degrees of freedom (number of categories minus one); look up the critical value at p = 0.05; if the calculated χ² exceeds the critical value, reject the null hypothesis. The critical value for one degree of freedom at p = 0.05 is 3.84; for two degrees, 5.99; for three degrees, 7.81. AQA provides the critical-value table on the data sheet — students must know how to read it but need not memorise it.
The classical AQA application is testing a Mendelian dihybrid cross against the 9:3:3:1 expected ratio. Chi-squared is also applied to goodness-of-fit tests for Hardy-Weinberg ratios. A common pitfall is to apply the test to ratios rather than counts — chi-squared requires whole-number observed and expected frequencies.
The Hardy-Weinberg Principle
The Hardy-Weinberg principle — independently formulated by Hardy and Weinberg in 1908 — provides the null model against which evolutionary change is detected. The principle states that in a large, randomly-mating population with no mutation, no migration and no selection, allele frequencies and genotype frequencies remain constant from generation to generation.
The algebra is elementary. For a single locus with two alleles A (frequency p) and a (frequency q), with p + q = 1, the expected genotype frequencies are p² (AA), 2pq (Aa) and q² (aa), and these three frequencies must sum to 1. Given any one of the three input quantities (the recessive phenotype frequency is the most common starting point, because q² is directly observable), all the others can be calculated.
The classical AQA application uses cystic fibrosis. The disease affects approximately 1 in 2,500 newborns in the UK — so q² = 1/2,500, giving q = 0.020 and the heterozygote (carrier) frequency 2pq ≈ 1 in 25. Roughly four percent of the UK population carries one copy of the recessive cystic fibrosis allele despite the disease being rare. This calculation pattern recurs across sickle-cell, phenylketonuria and Tay-Sachs contexts.
The five Hardy-Weinberg assumptions — large population, random mating, no mutation, no migration, no selection — are the levers of evolutionary change. Deviation of observed genotype frequencies from Hardy-Weinberg expectations indicates that at least one assumption is being violated, and is the standard population-genetic signal that evolution is happening at the locus.
A common pitfall is to confuse allele frequency (p, q) with genotype frequency (p², 2pq, q²). The algebra works only if the distinction is held clearly.
Natural Selection
Natural selection develops Darwin's mechanism in its modern population-genetic form. The argument has four premises: organisms produce more offspring than can survive; individuals vary heritably; some variants survive and reproduce better than others; the better-adapted variants increase in frequency over generations. The fourth premise is the population-genetic restatement of "survival of the fittest" — fitness is operationalised as the relative reproductive success of a genotype.
AQA recognises three modes of selection by the shape of the survival-against-phenotype curve. Directional selection favours one extreme of the distribution and shifts the population mean over generations — antibiotic resistance in bacteria, peppered-moth melanism during the industrial revolution and Darwin's finches' beak depth in drought years are the canonical examples. Stabilising selection favours the population mean and reduces variance — human birth weight (extreme low and extreme high birth weights both carry higher mortality) is the textbook example. Disruptive selection favours both extremes against the mean, increasing variance and potentially driving speciation — African seedcrackers, in which large-billed and small-billed birds specialise on different seeds while medium-billed birds fare poorly, are the cited example.
The selection coefficient s measures the fitness disadvantage of one genotype relative to the most-fit. Even modest coefficients drive substantial allele-frequency change over geological timescales.
A common pitfall is to describe natural selection as acting "for the good of the species". It does not — natural selection acts on individuals (or, in modern formulations, on alleles), and the consequences for the species are emergent rather than purposeful.
Genetic Drift and Gene Flow
Genetic drift and gene flow develops the non-selective evolutionary processes. Genetic drift is the random change in allele frequencies between generations due to sampling — even in the absence of selection, finite populations show fluctuations in allele frequencies simply because the gametes that found the next generation are a finite sample of the parental gene pool.
Drift is strongest in small populations and weakest in large populations. The founder effect is the form of drift that operates when a small subset of a population establishes a new colony — the new population starts with whatever allele frequencies happened to be in the founders, which may differ substantially from the parent population. The Amish population of Pennsylvania, descended from approximately two hundred eighteenth-century founders, shows elevated frequencies of several otherwise rare recessive disorders for this reason. The bottleneck effect is the form of drift that operates when a population is dramatically reduced (by disease, natural disaster or hunting pressure) and then recovers — the recovered population reflects the allele frequencies of the survivors, which are again a finite and non-representative sample. Northern elephant seals, hunted nearly to extinction in the nineteenth century, show very low genetic diversity today as a consequence.
Gene flow is the movement of alleles between populations through migration of individuals or gametes. It tends to homogenise allele frequencies between populations and to oppose the divergence required for speciation. A single migrant per generation is, in classical population-genetic models, sufficient to prevent populations from diverging by drift alone.
A common pitfall is to describe drift as a force that selects against rare alleles. It does not — drift is non-directional. Rare alleles are more likely to be lost simply because there are fewer copies for the random sampling to preserve.
Speciation
Speciation develops the mechanisms by which new species arise. A species is most commonly defined under the biological species concept (Mayr, 1942) as a group of organisms that can interbreed to produce fertile offspring. The concept has well-known limitations — it does not apply to asexual organisms or to most fossil lineages — but is the operational definition used throughout the AQA spec.
Allopatric speciation occurs when a physical barrier separates two populations, gene flow ceases, the populations diverge by drift and differential selection in their respective environments, and ultimately become reproductively incompatible. The classical example is the Galápagos finches — Darwin's observation that thirteen finch species, each adapted to a particular ecological niche, occupy the archipelago, having diverged from a single mainland ancestor following colonisation. Geographical separation, ecological divergence and reproductive isolation are the three stages.
Sympatric speciation occurs without geographical separation — divergence happens within a single area through ecological, behavioural or chromosomal mechanisms. The Lake Victoria cichlids — hundreds of species derived from a single ancestor within the last fifteen thousand years through sexual selection on colouration and feeding morphology — are the most famous case. Polyploidy (chromosomal duplication during meiotic error) is a particularly rapid sympatric mechanism in plants — a single polyploid event can establish reproductive isolation in one generation, because the polyploid offspring cannot produce balanced gametes when crossed with diploid parents.
Reproductive isolating mechanisms are classified as prezygotic (temporal, behavioural, mechanical, gametic) or postzygotic (hybrid inviability, sterility, breakdown). The horse-donkey cross producing sterile mules is the canonical postzygotic example.
A common pitfall is to use "subspecies" as a synonym for "incipient species". Subspecies remain capable of interbreeding by definition; speciation is complete only when reproductive isolation prevents successful interbreeding.
Classification and Taxonomy
Classification and taxonomy covers the Linnaean hierarchy and the modern three-domain system. The Linnaean hierarchy — established by Carl Linnaeus in the eighteenth century — orders organisms into nested categories: domain, kingdom, phylum, class, order, family, genus, species. Each level contains the levels below it. The binomial name comprises genus and species — Homo sapiens, Felis catus, Quercus robur — and is universally recognised across languages.
The three-domain system — proposed by Carl Woese in 1977 on the basis of ribosomal RNA sequence comparisons — divides cellular life into Bacteria, Archaea and Eukarya. The system superseded the older five-kingdom scheme by recognising that the Archaea (extremophiles, methanogens) are as deeply diverged from the Bacteria as either group is from the Eukarya — a finding that emerged only from molecular analysis. The five-kingdom scheme remains in use as a teaching tool but is phylogenetically inaccurate at the deepest levels.
Cladistics — developed by Willi Hennig in the 1950s — classifies organisms strictly by shared derived characters (synapomorphies), producing monophyletic groups (a common ancestor plus all of its descendants). A paraphyletic group includes the common ancestor and some but not all of its descendants — "reptiles" excluding birds is paraphyletic, because birds are descended from reptilian ancestors. A polyphyletic group includes multiple ancestors and is regarded as a classification error.
A common pitfall is to confuse a phylogenetic tree (a hypothesis about evolutionary relationships) with a classification (a system of nested categories). The two are related — modern classifications aim to reflect phylogeny — but they are not identical, and the AQA mark scheme respects the distinction.
Molecular Evidence for Evolution and Phylogeny
Molecular evidence for evolution and phylogeny develops the most powerful modern evidence base. DNA sequence comparison — first applied at scale in the 1980s — measures the number of nucleotide differences between homologous genes in different species. Closely related species show few differences; distantly related species show many. Human and chimpanzee genomes differ at approximately one percent of nucleotide sites; human and mouse genomes at approximately fifteen percent; human and yeast at approximately fifty percent. The progressive divergence aligns precisely with the order of branching predicted by morphological and palaeontological evidence.
Amino-acid sequence comparison — pioneered by Sanger's work on insulin in the 1950s — applies the same logic to protein primary structure. Cytochrome c, an essential respiratory electron carrier present in all aerobes, varies by zero amino acids between humans and chimpanzees, by twelve between humans and dogs, and by fifty-six between humans and yeast. The pattern of sequence differences forms a phylogenetic signal that recovers the same evolutionary tree, regardless of which gene or protein is examined.
The molecular clock hypothesis — proposed by Zuckerkandl and Pauling in 1965 — exploits the approximately constant rate of neutral mutation per generation to estimate divergence times from sequence differences. The clock is not perfectly constant — different genes and different lineages evolve at different rates — but the technique, calibrated against well-dated fossil divergences, allows reasonably precise estimation of evolutionary timescales for which no fossil record exists.
Phylogenetic trees are constructed by clustering species so that the total branch length explaining the observed sequence differences is minimised (parsimony methods) or by computing the most likely tree given an explicit model of nucleotide substitution (maximum-likelihood methods). The molecular evidence converges with the morphological and palaeontological evidence to a striking degree — multiple independent lines of evidence pointing to the same evolutionary tree. A common pitfall is to treat molecular trees as "more accurate" than morphological trees; both have characteristic error sources, and the convergence is the strongest scientific argument.
Fossil, Biogeographic and Comparative Evidence
Evidence for evolution: fossil, biogeographic and comparative covers the three classical evidence categories that established evolutionary theory before molecular biology.
The fossil record documents the historical sequence of organisms. Lower (older) strata contain simpler organisms; higher strata contain progressively more complex and more recent forms. Transitional fossils — Archaeopteryx between reptiles and birds, Tiktaalik between fish and tetrapods, the Australopithecus / Homo sequence between great apes and modern humans — document the major evolutionary transitions in morphological terms. The fossil record is incomplete (soft-bodied organisms rarely fossilise; the geological column is patchy), but the patterns it preserves are unambiguous in their direction.
Biogeographic evidence — assembled in essence by Darwin and Wallace during their respective voyages of the 1830s and 1850s — documents the geographic distributions that descent with modification predicts and special creation does not. The marsupials of Australia, the lemurs of Madagascar and the unique flora and fauna of oceanic islands all reflect the joint history of evolutionary divergence and geographical isolation. Continental drift — confirmed by plate tectonics in the 1960s — explains the deep biogeographic patterns by which related taxa occur on continents that were once joined.
Comparative anatomy documents the homologous structures (similar structure, different function, shared evolutionary origin) that pervade related groups. The vertebrate forelimb — bat wing, whale flipper, horse leg, human arm — is built from the same skeletal elements arranged in the same pattern despite radically different functions. Homologies are the morphological signal of common descent. Analogous structures (similar function, different evolutionary origin — bird wing versus insect wing) document convergent evolution — independent lineages converging on similar morphological solutions to similar selective pressures. Vestigial structures — the human appendix, whale pelvic bones, snake hindlimbs — are evolutionary remnants of structures that were functional in ancestral forms.
The three classical evidence lines, together with the modern molecular evidence developed in Lesson 7, form an integrated case of converging independent inferences — exactly the form that scientific theories take when they are well-established. A common pitfall is to treat each evidence line in isolation. The strength of the evolutionary framework comes from the convergence of multiple independent lines, not from any single line alone.
Cross-Topic Synoptic Links
Populations and evolution connects to genetic information through the foundations of meiosis, chromosomal segregation, mutation and inheritance — every Hardy-Weinberg calculation and every chi-squared test on a Mendelian cross depends on this material. It connects to gene expression and biotechnology through DNA sequencing, polymerase chain reaction, gel electrophoresis and the molecular techniques that make phylogenetic analysis operational. It connects to ecosystems and energy flow through population dynamics, carrying capacity and the ecological context in which selection operates.
The cellular foundations from cells and the biochemical fluency from biological molecules underpin the molecular evidence developed here — every cytochrome-c comparison and every DNA-divergence calculation assumes a working understanding of protein and nucleic acid structure. The metabolic logic of energy transfers explains why cytochrome c is so highly conserved across the tree of life — its role in oxidative phosphorylation is so functionally constrained that almost any mutation is selected against.
Required Practical Anchors
This course does not directly own one of the twelve AQA required practicals. RP11 (sampling distribution of organisms using a quadrat or transect) is housed in course 9, ecosystems and energy flow, where its application to ecological population estimation is most naturally developed. The chi-squared lesson in this course supplies the statistical machinery that RP11 data analysis routinely depends on, and cross-referencing the two courses is part of the standard Paper 3 synoptic preparation.
How to Revise This Topic
The most effective revision for Sections 3.7.2 and 3.7.3 combines algebraic drill with comparative tables. The single highest-yield retrieval task is running a Hardy-Weinberg calculation from blank — start with a stated recessive disease frequency, derive q, then p, then the heterozygote frequency, then the homozygote dominant frequency — until the procedure runs in under two minutes without notes. The second highest-yield task is running a chi-squared calculation from a stated dihybrid cross to the accept/reject decision, including the degrees-of-freedom and critical-value steps.
Apply retrieval practice (Roediger and Karpicke, 2006) by writing one-page summaries of each evidence category from blank — fossil, biogeographic, comparative, molecular — and then comparing to the lesson notes. Apply spaced repetition (Ebbinghaus's forgetting curve) by revisiting the Hardy-Weinberg formula and the chi-squared formula at expanding intervals; both are mark-rich and easy to forget without regular drill. Interleave allopatric and sympatric speciation questions in the same session to force discrimination between the two mechanisms; AQA reliably rewards explicit naming of the mechanism and the isolating-barrier type.
Closing
Populations and evolution is the topic that unifies A-Level Biology — the framework against which every other section of the spec makes evolutionary sense. Start with the chi-squared and Hardy-Weinberg lessons to anchor the quantitative machinery, then walk through natural selection, drift, gene flow, speciation and classification in order. Finish with the molecular evidence and fossil-biogeographic-comparative evidence lessons for the converging-lines argument that strong A* candidates exploit on the extended-response items. The full Populations and Evolution course is course 8 of 11 in the LearningBro AQA A-Level Biology learning path, and the integrative thinking it trains is the conceptual capstone of the entire programme.