You are viewing a free preview of this lesson.
Subscribe to unlock all 8 lessons in this course and every other course on LearningBro.
Genetic fingerprinting (also called DNA profiling) is a technique used to identify individuals on the basis of variation at highly polymorphic regions of the genome. Except for monozygotic twins (whose DNA is identical at the level of genome sequence), no two people share the same DNA fingerprint at the resolution of modern STR-based systems. The technique has become indispensable in forensic science, paternity testing, immigration, disaster victim identification, conservation genetics and archaeology. The molecular basis is variation in repeat-DNA copy number; the practical technology depends almost entirely on the PCR and gel electrophoresis methods introduced in Lesson 5.
Spec mapping: This lesson sits in AQA 7402 Section 3.8.4 — Gene technologies allow the study and alteration of gene function. The specification expects candidates to describe the principle of genetic fingerprinting (DNA probes hybridising with VNTR/STR loci, separation by gel electrophoresis, generation of a unique banding pattern) and to evaluate applications and ethical considerations. (Refer to the official AQA specification document for exact wording.)
By the end of this lesson you should be able to: explain why non-coding tandem-repeat loci are so much more variable between individuals than coding sequences; describe the modern multiplex-PCR-plus-capillary-electrophoresis profiling workflow step by step; contrast it with the original Southern-blot VNTR method; calculate a multilocus random-match probability from per-locus frequencies; apply allele inheritance to a paternity problem; and evaluate the ethical, legal and statistical limits of DNA evidence.
Students often quote a figure such as "one in a billion" for a DNA match without understanding where it comes from. The number is not a property of any single locus; it is the product of the frequencies at many independent loci, and working one through makes the statistical logic concrete.
The reasoning rests on two population-genetics ideas that are examined synoptically with inheritance and populations. First, at a single locus, if an allele has frequency p in the population, then under Hardy–Weinberg expectations the frequency of a homozygous genotype is p2, and the frequency of a heterozygous genotype carrying two alleles of frequencies p and q is 2pq. Second, because the STR loci used in profiling lie on different chromosomes (or far apart on the same one), the genotypes at different loci are inherited essentially independently, so their frequencies multiply — this is the assumption of linkage equilibrium.
Take a simplified profile at four loci, and suppose the individual's genotype frequency works out at each locus as follows: locus one, a heterozygote at frequency 0.10; locus two, a heterozygote at 0.08; locus three, a homozygote at 0.04; locus four, a heterozygote at 0.05. The probability that a random unrelated person shares this exact four-locus genotype is the product:
0.10×0.08×0.04×0.05=1.6×10−5
That is roughly a one-in-sixty-thousand chance — already small, but not yet compelling on its own, because sixty thousand people is fewer than a small city. The power of a real forensic panel comes from adding loci. The UK DNA-17 system multiplies frequencies across seventeen loci; because each locus multiplies the running product by a small fraction, the combined random-match probability typically falls below 10−15 — far smaller than the entire human population, so that in practice only a monozygotic twin would be expected to share the full profile.
Two cautions belong in any top-band answer that quotes such a figure. First, the multiplication is only valid if the loci really are independent and the population frequencies are correct for the relevant sub-population; close relatives share alleles, so the random-match probability for a sibling is much higher than for an unrelated person. Second, the tiny random-match probability is not the same as the probability that the suspect is innocent: laboratory contamination, sample mix-ups and mixture-interpretation errors occur at rates that can far exceed 10−15, so in a well-run case the realistic dominant source of doubt is procedural error, not coincidence. Communicating both numbers honestly is part of the ethical use of DNA evidence.
Although approximately 99.9% of human DNA sequence is identical between unrelated individuals, the remaining 0.1% contains highly variable regions. Much of this variation falls in repetitive DNA — regions where a short sequence of bases is repeated multiple times in tandem (head-to-tail).
These tandem-repeat regions are far more variable between individuals than coding sequences because mutations that add or remove repeat units (by slippage during replication or by unequal crossing over during meiosis) accumulate freely there — they typically have no functional consequence, so selection does not eliminate them. The result is that the number of repeats at any given locus differs widely between individuals.
Key Definition: Short tandem repeats (STRs) are sequences of 2–6 base pairs that are repeated a variable number of times at specific loci in the genome. The number of repeats at each locus varies between individuals.
flowchart TD
A["Biological sample (blood, saliva, hair, skin cells)"] --> B["DNA extraction"]
B --> C["Multiplex PCR: 17 STR loci + amelogenin, primers fluorescently labelled"]
C --> D["Capillary electrophoresis: fragments separated by size"]
D --> E["Laser detection of fluorescent peaks"]
E --> F["Electropherogram: alleles at each STR locus"]
F --> G["Compare profile with reference / suspect / database / parent"]
G --> H["Match probability calculated → identity / paternity / exclusion"]
If the allele frequencies at each STR locus are known from population databases, the probability of a random unrelated individual sharing the full multilocus profile can be calculated by multiplying the per-locus genotype frequencies (assuming linkage equilibrium and Hardy–Weinberg equilibrium).
With 17 STR loci, typical match probabilities are <10⁻¹⁵ — vanishingly small, far smaller than the world's population. This is the statistical foundation of forensic DNA evidence.
The original DNA fingerprinting method, devised by Alec Jeffreys and his colleagues at Leicester, used Southern blotting to detect VNTRs. Although largely superseded by PCR-based methods, Southern blotting remains an important benchmark for understanding the principles and is still examined.
| Feature | Southern blot (Jeffreys, 1984) | Modern STR / capillary electrophoresis |
|---|---|---|
| Repeat type detected | VNTRs (minisatellites, 10-60 bp units) | STRs (microsatellites, 2-6 bp units) |
| Starting DNA required | Microgram quantities | Picogram quantities (single cells) |
| Sensitivity to degraded DNA | Poor (large fragments needed) | Good (small fragments amplify easily) |
| Time to result | 1-3 weeks | 1-2 days |
| Automation | Manual | Fully automated, multiplex |
| Throughput | Few samples per week | Thousands of samples per day |
| Sensitivity | Limited by autoradiography | Sub-femtomolar by fluorescence |
Worked Example — Paternity Analysis:
At a particular STR locus, a child has alleles with 8 and 12 repeats. The mother has alleles with 8 and 10 repeats. A potential father has alleles with 11 and 12 repeats. Is he likely to be the biological father?
Subscribe to continue reading
Get full access to this lesson and all 8 lessons in this course.