You are viewing a free preview of this lesson.
Subscribe to unlock all 10 lessons in this course and every other course on LearningBro.
The genetic code is the set of rules by which information encoded in the sequence of DNA bases is translated into sequences of amino acids in proteins. It is one of the most elegant and universal features of biology. This lesson covers the OCR A-Level Biology A specification point 2.1.3 (f) — the nature of the genetic code, including the key descriptors: triplet, degenerate, non-overlapping and (nearly) universal.
There are four bases in DNA (A, T, C, G) and 20 amino acids commonly found in proteins. The minimum number of bases required to specify one amino acid — assuming a fixed number per amino acid — can be worked out using powers of 4:
| Bases per codon | Possible combinations | Enough for 20 amino acids? |
|---|---|---|
| 1 | 4¹ = 4 | No — only 4 amino acids |
| 2 | 4² = 16 | No — only 16 amino acids |
| 3 | 4³ = 64 | Yes — with room to spare |
| 4 | 4⁴ = 256 | Yes, but wastefully so |
A triplet code — three bases per amino acid — is therefore the smallest workable unit that can specify all 20 amino acids. Evolution has converged on this solution: every cell in every organism on Earth uses a three-base codon.
Key Definition — Codon: A triplet of bases in mRNA (or a gene) that codes for a single amino acid (or for starting/stopping translation).
The OCR specification requires you to understand four specific descriptors of the genetic code:
Each amino acid is specified by a sequence of three consecutive bases — a codon on mRNA (or a triplet on the coding strand of DNA). There are 64 possible codons (4 × 4 × 4).
| 5' → 3' mRNA codon | AUG | GCA | UUU | UGG | UAA |
|---|---|---|---|---|---|
| Amino acid | Met | Ala | Phe | Trp | STOP |
With 64 codons but only 20 amino acids, there are more codons than amino acids. This means:
Key Definition — Degenerate code: A code in which more than one codon can specify the same amino acid.
Biological importance of degeneracy: Because several codons code for the same amino acid, some point mutations (changes in a single base) are "silent" — they produce a different codon that still codes for the same amino acid. The degenerate code therefore provides some resistance to the harmful effects of mutations.
The bases of the genetic code are read in sequence, one codon after another, without any overlap between codons. Each base belongs to only one codon; each codon is read separately.
Correct reading (non-overlapping — used in life): the mRNA bases are read as a sequence of adjacent, non-overlapping triplets. Each base belongs to exactly one codon.
| Codon 1 | Codon 2 | Codon 3 | Codon 4 |
|---|---|---|---|
| AUG | UUC | GCA | UGA |
| Met | Phe | Ala | STOP |
Hypothetical overlapping code (NOT used): if codons could share bases, the same nucleotide sequence would generate a different (interleaved) set of codons. Biology does not use this scheme — overlapping codes would constrain mutation tolerance and force most adjacent amino-acid pairs to be biochemically related, which is not observed.
| Codon 1 | Codon 2 | Codon 3 | Codon 4 |
|---|---|---|---|
| AUG | UGU | GUU | UUC |
Consequence: the reading frame matters enormously. If a base is inserted or deleted (a frameshift mutation), every codon downstream of the change is read in a different frame and the amino acid sequence becomes entirely wrong. This is much more damaging than a single-base substitution.
Exam Tip: Questions often ask about the effects of different types of mutation. A substitution changes only one codon (and may be silent if the code is degenerate). An insertion or deletion causes a frameshift, changing every subsequent codon — nearly always catastrophic.
The same codons specify the same amino acids in (almost) all organisms — bacteria, archaea, plants, fungi and animals.
Biological significance:
Exam Tip: Be careful with "universal". There are a few very minor exceptions — for example, mitochondria and some unicellular organisms have small variations. At A-Level you can describe the code as "nearly universal" or "universal with a few rare exceptions" for an extra mark of precision.
| Property | Meaning | Consequence |
|---|---|---|
| Triplet | Three bases per amino acid | 64 codons, enough to code for all 20 amino acids |
| Degenerate | Most amino acids coded by more than one codon | Some point mutations are silent |
| Non-overlapping | Each base belongs to only one codon | Insertions/deletions cause damaging frameshifts |
| Universal | The same codons mean the same amino acids in nearly all organisms | Evidence for evolution; enables genetic engineering |
graph LR
A[Genetic code] --> B[Triplet]
A --> C[Degenerate]
A --> D[Non-overlapping]
A --> E[Universal]
B --> F["64 codons<br/>code for 20 amino acids"]
C --> G[Most AAs have multiple codons]
D --> H[Reading frame critical]
E --> I[Enables genetic engineering]
Given a short mRNA sequence, you should be able to identify the amino acids it codes for using a codon table.
Example: 5' — AUGGCAUUUUGGUAA — 3'
Split into codons starting at the AUG:
Polypeptide: Met–Ala–Phe–Trp.
Model answer for (3): "The genetic code is degenerate, meaning that most amino acids are coded for by more than one codon. A point mutation may change one codon to another codon that codes for the same amino acid, so the amino acid sequence of the protein is unchanged. This is called a silent mutation."
Spec Mapping: This lesson is mapped to OCR H420 Module 2.1.3 — Nucleotides and nucleic acids, covering the nature of the genetic code (triplet, degenerate, non-overlapping, universal) and the relationship between gene base sequence and polypeptide amino-acid sequence (refer to the official OCR H420 specification document for exact wording).
The genetic code links DNA structure (Lesson 2) to protein synthesis (Lessons 6–7) and is the conceptual bridge between molecular biology and the genetics topics of Module 6.1. The four code properties — triplet, degenerate, non-overlapping, (nearly) universal — are essential AO1 mark points. Questions about mutations (Module 6.1) hinge on understanding the consequences of code properties: degeneracy allows silent mutations; non-overlap means insertions and deletions cause frameshifts; universality is why genetic engineering between species is possible.
The genetic code was deciphered through a coordinated experimental programme between 1961 and 1966.
Marshall Nirenberg and Heinrich Matthaei (1961, NIH) created a cell-free protein-synthesis system and added synthetic poly-U RNA, obtaining a polypeptide composed exclusively of phenylalanine. This established that the codon UUU codes for Phe — the first codon assignment. The school of thought to take into the exam: "synthetic templates of known sequence let you read out the code one codon at a time".
Har Gobind Khorana (1960s) used chemically synthesised RNAs of defined repeating sequence (e.g. poly-UC, alternating U and C) to assign codons whose sequence Nirenberg's random copolymers could not isolate. By 1966 the full codon table was complete: 61 sense codons + 3 stop codons + the AUG start.
Francis Crick, Sydney Brenner, Leslie Barnett and Richard Watts-Tobin (1961) used acridine-induced frameshift mutations in T4 bacteriophage to demonstrate that the code is read in non-overlapping triplets. Their experimental logic — that three nearby insertions (or three nearby deletions) restored function while one or two did not — is one of the canonical demonstrations of triplet code.
Crick's "wobble hypothesis" (1966): proposed that the third base of a codon pairs less strictly with its tRNA anticodon, explaining the structural basis of degeneracy. Most synonymous codons differ only in the third position.
Nirenberg, Khorana and Robert Holley shared the 1968 Nobel Prize in Physiology or Medicine for cracking the genetic code. Paraphrase the schools of thought — do not invent quotation.
This lesson connects forward to:
ocr-alevel-biology-nucleic-acids-enzymes — Transcription (Lesson 6): the mRNA codon table you learn here is read off the template strand of DNA by RNA polymerase.ocr-alevel-biology-nucleic-acids-enzymes — Translation (Lesson 7): tRNA anticodons read mRNA codons; the genetic code is the rule-book that links the two.ocr-alevel-biology-genetics-inheritance — Mutations: silent, missense, nonsense and frameshift mutations are defined relative to the code properties. Sickle-cell anaemia (E6V missense in β-globin) and cystic fibrosis (ΔF508 deletion in CFTR) are canonical examples.ocr-alevel-biology-genetics-inheritance — Genetic engineering: universality is the load-bearing reason why human insulin can be produced in E. coli — the bacterium reads human DNA codons identically.ocr-alevel-biology-biological-molecules — Protein structure: primary structure (the amino-acid sequence specified by the codons) determines tertiary structure (Anfinsen's principle), which determines function.Question (6 marks): Describe what is meant by the genetic code being triplet, degenerate, non-overlapping and (nearly) universal, and explain why each property is biologically important.
Mark scheme decomposition (AO breakdown):
Subscribe to continue reading
Get full access to this lesson and all 10 lessons in this course.