You are viewing a free preview of this lesson.
Subscribe to unlock all 10 lessons in this course and every other course on LearningBro.
Transcription is the first stage of gene expression. It converts the digital, double-stranded, base-coded information stored in DNA into a single-stranded, transportable, translatable messenger RNA (mRNA) intermediate. In eukaryotic cells, transcription is spatially separated from translation: the mRNA is synthesised in the nucleus, processed within the nucleus, and then exported through nuclear pores to ribosomes in the cytoplasm or on the rough endoplasmic reticulum. This separation creates an opportunity for extensive post-transcriptional regulation — capping, polyadenylation, splicing, and alternative splicing — that is absent from the simpler prokaryotic system.
Spec mapping: This lesson sits in AQA 7402 Section 3.4.2 — DNA and protein synthesis, with extension into Section 3.8 (control of gene expression) through the role of transcription factors. The relevant content covers the mechanism of transcription, the role of RNA polymerase, and the production of mature mRNA from pre-mRNA in eukaryotes. (Refer to the official AQA specification document for exact wording.)
DNA is the long-term, faithful, double-stranded store of genetic information. It is too valuable to send to the ribosomes — the working copies must therefore be derived from it. Transcription generates those working copies as mRNA, a single-stranded molecule that can:
The relationship between DNA and mRNA is asymmetric in another important way: only one strand of the DNA double helix is used as a template for any given gene. The other strand carries the complementary sequence and is bypassed by the polymerase.
| Strand | Role | Alternative names |
|---|---|---|
| Template strand | Read by RNA polymerase 3′→5′; complementary to mRNA | Antisense strand, non-coding strand |
| Coding strand | Same sequence as mRNA (except T replaces U); not used as template | Sense strand, non-template strand |
Key Definition: Transcription is the process by which the base sequence of one strand of a gene's DNA (the template strand) is used to synthesise a complementary single-stranded messenger RNA molecule. The mRNA is built 5′→3′ from free ribonucleoside triphosphates by the enzyme RNA polymerase.
flowchart TD
A["Initiation: TFs + RNA Pol II bind to promoter (TATA box)"] --> B["Transcription bubble opens"]
B --> C["Elongation: free NTPs added 5'→3'; template read 3'→5'"]
C --> D["DNA re-anneals behind the moving polymerase"]
D --> E["Termination at terminator sequence"]
E --> F["Pre-mRNA released for processing"]
A DNA template strand has the sequence 3′-TAC GCG ATT CCG GGA ATT-5′. The mRNA synthesised from this template by RNA polymerase will have a complementary sequence in the 5′→3′ direction. Write the sequence of the resulting mRNA.
Step 1. Note the template strand is presented 3′→5′ — the direction in which RNA polymerase reads.
Step 2. Build the complementary mRNA 5′→3′, replacing T with U:
| Template (3′→5′) | T | A | C | G | C | G | A | T | T | C | C | G | G | G | A | A | T | T |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| mRNA (5′→3′) | A | U | G | C | G | C | U | A | A | G | G | C | C | C | U | U | A | A |
Step 3. mRNA sequence: 5′-AUG CGC UAA GGC CCU UAA-3′.
Note this happens to contain an AUG start codon, a UAA stop codon (at position 4), and another UAA at the end. In a real gene the start and stop would be widely separated and the intervening codons would form the open reading frame.
The fundamental chemistry is identical, but the regulation, location and processing differ substantially:
| Feature | Prokaryotes | Eukaryotes |
|---|---|---|
| Location | Cytoplasm (no nucleus) | Nucleus |
| RNA polymerase | Single type | Three types — Pol I (rRNA), Pol II (mRNA), Pol III (tRNA + small RNAs) |
| Transcription factors | Largely unnecessary — RNA polymerase binds directly via a σ factor | Required for RNA polymerase to bind the promoter |
| Promoter elements | Pribnow box (−10), −35 element | TATA box (−25 to −30), CpG islands, enhancers |
| mRNA processing | None — translation begins immediately | Extensive — 5′ cap, poly-A tail, splicing |
| Coupling with translation | Yes — ribosomes attach to nascent mRNA while transcription is ongoing | No — mRNA must be processed and exported first |
| Introns | Rare | Common — must be spliced out |
| Operons | Common — multiple genes co-transcribed (e.g. lac operon) | Rare — most genes have individual promoters |
This regulatory complexity is one reason why eukaryotic gene expression is more amenable to fine-grained tissue-specific and developmental control: every step from chromatin remodelling onwards offers a potential regulatory checkpoint.
In eukaryotes the pre-mRNA undergoes three crucial modifications before it can be exported and translated:
flowchart TD
A["Pre-mRNA: Exon1 — Intron1 — Exon2 — Intron2 — Exon3"] --> B["snRNPs assemble at splice sites"]
B --> C["Intron 1 cut at 5' splice site; forms lariat at branch point"]
C --> D["Intron 1 released as lariat; Exon 1 joined to Exon 2"]
D --> E["Process repeats for Intron 2"]
E --> F["Mature mRNA: Exon1-Exon2-Exon3 + 5' cap + poly-A tail"]
F --> G["Export through nuclear pore to cytoplasm"]
Key Definition: Alternative splicing is the process by which different combinations of exons from the same pre-mRNA are joined to produce different mature mRNA molecules and therefore different polypeptides from a single gene.
This is the dominant mechanism by which complex organisms expand their proteome beyond their modest gene count:
In humans, the calcitonin / CGRP gene illustrates tissue-specific alternative splicing:
The same gene, the same genome, the same pre-mRNA — but two functionally distinct proteins generated by tissue-specific splicing decisions.
The fruit fly Dscam (Down syndrome cell adhesion molecule) gene contains four clusters of mutually exclusive exons (12, 48, 33, and 2 alternatives respectively). The combinatorial choice gives 12 × 48 × 33 × 2 = 38,016 possible mRNA variants from a single gene — more than twice the total number of genes in the Drosophila genome. This expansion underlies the specificity of neuronal wiring in the developing fly brain: different neurones express different Dscam isoforms, allowing them to recognise "self" axons (matching Dscam isoform → repulsion to prevent self-contact) while remaining unresponsive to "non-self" axons (different Dscam isoform).
The Dscam example is a striking illustration of how alternative splicing can expand a single gene's functional output to combinatorial levels that exceed what would be achievable by gene duplication alone.
A gene transcribes a 12 kb pre-mRNA containing 8 exons (each ~500 bp) and 7 introns (each ~1 kb). After splicing, what is the size of the mature mRNA?
Subscribe to continue reading
Get full access to this lesson and all 10 lessons in this course.