You are viewing a free preview of this lesson.
Subscribe to unlock all 10 lessons in this course and every other course on LearningBro.
This lesson covers the normal approximation to the binomial distribution as required by the Edexcel A-Level Mathematics specification (9MA0), Paper 3 Section A -- Statistics. You need to understand when and why this approximation is used, how to apply it with a continuity correction, and how to calculate probabilities.
When n is large, calculating binomial probabilities involves large factorials and summing many terms. The normal distribution provides a simpler approximation.
The approximation is appropriate when:
These conditions ensure the binomial is roughly symmetric and bell-shaped.
Exam Tip: Always check np > 5 and n(1 - p) > 5 and state your conclusion.
If X ~ B(n, p) and conditions are met:
X is approximately N(mu, sigma²) where:
X ~ B(100, 0.4): np = 40 > 5, n(1-p) = 60 > 5. So X approximately N(40, 24).
The binomial is discrete, the normal is continuous. We adjust by +/- 0.5:
| Binomial probability | Normal approximation |
|---|---|
| P(X = r) | P(r - 0.5 < Y < r + 0.5) |
| P(X ≤ r) | P(Y < r + 0.5) |
| P(X < r) | P(Y < r - 0.5) |
| P(X ≥ r) | P(Y > r - 0.5) |
| P(X > r) | P(Y > r + 0.5) |
Think of each integer r as occupying the interval (r - 0.5, r + 0.5).
Exam Tip: The continuity correction is essential. Forgetting it is a very common error.
X ~ B(80, 0.3). Find P(X ≤ 20).
mu = 24, sigma² = 16.8, sigma = 4.099.
P(X ≤ 20) --> P(Y < 20.5)
Z = (20.5 - 24)/4.099 = -0.854
P(Z < -0.854) = 1 - 0.803 = 0.197
X ~ B(200, 0.45). Find P(X ≥ 100).
mu = 90, sigma = 7.036.
P(X ≥ 100) --> P(Y > 99.5)
Z = (99.5 - 90)/7.036 = 1.350
P(Z > 1.350) = 1 - 0.9115 = 0.0885
X ~ B(50, 0.5). Find P(X = 25).
mu = 25, sigma = 3.536.
P(X = 25) --> P(24.5 < Y < 25.5)
Z1 = -0.141, Z2 = 0.141
P = 0.5557 - 0.4443 = 0.111 (exact binomial gives 0.1123 -- good approximation).
Edexcel 9MA0-03 specification, Statistics section 4 — Statistical distributions, sub-strand 4.4 and Section 7 — The normal distribution, sub-strand 7.5 covers select an appropriate probability distribution for a context, with appropriate reasoning, including recognising when the binomial or normal model may not be appropriate (refer to the official specification document for exact wording). The normal approximation to the binomial is examinable on 9MA0-03 Paper 3 (Statistics and Mechanics), sitting at the join between the binomial distribution (Year 1) and the normal distribution (Year 2). It is also synoptic with Section 5 (Statistical hypothesis testing) because larger n tests are routinely conducted using the normal approximation. The Edexcel formula booklet lists the normal distribution but does not state the approximation rule — students must recall X∼B(n,p)≈N(np,np(1−p)) and the continuity correction ±0.5 from memory.
Question (8 marks):
The random variable X∼B(100,0.4).
(a) Explain why X may be approximated by a normal distribution, stating the parameters of the approximating distribution. (3)
(b) Use a normal approximation, with continuity correction, to estimate P(X≤45). (4)
(c) State one reason why the value found in (b) is an approximation rather than an exact probability. (1)
Solution with mark scheme:
(a) Step 1 — justify the approximation.
Since n=100 is large and p=0.4 is not close to 0 or 1, both np=40 and n(1−p)=60 exceed 10, so the binomial distribution is approximately symmetric and the normal approximation is appropriate.
B1 — stating that n is large with p not extreme, or equivalently that np>5 and n(1−p)>5. Reasons in words score; numerical values without justification do not.
Step 2 — state the parameters.
μ=np=100×0.4=40 and σ2=np(1−p)=100×0.4×0.6=24.
B1 — correct mean.
B1 — correct variance 24 (or standard deviation σ=24≈4.899). So Y∼N(40,24).
(b) Step 1 — apply the continuity correction.
P(X≤45)≈P(Y<45.5), where Y∼N(40,24).
M1 — correct continuity correction in the right direction (upper bound +0.5 because X≤45 is inclusive on the upper end).
Step 2 — standardise.
Z=2445.5−40=4.89905.5≈1.1227
M1 — correct standardisation using μ=40 and σ=24.
Step 3 — read tables / use calculator.
P(Z<1.1227)≈0.8692.
A1 — correct probability value to at least 3 significant figures.
A1 — final stated probability P(X≤45)≈0.869 in context.
(c) Step 1 — explain the approximation source.
The binomial distribution is discrete whereas the normal distribution is continuous, so any conversion via continuity correction is inherently an approximation; agreement is best when n is large.
B1 — any one valid reason (discreteness vs continuity; finite n; tail behaviour differs slightly).
Total: 8 marks (B3 M2 A2 B1).
Question (6 marks): A factory produces components, of which historically 30% are classified as premium grade. A random sample of 200 components is taken; let X denote the number that are premium grade.
(a) Write down the exact distribution of X and justify the use of a normal approximation. (2)
(b) Using a suitable approximation with continuity correction, estimate P(55≤X≤70). (4)
Mark scheme decomposition by AO:
(a)
(b)
Total: 6 marks split AO1 = 4, AO2 = 2. This is balanced between procedural fluency (standardisation) and reasoning (interpreting the inclusive interval and correcting both endpoints in the right direction).
Connects to:
Section 4 — The binomial distribution: the approximation begins from X∼B(n,p). Confidence with the binomial pmf and its mean np and variance np(1−p) is prerequisite. Without these formulas memorised, the approximating normal cannot be parameterised.
Section 7 — The normal distribution: the entire machinery of standardisation Z=(Y−μ)/σ and looking up Φ(z) is reused unchanged. The approximation is essentially a parameter mapping from (n,p) to (μ,σ2); everything afterwards is normal-distribution practice.
Central limit theorem (informal at A-Level, formal at university): the binomial is a sum of n independent Bernoulli trials, so the CLT says the standardised sum tends to a standard normal. The approximation is the CLT applied at a fixed (large) n rather than as a limit.
Section 5 — Statistical hypothesis testing: for tests on a binomial proportion with large n, the test statistic is constructed using the normal approximation. Critical regions and p-values rely on the same standardisation step used here.
Modelling cycle (AO3): choosing the approximation is a modelling decision — students must justify why the binomial is acceptable, why the normal approximation is appropriate, and what the continuity correction compensates for. This is precisely the AO3 reasoning Edexcel rewards.
Normal-approximation questions on 9MA0-03 spread AO marks more evenly than pure procedural questions:
| AO | Typical share | Earned by |
|---|---|---|
| AO1 (knowledge / procedure) | 50–60% | Computing μ=np, σ2=np(1−p), applying standardisation, reading normal-table values |
| AO2 (reasoning / interpretation) | 25–35% | Justifying use of approximation, choosing correct continuity correction direction, interpreting result in context |
| AO3 (problem-solving / modelling) | 10–20% | Selecting between exact binomial and normal approximation, stating limitations, contextual conclusions |
Examiner-rewarded phrasing: "since n is large and p is not close to 0 or 1"; "applying a continuity correction because the binomial is discrete and the normal is continuous"; "let Y∼N(np,np(1−p))". Phrases that lose marks: writing "X∼N(np,np(1−p))" — the original variable is still binomial, only the approximating variable is normal; omitting the continuity correction "to save time"; using σ2=np (confusing with Poisson).
A specific Edexcel pattern to watch: when the question says "use a suitable approximation", students must state which approximation is being used. Silently swapping in a normal model and computing a probability scores the procedural marks but loses the AO2 marks for justification.
Question: X∼B(80,0.5). Use a normal approximation with continuity correction to estimate P(X≥45).
Grade C response (~190 words):
X∼B(80,0.5) so μ=80×0.5=40 and σ2=80×0.5×0.5=20.
So Y∼N(40,20).
P(X≥45)≈P(Y>44.5).
z=(44.5−40)/20=4.5/4.472≈1.006.
P(Z>1.006)=1−0.8427=0.1573.
So P(X≥45)≈0.157.
Examiner commentary: Full marks (3/3). The candidate identifies the parameters correctly, applies the continuity correction in the right direction (X≥45 becomes Y>44.5 because 45 is included in the original event so we shift down to 44.5), and standardises accurately. The final probability is given to three significant figures, matching examiner expectations. Working is brief but every step is verifiable. A typical Grade C answer for a procedural question — efficient and correct.
Grade A response (~240 words):*
The exact distribution is X∼B(80,0.5). Since n=80 is large and p=0.5, both np=40 and n(1−p)=40 comfortably exceed 10, so the normal approximation is appropriate.
Approximating: Y∼N(μ,σ2) with μ=np=40 and σ2=np(1−p)=20, giving σ=20=25.
Apply the continuity correction. Because the event X≥45 includes the value 45, the corresponding continuous event is Y>44.5 (we shift the boundary down by 0.5 to capture the discrete probability mass at X=45):
P(X≥45)≈P(Y>44.5)
Standardise:
Z=2544.5−40=254.5≈1.0062
Subscribe to continue reading
Get full access to this lesson and all 10 lessons in this course.