You are viewing a free preview of this lesson.
Subscribe to unlock all 10 lessons in this course and every other course on LearningBro.
This lesson covers the normal distribution, the most important continuous probability distribution at A-Level. The normal distribution is used to model a wide range of natural phenomena and is central to statistical inference and hypothesis testing.
The normal distribution is a continuous probability distribution with a characteristic bell-shaped curve. If X follows a normal distribution with mean μ and variance σ2, we write:
X∼N(μ,σ2)
Key properties:
| Region | Percentage of data |
|---|---|
| μ±σ | ≈68% |
| μ±2σ | ≈95% |
| μ±3σ | ≈99.7% |
Exam Tip: The variance is σ2, not σ. When a question says X∼N(50,16), the standard deviation is σ=16=4. A common error is to confuse σ and σ2.
The standard normal distribution has mean 0 and variance 1:
Z∼N(0,1)
Any normal variable X can be standardised to Z using:
Z=σX−μ
This allows us to use standard normal tables to find probabilities for any normal distribution.
To find P(X<a):
Example: X∼N(100,225), find P(X<115).
σ=225=15
z=15115−100=1
P(X<115)=P(Z<1)=Φ(1)=0.8413
Given a probability, find the corresponding value of x:
Example: X∼N(50,9). Find a such that P(X<a)=0.9.
σ=3
From tables: z=1.2816 (since Φ(1.2816)=0.9)
a=50+1.2816×3=53.84
If the mean or standard deviation is unknown, set up equations using given probability information and solve simultaneously.
Example: X∼N(μ,σ2). Given P(X<20)=0.2 and P(X>50)=0.1:
σ20−μ=−0.8416andσ50−μ=1.2816
Solving these simultaneous equations gives μ and σ.
The normal distribution can approximate the binomial distribution when n is large and p is not too close to 0 or 1. The rule of thumb is:
np>5andn(1−p)>5
When approximating B(n,p) by N(np,np(1−p)), apply a continuity correction:
| Binomial | Normal approximation |
|---|---|
| P(X≤r) | P(Y<r+0.5) |
| P(X≥r) | P(Y>r−0.5) |
| P(X=r) | P(r−0.5<Y<r+0.5) |
Exam Tip: Always show the standardisation step when finding normal probabilities. Write P(X<a)=P(Z<σa−μ) explicitly — this gains method marks even if the final answer is wrong.
AQA 7357 specification, Paper 3 — Statistics, Section R (Year 2 content) covers the Normal distribution as a model; find probabilities using a Normal distribution; link to histograms, mean, standard deviation, points of inflection and the binomial distribution; select an appropriate probability distribution for a context, with appropriate reasoning, including recognising when the binomial or Normal model may not be appropriate (refer to the official specification document for exact wording). Section R extends Section O (probability) and Section Q (the binomial distribution from Year 1) and feeds directly into Section S (statistical hypothesis testing). Calculator-based standardisation and inverse-normal queries are the dominant exam tasks; the standard-normal table is no longer required for routine work but appears in the AQA formulae booklet for reference. The Central Limit Theorem is referenced informally — examined explicitly only in the context of "the sample mean is approximately normal".
Question (8 marks):
A machine fills bottles with a nominal volume of 500 ml. The actual volume X ml dispensed is modelled as X∼N(μ,σ2).
(a) Given μ=502 and σ=1.5, find P(X<500). (2)
(b) Find the value of x such that P(X<x)=0.95. (2)
(c) A quality engineer wants P(X<500)=0.02 and P(X>504)=0.05. Find μ and σ. (4)
Solution with mark scheme:
(a) Step 1 — standardise.
Z=σX−μ=1.5500−502=−34≈−1.333
M1 — correct standardisation with the right values substituted. A common slip is to invert the fraction or to use σ2=2.25 in the denominator instead of σ=1.5. The standardisation formula uses σ, not σ2.
Step 2 — read probability.
P(X<500)=P(Z<−1.333)≈0.0912.
A1 — answer to 3 s.f. (calculator output). Acceptable range typically 0.091–0.092.
(b) Step 1 — inverse normal.
We need x with Φ(1.5x−502)=0.95. From inverse normal, z0.95≈1.6449.
M1 — identification of the correct z-value (positive because the cumulative probability exceeds 0.5 — sketch the normal curve and shade the left tail of area 0.95 to confirm sign).
Step 2 — un-standardise.
x=μ+zσ=502+1.6449×1.5≈504.47.
A1 — x≈504 ml (3 s.f.).
(c) Step 1 — translate each probability into a z-equation.
From P(X<500)=0.02:
σ500−μ=z0.02≈−2.0537
From P(X>504)=0.05⇔P(X<504)=0.95:
σ504−μ=z0.95≈1.6449
M1 — both z-values correctly identified, with correct sign on the lower tail (z0.02 is negative — students who write +2.0537 here lose both marks).
Step 2 — solve simultaneously.
500−μ=−2.0537σ and 504−μ=1.6449σ. Subtracting the first from the second:
4=(1.6449−(−2.0537))σ=3.6986σ⟹σ≈1.0815
M1 — eliminating μ to find σ.
Step 3 — back-substitute.
μ=500+2.0537×1.0815≈502.22.
A1 — μ≈502 ml.
A1 — σ≈1.08 ml, with both stated to a sensible accuracy and units carried through.
Total: 8 marks (M4 A4).
Question (6 marks): The lengths L mm of components produced by a process are modelled as L∼N(48,1.62).
(a) Find P(46<L<49). (2)
(b) A component is acceptable if its length is within 2 mm of the mean. Find the probability that a component is acceptable. (2)
(c) Of 200 components, find the expected number that are not acceptable. (2)
Mark scheme decomposition by AO:
(a)
(b)
(c)
Total: 6 marks split AO1 = 4, AO2 = 1, AO3 = 1. Section R questions are AO1-heavy at routine level but pivot to AO3 when the context demands modelling judgement (the rounding step in part (c)).
Connects to:
Section Q — the binomial distribution: when n is large and p is not too close to 0 or 1, Bin(n,p)≈N(np,np(1−p)). The continuity correction P(X≤k)≈P(Y<k+0.5) (with Y continuous-normal) is essential. AQA tests this approximation in Section R when the binomial calculation would otherwise be tedious.
Section S — hypothesis testing for the population mean: the test statistic Z=σ/nXˉ−μ0 is normal under H0 when the population is normal (or large n via CLT). Critical values ±1.96 (two-tailed, 5%) and ±1.6449 (one-tailed) come straight from inverse normal.
Central Limit Theorem (informal Year 2 content): for any distribution with finite variance, the sample mean Xˉn approaches normality as n→∞. This is why the normal distribution shows up everywhere — sums of independent effects tend toward it.
Section P — conditional probability: combining P(A∩B)/P(B) with normal probabilities (e.g., "given that X>50, find P(X>55)") tests both topics. The calculation reduces to a ratio of two upper-tail probabilities.
Statistical inference (university-level extension): confidence intervals Xˉ±1.96σ/n depend on the same z-values as hypothesis testing. The same standardisation underpins both procedures.
Section R questions split AO marks as follows:
| AO | Typical share | Earned by |
|---|---|---|
| AO1 (knowledge / procedure) | 50–60% | Standardising correctly, using the calculator's normal CDF and inverse-normal modes, simplifying to required accuracy |
| AO2 (reasoning / interpretation) | 25–35% | Interpreting context (e.g. "within 2 of the mean"), choosing between standard-normal and N(μ,σ2) form, justifying the use of normal as a model |
| AO3 (problem-solving / modelling) | 10–25% | Solving for μ and σ simultaneously, rounding sensibly, commenting on model fit |
Examiner-rewarded phrasing: "standardising using Z=(X−μ)/σ"; "from inverse normal, z=1.6449 to 4 d.p."; "expected number =np=200×0.211≈42 components". Phrases that lose marks: writing "z=1.6449⟹x=1.6449" (forgetting to un-standardise); using σ2 where σ is needed in standardisation; quoting probabilities to 1 s.f. when 3 s.f. is expected.
A specific AQA pattern: "give your answer to an appropriate degree of accuracy" is itself an AO2 cue — three significant figures for probabilities, two for expected counts of physical objects, integer for "number of components".
Question: X∼N(50,16). Find P(X>58).
Grade C response (~150 words):
σ=16=4. Standardise: Z=(58−50)/4=2.
P(X>58)=P(Z>2)=1−Φ(2)≈1−0.9772=0.0228.
Examiner commentary: Full marks (3/3). The candidate correctly extracts σ=4 from the variance σ2=16, standardises cleanly, and converts the upper-tail query to 1−Φ(z). The answer is given to 3 s.f., which matches AQA's expectation for normal probabilities. Many candidates lose the M1 here by treating the second parameter as σ, computing Z=(58−50)/16=0.5, and producing P(Z>0.5)≈0.309. That is the most common single error on Section R.
Grade A response (~210 words):*
The model is X∼N(50,16), so μ=50 and σ2=16, giving σ=4.
To find P(X>58), standardise to the standard normal:
Z=σX−μ=458−50=2
So P(X>58)=P(Z>2). Using the symmetry and Φ(2)≈0.9772:
Subscribe to continue reading
Get full access to this lesson and all 10 lessons in this course.