You are viewing a free preview of this lesson.
Subscribe to unlock all 10 lessons in this course and every other course on LearningBro.
This lesson covers hypothesis testing for a population mean using the normal distribution as required by the Edexcel A-Level Mathematics specification (9MA0), Paper 3 Section A -- Statistics. You need to test whether a sample provides evidence of a change in the population mean, using a test statistic and critical values.
Use this test when:
Z = (X-bar - mu0) / (sigma / sqrt(n))
where X-bar is the sample mean, mu0 is the hypothesised mean, sigma is the known population SD, and n is the sample size.
Standard error = sigma / sqrt(n)
As n increases, the standard error decreases -- larger samples give more precise estimates.
A manufacturer claims mean lifetime is 1200 hours (sigma = 100). A sample of 25 gives mean 1160 hours. Test at 5%.
H0: mu = 1200, H1: mu < 1200.
SE = 100/sqrt(25) = 20. Z = (1160 - 1200)/20 = -2.0.
Critical value (5%, one-tailed): -1.6449.
-2.0 < -1.6449 --> reject H0. Sufficient evidence the mean is less than 1200 hours.
p-value: P(Z < -2.0) = 0.0228 < 0.05. Same conclusion.
Old mean score was 65 (sigma = 10). New method with n = 40 gives mean 68. Test at 5%.
H0: mu = 65, H1: mu > 65.
SE = 10/sqrt(40) = 1.581. Z = (68 - 65)/1.581 = 1.897.
Critical value: 1.6449. 1.897 > 1.6449 --> reject H0. Sufficient evidence the mean has increased.
A company claims mean volume is 330 ml (sigma = 5). Inspector tests 50 bottles, gets mean 328.5. Test at 1%.
H0: mu = 330, H1: mu ≠ 330. alpha/2 = 0.005.
SE = 5/sqrt(50) = 0.707. Z = (328.5 - 330)/0.707 = -2.121.
Critical values: +/-2.5758. -2.121 is between -2.5758 and 2.5758.
Do not reject H0. Insufficient evidence at 1% level that the mean differs from 330 ml.
| Significance level | One-tailed | Two-tailed |
|---|---|---|
| 10% | +/-1.2816 | +/-1.6449 |
| 5% | +/-1.6449 | +/-1.9600 |
| 2.5% | +/-1.9600 | +/-2.2414 |
| 1% | +/-2.3263 | +/-2.5758 |
Exam Tip: Memorise z = 1.6449 (5% one-tailed), z = 1.9600 (5% two-tailed), z = 2.5758 (1% two-tailed).
When n = 1: Z = (x - mu0) / sigma (standard error = sigma).
Fish weight ~ N(2.5, 0.09). A fish weighs 3.2 kg. Test at 5% whether mean is higher.
Z = (3.2 - 2.5)/0.3 = 2.333. Critical value 1.6449. 2.333 > 1.6449 --> reject H0.
Always relate the conclusion to the original context. Include:
Good examples:
"There is sufficient evidence at the 5% significance level to conclude that the new fertiliser increases the mean yield of wheat."
"There is insufficient evidence at the 1% significance level to conclude that the mean delivery time has changed from 3 days."
Both tests follow the same five-step framework:
The difference:
Edexcel 9MA0-03 specification section 8 — Statistical hypothesis testing, sub-strands 8.1, 8.2 and 8.3 covers conduct a statistical hypothesis test for the mean of a normal distribution with known, given or assumed variance and interpret the results in context (refer to the official specification document for exact wording).; "Extend ideas of hypothesis testing to test for zero correlation"; "Understand and apply the language of statistical hypothesis testing." This sub-strand sits in Paper 3 — Statistics and Mechanics but builds directly on Year 1 Section 6 (Statistical distributions, Normal distribution) and Year 1 Section 7 (Hypothesis testing for the binomial proportion). The Edexcel formula booklet provides the standard normal CDF tables and the critical-value tables for the product-moment correlation coefficient (PMCC); the Z-test statistic itself must be constructed by the candidate.
Question (8 marks):
A factory fills jars labelled 500 g. The fill weight X grams is modelled as X∼N(μ,42) with σ=4 taken as known from long-run process data. A random sample of n=25 jars has mean xˉ=502.1 g. Test, at the 5% significance level, whether the mean fill weight differs from 500 g. (8)
Solution with mark scheme:
Step 1 — state hypotheses.
H0:μ=500, H1:μ=500 (two-tailed, since we are testing whether the mean differs from 500).
B1 — both hypotheses correctly stated, in terms of the population mean μ (not xˉ). A common slip is writing H0:xˉ=500 — the hypothesis is always about the parameter, never about the sample statistic.
Step 2 — state the distribution of the sample mean under H0.
Under H0, Xˉ∼N(500,2542)=N(500,0.64), so the standard error is σ/n=4/5=0.8.
M1 — correct sampling distribution of the mean with variance σ2/n, not σ2.
Step 3 — compute the test statistic.
z=σ/nxˉ−μ0=0.8502.1−500=0.82.1=2.625
M1 — substitution into the Z-formula. A1 — value z=2.625 to at least 3 s.f.
Step 4 — identify the critical region.
Two-tailed test at 5%: critical values are ±z0.025=±1.96. Reject H0 if ∣z∣>1.96.
B1 — correct critical value(s) with the two-tailed split (2.5% in each tail, not 5%).
Step 5 — compare and conclude.
Since 2.625>1.96, the test statistic lies in the critical region. Reject H0.
M1 — valid comparison of z with the critical value (or equivalently p with α).
A1 — conclusion in context: "There is sufficient evidence at the 5% level to suggest that the mean fill weight differs from 500 g."
A1 — explicit reference to context (jars / fill weight), not a bare "reject H0".
Total: 8 marks (B2 M3 A3, split as shown).
Question (6 marks): The reaction time T seconds of trained athletes to a starting signal is modelled as T∼N(μ,0.052). A coach claims a new training programme reduces the mean reaction time below the historical value of 0.18 s. A sample of n=16 athletes after the programme gives tˉ=0.165 s. Test the coach's claim at the 1% significance level.
Mark scheme decomposition by AO:
Total: 6 marks split AO1 = 3, AO2 = 2, AO3 = 1. Notice the AO3 mark for a contextual conclusion — examiners reserve AO3 marks for the candidate who connects the statistical decision back to the original claim under test.
Connects to:
Year 1 Section 6 — Normal distribution: the test relies on the linearity property Xˉ=n1∑Xi∼N(μ,σ2/n), which is exact when each Xi is normal. Computing P(Xˉ>502.1) uses the standard normal table from Year 1.
Year 1 Section 7 — Binomial hypothesis test: the logic — null/alternative, significance level, critical region, conclusion — is identical. Only the test statistic and its distribution change. A common Paper 3 stem mixes both: a binomial test in (a), a normal-mean test in (b).
Year 2 Section 4 — Correlation: the PMCC test H0:ρ=0 vs H1:ρ=0 uses critical r-values from the formula booklet. Same hypothesis-testing framework, different statistic.
Central limit theorem (background, not examined directly): even when Xi is not normal, Xˉ is approximately normal for large n. The 9MA0 spec assumes normality is given — but understanding why the test still works for non-normal X is the conceptual bridge to A-level Further Maths and undergraduate statistics.
Statistical inference (whole topic): the Z-test is the simplest example of a likelihood-based decision rule. Confidence intervals xˉ±z∗⋅σ/n are the dual of the test — accepting H0 at level α is equivalent to μ0 lying inside the (1−α) confidence interval.
Hypothesis-testing questions on 9MA0-03 split AO marks roughly as:
| AO | Typical share | Earned by |
|---|---|---|
| AO1 (knowledge / procedure) | 50–60% | Stating hypotheses correctly, computing the standard error, evaluating the Z statistic, identifying critical values from tables |
| AO2 (reasoning / interpretation) | 25–35% | Choosing one- vs two-tailed; comparing test statistic to critical region; recognising the sampling distribution |
| AO3 (problem-solving / modelling) | 10–20% | Conclusion in context; assessing whether the modelling assumption (known σ, normal X) is reasonable |
Examiner-rewarded phrasing: "Under H0, Xˉ∼N(μ0,σ2/n)"; "There is sufficient/insufficient evidence at the α% level to support the claim that …"; "The test statistic lies in / does not lie in the critical region." Phrases that lose marks: "accept H0" (the correct phrase is do not reject); "H0 is true" (we never prove H0); a bare "reject H0" with no reference to the original context (loses the contextual A1).
A specific Edexcel pattern: when the question gives the significance level as a percentage and the test is two-tailed, candidates must split the level — 5% two-tailed means 2.5% in each tail, critical values ±1.96. Forgetting to halve is the single most-penalised slip.
Question: The IQ scores of pupils at a school are modelled as N(μ,152). A sample of n=36 pupils has mean xˉ=105. State the distribution of Xˉ under H0:μ=100 and compute the test statistic z.
Grade C response (~180 words):
Under H0, the sample mean is normally distributed with mean 100 and variance 152/36=6.25. So Xˉ∼N(100,6.25).
The test statistic is z=(105−100)/6.25=5/2.5=2.
Subscribe to continue reading
Get full access to this lesson and all 10 lessons in this course.