You are viewing a free preview of this lesson.
Subscribe to unlock all 10 lessons in this course and every other course on LearningBro.
This lesson covers the fundamentals of hypothesis testing at A-Level. Hypothesis testing is a formal procedure for making decisions about a population parameter based on sample evidence. It is one of the most important topics in A-Level statistics and appears frequently in examinations.
A hypothesis test uses sample data to assess evidence against a claim about a population parameter.
| Term | Definition |
|---|---|
| Null hypothesis H0 | The default assumption; assumed true unless there is strong evidence against it |
| Alternative hypothesis H1 | The claim we are trying to find evidence for |
| Test statistic | A value calculated from the sample data used to decide the outcome |
| Significance level α | The probability of rejecting H0 when it is actually true (usually 5% or 1%) |
| Critical value | The boundary value that determines the rejection region |
| Critical region | The set of values of the test statistic that would lead to rejection of H0 |
| p-value | The probability of obtaining the observed result (or more extreme) assuming H0 is true |
Exam Tip: You must state both hypotheses and your conclusion in the context of the question. Writing "reject H0" without explaining what this means in the real-world context will lose marks.
Tests for a change in one direction only:
Tests for a change in either direction:
If X∼B(n,p), we test H0:p=p0 against H1:p>p0, H1:p<p0, or H1:p=p0.
Example: A manufacturer claims that 10% of items are defective. A sample of 20 items contains 5 defectives. Test at the 5% significance level whether the proportion of defectives has increased.
H0:p=0.1, H1:p>0.1 (one-tailed, right tail)
X∼B(20,0.1) under H0
P(X≥5)=1−P(X≤4)=1−0.9568=0.0432
Since 0.0432<0.05, reject H0. There is sufficient evidence at the 5% level to suggest the proportion of defectives has increased.
| H0 is true | H0 is false | |
|---|---|---|
| Reject H0 | Type I error (probability = α) | Correct decision |
| Do not reject H0 | Correct decision | Type II error |
The significance level α is the probability of a Type I error.
Exam Tip: When writing your conclusion, always use the phrase "there is sufficient evidence to suggest..." or "there is insufficient evidence to suggest..." rather than definitively stating that the hypothesis is true or false. Hypothesis tests assess evidence — they do not prove anything.
AQA 7357 specification, Paper 3 — Statistics, Section S: Statistical Hypothesis Testing. AQA requires candidates to "understand and apply the language of statistical hypothesis testing", to "conduct a statistical hypothesis test for the proportion in the binomial distribution B(n,p) using either a one-tailed or two-tailed test, and interpret the results in context". This sub-strand sits inside the Statistics route on Paper 3 and connects forward to Section T (sampling distribution of the sample mean and Z-test for a normal mean), Section R (correlation coefficient hypothesis test for ρ=0) and laterally into Section P (probability) and Section Q (the binomial model itself). The AQA formula booklet lists the binomial probability mass function but does not tabulate cumulative binomial probabilities — candidates must use the calculator's binomial CDF for critical-region work.
Question (8 marks):
A bakery claims that 30% of its sourdough loaves contain visible starter bubbles on the crust. A quality inspector suspects the true proportion is lower and inspects a random sample of 20 loaves, finding that only 2 have visible bubbles.
(a) Stating your hypotheses clearly, test at the 5% significance level whether there is evidence that the proportion is lower than claimed. (6)
(b) State, with a reason, whether your conclusion would change at the 1% significance level. (2)
Solution with mark scheme:
(a) Step 1 — define the variable and state hypotheses.
Let X be the number of loaves with visible bubbles in a sample of 20. Under the bakery's claim, X∼B(20,p).
H0:p=0.3 (the bakery's claim is correct) H1:p<0.3 (the proportion is lower than claimed)
B1 — both hypotheses correctly stated in terms of the population proportion p, not in terms of X or xˉ. A common error is writing H0:X=0.3 — that scores zero because X is a count, not a proportion.
Step 2 — identify the test as one-tailed (lower).
The inspector's suspicion ("lower than claimed") fixes a one-tailed lower test. Significance level α=0.05.
B1 — recognising one-tailed test and stating α.
Step 3 — compute the p-value.
Under H0, X∼B(20,0.3). The observed value is x=2. For a lower-tailed test, the p-value is
P(X≤2∣p=0.3)=∑k=02(k20)(0.3)k(0.7)20−k
Using the binomial CDF: P(X≤2)≈0.0355.
M1 — correct probability statement (cumulative, lower tail, under H0). A1 — correct numerical value to 3 significant figures or better.
Step 4 — compare and decide.
0.0355<0.05, so the result lies in the critical region. Reject H0.
M1 — explicit comparison of p-value with α and a decision about H0.
Step 5 — conclusion in context.
There is sufficient evidence at the 5% significance level to suggest that the true proportion of sourdough loaves with visible bubbles is lower than the bakery's claimed 30%.
A1 — non-assertive conclusion ("sufficient evidence to suggest"), in the context of the question (sourdough loaves, bakery), with the direction of the alternative hypothesis stated.
(b) At α=0.01: 0.0355>0.01, so we do not reject H0.
M1 — correct comparison at the new significance level.
There is insufficient evidence at the 1% level to suggest the proportion is lower than 30%; the conclusion changes.
A1 — context-aware statement of the changed conclusion, with the reason being that the p-value now exceeds α.
Total: 8 marks (B2 M3 A3, split as shown).
Question (6 marks): A coin is suspected of being biased. It is tossed 30 times and lands heads on 21 occasions. Test, at the 5% significance level, whether there is evidence that the coin is biased.
Mark scheme decomposition by AO:
Total: 6 marks split AO1 = 4, AO2 = 1, AO3 = 1. AQA awards the AO3 mark only when the conclusion is fully contextualised — "the coin is biased" is acceptable; "reject H0" alone is not.
Connects to:
Section Q — the binomial distribution itself. Hypothesis testing for p is impossible without confidence in B(n,p) probability calculation, the assumptions of independence and constant probability, and the cumulative distribution function. A candidate who cannot compute P(X≤k) accurately cannot run any test.
Section R — correlation coefficient r. The structure "test H0:ρ=0 vs H1:ρ=0 at the 5% level" reuses every element of binomial testing — null/alternative, one- vs two-tailed, p-value or critical-value comparison, contextual conclusion. Only the test statistic changes. AQA explicitly examines this parallel.
Section T — hypothesis test for a normal mean. When the sample size is large enough that Xˉ∼N(μ,σ2/n) approximately, the same testing framework applies with Z=(xˉ−μ0)/(σ/n) as the test statistic. Candidates who internalise the binomial test transfer it cleanly.
Statistical inference (broader). Confidence intervals are the dual of hypothesis tests: a 95% confidence interval for p that excludes p0 corresponds to rejecting H0:p=p0 at the 5% level (two-tailed). This duality is implicit at A-Level but explicit at university.
Modelling assumptions. The binomial test assumes n independent trials with constant probability p. If trials are correlated (e.g. multiple loaves from the same batch), the model breaks and inference is invalid. AQA examines this critical-thinking step in synoptic Paper 3 questions.
Hypothesis testing on AQA Paper 3 distributes AO marks distinctively:
| AO | Typical share | Earned by |
|---|---|---|
| AO1 (knowledge / procedure) | 50–60% | Stating H0 and H1 correctly, computing the p-value or critical value, identifying one- vs two-tailed |
| AO2 (reasoning / interpretation) | 20–30% | Choosing the correct tail, comparing p-value with α, justifying rejection or non-rejection |
| AO3 (problem-solving / modelling) | 15–25% | Interpreting the conclusion in the context of the question, commenting on model validity, addressing changes in α |
Examiner-rewarded phrasing: "there is sufficient/insufficient evidence at the X% level to suggest …"; "since the p-value (0.0355) is less than the significance level (0.05), we reject H0"; "in the context of this problem, we conclude that …". Phrases that lose marks: "the bakery is lying" (assertive, not contextual); "accept H0" (H0 is never accepted, only "not rejected"); "H0:xˉ=0.3" (wrong variable — hypotheses are about the parameter p, not the statistic).
A specific AQA pattern to watch: a question saying "test at the 5% significance level" without specifying one- or two-tailed expects you to read the alternative hypothesis from the wording. "Suspects the proportion is lower" is one-tailed lower; "wonders whether the proportion has changed" is two-tailed. Mis-reading direction costs the entire AO2 mark.
Question: Write down the null and alternative hypotheses for testing whether a coin showing heads on 12 of 20 tosses is biased toward heads. Identify whether the test is one-tailed or two-tailed.
Grade C response (~150 words):
Let p be the probability of heads.
H0:p=0.5 H1:p>0.5
This is a one-tailed test because we are testing whether the coin is biased toward heads (a specific direction).
Examiner commentary: Full marks (3/3). The candidate correctly states H0 and H1 in terms of the parameter p, identifies the one-tailed direction, and gives a one-line justification linking "biased toward heads" to "p>0.5". The reasoning is brief but every step is correct. This is the standard Grade C answer for a procedural question — efficient and correctly contextualised.
Grade A response (~190 words):*
Define p as the population probability that the coin lands heads on a single toss, modelling X∼B(20,p) where X is the number of heads in 20 tosses.
H0:p=0.5 (the coin is fair) H1:p>0.5 (the coin is biased toward heads)
The phrase "biased toward heads" specifies the direction of departure from fairness, so this is a one-tailed (upper) test. The entire significance level is concentrated in the upper tail of the binomial distribution under H0.
Subscribe to continue reading
Get full access to this lesson and all 10 lessons in this course.