You are viewing a free preview of this lesson.
Subscribe to unlock all 10 lessons in this course and every other course on LearningBro.
This lesson covers hypothesis testing as applied to real-world data — specifically in the contexts of correlation, proportion, and the mean. The emphasis is on setting up tests correctly, carrying them out using data from the AQA large data set, and interpreting the results in context.
A hypothesis test follows a structured procedure:
| Term | Definition |
|---|---|
| Null hypothesis (H0) | The default assumption — no effect, no difference, no correlation |
| Alternative hypothesis (H1) | The claim we are testing for |
| Significance level (α) | The probability of rejecting H0 when it is true (Type I error) |
| Test statistic | A value calculated from the data that is used to make the decision |
| Critical value | The boundary value — reject H0 if the test statistic exceeds this |
| Critical region | The set of values of the test statistic that lead to rejection of H0 |
| p-value | The probability of obtaining the observed result (or more extreme) under H0 |
This test uses the binomial distribution and is used when we want to test a claim about the probability of success p.
Under H0, the number of successes X∼B(n,p0).
A student claims that the proportion of days with measurable rainfall at Hurn in July is 0.30. From the large data set, in a sample of 31 July days, 14 have measurable rainfall.
H0:p=0.30, H1:p>0.30 (one-tailed test at 5% level)
Under H0, X∼B(31,0.30).
Calculate P(X≥14)=1−P(X≤13).
Using cumulative binomial tables or a calculator: P(X≤13)≈0.9614, so P(X≥14)≈0.0386.
Since 0.0386<0.05, reject H0. There is sufficient evidence at the 5% significance level to suggest that the proportion of rainy days at Hurn in July is greater than 0.30.
The critical region for this test is the set of values of X for which we reject H0. We need the smallest value c such that P(X≥c)≤0.05.
From the cumulative binomial tables: P(X≥14)≈0.0386<0.05, P(X≥13)≈0.0729>0.05.
So the critical region is X≥14, and the actual significance level is 0.0386 (3.86%).
When testing a claim about the population mean μ of a normally distributed variable with known variance:
Under H0, if X∼N(μ,σ2) and we take a sample of size n:
Xˉ∼N(μ0,nσ2)
The test statistic is:
z=σ/nxˉ−μ0
The long-term mean daily mean temperature at Leuchars in October is 9.5°C, with a standard deviation of 2.0°C. A sample of 25 October days from the large data set gives a mean of 10.3°C. Test at the 5% level whether the mean has increased.
H0:μ=9.5, H1:μ>9.5 (one-tailed)
z=2.0/2510.3−9.5=0.40.8=2.0
Critical value at 5% (one-tailed): z=1.6449
Since 2.0>1.6449, reject H0. There is sufficient evidence at the 5% level to suggest that the mean daily mean temperature at Leuchars in October has increased.
To test whether there is significant linear correlation in the population:
The sample PMCC r is compared with critical values from the PMCC table, which depend on the sample size n and the significance level α.
A student calculates the PMCC between daily mean temperature and daily total sunshine at Camborne for 15 days in June, obtaining r=0.58.
H0:ρ=0, H1:ρ>0 (one-tailed, at 5%)
Critical value for n=15, one-tailed, 5%: 0.4409
Since 0.58>0.4409, reject H0. There is sufficient evidence at the 5% level of a positive correlation between daily mean temperature and daily total sunshine at Camborne in June.
| Test type | H1 | Critical region |
|---|---|---|
| One-tailed (upper) | μ>μ0 or p>p0 or ρ>0 | Upper tail only |
| One-tailed (lower) | μ<μ0 or p<p0 or ρ<0 | Lower tail only |
| Two-tailed | μ=μ0 or p=p0 or ρ=0 | Both tails (α/2 in each) |
The choice between one-tailed and two-tailed depends on the question. If the question asks whether a parameter has increased, use a one-tailed (upper) test. If it asks whether it has changed, use a two-tailed test.
This is one of the most commonly examined skills and a frequent source of lost marks.
If rejecting H0: "There is sufficient evidence at the [significance level] to suggest that [contextual statement matching H1]."
If not rejecting H0: "There is insufficient evidence at the [significance level] to suggest that [contextual statement matching H1]."
| Error | Correct approach |
|---|---|
| "Accept H0" | Say "there is insufficient evidence to reject H0" |
| "Prove H1" | Say "there is evidence to suggest..." — hypothesis tests provide evidence, not proof |
| Stating the conclusion without context | Always relate back to the original claim and the real-world situation |
| Using the sample statistic in the hypotheses | Hypotheses are about population parameters (μ, p, ρ) |
"There is sufficient evidence at the 5% significance level to suggest that the mean daily mean temperature at Leuchars in October has increased from the long-term average of 9.5°C. This could be due to climate change, although other factors such as natural variability and the urban heat island effect may also play a role."
| Error type | Definition | Probability |
|---|---|---|
| Type I | Rejecting H0 when it is true | Equal to the significance level α |
| Type II | Failing to reject H0 when it is false | Depends on the true parameter value and sample size |
The relationship between Type I and Type II errors:
When working with the large data set for hypothesis testing:
Exam Tip: The most common reason for losing marks on hypothesis testing questions is giving a non-contextual conclusion. Never just write "reject H0" — always translate this into a sentence about the real-world situation: "There is sufficient evidence at the 5% level to suggest that the mean daily rainfall at Camborne has increased from the historical average of 3.2 mm."
AQA 7357 specification, Paper 3 — Statistics, sub-strands N1–N3 (hypothesis testing) and S1–S6 (probability and statistical distributions) covers the language of statistical hypotheses; conduct a statistical hypothesis test for the proportion in the binomial distribution B(n,p), for the mean of a normal distribution with known, given or assumed variance, and for a zero correlation coefficient using a given critical value (refer to the official specification document for exact wording). Hypothesis testing is the AO3-rich climax of A-Level Statistics: it threads through section S2 (binomial distribution and its tests), section S5 (normal distribution and tests on μ), section S6 (correlation, regression and the product-moment test) and connects synoptically to section S1 (sampling — every test rests on assumed sampling distributions). The AQA Large Data Set (LDS) — the published weather dataset spanning UK and overseas locations across two contrasting periods — is the explicit setting from which Paper 3 hypothesis-testing questions are drawn. The formula booklet provides binomial PMF, normal CDF inversion, and the product-moment correlation formula, but does not list the test procedure — candidates must structure the test themselves.
Question (8 marks):
A meteorologist claims that, in the months covered by the AQA Large Data Set, the daily mean wind speed at one of the UK locations exceeds the long-run climatological mean of μ0=9.5 knots. A random sample of n=25 days is taken from the LDS for that location. The sample mean wind speed is xˉ=10.4 knots. Assume daily mean wind speed is normally distributed with known standard deviation σ=2.1 knots.
Test, at the 5% significance level, whether the data support the meteorologist's claim. State your hypotheses, test statistic, P-value (or critical comparison), and conclusion in context. (8)
Solution with mark scheme:
Step 1 — state hypotheses.
Let μ be the population mean daily wind speed (in knots) at the LDS location over the period sampled.
H0:μ=9.5H1:μ>9.5
B1 — correct H0 stated as an equality on the population parameter μ (not on xˉ).
B1 — correct one-tailed H1 matching the directional claim "exceeds". Common error: writing H1:μ=9.5 (two-tailed) when the claim is directional, which doubles the rejection region and changes the conclusion.
Step 2 — identify the test and sampling distribution.
Under H0, since the population is normal with known σ, the sample mean satisfies:
Xˉ∼N(μ0,nσ2)=N(9.5,252.12)=N(9.5,0.1764)
so the standard error is σ/n=2.1/5=0.42.
M1 — correct sampling distribution of Xˉ under H0, including the σ2/n variance.
Step 3 — compute the test statistic.
z=σ/nxˉ−μ0=0.4210.4−9.5=0.420.9≈2.143
M1 — correct standardisation. A1 — correct value to at least 3 s.f.
Step 4 — find the P-value (or compare with critical value).
For the one-tailed upper test, the P-value is P(Z≥2.143). Using the standard normal CDF: P(Z≥2.143)=1−Φ(2.143)≈1−0.9839=0.0161.
Equivalently, the upper 5% critical value is z0.05=1.645, and 2.143>1.645.
M1 — correct identification of the upper-tail probability or correct critical value retrieval.
A1 — correct P-value ≈0.016 (or correct comparison 2.143>1.645 stated explicitly).
Step 5 — conclusion in context.
Since 0.0161<0.05, we reject H0 at the 5% significance level. There is sufficient evidence to support the meteorologist's claim that the mean daily wind speed at this LDS location exceeds 9.5 knots over the period sampled.
A1 — conclusion that (i) refers to rejecting/not rejecting H0 in the language of evidence, not certainty, and (ii) is phrased in context — naming wind speed, the location, and the directional claim. A bare "reject H0" earns no context mark.
Total: 8 marks (B2 M3 A3, split as shown).
Question (6 marks): A student claims that, at a UK LDS location, the proportion of "rain days" (days on which daily rainfall ≥1mm) during May–October is at most 0.3. From a random sample of n=30 days drawn from the LDS, X=14 are rain days. Conduct a hypothesis test at the 5% significance level to assess whether the data contradict the student's claim. State your conclusion in context.
Mark scheme decomposition by AO:
Total: 6 marks split AO1 = 3, AO2 = 1, AO3 = 2. This is an AO3-heavy specimen by Statistics standards: the student's claim is the alternative (because it is the inequality being tested against), so framing H0 as the equality boundary and H1 as the strict inequality is the AO2 reasoning step that many candidates botch. The AO3 marks reward the comparison and the contextual conclusion.
Connects to:
Section S1 — Sampling: every hypothesis test assumes a random sample. If the LDS days are chosen by systematic sampling (every 5th day) rather than simple random sampling, autocorrelation in weather data (today's wind correlates with yesterday's) inflates the effective sample size — invalidating the standard error σ/n. AO3 questions sometimes ask "comment on the validity of the test" — naming this autocorrelation issue is the A* response.
Section S2 — Binomial distribution: binomial tests on p require modelling each LDS day as an independent Bernoulli trial. The exchange between "rain / no rain" Bernoulli outcomes and the binomial count is the conceptual hinge. Independence is suspect for consecutive days — flagging this is AO3.
Section S5 — Normal distribution: tests on μ with known σ use Z=(Xˉ−μ0)/(σ/n). The Central Limit Theorem (Year 2) means even non-normal weather variables (rainfall is right-skewed) yield approximately normal sample means once n is large — this licenses the test under broader conditions than strict normality.
Section S6 — Correlation: the product-moment correlation test H0:ρ=0 versus H1:ρ=0 uses the sample r against tabulated critical values depending on n. Testing whether daily mean temperature and daily mean wind speed are correlated at an LDS location is a canonical Paper 3 question.
Modelling assumptions across the LDS: AQA explicitly tests whether candidates can criticise the modelling — assuming wind speed is normal when it is bounded below by 0; assuming rain days are independent when storm systems persist for days; assuming the LDS sample period is representative when it covers only specific months. Questions phrased "comment on the suitability of this model" are AO3.5a calls.
Hypothesis-testing questions on AQA 7357 Paper 3 split AO marks more evenly than algebra topics:
Subscribe to continue reading
Get full access to this lesson and all 10 lessons in this course.