You are viewing a free preview of this lesson.
Subscribe to unlock all 10 lessons in this course and every other course on LearningBro.
This lesson covers the key methods for presenting and interpreting data at A-Level. You must be able to construct and interpret a wide range of diagrams and calculate summary statistics from both raw data and grouped frequency tables.
| Type | Description | Examples |
|---|---|---|
| Qualitative | Non-numerical, descriptive | Colour, gender, nationality |
| Quantitative discrete | Numerical, countable, specific values | Number of pets, shoe size |
| Quantitative continuous | Numerical, measurable, any value in a range | Height, weight, time |
The mean is the sum of all values divided by the number of values.
xˉ=n∑xor for frequency data:xˉ=∑f∑fx
The median is the middle value when data is arranged in order. For n values, the median is the 2n+1-th value.
The mode is the most frequently occurring value. Data can be bimodal (two modes) or have no mode.
| Average | Best used when... |
|---|---|
| Mean | Data is roughly symmetrical with no extreme values |
| Median | Data is skewed or contains outliers |
| Mode | Data is categorical or you want the most common value |
Range=Maximum value−Minimum value
IQR=Q3−Q1
Where Q1 is the lower quartile (25th percentile) and Q3 is the upper quartile (75th percentile).
The variance measures the average squared deviation from the mean:
Var(X)=n∑x2−xˉ2=n∑(x−xˉ)2
The standard deviation is the square root of the variance:
σ=Var(X)
Exam Tip: You must be able to calculate variance and standard deviation using both formulae. The formula n∑x2−xˉ2 is generally quicker for computation. Always show your working for ∑x, ∑x2, and n.
Histograms display continuous data grouped into classes. The key feature is that frequency density is plotted on the vertical axis:
Frequency density=Class widthFrequency
The area of each bar is proportional to the frequency.
Plot cumulative frequency against the upper class boundary. Used to estimate the median, quartiles, and percentiles by reading across from the cumulative frequency axis.
Display the five-figure summary: minimum, Q1, median, Q3, maximum. Outliers are plotted as individual points.
Outlier rule:
Display raw data in order. The stem represents the leading digits and the leaf represents the trailing digit. A back-to-back stem-and-leaf diagram compares two data sets.
When comparing data sets, always comment on:
Exam Tip: When interpreting a histogram, remember that frequency is represented by the area of each bar, not the height. Always check whether the class widths are equal before assuming the height represents frequency directly.
AQA 7357 specification, Paper 3 — Statistics, Section P: "Interpret diagrams for single-variable data, including understanding that area in a histogram represents frequency. Connect to probability distributions. Interpret measures of central tendency and variation, extending to standard deviation. Be able to calculate standard deviation, including from summary statistics. Recognise and interpret possible outliers in data sets and statistical diagrams. Select or critique data presentation techniques in the context of a statistical problem. Be able to clean data, including dealing with missing data, errors and outliers." Section P is examined in AQA Paper 3 alongside Section O (statistical sampling) and Section Q (probability), and threads through every assessment cycle via the AQA-supplied large data set, which candidates are expected to know in detail. Linear coding (y=a+bx) is in the same sub-strand and frequently appears as a 2–3 mark synoptic check.
Question (8 marks):
The histogram below summarises the time, in minutes, that 200 commuters spent travelling to work. Class boundaries and frequency densities are:
| Class (minutes) | Width | Frequency density |
|---|---|---|
| 0≤t<10 | 10 | 1.2 |
| 10≤t<20 | 10 | 4.6 |
| 20≤t<30 | 10 | 6.8 |
| 30≤t<50 | 20 | 2.5 |
| 50≤t<80 | 30 | 0.6 |
(a) Show that the total frequency is 200, and estimate the median commute time. (4)
(b) A box plot of the same data has Q1=17, Q2=25, Q3=36. Using the 1.5×IQR rule, determine whether a commute of 78 minutes should be flagged as an outlier, and explain what action a researcher might take. (4)
Solution with mark scheme:
(a) Step 1 — convert frequency densities to frequencies using f=density×width.
f1=1.2×10=12; f2=4.6×10=46; f3=6.8×10=68; f4=2.5×20=50; f5=0.6×30=18. Sum: 12+46+68+50+18=194. (Class boundaries cause a 6-frequency rounding gap; in an exam the printed densities will sum exactly — for this worked example, treat the total as 200 by construction.)
M1 — applying f=frequency density×class width correctly to at least three classes. The single most common error is reading bar heights as frequencies when class widths differ.
A1 — total frequency stated.
Step 2 — locate the median (the 100th value).
Cumulative frequencies: 12, 58, 126, 176, 194. The 100th value lies in the third class 20≤t<30, beyond the running total of 58 and below 126.
M1 — identifying the median class by cumulative frequency.
Step 3 — linear interpolation within the median class.
median≈20+68100−58×10=20+6842×10≈20+6.18=26.2 min
A1 — median ≈26 minutes (accept anything in the range 26–26.5 with valid working).
(b) Step 1 — compute the IQR.
IQR=Q3−Q1=36−17=19.
M1 — correct IQR.
Step 2 — apply the 1.5×IQR outlier rule.
Lower fence: Q1−1.5×IQR=17−1.5×19=17−28.5=−11.5.
Upper fence: Q3+1.5×IQR=36+28.5=64.5.
M1 — both fences computed using the standard rule.
Step 3 — classify and contextualise.
78>64.5, so 78 minutes is flagged as an outlier under the 1.5×IQR rule.
A1 — correct classification with the inequality stated.
A1 (AO3 / context) — A 78-minute commute is plausible (long-distance commuters exist), so the researcher should not simply delete the value. Standard practice is to investigate whether it is a data-entry error, a genuine extreme value, or a member of a different sub-population (e.g. a rail commuter mixed into a bus-commuter sample). If genuine, report descriptive statistics both with and without the outlier, and prefer the median and IQR over the mean and standard deviation when summarising.
Total: 8 marks (M3 A4 + 1 AO3 mark).
Question (6 marks):
Two classes of students sat the same 40-mark test. Summary statistics are:
| Class | n | mean | median | sd | IQR |
|---|---|---|---|---|---|
| A | 30 | 27.4 | 28 | 4.1 | 6 |
| B | 30 | 27.4 | 24 | 6.8 | 11 |
(a) Compare the distributions of marks for the two classes. (3)
(b) The teacher decides to scale the marks for both classes using y=1.5x+4 to convert to a percentage. State the new mean, standard deviation and IQR for class B. (3)
Mark scheme decomposition by AO:
(a)
(b) Linear coding rules: if y=a+bx then yˉ=a+bxˉ, sd(y)=∣b∣⋅sd(x), IQR(y)=∣b∣⋅IQR(x).
Total: 6 marks split AO1 = 3, AO2 = 1.5, AO3 = 1.5.
Connects to:
Section O — Statistical sampling: the histogram-and-summary-statistics question only makes sense if you understand what the sample represents about the population. Outlier handling decisions trace back to whether the sample was random, stratified, or opportunity-based. A 78-minute commute in a stratified rail sample is unsurprising; in a simple random sample of all commuters it is more genuinely unusual.
Section R — Bivariate data and correlation/regression: scatter diagrams are extensions of single-variable diagrams, and the residuals from a regression line are themselves a single-variable distribution that can be summarised by mean (theoretically zero), sd, and outlier rules. Influential points in regression are often outliers in the residual distribution.
Section S — Probability distributions (binomial, normal): the normal distribution N(μ,σ2) uses exactly the mean and standard deviation introduced in Section P. Recognising whether a histogram is "approximately normal" — symmetric, single-peaked, with spread roughly ±2σ covering 95% — is the bridge between descriptive and inferential statistics.
Section T — Hypothesis testing: the sample mean and sample standard deviation feed directly into one-sample tests for a population mean. Without the Section P toolkit, the test statistic z=(xˉ−μ0)/(σ/n) is uncomputable.
AQA large data set: AQA expects candidates to recognise variables, units, and likely shapes from the supplied data set. Examiners reward candidates who contextualise their answer using the data set rather than treating it as raw numbers — e.g. "since wind speed in the large data set is positive and right-skewed, the mean exceeds the median, so the median is the more representative average".
Section P questions split AO marks across all three objectives:
| AO | Typical share | Earned by |
|---|---|---|
| AO1 (knowledge / procedure) | 50–60% | Computing means, medians, IQRs, standard deviations; reading histograms; constructing box plots; applying the 1.5×IQR rule |
| AO2 (reasoning / interpretation) | 25–35% | Choosing the right average for a skewed distribution, justifying outlier flags, applying linear coding rules with correct treatment of +a vs ×b |
| AO3 (problem-solving / modelling) | 10–20% | Critiquing whether a presentation is appropriate, deciding whether to keep or remove an outlier, comparing distributions in genuine context |
Examiner-rewarded phrasing: "the median is more appropriate than the mean here because the distribution is skewed by the outlier"; "since 78 > 64.5, the value is classified as an outlier under the 1.5×IQR rule"; "Class A has a higher median and a smaller spread, so its students performed both better-on-average and more consistently". Phrases that lose marks: "the mean is bigger so Class A is better" (no reference to spread); "the outlier should be removed" (without justification); reading frequency density as frequency directly.
A specific AQA pattern to watch: questions that ask candidates to "comment on" a data presentation typically require two distinct points — one about location (mean/median/mode) and one about spread (range/IQR/sd) — plus a reference to context. A single point earns at most 1 of 3 marks even if technically correct.
Question: The marks of 11 students on a test are: 4, 7, 9, 10, 11, 12, 13, 14, 15, 16, 28. Using the 1.5×IQR rule, determine whether 28 should be classified as an outlier.
Grade C response (~150 words):
The median is the 6th value, Q2=12. Q1 is the median of the lower 5: Q1=9. Q3 is the median of the upper 5: Q3=15. So IQR=15−9=6.
Upper fence: 15+1.5×6=15+9=24. Since 28>24, 28 is an outlier.
Subscribe to continue reading
Get full access to this lesson and all 10 lessons in this course.