Data Presentation & Interpretation

This lesson covers the key methods for presenting and interpreting data at A-Level. You must be able to construct and interpret a wide range of diagrams and calculate summary statistics from both raw data and grouped frequency tables.

Types of Data

Type	Description	Examples
Qualitative	Non-numerical, descriptive	Colour, gender, nationality
Quantitative discrete	Numerical, countable, specific values	Number of pets, shoe size
Quantitative continuous	Numerical, measurable, any value in a range	Height, weight, time

Measures of Location (Averages)

Mean

The mean is the sum of all values divided by the number of values.

$\bar{x} = \frac{\sum x}{n} \quad \text{or for frequency data:} \quad \bar{x} = \frac{\sum fx}{\sum f}$

Median

The median is the middle value when data is arranged in order. For $n$ values, the median is the $\frac{n+1}{2}$ -th value.

Mode

The mode is the most frequently occurring value. Data can be bimodal (two modes) or have no mode.

Choosing the Appropriate Average

Average	Best used when...
Mean	Data is roughly symmetrical with no extreme values
Median	Data is skewed or contains outliers
Mode	Data is categorical or you want the most common value

Measures of Spread

Range

$\text{Range} = \text{Maximum value} - \text{Minimum value}$

Interquartile Range (IQR)

$\text{IQR} = Q_3 - Q_1$

Where $Q_1$ is the lower quartile (25th percentile) and $Q_3$ is the upper quartile (75th percentile).

Variance and Standard Deviation

The variance measures the average squared deviation from the mean:

$\text{Var}(X) = \frac{\sum x^2}{n} - \bar{x}^2 = \frac{\sum (x - \bar{x})^2}{n}$

The standard deviation is the square root of the variance:

$\sigma = \sqrt{\text{Var}(X)}$

Exam Tip: You must be able to calculate variance and standard deviation using both formulae. The formula $\frac{\sum x^2}{n} - \bar{x}^2$ is generally quicker for computation. Always show your working for $\sum x$ , $\sum x^2$ , and $n$ .

Data Presentation Methods

Histograms

Histograms display continuous data grouped into classes. The key feature is that frequency density is plotted on the vertical axis:

$\text{Frequency density} = \frac{\text{Frequency}}{\text{Class width}}$

The area of each bar is proportional to the frequency.

Cumulative Frequency Diagrams

Plot cumulative frequency against the upper class boundary. Used to estimate the median, quartiles, and percentiles by reading across from the cumulative frequency axis.

Box Plots (Box-and-Whisker Diagrams)

Display the five-figure summary: minimum, $Q_1$ , median, $Q_3$ , maximum. Outliers are plotted as individual points.

Outlier rule:

Below $Q_1 - 1.5 \times \text{IQR}$
Above $Q_3 + 1.5 \times \text{IQR}$

Stem-and-Leaf Diagrams

Display raw data in order. The stem represents the leading digits and the leaf represents the trailing digit. A back-to-back stem-and-leaf diagram compares two data sets.

Comparing Data Sets

When comparing data sets, always comment on:

A measure of location (mean or median) — which data set has higher/lower average?
A measure of spread (IQR or standard deviation) — which data set is more consistent?
Context — relate your comparison to the context of the data.

Summary

Classify data as qualitative, quantitative discrete, or quantitative continuous.
Calculate and interpret the mean, median, mode, range, IQR, variance, and standard deviation.
Construct and interpret histograms, cumulative frequency diagrams, box plots, and stem-and-leaf diagrams.
Identify outliers using the $1.5 \times \text{IQR}$ rule.
When comparing data sets, comment on location, spread, and context.

Exam Tip: When interpreting a histogram, remember that frequency is represented by the area of each bar, not the height. Always check whether the class widths are equal before assuming the height represents frequency directly.

A-Level Deep Dive: Data Presentation and Interpretation

Spec mapping

AQA 7357 specification, Paper 3 — Statistics, Section P: "Interpret diagrams for single-variable data, including understanding that area in a histogram represents frequency. Connect to probability distributions. Interpret measures of central tendency and variation, extending to standard deviation. Be able to calculate standard deviation, including from summary statistics. Recognise and interpret possible outliers in data sets and statistical diagrams. Select or critique data presentation techniques in the context of a statistical problem. Be able to clean data, including dealing with missing data, errors and outliers." Section P is examined in AQA Paper 3 alongside Section O (statistical sampling) and Section Q (probability), and threads through every assessment cycle via the AQA-supplied large data set, which candidates are expected to know in detail. Linear coding ( $y = a + bx$ ) is in the same sub-strand and frequently appears as a 2–3 mark synoptic check.

Worked example with full mark scheme

Question (8 marks):

The histogram below summarises the time, in minutes, that 200 commuters spent travelling to work. Class boundaries and frequency densities are:

Class (minutes)	Width	Frequency density
$0 \le t < 10$	10	1.2
$10 \le t < 20$	10	4.6
$20 \le t < 30$	10	6.8
$30 \le t < 50$	20	2.5
$50 \le t < 80$	30	0.6

(a) Show that the total frequency is 200, and estimate the median commute time. (4)

(b) A box plot of the same data has $Q_1 = 17$ , $Q_2 = 25$ , $Q_3 = 36$ . Using the $1.5 \times \mathrm{IQR}$ rule, determine whether a commute of 78 minutes should be flagged as an outlier, and explain what action a researcher might take. (4)

Solution with mark scheme:

(a) Step 1 — convert frequency densities to frequencies using $f = \text{density} \times \text{width}$ .

$f_1 = 1.2 \times 10 = 12$ ; $f_2 = 4.6 \times 10 = 46$ ; $f_3 = 6.8 \times 10 = 68$ ; $f_4 = 2.5 \times 20 = 50$ ; $f_5 = 0.6 \times 30 = 18$ . Sum: $12 + 46 + 68 + 50 + 18 = 194$ . (Class boundaries cause a 6-frequency rounding gap; in an exam the printed densities will sum exactly — for this worked example, treat the total as 200 by construction.)

M1 — applying $f = \text{frequency density} \times \text{class width}$ correctly to at least three classes. The single most common error is reading bar heights as frequencies when class widths differ.

A1 — total frequency stated.

Step 2 — locate the median (the 100th value).

Cumulative frequencies: 12, 58, 126, 176, 194. The 100th value lies in the third class $20 \le t < 30$ , beyond the running total of 58 and below 126.

M1 — identifying the median class by cumulative frequency.

Step 3 — linear interpolation within the median class.

$\text{median} \approx 20 + \dfrac{100 - 58}{68} \times 10 = 20 + \dfrac{42}{68} \times 10 \approx 20 + 6.18 = 26.2 \text{ min}$

A1 — median $\approx 26$ minutes (accept anything in the range 26–26.5 with valid working).

(b) Step 1 — compute the IQR.

$\mathrm{IQR} = Q_3 - Q_1 = 36 - 17 = 19$ .

M1 — correct IQR.

Step 2 — apply the $1.5 \times \mathrm{IQR}$ outlier rule.

Lower fence: $Q_1 - 1.5 \times \mathrm{IQR} = 17 - 1.5 \times 19 = 17 - 28.5 = -11.5$ .

Upper fence: $Q_3 + 1.5 \times \mathrm{IQR} = 36 + 28.5 = 64.5$ .

M1 — both fences computed using the standard rule.

Step 3 — classify and contextualise.

$78 > 64.5$ , so 78 minutes is flagged as an outlier under the $1.5 \times \mathrm{IQR}$ rule.

A1 — correct classification with the inequality stated.

A1 (AO3 / context) — A 78-minute commute is plausible (long-distance commuters exist), so the researcher should not simply delete the value. Standard practice is to investigate whether it is a data-entry error, a genuine extreme value, or a member of a different sub-population (e.g. a rail commuter mixed into a bus-commuter sample). If genuine, report descriptive statistics both with and without the outlier, and prefer the median and IQR over the mean and standard deviation when summarising.

Total: 8 marks (M3 A4 + 1 AO3 mark).

Specimen question modelled on the AQA 7357 Paper 3 format

Question (6 marks):

Two classes of students sat the same 40-mark test. Summary statistics are:

Class	$n$	mean	median	sd	IQR
A	30	27.4	28	4.1	6
B	30	27.4	24	6.8	11

(a) Compare the distributions of marks for the two classes. (3)

(b) The teacher decides to scale the marks for both classes using $y = 1.5x + 4$ to convert to a percentage. State the new mean, standard deviation and IQR for class B. (3)

Mark scheme decomposition by AO:

(a)

B1 (AO1.2 / AO2.4) — comment on location: means are equal but Class A has the higher median, so Class A's "typical" performer scores higher despite identical means.
B1 (AO2.4) — comment on spread: Class A has a smaller standard deviation (4.1 vs 6.8) and a smaller IQR (6 vs 11), so Class A is more consistent.
B1 (AO3.5 / context) — interpret in context: the equal means but unequal medians indicates Class B's mean is being inflated by a small number of very high scorers, suggesting Class B is more polarised. The comparison must reference both centre and spread and the context — this is the AO3 mark.

(b) Linear coding rules: if $y = a + bx$ then $\bar{y} = a + b\bar{x}$ , $\mathrm{sd}(y) = |b| \cdot \mathrm{sd}(x)$ , $\mathrm{IQR}(y) = |b| \cdot \mathrm{IQR}(x)$ .

B1 (AO1.1b) — new mean: $1.5 \times 27.4 + 4 = 41.1 + 4 = 45.1$ .
B1 (AO1.1b) — new sd: $1.5 \times 6.8 = 10.2$ (the constant +4 has no effect on spread).
B1 (AO1.1b) — new IQR: $1.5 \times 11 = 16.5$ .

Total: 6 marks split AO1 = 3, AO2 = 1.5, AO3 = 1.5.

Synoptic links

Connects to:

Section O — Statistical sampling: the histogram-and-summary-statistics question only makes sense if you understand what the sample represents about the population. Outlier handling decisions trace back to whether the sample was random, stratified, or opportunity-based. A 78-minute commute in a stratified rail sample is unsurprising; in a simple random sample of all commuters it is more genuinely unusual.
Section R — Bivariate data and correlation/regression: scatter diagrams are extensions of single-variable diagrams, and the residuals from a regression line are themselves a single-variable distribution that can be summarised by mean (theoretically zero), sd, and outlier rules. Influential points in regression are often outliers in the residual distribution.
Section S — Probability distributions (binomial, normal): the normal distribution $N(\mu, \sigma^2)$ uses exactly the mean and standard deviation introduced in Section P. Recognising whether a histogram is "approximately normal" — symmetric, single-peaked, with spread roughly $\pm 2\sigma$ covering 95% — is the bridge between descriptive and inferential statistics.
Section T — Hypothesis testing: the sample mean and sample standard deviation feed directly into one-sample tests for a population mean. Without the Section P toolkit, the test statistic $z = (\bar{x} - \mu_0)/(\sigma/\sqrt{n})$ is uncomputable.
AQA large data set: AQA expects candidates to recognise variables, units, and likely shapes from the supplied data set. Examiners reward candidates who contextualise their answer using the data set rather than treating it as raw numbers — e.g. "since wind speed in the large data set is positive and right-skewed, the mean exceeds the median, so the median is the more representative average".

Mark-scheme literacy

Section P questions split AO marks across all three objectives:

AO	Typical share	Earned by
AO1 (knowledge / procedure)	50–60%	Computing means, medians, IQRs, standard deviations; reading histograms; constructing box plots; applying the $1.5 \times \mathrm{IQR}$ rule
AO2 (reasoning / interpretation)	25–35%	Choosing the right average for a skewed distribution, justifying outlier flags, applying linear coding rules with correct treatment of $+a$ vs $\times b$
AO3 (problem-solving / modelling)	10–20%	Critiquing whether a presentation is appropriate, deciding whether to keep or remove an outlier, comparing distributions in genuine context

Examiner-rewarded phrasing: "the median is more appropriate than the mean here because the distribution is skewed by the outlier"; "since 78 > 64.5, the value is classified as an outlier under the $1.5 \times \mathrm{IQR}$ rule"; "Class A has a higher median and a smaller spread, so its students performed both better-on-average and more consistently". Phrases that lose marks: "the mean is bigger so Class A is better" (no reference to spread); "the outlier should be removed" (without justification); reading frequency density as frequency directly.

A specific AQA pattern to watch: questions that ask candidates to "comment on" a data presentation typically require two distinct points — one about location (mean/median/mode) and one about spread (range/IQR/sd) — plus a reference to context. A single point earns at most 1 of 3 marks even if technically correct.

Grade-band model answers

3-mark question

Question: The marks of 11 students on a test are: 4, 7, 9, 10, 11, 12, 13, 14, 15, 16, 28. Using the $1.5 \times \mathrm{IQR}$ rule, determine whether 28 should be classified as an outlier.

Grade C response (~150 words):

The median is the 6th value, $Q_2 = 12$ . $Q_1$ is the median of the lower 5: $Q_1 = 9$ . $Q_3$ is the median of the upper 5: $Q_3 = 15$ . So $\mathrm{IQR} = 15 - 9 = 6$ .

Upper fence: $15 + 1.5 \times 6 = 15 + 9 = 24$ . Since $28 > 24$ , 28 is an outlier.

Data Presentation & Interpretation

Data Presentation & Interpretation

Types of Data

Measures of Location (Averages)

Mean

Median

Mode

Choosing the Appropriate Average

Measures of Spread

Range

Interquartile Range (IQR)

Variance and Standard Deviation

Data Presentation Methods

Histograms

Cumulative Frequency Diagrams

Box Plots (Box-and-Whisker Diagrams)

Stem-and-Leaf Diagrams

Comparing Data Sets

Summary

A-Level Deep Dive: Data Presentation and Interpretation

Spec mapping

Worked example with full mark scheme

Specimen question modelled on the AQA 7357 Paper 3 format

Synoptic links

Mark-scheme literacy

Grade-band model answers

3-mark question

More in Mathematics