You are viewing a free preview of this lesson.
Subscribe to unlock all 10 lessons in this course and every other course on LearningBro.
This lesson introduces statistical distributions at A-Level, focusing on the concept of a random variable and its probability distribution. Understanding distributions is fundamental to modelling real-world phenomena and underpins all of inferential statistics.
A random variable is a variable whose value depends on the outcome of a random event. Random variables are denoted by capital letters (e.g., X), and their specific values by lowercase letters (e.g., x).
| Type | Description | Example |
|---|---|---|
| Discrete | Takes specific, countable values | Number of heads in 10 coin tosses |
| Continuous | Takes any value in a range | Height of a randomly chosen student |
A probability distribution for a discrete random variable X lists all possible values and their probabilities.
Properties:
Example:
| x | 1 | 2 | 3 | 4 |
|---|---|---|---|---|
| P(X=x) | 0.1 | 0.3 | 0.4 | 0.2 |
Check: 0.1+0.3+0.4+0.2=1 ✓
If a probability is unknown, use the fact that all probabilities sum to 1 to find it.
Example: If P(X=1)=0.2, P(X=2)=k, P(X=3)=2k, P(X=4)=0.1:
0.2+k+2k+0.1=1⟹3k=0.7⟹k=307
The expected value of a discrete random variable is the long-run average:
E(X)=μ=∑x⋅P(X=x)
Var(X)=E(X2)−[E(X)]2
where E(X2)=∑x2⋅P(X=x).
σ=Var(X)
Exam Tip: When calculating E(X) and Var(X), set up a table with columns for x, P(X=x), xP, and x2P. This organises your working clearly and reduces errors.
If Y=aX+b, then:
E(Y)=aE(X)+b Var(Y)=a2Var(X)
Note: Adding a constant b shifts the mean but does not affect the variance. Multiplying by a scales both the mean and the standard deviation.
A discrete uniform distribution assigns equal probability to each of n outcomes:
P(X=x)=n1
Example: A fair six-sided die has P(X=x)=61 for x=1,2,3,4,5,6.
For integers 1 to n: E(X)=2n+1,Var(X)=12n2−1
Exam Tip: Probability distribution questions often ask you to find an unknown probability by using the fact that all probabilities sum to 1. Always state this condition explicitly in your working.
AQA 7357 specification, Paper 3 — Statistics, Section R: Statistical Distributions. This section requires students to "understand and use simple, finite, discrete probability distributions (defined analytically, in tables or by formula), including the distribution of a random variable, expected value E(X), variance Var(X), and the discrete uniform distribution." Binomial B(n,p) and normal N(μ,σ2) distributions are examined under separate sub-strands and treated in their own lessons; this Deep Dive is restricted to the general discrete framework — probability functions, expectation, variance, and the uniform case. Although the Section R material is concentrated in Paper 3, the language of expectation propagates into Paper 2 Mechanics (impulse and average force as expected change in momentum) and Paper 1 Pure (sigma notation, infinite series convergence). The AQA formula booklet provides E(X)=∑xP(X=x) and Var(X)=E(X2)−[E(X)]2 — the second form is the computational form and is the one students are expected to apply.
Question (8 marks):
The discrete random variable X has probability function
P(X=x)=⎩⎨⎧kxk(6−x)0for x=1,2,3for x=4,5otherwise
(a) Show that k=91. (2)
(b) Find E(X). (2)
(c) Find Var(X). (4)
Solution with mark scheme:
(a) Step 1 — apply the total-probability axiom.
The probabilities must sum to 1:
∑x=15P(X=x)=k(1)+k(2)+k(3)+k(2)+k(1)=9k
M1 — recognising that ∑P(X=x)=1 over the support and writing the correct sum. The substitution P(X=4)=k(6−4)=2k and P(X=5)=k(6−5)=k must be evaluated correctly; many candidates double-count or misread the piecewise rule.
A1 — solving 9k=1⟹k=91 as required.
(b) Step 1 — list the probabilities and apply E(X)=∑xP(X=x).
| x | 1 | 2 | 3 | 4 | 5 |
|---|---|---|---|---|---|
| P(X=x) | 91 | 92 | 93 | 92 | 91 |
E(X)=1⋅91+2⋅92+3⋅93+4⋅92+5⋅91=91+4+9+8+5=927=3
M1 — correct expectation formula applied with all five terms.
A1 — E(X)=3. The symmetry of the distribution about x=3 provides a useful check: a symmetric distribution has its expectation at the centre of symmetry.
(c) Step 1 — compute E(X2).
E(X2)=1⋅91+4⋅92+9⋅93+16⋅92+25⋅91=91+8+27+32+25=993=331
M1 — applying E(X2)=∑x2P(X=x), not [E(X)]2.
A1 — E(X2)=331.
Step 2 — apply the computational form of variance.
Var(X)=E(X2)−[E(X)]2=331−9=331−27=34
M1 — using Var(X)=E(X2)−[E(X)]2.
A1 — Var(X)=34.
Total: 8 marks (M4 A4).
Question (6 marks): The discrete random variable Y has the discrete uniform distribution on {1,2,3,4,5,6}, so that P(Y=y)=61 for each y∈{1,…,6}.
(a) Write down E(Y). (1)
(b) Show that Var(Y)=1235. (3)
(c) The random variable W=2Y−5. Find E(W) and Var(W). (2)
Mark scheme decomposition by AO:
(a)
(b)
(c)
Total: 6 marks split AO1 = 5, AO2 = 1. This is an AO1-dominated question — AQA uses uniform-distribution and linear-transformation questions to test fluency with the algebraic identities for E and Var.
Connects to:
Binomial distribution B(n,p) (own lesson): the binomial is derived by treating n independent Bernoulli trials. Its mean np and variance np(1−p) are special cases of the general E and Var formulae applied to a sum of indicator variables. The general framework here is the foundation; the binomial is one named example.
Normal distribution N(μ,σ2) (own lesson): the parameters μ and σ2 are precisely E(X) and Var(X) for a continuous random variable. The discrete formulae generalise to integrals: E(X)=∫xf(x)dx. Understanding the discrete case first is the conceptual ladder to the continuous case.
Mechanics — impulse and average force: impulse equals change in momentum; over many small collisions, the expected impulse per collision drives the average force. Thinking of force as an expectation links Section R to Newtonian mechanics in a way A* candidates exploit in modelling questions.
Pure — series and sigma notation: E(X)=∑xP(X=x) is a finite series. Manipulating these sums uses the \sum\-notation rules from Pure: ∑(ax+b)P(X=x)=aE(X)+b. Linearity of expectation is a sigma-notation identity in disguise.
Modelling and expected value: in financial-decision contexts (insurance premiums, expected utility), E(X) is the long-run average payoff. Variance Var(X) measures risk. The economics intuition — "high variance = high risk" — is the same object as the AQA computational definition.
Statistical-distribution questions on AQA 7357 split AO marks heavily toward AO1:
| AO | Typical share | Earned by |
|---|---|---|
| AO1 (knowledge / procedure) | 60–75% | Computing ∑P(X=x)=1, applying E(X)=∑xP(X=x), computing E(X2), applying Var(X)=E(X2)−[E(X)]2 |
| AO2 (reasoning / interpretation) | 15–30% | Recognising symmetry to short-circuit E(X), justifying use of linearity E(aX+b)=aE(X)+b, interpreting variance as spread |
| AO3 (problem-solving) | 5–15% | Modelling a real-world scenario as a discrete distribution, choosing whether uniform/binomial/normal applies, justifying assumptions |
Examiner-rewarded phrasing: "since probabilities must sum to 1, we have ∑P(X=x)=1"; "using Var(X)=E(X2)−[E(X)]2"; "by linearity of expectation". Phrases that lose marks: writing E(X2) when meaning [E(X)]2 or vice versa (a costly notational slip); leaving Var(X) as a decimal when the question asks for an exact fraction; failing to verify that probabilities sum to 1 before computing E(X).
A specific AQA pattern to watch: questions phrased "show that k=…" demand that you derive k from the total-probability axiom and state the conclusion explicitly. Computing k but not writing the final equality k=91 can lose the A1.
Question: The discrete random variable X has probability distribution given by P(X=x)=10x for x=1,2,3,4. Find E(X).
Grade C response (~170 words):
E(X)=1⋅101+2⋅102+3⋅103+4⋅104=101+4+9+16=1030=3.
So E(X)=3.
Examiner commentary: Full marks (3/3). The candidate applies the expectation formula correctly, evaluates the sum cleanly, and gives the answer as an exact integer. Working is brief but every step is verifiable. A typical Grade C answer for a procedural question — efficient and correct. One minor point: the candidate could have first verified ∑P(X=x)=1010=1 to confirm the distribution is valid. On longer questions where validity is not given, this check is the difference between full and partial marks. Here it is implicit and the marker accepts the omission.
Grade A response (~210 words):*
First verify the distribution is valid: ∑x=14P(X=x)=101+2+3+4=1010=1. Confirmed.
Apply the definition of expectation:
Subscribe to continue reading
Get full access to this lesson and all 10 lessons in this course.