You are viewing a free preview of this lesson.
Subscribe to unlock all 10 lessons in this course and every other course on LearningBro.
A continuous random variable is described not by a list of probabilities but by a probability density function (pdf): a curve whose area gives probability. This lesson works two of the most important named continuous models in depth — the continuous uniform U(a,b) and the exponential Exp(λ) — deriving their means and variances from first principles, proving the exponential's defining memoryless property, and tying the exponential to the Poisson process that generates it. These two distributions, and the integration techniques used to handle them, underpin almost everything that follows in Further Statistics 2.
This is Paper 3 optional content — Statistics (7367/3S), one of the two applied options a Further-Maths student may choose (the others being Mechanics 7367/3M and Discrete 7367/3D). Paper 3 is 2 hours, 100 marks, weighted AO1 40% / AO2 25% / AO3 35% — markedly more problem-solving-heavy than the compulsory pure papers. This lesson leans on AO1 (carry out integration to find moments and probabilities) and AO2 (interpret the memoryless property; reason about which model fits a context), with AO3 appearing whenever a worded scenario must first be translated into the right distribution. It builds directly on A-Level Maths Statistics (the normal and the idea of a continuous distribution) and on A-Level Maths Pure (definite integration, integration by parts, and improper integrals — the ∫0∞ you meet in Further Calculus).
A continuous random variable X has a probability density function f(x) satisfying
f(x)≥0for all x,∫−∞∞f(x)dx=1.
Probability is area under the curve: P(a≤X≤b)=∫abf(x)dx. A single point carries zero area, so P(X=c)=0 for any constant c; this is why ≤ and < are interchangeable for continuous variables — a fact worth remembering when you read off table values. The cumulative distribution function (cdf) is
F(x)=P(X≤x)=∫−∞xf(t)dt,sof(x)=F′(x),
and moments are weighted integrals of the density:
E(X)=∫−∞∞xf(x)dx,E(X2)=∫−∞∞x2f(x)dx,Var(X)=E(X2)−[E(X)]2.
That last identity — Var(X)=E(X2)−[E(X)]2 — is the engine of almost every variance calculation below. Everything in this lesson is an application of these four formulae to two particular choices of f(x).
If X∼U(a,b), all sub-intervals of equal length inside [a,b] are equally likely, so the density is constant. For the total area to be 1 over a base of width b−a, the height must be 1/(b−a):
f(x)=b−a1,a≤x≤b,f(x)=0 otherwise.
Integrating gives the cdf, a straight ramp from 0 to 1:
F(x)=⎩⎨⎧0b−ax−a1x<aa≤x≤bx>b.
By symmetry the mean is the midpoint, but we derive it for practice:
E(X)=∫abb−axdx=b−a1[2x2]ab=2(b−a)b2−a2=2(b−a)(b−a)(b+a)=2a+b.
For the variance, first compute the second moment:
E(X2)=∫abb−ax2dx=b−a1⋅3b3−a3=3a2+ab+b2,
using b3−a3=(b−a)(a2+ab+b2). Then
Var(X)=E(X2)−[E(X)]2=3a2+ab+b2−(2a+b)2=124(a2+ab+b2)−3(a+b)2=12(b−a)2.
| Property | Formula |
|---|---|
| E(X) | 2a+b |
| E(X2) | 3a2+ab+b2 |
| Var(X) | 12(b−a)2 |
| F(x) on [a,b] | b−ax−a |
Notice that the variance depends only on the width b−a, not on the location of the interval — shifting U(a,b) along the axis changes the mean but never the spread.
The exponential models the waiting time until the next event in a process where events happen at a constant average rate λ per unit time. Its density decays from a peak at 0:
f(x)=λe−λx,x≥0,λ>0.
Integrating from 0 to x gives the cdf, and its complement is the survival function:
F(x)=1−e−λx,P(X>x)=1−F(x)=e−λx,x≥0.
The survival form P(X>x)=e−λx is the one you will reach for most often — "the probability of still waiting after time x."
E(X)=∫0∞xλe−λxdx.
Take u=x, dxdv=λe−λx, so dxdu=1 and v=−e−λx:
E(X)=[−xe−λx]0∞+∫0∞e−λxdx=0+[−λ1e−λx]0∞=λ1.
The boundary term vanishes because xe−λx→0 as x→∞ (exponential decay beats linear growth). A second integration by parts gives E(X2)=2/λ2, hence
Var(X)=E(X2)−[E(X)]2=λ22−λ21=λ21.
The median m solves F(m)=21: 1−e−λm=21⇒e−λm=21⇒m=λln2. The density is largest at x=0, so the mode is 0. For a right-skewed distribution the order mode < median < mean is exactly what we see: 0<λln2<λ1 (since ln2≈0.693).
| Property | Value |
|---|---|
| E(X) | 1/λ |
| Var(X) | 1/λ2 |
| Median | (ln2)/λ |
| Mode | 0 |
| P(X>x) | e−λx |
The exponential is the only continuous distribution that is memoryless: having already waited s, the further wait is distributed exactly as if you had just started.
P(X>s+t∣X>s)=P(X>t).
Proof. Using the conditional-probability definition and the survival function,
P(X>s+t∣X>s)=P(X>s)P(X>s+t∩X>s)=P(X>s)P(X>s+t)=e−λse−λ(s+t)=e−λt=P(X>t).
The event {X>s+t} is a subset of {X>s}, so their intersection is just {X>s+t} — that is the only subtle step. Interpretation: a component with an exponential lifetime "does not age." If a bus is equally likely to arrive in any small instant, then having waited ten minutes already tells you nothing about how much longer you must wait.
A Poisson process of rate λ is the standard model for events occurring randomly and independently at a constant average rate. It has two complementary descriptions:
The link between the first two is exact and worth seeing. "No event has occurred by time x" means zero events in [0,x], and the count there is Po(λx), so
P(X>x)=P(0 events in [0,x])=e−λx0!(λx)0=e−λx,
which is precisely the exponential survival function. So the exponential gap is not an extra assumption — it is forced by the Poisson count. This duality is the single most useful idea in the topic: any question phrased in terms of counts ("how many arrivals in an hour") is Poisson, while the same scenario phrased in terms of time ("how long until the next arrival") is exponential, and you may switch freely between the two descriptions. The Gamma generalises this: the sum of n independent Exp(λ) gaps is the time to the nth event, with density f(x)=(n−1)!λnxn−1e−λx, mean n/λ and variance n/λ2 (the exponential is the case n=1).
Faults occur on a cable in a Poisson process at a rate of 0.5 per km. (a) Find the probability that a 4 km stretch has no faults. (b) Find the probability that the distance to the first fault exceeds 4 km. (c) Comment on your two answers.
(a) Count in 4 km: N∼Po(0.5×4)=Po(2), so
P(N=0)=e−20!20=e−2=0.1353.
(b) Distance to first fault: X∼Exp(0.5), so
P(X>4)=e−0.5×4=e−2=0.1353.
(c) The answers are identical, because "no fault in 4 km" and "first fault beyond 4 km" are the same event described two ways — the exact Poisson–exponential link. Working either route is valid; choose whichever the question makes more natural.
A train is equally likely to arrive at any time between 09:00 and 09:20; let X∼U(0,20) be the minutes after 09:00. You reach the platform at 09:05. Find the probability you must wait more than a further 10 minutes.
You arrived at 5, and "more than 10 further minutes" means the train arrives after 15; since you know it has not yet arrived, condition on X>5:
P(X>15∣X>5)=P(X>5)P(X>15)(M1 conditional set-up) =(20−5)/20(20−15)/20=15/205/20=31.(A1)
(M1 for the conditional ratio with correct events; A1 for 31. Unlike the exponential, the uniform is not memoryless: the answer 31 differs from the unconditional P(X>10)=21.)
Customers arrive at a help desk in a Poisson process at a rate of 3 per hour. Let T (hours) be the time to the next arrival, so T∼Exp(3).
(a) Find P(T≤10 min).
Convert: 10 min =61 hour.
P(T≤61)=1−e−3×1/6=1−e−0.5.(M1 use of F(x)=1−e−λx) =1−0.6065=0.3935.(A1, 4 s.f.)
(b) Given that no customer has arrived in the first 20 minutes, find the probability of waiting a further 10 minutes.
By the memoryless property the extra wait restarts the clock:
P(T>30 min∣T>20 min)=P(T>10 min)=e−3×1/6=e−0.5=0.6065.(M1 memoryless; A1)
(M1 for converting units and applying the correct exponential formula; M1 for invoking memorylessness rather than recomputing a conditional integral; A1 values to 4 s.f.)
The time X (minutes) between successive calls to a switchboard is exponential with median 4 minutes. Find λ, the mean time between calls, and P(X>6).
Median: λln2=4⇒λ=4ln2=0.1733.(M1 median equation; A1 λ) E(X)=λ1=ln24=5.771 min.(B1) P(X>6)=e−λ×6=e−6ln2/4=e−1.5ln2=2−1.5=0.3536.(M1 survival fn; A1)
(M1/A1 for solving the median equation; B1 for the mean; M1/A1 for the survival probability. Neat check: e−1.5ln2=(eln2)−1.5=2−1.5.)
A continuous random variable X has X∼U(0,6). (a) Show that Var(X)=3. (b) Find P(∣X−3∣<1).
(a) Here a=0, b=6:
Var(X)=12(b−a)2=1262=1236=3.(M1 formula; A1)
For a "show that," quoting the formula and substituting is enough; you need not re-derive it from E(X2)−[E(X)]2 unless asked.
(b) The event ∣X−3∣<1 means 2<X<4, a sub-interval of length 2:
P(2<X<4)=6−04−2=62=31.(M1 interpret modulus; A1)
(M1/A1 for the variance; M1 turning the modulus inequality into an interval, A1 for 31. The mean is 3, so the event is "within 1 of the mean.")
(specimen-style — not from any past paper)
The lifetime X, in thousands of hours, of a certain electronic component is modelled by X∼Exp(0.4). (a) State the mean lifetime. (b) Find the probability a component lasts more than 3000 hours. (c) Given that a component has already lasted 2000 hours, find the probability it lasts at least 3000 hours more. (d) Comment on what your answers to (b) and (c) reveal about this model.
(a) E(X)=1/0.4=2.5 thousand hours, i.e. 2500 hours.
(b) P(X>3)=e−0.4×3=e−1.2=0.3012.
(c) By the memoryless property, P(X>5∣X>2)=P(X>3)=e−1.2=0.3012.
(d) The two probabilities are identical: under the exponential model the component "does not age," so prior use gives no information about remaining life. This is a known limitation — real components usually wear out, so an exponential lifetime is realistic only for failures caused by random external shocks rather than gradual wear.
Question: "The waiting time T (minutes) for a lift follows Exp(0.2). Find P(T>10) and interpret it."
Mid-band response. P(T>10)=e−0.2×10=e−2=0.135. Examiner-style commentary: Correct method and value to 3 s.f. The numerical answer earns the marks, but there is no interpretation, so any "comment/interpret" mark is lost.
Stronger response. As above, plus: "So about 13.5% of the time the wait exceeds 10 minutes; since the mean wait is 1/0.2=5 minutes, a wait beyond 10 minutes (twice the mean) is fairly unusual." Examiner-style commentary: The candidate links the probability to the mean and gives a contextual reading — the AO2/AO3 communication examiners reward.
Top-band response. As Stronger, plus: "Because the exponential is memoryless, this 0.135 is also the probability of waiting another 10 minutes given any wait so far; the model therefore assumes the lift's arrival is equally likely in every instant — reasonable for random demand but not if lifts run to a timetable." Examiner-style commentary: Connecting the numerical answer to the defining property and critiquing the modelling assumption is exactly the structured reasoning that separates the top band.
The memoryless property does not just hold for the exponential — it characterises it. Suppose a continuous lifetime X on [0,∞) satisfies P(X>s+t)=P(X>s)P(X>t) for all s,t≥0. Write g(x)=P(X>x); then g(s+t)=g(s)g(t). This is Cauchy's multiplicative functional equation, and its only monotone solutions are g(x)=e−λx for some λ≥0. Hence any memoryless continuous lifetime must be exponential — there is no other choice. (A STEP-style exercise: set h(x)=lng(x), deduce h(s+t)=h(s)+h(t), and argue that a monotone additive function is linear, so h(x)=−λx.)
A second thread: the sum of n independent Exp(λ) variables is Gamma(n,λ), the waiting time to the nth event of a Poisson process — the continuous cousin of "sum of geometrics is negative binomial." At university you would prove this with moment generating functions: the MGF of Exp(λ) is M(t)=λ/(λ−t) for t<λ, and the MGF of a sum of independents is the product, giving [λ/(λ−t)]n — the Gamma MGF. The same tool instantly recovers E(X)=M′(0)=1/λ and E(X2)=M′′(0)=2/λ2, confirming our by-parts results without any integration.
P1. X∼U(2,8). Find P(3≤X≤5) and Var(X). Answer: P=(5−3)/(8−2)=2/6=1/3; Var(X)=(8−2)2/12=36/12=3.
P2. X∼Exp(0.5). Find the median and P(X<2). Answer: median =ln2/0.5=2ln2=1.386; P(X<2)=1−e−0.5×2=1−e−1=0.632.
P3. Buses arrive in a Poisson process, one every 15 minutes on average. Find the probability the next bus takes more than 20 minutes. Answer: rate λ=1/15 per min, so P(X>20)=e−20/15=e−4/3=0.264.
P4. For X∼Exp(λ), show that P(X>2/λ)=e−2 and comment on why this is independent of λ. Answer: P(X>2/λ)=e−λ⋅2/λ=e−2=0.135. The threshold "two means" scales with 1/λ, so the exceedance probability is the same for every exponential — the distribution has a fixed shape.
P5. X∼U(0,a) has variance 12. Find a and E(X). Answer: (a−0)2/12=12⇒a2=144⇒a=12; E(X)=a/2=6.
P6. A machine's time-to-failure is Exp(λ) with P(X>100)=0.8 (hours). Find λ and the mean lifetime. Answer: e−100λ=0.8⇒λ=−1001ln0.8=0.00223; mean =1/λ=448 hours (3 s.f.).
P7. Calls arrive in a Poisson process at 4 per hour. Using the count description, find the probability of exactly 2 calls in the next 30 minutes; using the time description, find the probability the first call comes within 15 minutes. Answer: count in 30 min: Po(4×0.5)=Po(2), P(N=2)=e−222/2!=2e−2=0.271. Time to first: Exp(4), P(X<0.25)=1−e−4×0.25=1−e−1=0.632.
P8. The time to the 3rd arrival in a Poisson process of rate 2 per hour follows Gamma(3,2). State its mean and variance. Answer: mean =n/λ=3/2=1.5 hours; variance =n/λ2=3/4=0.75.
Aligned to AQA A-Level Further Mathematics 7367, Paper 3 Statistics option (7367/3S): continuous random variables, the continuous uniform U(a,b), the exponential Exp(λ) with its memoryless property, and the Poisson-process link. The content is equivalent to the corresponding sections of Edexcel Further Statistics 1 (9FM0/3B) and OCR (H245) Statistics — the distributions and moment formulae are identical across boards; only the option codes differ.
U(a,b): E=2a+b, Var=12(b−a)2Exp(λ): E=λ1, Var=λ21, P(X>x)=e−λx
| Feature | Uniform U(a,b) | Exponential Exp(λ) |
|---|---|---|
| Support | [a,b] (bounded) | [0,∞) (unbounded) |
| Shape | flat (rectangular) | decreasing, right-skewed |
| Mean | (a+b)/2 | 1/λ |
| Variance | (b−a)2/12 | 1/λ2 |
| Memoryless? | No | Yes (the defining property) |
| Typical use | random point in an interval | waiting time between events |
Recap. Probability for a continuous variable is area under its pdf; Var(X)=E(X2)−[E(X)]2 drives every variance derivation. The uniform spreads probability evenly with variance set purely by interval width; the exponential models constant-rate waiting times, has mean 1/λ, survival e−λx, and is the unique memoryless continuous distribution — the bridge from the Poisson process to all the inference that follows.