You are viewing a free preview of this lesson.
Subscribe to unlock all 10 lessons in this course and every other course on LearningBro.
This lesson introduces continuous random variables — variables that can take any value in an interval (or the whole real line). Unlike a discrete variable, a continuous variable assigns zero probability to any single value; instead, probability is area under a probability density function (PDF), computed by integration. This is the calculus-meets-statistics heart of 7367/3S, and it sets up the expectation, CDF and named-distribution work that follows.
This belongs to the Paper 3 Statistics option (7367/3S) (AO weighting AO1 40% / AO2 25% / AO3 35%). Setting up and evaluating the defining integrals is AO1; finding unknown constants and probabilities across piecewise boundaries is AO3; interpreting "density is not probability" is AO2. The prerequisite is A-Level Mathematics integration (definite integrals, areas) and the discrete-variable language from Lesson 1 — every formula here is the integral analogue of a discrete sum.
For a discrete variable, P(X=x) can be positive. For a continuous variable,
P(X=x)=0for every single value x,
because a single point has zero width, and area over zero width is zero. Probability is assigned to intervals:
P(a≤X≤b)=∫abf(x)dx,
where f is the PDF — the continuous analogue of the probability mass function, with ∫ replacing ∑.
Because endpoints carry no probability, for a continuous X all four are equal: P(a≤X≤b)=P(a<X<b)=P(a≤X<b)=P(a<X≤b).
| Property | Requirement |
|---|---|
| Non-negativity | f(x)≥0 for all x |
| Normalisation | ∫−∞∞f(x)dx=1 |
Crucially, f(x) is a density, not a probability — it may exceed 1. Only the area is a probability, and the total area is 1. For example f(x)=3 on [0,31] is a valid PDF: f=3>1, but ∫01/33dx=1.
The simplest model is X∼U(a,b) with constant density
f(x)=b−a1,a≤x≤b,
giving, by the standard results,
| Quantity | Value |
|---|---|
| E(X) | 2a+b |
| Var(X) | 12(b−a)2 |
Probabilities are just length ratios: P(c≤X≤d)=b−ad−c for a≤c≤d≤b.
Because probability is area under the PDF, a quick sketch is one of the most powerful checks available in this topic. Three habits pay off repeatedly.
First, the total area is always 1, so the height of the density is constrained by the width of its support. A uniform density on an interval of width 2 must have height 21; a uniform density on an interval of width 31 must have height 3. This is why a density can legitimately exceed 1: a narrow support forces a tall curve to keep the area at 1. Whenever you find a constant k, sanity-check that the resulting curve encloses unit area.
Second, the shape of the curve tells you where probability concentrates. For f(x)=83x2 on [0,2], the density rises steeply toward x=2, so most of the probability sits in the right-hand part of the interval — which is exactly why P(1≤X≤2)=87 is so much larger than P(0≤X≤1)=81. A glance at the graph predicts that imbalance before any integration. For the triangular density of Example 3, the symmetric "tent" shape immediately tells you the variable is most likely to land near the middle, x=1, and that P(0.5≤X≤1.5) (the central region) should be a clear majority of the area — confirmed as 0.75.
Third, a sketch fixes the limits of integration. Drawing the support and shading the required region is the surest way to avoid the single most common error in this topic: integrating over the wrong range. For a probability such as P(X>1.5) on a support [0,2], the picture makes it obvious that the limits are 1.5 to 2, not 1.5 to ∞.
These ideas extend to modelling a real continuous quantity. Suppose the time X (in minutes, 0≤X≤5) that a customer waits at a counter is modelled by a density that decreases linearly from a maximum at x=0 to zero at x=5 — capturing "short waits are common, long waits are rare". Such a density has the form f(x)=c(5−x); imposing ∫05c(5−x)dx=1 gives c[5x−2x2]05=c(25−12.5)=12.5c=1, so c=252=0.08. The probability of waiting under one minute is then P(X<1)=0.08[5x−2x2]01=0.08(4.5)=0.36. Choosing a sensible algebraic form for a density to match a described behaviour, then fixing its constant by normalisation, is a recurring AO3 modelling skill.
Every probability you compute in this lesson is an integral of the density from the left up to some point, and that observation leads directly to the cumulative distribution function F, studied in detail in the next lesson:
F(x)=P(X≤x)=∫−∞xf(t)dt.
F(x) is the running total of probability accumulated as you sweep from left to right. It starts at 0 (no area has accumulated below the support) and climbs to 1 (all the area, at and beyond the right end of the support). Two features follow immediately from "probability is area". First, F is non-decreasing: moving the upper limit to the right can only add more (non-negative) area. Second, because F is an integral of f, the Fundamental Theorem of Calculus gives f(x)=F′(x) — the density is the rate at which probability accumulates. Where the density is tall, probability piles up quickly and F rises steeply; where the density is zero (outside the support), F is flat.
This perspective re-derives the interval formula painlessly. Since the area from a to b is the area up to b minus the area up to a,
P(a≤X≤b)=∫abf(x)dx=F(b)−F(a).
For our worked density f(x)=83x2 on [0,2], accumulating area from 0 gives F(x)=∫0x83t2dt=8x3 for 0≤x≤2 (and 0 below, 1 above). Then P(1≤X≤2)=F(2)−F(1)=1−81=87, recovering the Example-1 result instantly — and the median will satisfy F(m)=0.5, the cleanest possible route to it. Keeping the area picture in mind throughout the continuous-distribution lessons is the thread that connects the PDF, the CDF, probabilities, the median and the quartiles into a single coherent story.
Piecewise densities deserve a second, more searching example, because they concentrate several of the skills above. Consider
f(x)=⎩⎨⎧kxk(4−x)00≤x≤22<x≤4otherwise,
a symmetric "tent" peaking at x=2. To find k, integrate each piece and set the total to 1:
∫02kxdx+∫24k(4−x)dx=k[2x2]02+k[4x−2x2]24=2k+k[(16−8)−(8−2)]=2k+2k=4k=1,
so k=41. The geometry confirms it: the graph is a triangle of base 4 and height k⋅2=21, area 21⋅4⋅21=1. Now compute P(1≤X≤3), a region straddling the join at x=2, so the integral must be split:
P(1≤X≤3)=∫124xdx+∫2344−xdx=41[2x2]12+41[4x−2x2]23=41(2−21)+41[(12−4.5)−(8−2)]=41(1.5)+41(1.5)=0.75.The symmetry of the tent about x=2 provides an instant check: the central band [1,3] is symmetric about the peak, the two half-integrals are equal (0.375 each), and the result 0.75 is the natural "central three-quarters". Recognising symmetry both shortcuts the arithmetic and guards against error — if your two pieces had come out unequal, you would know at once that a slip had crept in. Piecewise densities reward neatness: label each piece, integrate within its own limits, and split any probability that crosses a boundary.
So far every support has been a bounded interval, but a continuous variable can be defined on an unbounded range — for instance [0,∞) — provided the area is still finite and equal to 1. This requires the density to decay fast enough as x→∞, and it brings improper integrals into play. Consider
f(x)=ke−2xfor x≥0,f(x)=0 otherwise.
For normalisation,
∫0∞ke−2xdx=k[−21e−2x]0∞=k(0−(−21))=2k=1⟹k=2,
where the upper limit uses e−2x→0 as x→∞. This is the exponential distribution Exp(2), a continuous model you will study in its own right; here it simply illustrates that the PDF rules carry over unchanged to infinite supports — non-negativity holds (2e−2x>0) and the (improper) integral equals 1. Probabilities follow the same area principle:
P(X>1)=∫1∞2e−2xdx=[−e−2x]1∞=e−2=0.135,P(X≤0.5)=[−e−2x]00.5=1−e−1=0.632.
The key practical point is that the limit at infinity must be handled by recognising that the exponential term vanishes — writing the upper boundary as a limit and evaluating it. Densities on (−∞,∞), such as the normal, work the same way, with the area in both tails contributing. Far from being exotic, infinite-support densities are central to statistics: they model waiting times, lifetimes and measurement errors, and they obey precisely the framework — non-negative density, unit area, probability by integration — established in this lesson.
The continuous uniform distribution deserves a worked application, because its "all values equally likely" structure makes it the natural model for rounding and for a value known only to lie in an interval. Suppose a train is equally likely to arrive at any moment in the 15-minute window after its scheduled time; let X be the delay in minutes, so X∼U(0,15) with constant density f(x)=151 on [0,15].
Subscribe to continue reading
Get full access to this lesson and all 10 lessons in this course.