You are viewing a free preview of this lesson.
Subscribe to unlock all 10 lessons in this course and every other course on LearningBro.
Price discrimination occurs when a firm charges different prices to different consumers (or in different sub-markets) for the same or a very similar product, where the price differences are not justified by differences in cost. The italicised qualifier matters: charging more for a first-class rail seat that genuinely costs more to provide is not price discrimination; charging a commuter more than a leisure traveller for an identical seat is. Price discrimination is a strategy available only to firms with market power — price-makers — and its purpose is to convert consumer surplus into extra producer surplus (profit) by tailoring the price to each buyer's willingness to pay. This lesson sets out the three necessary conditions, distinguishes the first, second and third degrees, and develops the central diagram the specification requires: third-degree discrimination across two sub-markets with different price elasticities, alongside the combined picture. It closes with a careful, two-sided welfare analysis, because the effect of price discrimination on total welfare is genuinely ambiguous — one of the richest evaluation themes in the whole course.
AQA A-Level Economics (7136) — this lesson sits within 4.1.5 The market mechanism, market failure and government intervention in markets: price discrimination by firms with market power, its conditions, types and welfare effects. It builds on the monopoly and revenue lessons and links to oligopoly pricing and to competition policy.
Assessment Objectives developed here:
| AO | Skill | In this lesson |
|---|---|---|
| AO1 | Knowledge | Define price discrimination and its three conditions; distinguish first/second/third degree; state the MR1=MR2=MC rule |
| AO2 | Application | Apply to named markets; identify which group faces the higher price and why |
| AO3 | Analysis | Use the two-sub-market diagram to show different prices, and chain the effect on profit and consumer surplus |
| AO4 | Evaluation | Judge the ambiguous welfare effects — output, surplus distribution, equity and efficiency — conditional on context |
Three conditions must all hold for price discrimination to be possible and profitable.
| Condition | Explanation | Why it matters |
|---|---|---|
| Market power | The firm must be a price-maker facing a downward-sloping demand curve | A perfect competitor is a price-taker and cannot set different prices — it must accept the market price |
| Separable markets / no resale | The firm must be able to identify and separate consumer groups, and prevent arbitrage (resale between them) | If the low-price group could resell to the high-price group, the high price would collapse |
| Different price elasticities of demand | The separated groups must have different PEDs (different willingness to pay) | If every group had the same elasticity, a single price would already be optimal — there is nothing to exploit |
The second condition — preventing resale — is the practical linchpin, and it explains which products are price-discriminated in the real world. Services are ideal because they are consumed on the spot and cannot be resold: a haircut, a cinema seat, a flight, a meal, a gym session. The firm can demand proof of eligibility (a student card, age, a railcard) at the point of consumption, and the buyer cannot pass the cheaper service on to someone who would have paid more. Physical goods are far harder to discriminate over, because a buyer who gets a low price can resell to a high-price buyer, arbitraging away the difference — which is why "student discounts" are common for services but rare for, say, laptops. Anything that segments buyers and blocks resale — geography, timing, identity, the requirement to be present — is therefore the firm's essential toolkit.
graph LR
A["All three conditions met?"] --> B["Market power (price-maker)"]
A --> C["Markets separable + no resale"]
A --> D["Different PED across groups"]
B --> E["Set MR1 = MR2 = ... = MC"]
C --> E
D --> E
E --> F["Higher price where demand inelastic; lower where elastic"]
Exam Tip: Examiners reward candidates who explain why resale must be prevented, not merely that it must. The clearest framing is arbitrage: if the cheap-bought unit can be sold on at the high price, the two "separate" markets merge back into one and the strategy fails. This is why the discriminating firm spends effort segmenting buyers (railcards, age checks, fenced-off cabin classes) — segmentation is the act of blocking arbitrage.
It is also worth seeing the three conditions as jointly necessary rather than a checklist to recite. Strip out any one and the strategy collapses: without market power the firm is a helpless price-taker; with easy resale the low price leaks to high-value buyers and the premium evaporates; with identical elasticities a single price is already optimal and there is nothing to gain from splitting the market. The conditions also explain why discrimination is a matter of degree in the real world rather than all-or-nothing. Market separation is rarely perfect — some students lend their railcards, some shoppers cross borders to exploit price differences — so firms invest continuously in tightening the fences: photo-ID railcards, region-locked digital goods, non-transferable tickets tied to a named traveller, and contracts that forbid resale. The strength of those fences, and the firm's ability to observe the characteristic that proxies for elasticity (age, timing, location, purchase history), set the practical limit on how finely a firm can discriminate — which is why the data-rich digital economy has pushed discrimination far closer to its theoretical first-degree extreme than the offline world ever allowed.
The firm charges each individual consumer the maximum they are willing to pay — their reservation price — for each unit. This extracts all consumer surplus and turns it into producer surplus.
| Feature | Detail |
|---|---|
| Definition | Every unit sold at the highest price that buyer will pay |
| Consumer surplus | Eliminated entirely — wholly transferred to the firm |
| Output | The allocatively efficient level: the firm produces up to where P=MC for the last unit, because it need not cut price on earlier units to sell one more |
| Deadweight loss | None — output reaches the efficient level (an unexpected efficiency result) |
| Practicality | Essentially impossible in pure form — it requires perfect knowledge of every buyer's willingness to pay |
| Approximations | Individually negotiated prices: auctions, haggling at a car dealership, bespoke quotes; increasingly, personalised pricing from consumer data |
The striking theoretical point — examined surprisingly often — is that perfect price discrimination is allocatively efficient. Because the firm captures the whole of each unit's value, it has no reason to restrict output to protect a single price, so it produces right up to P=MC and the monopoly deadweight loss disappears. The catch is distributional: efficiency is achieved by transferring all the surplus from consumers to the producer, which is why the result is admired for its efficiency yet condemned on equity grounds.
The firm charges different prices for different quantities or versions, and lets consumers self-select by how much, or which version, they buy. The firm does not need to know each buyer's type in advance — the menu sorts them.
| Method | How it works | Illustrative example |
|---|---|---|
| Bulk discounts | Lower unit price for larger purchases | "Buy more, pay less per unit" multibuys |
| Block tariffs | Successive blocks of consumption priced differently | Utility tariffs with one rate for the first block and another beyond it |
| Bundling | Several products sold together below the sum of separate prices | Media and software suites sold as packages |
| Two-part tariffs | A fixed access fee plus a per-unit charge | A membership fee plus per-use charges |
The most common form: the firm splits consumers into identifiable groups by an observable characteristic (age, time of purchase, location, occupation) and charges each group a different price reflecting its elasticity.
| Group | Typically charged | Reason (PED) |
|---|---|---|
| Students | Lower | More elastic — lower incomes, more price-sensitive, more substitutes |
| Senior citizens | Lower | Often elastic — frequently on fixed incomes, flexible timing |
| Peak-time commuters | Higher | Inelastic — must travel at fixed times, few alternatives |
| Off-peak/leisure travellers | Lower | Elastic — can choose when to travel |
| Business travellers | Much higher | Very inelastic — employer pays, timing fixed, trip essential |
| Children | Lower | Families are price-sensitive; attracting them brings paying adults |
A multi-market discriminator maximises profit by allocating output so that marginal revenue is equal across all sub-markets and equal to marginal cost:
MR1=MR2=⋯=MC.
The logic is an arbitrage-of-effort argument: if MR were higher in market 1 than market 2, the firm could raise profit by shifting a unit from market 2 to market 1, and it keeps doing so until the marginal revenues are equalised. Combining this with the relationship between MR and elasticity gives the headline result: the sub-market with the more inelastic demand is charged the higher price, and the more elastic sub-market the lower price. Intuitively, inelastic buyers will keep buying even when charged more, so the firm extracts more from them; elastic buyers must be tempted with a lower price or they walk away.
Exam Tip: The single most common omission is failing to link the price difference to elasticity. Do not merely say "students pay less"; say "students pay less because their demand is more elastic, so a lower price is needed to retain them, whereas the inelastic group will tolerate a higher price." That causal link, ideally with the MR1=MR2=MC rule, is what secures the analysis marks.
The specification expects the standard three-panel picture: two sub-markets with different demand elasticities side by side, plus the combined market that determines the common marginal cost. The firm faces a single marginal cost (it makes one product), so it sets a common MR(=MC) and then equalises that marginal revenue across both sub-markets — charging a higher price in the inelastic market and a lower price in the elastic market.
Reading the panels together: the marginal cost line is the same height in both (one product, one MC), and the firm produces in each sub-market up to where that market's MR meets this common MC. Because demand in sub-market A is steeper (more inelastic), its MR meets MC at a point that supports a higher price Pa; because demand in sub-market B is flatter (more elastic), its lower MR meets the same MC at a lower price Pb. The result, Pa>Pb with MRa=MRb=MC, is the geometric statement of "charge the inelastic market more." Compared with a single-price monopoly that lumped both groups together, the firm now earns more profit and may serve more total output — because it picks up elastic-market buyers (at the low price) who would have been excluded by a single, higher price.
It is worth dwelling on why a discriminating firm often produces more than a single-price monopolist, because this is the crux of the favourable welfare argument. A single-price monopolist must charge one price to everyone, and in choosing it faces a trade-off: a high price extracts a lot from the inelastic buyers but drives away the elastic ones, while a low price wins the elastic buyers but sacrifices revenue from the inelastic ones. Whatever single price it picks is a compromise that excludes some willing buyers — the elastic-market customers whose willingness to pay lies below the single price but above marginal cost. Those excluded trades are precisely the monopoly deadweight loss: mutually beneficial exchanges that simply do not happen. Price discrimination unbundles this compromise. By charging the inelastic group a high price and the elastic group a separate low price, the firm can serve both — capturing the inelastic buyers' high willingness to pay while still selling, at a lower price, to elastic buyers it would otherwise have turned away. Output rises towards the efficient level and the deadweight loss shrinks. The distributional sting is that the extra surplus created, and much of the surplus that consumers previously enjoyed, ends up with the firm. This is the single most important analytical insight of the topic: discrimination can be efficiency-improving (more trades occur) and distributionally regressive (the firm captures the gains) at the same time, which is exactly why its welfare verdict cannot be settled without specifying the context.
Suppose marginal cost is constant at £4 and the firm faces two sub-markets with the hypothetical marginal-revenue functions MRa=20−2Qa (inelastic business segment) and MRb=12−2Qb (elastic leisure segment). Profit maximisation sets each MR equal to MC = £4:
20−2Qa=4⟹Qa=8;12−2Qb=4⟹Qb=4.
Subscribe to continue reading
Get full access to this lesson and all 10 lessons in this course.