Roots of Polynomials
One of the most elegant ideas in Further Mathematics is that you can know a great deal about the roots of a polynomial without ever solving it. The coefficients already encode the sum of the roots, the product of the roots, and every other symmetric function of them. The bridge between coefficients and roots is a set of identities usually called Vieta's formulae, and once you can move fluently in both directions — coefficients → symmetric functions, and symmetric functions → a new polynomial — a whole family of exam questions opens up.
This lesson builds that fluency from first principles: where the formulae come from, how to combine them to reach ∑α2, ∑α3, ∑α1 and the like, and how to construct the polynomial whose roots are a transformation of the original ones.
Where this sits in AQA 7367
This is the roots of polynomials / symmetric functions strand of the compulsory pure content examined on Paper 1 and Paper 2 (each 2 hours, 100 marks, 33⅓% of the A-Level). It builds directly on GCSE/A-Level factor and remainder work and on sum/product of quadratic roots, and it feeds forward into the summation of series and complex numbers strands (where conjugate roots and roots of unity are exactly symmetric-function questions in disguise). The routine evaluation of ∑α2 or the construction of a transformed equation is largely AO1; the "show that" and "hence deduce" framings that examiners hang on this topic carry the AO2 reasoning and communication marks.
Core theory: why the coefficients know the roots
Start with the fact you already trust for quadratics. If α and β are the roots of ax2+bx+c=0, then the polynomial factorises as a(x−α)(x−β). Expanding,
a(x−α)(x−β)=a(x2−(α+β)x+αβ)=ax2−a(α+β)x+aαβ.
Comparing this identically with ax2+bx+c forces
−a(α+β)=b⇒α+β=−ab,aαβ=c⇒αβ=ac.
The same factor-and-compare argument generalises to any degree. For the cubic ax3+bx2+cx+d=0=a(x−α)(x−β)(x−γ), it is worth expanding the product carefully once so the pattern is unmistakable. Multiplying the first two brackets,
(x−α)(x−β)=x2−(α+β)x+αβ,
then multiplying by (x−γ):
[x2−(α+β)x+αβ](x−γ)=x3−(α+β+γ)x2+(αβ+βγ+γα)x−αβγ.
Multiplying through by a and comparing with ax3+bx2+cx+d gives α+β+γ=−ab, αβ+βγ+γα=ac, αβγ=−ad. The same expansion, carried one bracket further, yields the quartic relations. The pattern is worth seeing all at once.
Vieta's formulae by degree
| Polynomial | ∑α | ∑αβ | ∑αβγ | product of all roots |
|---|
| ax2+bx+c | −ab | — | — | αβ=ac |
| ax3+bx2+cx+d | −ab | ac | — | αβγ=−ad |
| ax4+bx3+cx2+dx+e | −ab | ac | −ad | αβγδ=ae |
Here ∑αβ means the sum of all distinct pairwise products, and ∑αβγ the sum of all distinct triple products. These are the elementary symmetric polynomials e1,e2,e3,….
The sign pattern. Reading along a row, the signs alternate −,+,−,+,…. A clean way to remember it: the elementary symmetric function ek equals (−1)k times the coefficient of xn−k, divided by the leading coefficient a. So the even symmetric functions (product of pairs, product of all four) come out positive, the odd ones negative.
Building higher symmetric functions
Examiners rarely ask for ∑α directly — they ask for something like ∑α2, ∑α1 or ∑α3, which you must build from the elementary ones. The three identities below cover almost every case.
Sum of squares. Square the sum and subtract off the cross-terms:
∑α2=(∑α)2−2∑αβ.
For two roots this is α2+β2=(α+β)2−2αβ; for three, α2+β2+γ2=(α+β+γ)2−2(αβ+βγ+γα). The reason is simply that (∑α)2 expands to ∑α2+2∑αβ.
Sum of reciprocals. Put the reciprocals over a common denominator:
∑α1=αβγ∑αβ(cubic),α1+β1=αβα+β(quadratic).
The numerator is always "one symmetric function up" from the denominator — for a cubic, e2 over e3.
Sum of cubes. For three roots,
∑α3=(∑α)3−3(∑α)(∑αβ)+3αβγ.
For two roots the analogue is α3+β3=(α+β)3−3αβ(α+β). Both follow from expanding (∑α)3 and re-grouping; you do not need Newton's identities in their full generality at A-Level, but it is worth knowing these are the first two Newton–Girard results.
A fourth identity that exam questions love is the sum of squares of pairwise products, needed whenever you square the roots. For a cubic,
∑α2β2=(αβ)2+(βγ)2+(γα)2=(∑αβ)2−2αβγ∑α.
This is just the sum-of-squares identity applied to the three quantities αβ,βγ,γα: their sum is ∑αβ, and the cross-terms αβ⋅βγ+⋯ factor as αβγ(α+β+γ). You will use it in the specimen question below to build the equation with roots α2,β2,γ2. The general principle behind all four identities is the same: any symmetric function of the roots can be rebuilt from the elementary ones ∑α,∑αβ,αβγ — the only skill is knowing which combination to use.
Worked examples (with mark scheme)
Example 1 — squares and reciprocals of a cubic's roots
The equation x3−6x2+11x−6=0 has roots α,β,γ. Find α2+β2+γ2 and α1+β1+γ1.
Write down Vieta's formulae first (here a=1):
∑α=6,∑αβ=11,αβγ=6.(M1: state all three)
∑α2=(∑α)2−2∑αβ=62−2(11)=36−22=14.(M1 apply identity; A1)
∑α1=αβγ∑αβ=611.(A1)
(M1 for stating the symmetric functions; M1 for using ∑α2=(∑α)2−2∑αβ; A1 for 14; A1 for 611. As a check, this cubic factorises as (x−1)(x−2)(x−3), and indeed 1+4+9=14 and 1+21+31=611 ✓.)
Two habits from this example generalise to every "roots of polynomials" question. First, write the Vieta data down before doing anything else — the line "∑α=6,∑αβ=11,αβγ=6" earns a method mark in its own right and orients the rest of the work. Second, never try to find the roots: the whole power of the method is that ∑α2 and ∑α1 come straight from the coefficients, even for cubics that do not factorise nicely. Here the cubic happens to have roots 1,2,3, which lets us verify the answer, but the method would be identical (and the verification unavailable) for a cubic with irrational or complex roots.
Example 2 — forming a new equation by symmetric functions
The equation 2x2−5x+1=0 has roots α,β. Find the quadratic whose roots are α2 and β2.
α+β=25,αβ=21.(M1)
New sum of roots:
α2+β2=(α+β)2−2αβ=425−1=421.(M1; A1)
New product of roots:
α2β2=(αβ)2=41.(A1)
A monic quadratic with these as sum and product is x2−(sum)x+(product)=0:
x2−421x+41=0⟹4x2−21x+1=0.(A1 clear fractions)
(M1 for the original symmetric functions; M1/A1 for the new sum; A1 for the new product; A1 for a correctly cleared final equation. Multiplying through by 4 to remove fractions is good exam practice.)
Example 3 — a transformation via substitution
The cubic x3−3x+1=0 has roots α,β,γ. Find the cubic with roots 2α,2β,2γ.
The fastest route is substitution. If the new roots are y=2x, then x=2y; substitute into the original equation:
(2y)3−3(2y)+1=0⇒8y3−23y+1=0.(M1 substitute x=y/2)
Multiply through by 8:
y3−12y+8=0.(A1)
(M1 for the correct substitution; A1 for the cleared cubic. Sanity check via symmetric functions: original ∑α=0, so ∑2α=0 — matches the absent y2 term; original αβγ=−1, so (2α)(2β)(2γ)=8(−1)=−8=−(constant) ✓.)
The two routes to a transformed equation — substitution and recomputing symmetric functions — are worth comparing, because the right choice saves time. Substitution (x=f−1(y)) is fastest for linear transformations of the roots: scalings kα (substitute x=y/k) and shifts α+c (substitute x=y−c), as here and in practice problem P3. The symmetric-functions route is unavoidable for non-linear transformations such as α2 or α1, where there is no single substitution that maps each old root to its new one cleanly. A quick test: if the new roots are g(α) for an invertible g, try substitution; otherwise, build the new ∑,∑∑,∏ directly. Recognising which tool fits is part of what the examiner is assessing.
Specimen-style exam question
(specimen-style — not from any past paper)
The cubic equation x3+px2+qx+r=0 has roots α,β,γ. It is given that ∑α=4, ∑αβ=1 and αβγ=−6.
(a) Write down the values of p, q and r.
(b) Find α2+β2+γ2.
(c) Hence find the cubic equation (with integer coefficients) whose roots are α2,β2,γ2.
(a) Matching the monic cubic, p=−∑α=−4, q=∑αβ=1, r=−αβγ=6. So the equation is x3−4x2+x+6=0.
(b) ∑α2=(∑α)2−2∑αβ=42−2(1)=14.
(c) For the new roots α2,β2,γ2 we need the three elementary symmetric functions:
∑α2=14(from (b)),
∑α2β2=(∑αβ)2−2αβγ∑α=12−2(−6)(4)=1+48=49,
α2β2γ2=(αβγ)2=(−6)2=36.
So the required cubic is y3−(∑α2)y2+(∑α2β2)y−α2β2γ2=0, namely
y3−14y2+49y−36=0.
The middle identity ∑α2β2=(∑αβ)2−2αβγ∑α is the "sum of squares" identity applied to the pairwise products αβ,βγ,γα; it is the step that separates a top answer from a stalled one.
The structure of part (c) is the canonical "new equation from squared roots" task, and it is worth internalising the recipe: to build the cubic with roots α2,β2,γ2 you need its three elementary symmetric functions, which are ∑α2 (sum), ∑α2β2 (pair-sum) and α2β2γ2=(αβγ)2 (product). Each is obtained from the original symmetric functions by a standard identity — ∑α2=(∑α)2−2∑αβ, ∑α2β2=(∑αβ)2−2αβγ∑α, and squaring the product. Assemble them into y3−(sum)y2+(pair-sum)y−(product)=0, minding the alternating signs. Candidates who memorise "squared roots need these three identities" rarely get stuck; those who try to find the roots first almost always do.
Synoptic links
- A-Level Maths (sum/product of quadratic roots, factor theorem). Everything here is that idea promoted to higher degree; α+β=−b/a and αβ=c/a are the degree-2 special case.
- Complex numbers. For a real-coefficient polynomial, non-real roots occur in conjugate pairs — itself a statement about symmetric functions (the coefficients are real, so symmetric functions of the roots are real). Roots of unity satisfy zn−1=0, whose symmetric functions you can read straight off: ∑zk=0, ∏zk=(−1)n−1.
- Summation of series. Newton's identities link power sums ∑αk to the elementary symmetric functions, mirroring the standard-results algebra of the next lesson.
- Matrices (eigenvalues). The characteristic polynomial's coefficients are symmetric functions of the eigenvalues: tr(A)=∑λ and det(A)=∏λ. The same Vieta's machinery reappears there.
Mark-scheme literacy
- "Write down" (e.g. the values of the symmetric functions) signals one mark for a stated result — quote −b/a directly without re-deriving it.
- "Hence" demands you use the previous part. In the specimen question, part (c) must reuse ∑α2=14 from (b); recomputing from scratch can forfeit the "hence" mark.
- "Show that" requires every line of working visible and an explicit final statement; a bare correct answer scores the A-mark but loses the M-marks for unseen method.
- Follow-through (ft) is common here: if you mis-state one symmetric function but then apply the identities correctly, the method marks can follow through to a "correct-by-your-values" answer.
- Give integer coefficients when asked — always clear fractions (multiply through), as in Examples 2 and 3.
Grade-band model answers
Question: "The cubic x3−6x2+11x−6=0 has roots α,β,γ. Show that α3+β3+γ3=36."
Mid-band response.
∑α=6, ∑αβ=11, αβγ=6.
∑α3=63−3(6)(11)+3(6)=216−198+18=36.
Examiner-style commentary: Correct symmetric functions and the right identity, reaching 36 for full marks on the computation. It quotes the cubic-sum identity without justification, which is acceptable here but leaves the candidate exposed if they misremember a coefficient.
Stronger response. As above, but the identity is stated and named first: "Using ∑α3=(∑α)3−3(∑α)(∑αβ)+3αβγ …", then the substitution.
Examiner-style commentary: Stating the general identity before substituting is the AO2 communication examiners look for in a "show that" — the reader can follow the logic, not just the arithmetic.
Top-band response. As Stronger, plus a verification: "This cubic factorises as (x−1)(x−2)(x−3), so the roots are 1,2,3 and 13+23+33=1+8+27=36, confirming the result."
Examiner-style commentary: The independent check via the explicit roots demonstrates genuine command and would catch any slip in the identity. This self-checking habit is what protects accuracy marks across a whole paper.
Common misconceptions
- "∑αβ is the product of all the roots." No — for a cubic it is the sum of the three pairwise products αβ+βγ+γα. The product of all roots is αβγ.
- "The sign of ∑α is +b/a." It is −b/a. The alternating signs trip many candidates; only the even symmetric functions are positive.
- "α2+β2=(α+β)2." You must subtract the cross-term: (α+β)2−2αβ. Forgetting the −2αβ is the single most common error in the topic.
- "For 2x2−5x+1, α+β=5." You must divide by the leading coefficient: α+β=−(−5)/2=25, not 5. Vieta's formulae need the polynomial in the form with a explicit.
- "To get the equation with roots 2α,…, substitute x=2y." It is the inverse: new root y=2x means x=y/2. Substituting the wrong way scales in the wrong direction.
- "A repeated root counts once in the symmetric functions." It counts with multiplicity. A double root α=β contributes to ∑α twice and to αβ as α2.
- "You must find the roots to find ∑α2." The whole point is that you never solve the polynomial — symmetric functions come straight from the coefficients. This matters most when the roots are irrational or complex and cannot be found neatly; the method still works unchanged.
- "The new equation's constant term is +(product of new roots)." For a monic cubic y3+py2+qy+s, the constant is s=−(product) — the alternating sign applies to the new equation just as to the old. Reassembling with the wrong sign is a frequent final-step error.
Common errors
- Sign slips on the alternating pattern — writing αβγ=+d/a for a cubic instead of −d/a.
- Not dividing by a when the leading coefficient is not 1 (Example 2's 2x2−⋯).
- Dropping the −2αβ (or −2∑αβ) in the sum-of-squares identity.
- Substituting the transformation the wrong way round (x=2y instead of x=y/2).
- Forgetting to clear fractions, leaving x2−421x+41=0 when integer coefficients were asked for.
- Mismatching the constant-term sign when reassembling the new polynomial — the product of roots enters as −(product) for a monic cubic's constant.
Going further (stretch)
The identities you used are the first cases of Newton's identities (the Newton–Girard formulae), which give every power sum pk=∑αk recursively in terms of the elementary symmetric functions e1,e2,…. For three roots they read
p1=e1,p2=e1p1−2e2,p3=e1p2−e2p1+3e3,
and in general, for k≤n,
pk=e1pk−1−e2pk−2+⋯+(−1)k−1kek.
These let you climb to ∑α4, ∑α5, … without ever expanding a fourth power by hand — exactly the kind of efficient recursion a STEP or MAT question rewards. A typical Oxbridge-flavoured task: given a cubic with ∑α=0, show that p3=3e3 and hence that the three roots of x3+px+q=0 satisfy α3+β3+γ3=−3q. (With e1=0, Newton's recursion collapses to p3=3e3=3(−q)=−3q.)
A second deep idea is the Fundamental Theorem of Symmetric Polynomials: every symmetric polynomial in the roots can be written as a polynomial in the elementary symmetric functions e1,…,en — and therefore in the coefficients. This is why questions like "find ∑α2β2" always have an answer expressible in the coefficients: it is guaranteed in advance, not a happy accident. The theorem even tells you the result will be a polynomial (no division needed) when the symmetric function is itself a polynomial in the roots — which is why ∑α2 comes out polynomial in the coefficients, whereas ∑α1 (not polynomial in the roots) involves dividing by αβγ.
A third connection reaches into the rest of the course. The discriminant of a polynomial — the quantity that detects repeated roots — is itself a symmetric function: for a quadratic, (α−β)2=(α+β)2−4αβ=a2b2−4ac, recovering the familiar b2−4ac. For a cubic the analogous (α−β)2(β−γ)2(γ−α)2 is a (more elaborate) symmetric function of the roots, hence expressible in b,c,d; computing it is a satisfying STEP-style challenge. And the same elementary-symmetric machinery reappears verbatim for matrix eigenvalues, where the characteristic polynomial's coefficients are tr(A)=∑λ and det(A)=∏λ — so the algebra you drill here pays off directly in the matrices strand. Recognising one idea (coefficients are symmetric functions of roots) behind quadratics, cubics, discriminants and eigenvalues is exactly the kind of unifying insight that marks out a strong Further-Maths candidate.
Additional A* practice (with worked answers)
P1. The cubic x3+2x2−5x−6=0 has roots α,β,γ. Find ∑α, αβγ and ∑α2.
Answer: ∑α=−2, ∑αβ=−5, αβγ=6. Then ∑α2=(−2)2−2(−5)=4+10=14. (It factorises as (x−2)(x+1)(x+3): 4+1+9=14 ✓.)
P2. The roots of x2−3x+1=0 are α,β. Find α3+β3.
Answer: α+β=3, αβ=1. α3+β3=(α+β)3−3αβ(α+β)=27−3(1)(3)=18.
P3. Given that α,β,γ are the roots of x3−4x2+x+6=0, find the cubic whose roots are α+1,β+1,γ+1.
Answer: New roots y=x+1⇒x=y−1. Substitute: (y−1)3−4(y−1)2+(y−1)+6=0. Expanding, (y3−3y2+3y−1)−4(y2−2y+1)+(y−1)+6=y3−7y2+12y+0=0, i.e. y3−7y2+12y=0. (Check: original roots are −1,2,3; shifted roots 0,3,4 give y(y−3)(y−4)=y3−7y2+12y ✓.)
P4. The quartic x4−2x3+3x2−4x+5=0 has roots α,β,γ,δ. Find ∑α and ∑α2.
Answer: ∑α=−(−2)/1=2; ∑αβ=3. So ∑α2=(∑α)2−2∑αβ=4−6=−2. (A negative sum of squares is fine — the roots are not all real.)
P5. The cubic x3+px+q=0 (no x2 term) has roots α,β,γ. Show that α2+β2+γ2=−2p.
Answer: Here ∑α=0 (no x2 term) and ∑αβ=p. Hence ∑α2=(∑α)2−2∑αβ=0−2p=−2p. Note this forces p≤0 for the roots to be all real, since a sum of real squares cannot be negative — a neat consequence worth observing.
Board-alignment footer
Aligned to AQA A-Level Further Mathematics 7367, compulsory pure (Papers 1 & 2): relations between roots and coefficients for quadratics, cubics and quartics, symmetric functions of roots, and forming equations with transformed roots. Directly equivalent to Edexcel (9FM0) Core Pure "roots of polynomials" and OCR (H245) "polynomials and rational functions" — the Vieta machinery is identical across boards.
Visual summary
∑α=−ab,∑αβ=ac,αβγ=−ad;∑α2=(∑α)2−2∑αβ
| Task | Method |
|---|
| Read symmetric functions | Vieta's formulae from coefficients (mind the −,+,− signs) |
| ∑α2 | (∑α)2−2∑αβ |
| ∑1/α (cubic) | αβγ∑αβ |
| ∑α3 (cubic) | (∑α)3−3(∑α)(∑αβ)+3αβγ |
| New equation, roots f(α) | substitute x=f−1(y), then clear fractions |
Recap. The coefficients of a polynomial are its symmetric functions of roots, up to sign. Master the three Vieta relations and the building identities (∑α2, ∑1/α, ∑α3, ∑α2β2), and you can answer any "roots of polynomials" question without ever solving the equation. Choose substitution for linear transformations of the roots and the symmetric-functions route for non-linear ones, mind the alternating signs when reassembling a new equation, and verify with a quick numerical check whenever the roots happen to be findable.