An Introduction To Galois Theory
An Introduction To Galois Theory
An Introduction To Galois Theory
mathematicians: teacher
webinars
30 April (Primary), 1 May
(Secondary)
menu search
An Introduction to Galois
Theory
This is a short introduction to Galois theory. The level of this article is necessarily quite high compared
to some NRICH articles, because Galois theory is a very difficult topic usually only introduced in the final
year of an undergraduate mathematics degree. This article only skims the surface of Galois theory and
should probably be accessible to a 17 or 18 year old school student with a strong interest in
mathematics. There is a short and very vague overview of a two important applications of Galois theory
in the introduction below. If you want to know more about Galois theory the rest of the article is more in
depth, but also harder.
The two most important things to know about in order to understand the in depth part of the article are
complex numbers and group theory. If you've not come across complex numbers before you can read An
Introduction to Complex Numbers , which should be accessible to 15 or 16 year old students. If you
haven't come across group theory before, don't worry. I introduce the idea of a group below, although it
might be better to try and find a book or web site that goes into more detail.
1 Introduction
1.1 Motivation
Galois theory is a very big subject, and until you are quite immersed in mathematical study in a way
which is unusual unless studying for a degree in maths, it can seem quite pointless. However, there are
two problems which provide some motivation for studying Galois theory - the existence of polynomials
which aren't soluble by radicals, and some results about classical Euclidean geometry, for example that
you cannot trisect an angle using a ruler and compass, and that certain regular polygons cannot be
constructed using a ruler and compass.
The first problem is this, given a polynomial p(x) with rational coefficients, for example
p(x) = x + 3x + 1, can you express the roots of p(x) using only rational numbers, multiplication,
2
division, addition, subtraction and the operation of raising a number to the power 1/n for n an integer?
So, for example, we can solve ax2 + bx + c = 0 using only these operations, because we know that
the solutions are:
− − −−−−−
2
−b ± √ b − 4ac
x =
2a
The coefficients a, b, c are all rational, and we have only used multiplication, division, addition,
subtraction and square root (which is raising to the power of 1/2).
Definition When we can find the solutions for a polynomial with rational coefficients using only rational
numbers and the operations of addition, subtraction, division, multiplication and finding nth roots, we say that
p(x) is soluble by radicals.
Using Galois theory, you can prove that if the degree of p(x) (i.e. the highest power of x in p(x)) is less
than 5 then the polynomial is soluble by radicals, but there are polynomials of degree 5 and higher not
soluble by radicals. In other words, polynomials of degree 5 whose solutions cannot be written down
using nth roots and the arithmetical operations, no matter how complicated.
1.2 History
So, why is Galois theory called Galois theory? The answer is that it is named after a French
mathematician Evariste Galois (1811-1832) who did some very important work in this area. He had a very
dramatic and difficult life, failing to get much of his work recognised due to his great difficulty in
expressing himself clearly. For example, he wasn't admitted to the leading university in Paris, the Ecole
Polytechnique , and had to make do with the Ecole Normale . He also met with difficulty because of his
political sympathies, he was a republican. This led to him being expelled from the Ecole Normale when
he wrote a letter to a newspaper criticising the director of the school. He joined a republican branch of
the militia and was later imprisoned (twice) because of his membership. The second time whilst in
prison he fell in love with the daughter of the prison physician, Stephanie-Felice du Motel and after being
released died in a duel with Perscheux d'Herbinville . The reasons for the duel are not entirely clear, but it
seems likely it had something to do with Stephanie. His death started republican riots and rallies which
lasted for several days.
Although Galois is often credited with inventing group theory and Galois theory, it seems that an Italian
mathematician Paolo Ruffini (1765-1822) may have come up with many of the ideas first. Unfortunately
his ideas were not taken seriously by the rest of the mathematical community at the time. There are
some links at the end of this document for anyone interested in finding out more about the history of
group theory and Galois theory.
1.3 Overview
Galois theory is concerned with symmetries in the roots of a polynomial p(x). For example, if
p(x) = x − 2 then the roots are ±√2. A symmetry of the roots is a way of swapping the solutions
2
around in a way which doesn't matter in some sense. So, √2 and −√2 are the same because any
polynomial expression involving √2 will be the same if we replace √2 by −√2. For example, we know
2
that √2 + √2 + 1 = 3 + √2. Or α2 + α + 1 = 3 + α when α = √2. However, the same equation
is true when α = −√2, and this will be true for any expression involving only adding and multiplying
√2.
The way the result about solubility by radicals above is proved (using Galois theory) is to prove a result
about the collection of symmetries among the roots of a polynomial given that the roots are built up
using only the special operations above. (It turns out that the collection of symmetries must form what
is called a soluble group. More on this near the end of this article.) Then you find a polynomial for which
the symmetries of the roots does not have this special property, so you know that the roots couldn't be
built up from the special operations.
The subject of the rest of this article is making precise what we mean by a symmetry of the roots and
about the structure of the collection of these symmetries.
1.4 Notation
Throughout this article, I'll use the following notation. The set of integers will be written Z, so writing
n ∈ Z means that n is in Z , the set of integers, i.e. n is an integer. The set of rational numbers is Q , the
extensions at each stage. Don't be too put off by this seemingly alien language, every word is explained
as it is introduced. The best strategy for reading it is to go slowly and make sure you understand exactly
what every word means before going on to the next section, because that word will be used again and
again, and if you don't quite understand it then everything will just get more and more confusing as you
read on. However, if you are reading this online you can simply click on any of the underlined words and
the original definition will pop up in a small window.
Definition (Group):
A group G is a collection of objects with an operation ⋅ satisfying the following rules (axioms):
(1) For any two elements x and y in the group G we also have x ⋅ y in the group G.
(2) There is an element (usually written 1 or e, but sometimes 0) called the identity in G such that
for any x in the group G we have 1 ⋅ x = x = x ⋅ 1.
(3) For any elements x, y, z in G we have (x ⋅ y) ⋅ z = x ⋅ (y ⋅ z) (so it doesn't matter what order
we do the calculations in). This property is called associativity ; it means we can write x ⋅ y ⋅ z
unambiguously (otherwise it would not be clear what we meant by x ⋅ y ⋅ z: would it be x ⋅ (y ⋅ z)
or (x ⋅ y) ⋅ z?).
(4) Every element x in G has a unique inverse y (sometimes written −x or x−1 ) so that
x ⋅ y = y ⋅ x = 1.
For example, the integers Z are a group with the operation of addition (we write this group(Z, +) or
sometimes, lazily, just Z). We can check thefour axioms: (1) If n, m are integers then n + m is an
integer, so we're OK here. (2) n + 0 = n = 0 + n so 0 is the identity for the integers. (3)
(n + m) + p = n + m + p = n + (m + p) so + is associative. (4) n + (−n) = 0 = (−n) + n so
we have inverses.
However, the integers are not a group with multiplication, because the identity on the integers with
multiplication is 1, and there is no integer n with 2n = 1.
Another way of thinking about it, for those who are happy with the ideas of sets and functions, is to
define the symmetric group on a set X to be SX = {f : X → X|f is invertible} with the
operation that for the functions f , g ∈ SX we have the function f ⋅ g defined to be
(f ⋅ g)(x) = f (g(x)) . The symmetric group above, Sn = S{1,2,…,n} , is the symmetric group on a
At this point, you may want to check you've followed so far. See if you can prove that Sn is a group and
that it has n! elements. If you're happy with the idea of sets and functions then you can prove that SX is
a group even if X is an infinite set.
2.2 Fields
Definition (Field):
A field F is a bit like a group, but we have two operations, usually written ⋅ and +. F is a field if F
has elements 0 and 1 such that F with the operation + is agroup (i.e. (F , +) is a group), the set F
without the element 0 is a group with the operation ⋅ (i.e. (F ∖ {0}, ⋅) is a group) and we have
relations like (x + y) ⋅ z = x ⋅ z + y ⋅ z (we say that ⋅ is distributive over +), 0 ⋅ x = 0 = x ⋅ 0,
x ⋅ y = y ⋅ x and x + y = y + x (which isn't always true for a group) and so on.
The definition of a field above is quite abstract, all it means is that a field is a set in which you can
add, subtract and multiply any elements, and you can divide by any element other than 0.
A good example of a field is the real numbers or the rational numbers. (Check the axioms.)
A less obvious example of a field (the important example for Galois theory) is Q[√2]. This is the set of
all numbers which can be written a + b√2 for a and b rational numbers. It is not immediately obvious
that this is a field, because we do not know, for example, if 1/(a + b√2) can be written as c + d√2.
However, you can always do this. If x = 1/(a + b√2) then (multiplying the top and bottom by
a − b√2):
a − b√2
x =
(a + b√2)(a − b√2)
And (a + b√2)(a − b√2) = a2 − 2b2 = p say. So we have that x = a/p − (b/p)√2. So Q[√2]
really is a field (the other axioms are clearly true, check them if you like).
If α is an algebraic number then Q[α] is a field. We can think of Q[α] in two ways. Firstly, as the set of
elements a0 + a1 α + … + an−1 αn−1 where each ai is a rational number and n is the smallest
integer such that there is a polynomial p(x) of degree n with p(α) = 0. The second way is that Q[α] is
the smallest field extension of Q containing α, this is explained in the next section. You can try to prove
that Q[α] is a field if you like, but you need to know a theorem called the Remainder Theorem.
2
This gives us lots of examples of fields. For example, Q[√2] is a
3 3 3
= {a + b√2 + c√2 : a, b, c ∈ Q}
field.
You can extend this idea to define, for α, β both algebraic, Q[α, β] to be the set of all expressions like
2αβ , α + α β , and so on.
2
To test yourself, you might like to see if you can show that Q[α, β] = Q[α][β] (the right hand side
makes sense because Q[α][β] = K[β] where K = Q[α] which is a field). This shows that Q[α, β] is a
field.
This gives us even more examples of fields, for example
Q[√2, √3] = {a + b√2 + c√3 + d√6 : a, b, c, d ∈ Q} .
The example above, Q[√2] is a field extension of Q since if a ∈ Q then a + 0√2 ∈ Q[√2], so
Q ⊆ Q[√2] . More generally we have that Q[α] is a field extension of Q for α an algebraic number.
You can check that for the function f above really does satisfy all the conditions.
The idea of a field automorphism is that it is just a way of relabelling the elements of the field without
changing the structure at all. In other words, we can replace the symbol √2 with the symbol −√2, do
all our calculations and then change the symbol −√2 back to √2 and we get the right answer. Field
automorphisms are the right way of expressing this idea, because the conditions that
f (x + y) = f (x) + f (y) preserve multiplication, addition and so on.
Definition (F-Automorphism):
More specifically, if we have a field extension K of a field F , then an F -automorphism of K is an
automorphism f of K with the additional property that f (x) = x for all x in F .
This is the precise way of defining the symmetry of the roots that I talked about above, because the F -
automorphism leaves all elements of F unchanged and only relabels the new elements we added to
form K. It turns out that for Q[√2] the function f I defined above is the only Q-automorphism other
than the obvious g(x) = x .
If p(x) is any polynomial (with rational coefficients, as always), K/Q is a field extension, and f is a Q-
automorphism of K then f (p(x)) = p(f (x)), see if you can prove this.
The reason this is useful is that it shows that a Q-automorphism of a splitting field K of a polynomial
p(x) rearranges the roots of p(x). If p(α) = 0 then p(f (α)) = f (p(α)) = f (0) = 0, so f (α) is then
a root of p(x).
In fact, we can go further than this and show that knowing how a Q-automorphism of a splitting field
rearranges the roots of p(x) is enough to tell us precisely what that Q-automorphism does to every
element of the splitting field. However, not every rearrangement of the roots of p(x) comes from a Q-
automorphism. For example, if p(x) = x4 − 5x2 + 6 (which we showed has splitting field
K = Q[√2, √3] ) which has roots ±√2 and ±√3 then there is no Q -automorphism f of K with
2
. Suppose there was, then f (√2) because f preserves
2
f (√2) = √3 = f (√2 ) = f (2) = 2
2
multiplicative structure and f (x) for rational x. But if f (√2) then f (√2) , i.e.
2
= x = √3 = √3
So now we can see why a Q-automorphism of a splitting field gives us exactly the right idea of a
symmetry of the roots which doesn't matter (i.e. doesn't change the structure at all).
At this point, you may want to see if you can find the splitting field and the Q-automorphisms of
p(x) = x
2
(two Q-automorphisms), and if you know about complex numbers, you could try x4
− 5 − 1
(f ⋅ g)(x) = f (g(x)) - check that this really is a group). It is called the Galois group of the field
extension F over Q , usually written Gal(F /Q). If F is the splitting field of a polynomial p(x)
then G is called the Galois group of the polynomial p(x), usually written Gal(p).
So, the group G is the same as C2 , the cyclic group of order 2, or S2 , the symmetric group of order 2,
because we have a single element f with f 2 = f ⋅ f = 1 the identity on the group.
As an exercise, you might like to find the Galois group of p(x) = ax2 + bx = c. [Hint: there are two
cases to consider, b2 − 4ac = r2 for some rational r or b2 − 4ac ≠ r2 for any rational r.]
If you know a bit about complex numbers (specifically, roots of unit) and you're quite adventurous, you
might like to try and show that for p(x) = xq − 1 with q a prime number, Gal(p) = Cq−1 the cyclic
group of order q − 1 .
If you know about subgroups, you can use the fact that the Q-automorphisms of a splitting field
rearrange the roots (and that the rearrangement of the roots alone tells us what the Q-automorphism
is) is to show that Gal(p) ≤ Sn where n is the degree of p(x). In particular, all polynomials have finite
Galois group.
4 Solubility by Radicals
To go any further into Galois theory would, unfortunately, be too complicated. I'll sketch the rest of the
proof of the existence of polynomials that are not soluble by radicals.
Third, you prove that the Galois group of any radical field extensionis soluble. This is the hardest part by
a long, long way. In fact, I'm not even going to attempt to explain what a soluble group is here, because
it would take too long.
Fourth, you prove that the group S5 (the symmetric group on 5 elements) is not soluble. If you know a
bit of group theory, this isn't very difficult.
Fifth, you find a polynomial p(x) whose Galois groupis S5 . The splitting field of this polynomial cannot
be a radical field extension (because all radical field extensions have soluble Galois groups, so the roots
of p(x) cannot be built up from +, −, ×, / and the nth roots.
5 Trisecting Angles
As I mentioned above, you can use Galois theory to show that it is impossible to trisect all angles using
ruler and compass methods. I'll outline a proof that you cannot construct an angle of 20∘ using ruler
and compasses (and so you cannot trisect an angle of 60 ).
∘
−−
αi ∈ Q[√α1 , … , √αi−1 ]. We'll call a field extension that looks like this a constructible field
− −−−
−−−−−−
extension . So, for example, Q[√2] is a constructible field extension, and so is Q[√1 + √2] ,
−−−−−− − −−−−−
because you can write Q[√1 + √2] = Q[√2, √ 1 + √2 ] .
It's not obvious that any constructible number must lie in a field extension of this form, but we can sort
of see why because given line segments of length x, y, it is possible to construct other line segments of
length x + y, xy and 1/x using geometric constructions. Moreover, you can construct a line segment of
length √x using only geometric constructions. In fact, you can also show that these are the only things
you can do with geometric constructions. (If you want to try, the way to prove this is to use the fact that
all you can do with unmarked rulers and compasses is to find the intersection between two lines, which
only gives you arithmetical operations, find the intersection between a line and a circle, which gives you
square roots, and intersections between circles and circles, which gives you square roots.) Can you see
why this means that a number in a constructible field extension (as defined above) can be constructed
using only an unmarked ruler and compass, and that only numbers in constructible field extensions can
be made in this way?
Next, you show that if you have a cubic polynomial p(x) = ax3 + bx2 + cx + d whose roots are not
rational numbers then the roots are not constructible? This isn't very difficult to prove but requires some
knowledge beyond what I'm assuming for this article.
Here's the clever part. Suppose you could construct a 20∘ angle, then the number cos(20∘ ) would be
constructible (you can just drop a perpendicular from a point on a line at 20 to the horizontal, distance
∘
1 from the origin). However, you can show that α = cos(20 ) is a root of the equation
∘
8x − 6x − 1 = 0 (by expanding cos(60 ) in terms of cos(20 ) using the addition formula). It is easy
3 ∘ ∘
to show that this has no rational roots, and so the roots are not constructible. This means that we
couldn't have constructed a 20∘ angle, because then we would be able to construct cos(20∘ ) which is
impossible. So a 60∘ angle cannot be trisected.
You can use methods like this to prove other results about what shapes can or can't be constructed and
so forth.
6 Further Reading
1. http://mathworld.wolfram.com/CubicEquation.html (lots about solving polynomials of degree 3, quite
hard)
2. http://mathworld.wolfram.com/QuarticEquation.html (lots about solving polynomials of degree 4,
quite hard)
3. http://mathworld.wolfram.com/Group.html (information about group theory, quite hard but lots of
links to interesting things about group theory)
4. http://members.tripod.com/~dogschool/ (long introduction to group theory, seems quite good and not
too difficult)
5. https://mathshistory.st-andrews.ac.uk/HistTopics/Development_group_theory/ (history of work on
group theory, quite a lot about Galois theory)
6. https://mathshistory.st-andrews.ac.uk/HistTopics/Abstract_groups/ (history of the development of the
concept of a group)
7. https://mathshistory.st-andrews.ac.uk/Biographies/Galois/ (biography of Galois, whose life story is very
dramatic - involving duels and political riots)
8. https://mathshistory.st-andrews.ac.uk/Biographies/Abel/ (biography of Abel, another important person
in the development of Galois theory)
9. https://mathshistory.st-andrews.ac.uk/Biographies/Ruffini/ (biography of Ruffini, who is the first person
to have come up with a proof that there are quintic equations which are not soluble by radicals,
although his work was little recognised at the time)
10. http://mathworld.wolfram.com/Trisection.html (trisecting angles, no proofs)
11. http://mathworld.wolfram.com/ConstructiblePolygon.html (constructible polygons, no proofs)
12. http://www.cut-the-knot.org/arithmetic/rational.shtml (constructible numbers, with proofs)
13. http://www.cut-the-knot.com/arithmetic/cubic.shtml (trisecting angles, with proofs)
The NRICH Project aims to enrich the mathematical experiences of all learners. To support this aim, members of the
NRICH team work in a wide range of capacities, including providing professional development for teachers wishing
to embed rich mathematical tasks into everyday classroom practice.