We will prove Fundamental Theorem of Algebra which says that every nonconstant polynomial with complex coefficients has a complex root. Equivalently, every polynomial with complex coefficients can be broken completely into linear factors. For example, a polynomial of degree can be written in the form
where
The theorem says that once we have added the number , satisfying
to real numbers there is no further algebraic kind of number which is needed in order to solve polynomial equations. At first this is not obvious. Quadratic equations forced us to leave the real line: the equation
has no real solution, so we introduced
. But why should a cubic, quartic, or much larger polynomial not force us to introduce a new number beyond the complex numbers? Why should the process stop at
? The proof below answers exactly that question.
The basic idea is this. Suppose there really were a larger algebraic number system containing roots which are not complex numbers. Suppose that some irreducible real polynomial really did require a root beyond the complex numbers. Put all of its roots and create one extension . We will study the arithmetic symmetries of
: the ways of permuting its roots while preserving addition, multiplication, and all real numbers. The symmetry argument does not itself produce the final contradiction. Its purpose is to reduce the possible counterexample step by step. It shows that, if there were anything genuinely new beyond the complex numbers, then there would have to be a field
with
such that
In words: there would have to be one more quadratic layer after the complex numbers. The final calculation then deals directly with a proposed new element of this quadratic layer. After completing the square, that element would satisfy
But every complex number already has a complex square root. Thus
for some
, and
Therefore
so
. The supposedly new element was already complex. Thus the whole proof reduces every possible alleged number beyond
to a quadratic equation which already splits in
.
Only two facts about the real numbers are used from analysis. Every real polynomial of odd degree has a real root. Every positive real number has a real square root. Both facts follow from the intermediate value theorem. Everything else below is algebra. This proof in modern terms belongs to Galois theory. We will calculate directly with roots, coordinate systems, sums over symmetries, products over symmetries, and the fields fixed by those symmetries.
Before beginning the details, here is the route of the proof. We explain what it means for a number system (field) to have finite size over . We prove that there is no nontrivial finite enlargement of
of odd size. We prove that the only nontrivial quadratic enlargement of
is
. We build a temporary finite number system containing all roots of a given real polynomial. We count the arithmetic-preserving ways of replacing its roots by other roots. We show that the symmetry group must have size a power of
and must be generated by one repeated symmetry. We show that a larger system would force a quadratic enlargement of
. We prove directly that no such enlargement exists. At the end, we first obtain the result for real polynomials and then pass to arbitrary complex coefficients.
Finite extensions
A field is a number system in which we can add, subtract, multiply, and divide by every nonzero element. The real numbers and the complex numbers
are fields. Suppose that
is a field containing another field
. We say that
has degree
over
if there are elements
such that every element of
can be written in one and only one way as
We write The notation looks abstract, but its meaning is very concrete:
is the number of coordinates needed to describe an element of
. For example, every complex number has the form
The two numbers
are enough to describe every complex number, and the description is unique. Therefore
So one may think of the complex numbers as a two-dimensional coordinate system over the real numbers.
Suppose that is a field and that
is irreducible of degree
. We create a formal symbol
satisfying
Every polynomial expression in
can be reduced to the form
Why? The equation allows us to replace
by a combination of lower powers. Repeating this process reduces every higher power. We also need to know that nonzero expressions can be divided. Let
After reducing powers, assume that
The polynomials
and
have no common factor, because
is irreducible and has larger degree. The Euclidean algorithm therefore gives polynomials
such that
Substitute
. Since
, we get
Thus
is the inverse of
. So the formal expressions form a field. We denote it by
It has degree
Suppose that we have three fields Assume that
needs
real coordinates and that
needs
coordinates from
. In symbols, suppose
Then
This is the multiplication rule for degrees. To see it directly, choose numbers
which form a coordinate system for
over
, and choose numbers
which form a coordinate system for
over
. We claim that the
products
form a coordinate system for
over
. First, they span
. Take any
. Since the
form a basis over
, we can write
Each coefficient
belongs to
, so it can itself be written as
Putting these two expressions together gives
Thus the products span all of
. Now suppose that
Group the expression according to the
.
The
are independent over
, so every coefficient must vanish:
for every
But the
are independent over
. Hence every
is zero. So the products are independent. This proves the multiplication rule.
We will repeatedly use the following simple idea. If a number system has only finitely many real coordinates, then sufficiently many powers of any element must satisfy a relation. Suppose that and let
. Look at the
numbers
Since
has only
independent real coordinate directions, these
numbers cannot all be independent over
. Therefore there are real numbers
, not all zero, such that
In other words, satisfies some polynomial equation with real coefficients. Among all nonzero real polynomials which vanish at
, choose one of smallest degree. Call it
This is called the minimal polynomial of
over
. Suppose that
with both
and
having smaller positive degree. Substituting
gives
Therefore either
or
But this would produce a polynomial of smaller degree which vanishes at
, contradicting the choice of
. Therefore:
is irreducible over
Suppose that Then the powers
are independent over
. Indeed, a relation among them would be a polynomial equation for
of degree smaller than
, which is impossible. They also span every expression made from
. The equation
allows us to rewrite
as a linear combination of lower powers. Repeating this process reduces every higher power
to a combination of
Thus the number system generated by
, written
, has degree
This is important because it turns a polynomial degree into a concrete count of coordinates.
Odd-degree extension of
We now prove the first major restriction. There is no nontrivial finite extension of having odd degree. Suppose that
is a finite extension containing
and suppose that
is odd. We want to show that no genuinely new number can occur in
.
Assume, for contradiction, that there is some Let
be its minimal polynomial. We have already seen that
Since
the multiplication rule gives
Therefore
divides the odd number
. So
is itself odd.
But every real polynomial of odd degree has a real root. Thus there is some such that
Therefore
for some real polynomial
. This says that
is reducible. But the minimal polynomial was irreducible. Contradiction. Hence no such
exists. Therefore
So there are no finite extensions of
of odd degrees.
This is our first major piece of information. Any finite algebraic extension of the real numbers must have even degree.
Quadratic Extensions of
Now let us examine the smallest possible nontrivial extension. Suppose that Choose some
Then
form a basis of
over
. Since every element of
has the form
, the square
must have the form
Rearranging gives This is just a quadratic equation. Complete the square. Define
Then
Write
There are three possibilities. If
, then
for some real
. Hence
Since
is a field, one of the factors must be zero. Thus
So
, which would imply
, contrary to our choice of
. If
, then
Again this gives
which is again impossible. The only remaining possibility is
Write
Then
Define
Then
Since every element of
has the form
, it also has the form
Therefore
We have proved: Every nontrivial quadratic extension of is equal to
In particular, there is only one way to leave the real line by solving a quadratic equation.
Complex square roots:
The next fact looks familiar, but we will prove it carefully because it is the final obstruction which prevents any further quadratic extension. Let We claim that there is a complex number
such that
If
, then
is real. If
, then
is a square root of
. If
, then
is a square root of
. Now suppose that
Define
Since
, we have
In particular,
Define
Then
Also,
Since
we have
Therefore
Hence
Now calculate:
Thus every complex number has a complex square root. This fact means that no equation of the form forces us to leave the complex numbers.
Splitting field
We now begin the main part of the proof. Let be an irreducible real polynomial. We want to prove that its degree is at most
. At this point we are not allowed to assume that
already has complex roots, because that is exactly what we are trying to prove. Instead, we will temporarily build a larger algebraic number system in which all roots of
exist. Think of this as a calculation device. We introduce formal symbols which obey polynomial equations, and then calculate with them exactly as we would calculate with ordinary numbers. Start with the real numbers. Adjoin one root of
, then adjoin another root which is still missing, then another, and continue until all roots of
are present. After finitely many steps we obtain a field
which contains every root of
. Because each step has finite degree, the multiplication rule shows that
is finite. This field
is called a splitting field of
. We will not use any mysterious property of it. We will only use two simple facts: It contains every root of
. It has finitely many real coordinates. Now work inside the finite extension
.
Galois Group
A Galois symmetry is a map which preserves the ordinary arithmetic rules:
and which leaves every real number unchanged:
Such a map cannot destroy a real polynomial equation. If
then
Therefore
is another root of the same real polynomial. So the map permutes roots of a polynomial. For example, in
there are symmetries fixing
:
and
The second map fixes all real numbers and sends
It does not change addition or multiplication:
Let be the collection of all symmetries of
which fix
. We can combine two such symmetries by applying one after the other. We can also reverse one, because a symmetry is injective, and every injective map from a finite set to itself is automatically onto. Thus each symmetry has an inverse. So
is a finite group of invertible arithmetic symmetries.
The first important claim is: Here
means the number of symmetries. The number of arithmetic-preserving ways to move the roots around is exactly the number of real coordinates in the field.
Write the construction of as a sequence
At the
-th stage, we adjoin a root
:
Let
The multiplication rule gives
We now show that the number of symmetries is exactly the same product. Suppose that a symmetry has already been defined on
. The element
satisfies a minimal polynomial
of degree
. Since
is a root of the original polynomial
, the polynomial
divides
when we work over
. Apply the already-defined symmetry to the coefficients of
. This creates another polynomial of degree
. Its roots are still roots of
, and all roots of
lie in
. Therefore there are exactly
possible choices for the image of
. Once we choose one of these roots, the extension of the map is forced. Indeed, every element of
has the form
If we decide that
goes to some root
, then this element must go to
Thus each stage has exactly
possible choices. Multiplying the choices gives
So we have proved:
Fixed Fields
Let be a collection of symmetries which is closed under composition and inverses. In other words,
is a smaller symmetry subgroup inside
. Consider all elements of
which are unchanged by every symmetry in
:
These fixed elements form a field. Indeed, if , then for every
,
If
, then
so
Thus
as well. The fixed field
contains exactly the numbers which the symmetries in
are unable to move.
For any , define the sum over
by
and define the product over
by
These two expressions are fixed by every symmetry in
. For example, let
. Then
As
runs through
, so does
. Therefore
The same reordering argument gives
Thus
This is the general version of familiar complex-number formulas. If
consists of the identity and conjugation, then
and
We now prove the central fact: . Once this is proved, the multiplication rule gives
Since
and
, we obtain
The quotient
is usually written
and means the number of equally sized blocks into which
divides
.
Every element of fixes the field
. Choose finitely many elements
which generate
over
. We can build
in stages:
At each stage, the image of a new generator has at most as many choices as the degree of its minimal polynomial over the previous field. Therefore the number of possible
-preserving symmetries is at most
Since every symmetry in
is one of these possible maps,
Now write We will prove that
can be described using at most
coordinates from the fixed field
. The key claim is that the maps
are linearly independent as functions. Suppose, for contradiction, that there is a relation
for every , where the coefficients
are not all zero. Choose such a relation with as few nonzero coefficients as possible. It cannot have only one nonzero term, because each
is a nonzero map. So at least two maps occur. Choose two different maps, say
Because they are different maps, there is some
such that
Apply the relation to
instead of
:
Since each
preserves multiplication,
Thus
Now subtract
times the original relation:
The coefficient of
is now zero. But the coefficient of
is nonzero because
So we have produced a nonzero relation with fewer nonzero terms. This contradicts our minimal choice. Therefore the maps
are linearly independent.
Now form the column vector
As ranges over
, these vectors span all of
. Otherwise there would be a nonzero row vector
which annihilates every
, producing a forbidden relation
Therefore we can choose elements
such that the matrix
is invertible. Take any . Since
is invertible, there are unique coefficients
such that
Now apply any to this system. Applying
changes every row indexed by
into the row indexed by
. But as
runs through
, so does
. Thus applying
merely rearranges the rows of the system. The transformed system therefore has exactly the same equations in a different order. Since the solution is unique,
Hence
Now look at the row corresponding to the identity map. It gives
So every element of
is a linear combination of
with coefficients in
. Therefore
Combining the two inequalities gives This tells us exactly how the number of symmetries controls the size of the fixed field.
Sylow Theorems
We know that Write
where
is odd. We will prove that
In other words, the number of symmetries must be a power of
.
Choose a subgroup having the largest possible size among all subgroups whose sizes are powers of
. Write
We claim that
. Assume instead that
Consider the left cosets of
in
:
There are
such cosets. This number is even because
. The group
acts on these cosets by left multiplication:
Every orbit has a number of elements dividing
, so every orbit has power-of-two size. In particular, every orbit with more than one point has even size. The total number of cosets is even. Therefore the number of one-point orbits must also be even. One one-point orbit is certainly
itself. So there must be another fixed coset
Being fixed means that
for every
Equivalently,
Thus
carries the subgroup
back to itself under conjugation. Let
be the collection of all elements of
with this property:
The quotient
has even size because there are an even number of fixed cosets. In any finite group with even size, there is a nonidentity element whose square is the identity. Indeed, pair every element with its inverse. Elements which are not their own inverses occur in pairs. If the identity were the only self-inverse element, the total number of elements would be odd. Therefore there must be another self-inverse element. Thus there is some
such that
Now consider the union
Because
preserves
under conjugation and because
, this union is closed under multiplication. It is therefore a subgroup. It contains two distinct cosets of
, so it has size
But this is a larger power-of-two subgroup than
, contradicting the choice of
. Therefore
Now consider the fixed field The fixed-field counting rule gives
But
is odd, and we proved that there are no nontrivial odd-degree extensions of
. Hence
Therefore
We have proved:
Thus the possible size of the group of symmetries is a power of two and no odd prime factor can occur.
Now suppose that has index
.
The fixed-field counting rule gives
But every quadratic extension of
is
. Therefore
Different subgroups give different fixed fields, so there can be only one subgroup of index
.
We now use a fact about groups. Let be a maximal proper subgroup of a finite group whose size is a power of
. “Maximal proper” means that
is not the whole group, but there is no subgroup strictly between
and the whole group. Using the same coset-counting argument as above, one shows that
must be normal: every group element carries
back to itself under conjugation. Therefore the quotient group
is defined. The quotient has power-of-two size and has no nontrivial proper subgroup, because
was maximal. The only power-of-two group with no nontrivial proper subgroup has two elements. Hence
Therefore every maximal proper subgroup has index
.
But has only one subgroup of index
. Call it
. Choose
We claim that repeated application of
gives every element of
:
Suppose not. Then the subgroup generated by
, written
would be proper. Since the group is finite, it would lie inside some maximal proper subgroup. But every maximal proper subgroup has index
, and there is only one such subgroup, namely
. This would imply
contradicting the choice of
. Therefore
Suppose, for contradiction, that Since
is cyclic and has power-of-two size, write
Let
generate
. Consider the subgroup
The quotient
has four elements, so the fixed-field counting rule gives
Call this four-dimensional field
Inside
lies the unique quadratic field
Indeed, the subgroup generated by
has index
, so its fixed field has degree
over
, hence is
. Therefore
Since and
the multiplication rule gives
So a larger degree extension would force a nontrivial quadratic extension of the complex numbers.
We now show directly that such a thing cannot exist. Suppose that Choose some
Then
form a basis for
over
. Therefore
Complete the square by putting
Then
Write
We proved that every complex number has a complex square root. Therefore there is some
such that
Thus
Factor the left-hand side:
Since
is a field, one factor must vanish. Therefore
In either case,
But then
contradicting the choice of
. Therefore: There is no nontrivial quadratic extension of
This contradicts the conclusion of the previous section. Hence Since
is a power of
, we must have
Because
we obtain
Return to the irreducible polynomial We built a finite field
containing all of its roots. We proved that
Let
be any root of
. Then
Therefore
But
Hence
Thus every irreducible polynomial in
has degree
or
Every real polynomial can therefore be factored into real linear factors and real quadratic factors. Every real quadratic splits over
, by the quadratic formula. Therefore every polynomial with real coefficients splits completely into linear factors over
.
Now let We want to prove that
has a complex root. Form the polynomial obtained by conjugating each coefficient:
Now multiply:
The polynomial
has real coefficients. To see this, conjugate all of its coefficients. Since conjugation reverses
and
, we get
Hence every coefficient of
is real. We have already proved that every nonconstant real polynomial has a complex root. So there is some
such that
Therefore
or
In the second case, take complex conjugates:
Thus
has a complex root in either case. After finding one root, divide by the corresponding linear factor and repeat. Therefore every complex polynomial splits completely into linear factors.
Summary of the Proof:
Take a real irreducible polynomial and put all of its roots in a finite splitting field . The symmetry group of
has as many elements as
has real coordinates. Odd-degree extensions add no new elements, so the odd part of the symmetry group disappears. The only possible first quadratic layer over
is
. The uniqueness of that quadratic layer forces the remaining symmetry group into one cyclic chain. If the splitting field were larger than
, the chain would contain a degree-four field
over
, and therefore a quadratic extension of
:
But a proposed new element of such an extension can be completed to a square:
Since
already has a complex square root, this forces
and hence the proposed new element to lie in
. Thus no new layer remains after the complex numbers.
The precise conclusion is that there is no finite algebraic extension of obtained by adjoining roots of polynomial equations with complex coefficients. That is exactly what it means for
to be algebraically closed.