rings

Similar to group theory, ring theory deals with studying specific rings. Almost all ring theory deals with commutative rings, in particular – non-commutative rings are kind of niche (though we’ll explore those).

Here’s my notes on ring theory.

November 27, 2023. Introduction to ring theory
Questions:
- What is a ring?
- What kinds of elements are in a ring?
- What is an integral domain, and what is a field?
- How do rings compare to groups?
- What operations can be done on rings?
In 1892, David Hilbert, while working with algebraic number theory, coined the term “Zahlring” (number ring) , which was later shortened to “Ring”, and finally translated in English to “ring” to refer to the structure we’re about to introduce.

$(R,+,\cdot)$ is a ring, defined as some underlying set $R$ whose elements have a notion of addition ( $+$ ) and multiplication ( $\cdot$ ). There are various definitions of a ring (see the appendix) but for our purposes we choose the strongest definition, where the following axioms must hold for all rings:
1. $(R,+)$ forms an (additive) abelian group with identity $0$ (called the zero element).
2. $(R,\cdot)$ forms a (multiplicative) commutative monoid with identity $1$ (called unity).
3. $\cdot$ distributes over $+$ .
One fact that holds for all rings is the binomial theorem, which we can derive from these axioms. We adopt a common notation for repeated addition and repeated multiplication:
- $kr$ (where $k$ is an integer) is $r$ added to itself $k$ times.
- $r^k$ (where $k$ is an integer) is $r$ multiplied by itself $k$ times.
Binomial Theorem: For any $r,s\in R$ for a ring $R$ , $(r+s)^n=\sum_{k=0}^n{n\choose k}r^ks^{n-k}$ for all nonnegative $n$ .

Proof by induction on $n$ .

Base case $n=0$ : $(r+s)^0=1={0\choose 0}r^0s^0$ . Note that this only exists since we have a multiplicative identity $1$ .

Inductive case $n>0$ : The inductive hypothesis gives $(r+s)^{n-1}=\sum_{k=0}^{n-1}{n-1\choose k}r^ks^{n-1-k}$ . Note that commutativity of the product lets us express every term of this sum as $c\cdot r^is^j$ for some integers $i,j$ and some integer coefficient $c={i+j-1\choose i}$ .

Now observe $\begin{aligned} (r+s)^n&=(r+s)(r+s)^{n-1}\\ &=r(r+s)^{n-1}+s(r+s)^{n-1}&\text{ by distributivity}\\ &=r\sum_{i=0}^{n-1}{n-1\choose i}r^is^{n-1-i}+s\sum_{j=0}^{n-1}{n-1\choose j}r^js^{n-1-j}&\text{ by IH}\\ &=\sum_{i=0}^{n-1}{n-1\choose i}r^{i+1}s^{n-1-i}+\sum_{j=0}^{n-1}{n-1\choose j}r^js^{n-j}\\ &=\sum_{i=1}^{n}{n-1\choose i-1}r^is^{n-i}+\sum_{j=0}^{n-1}{n-1\choose j}r^js^{n-j}\\ &=\sum_{i=0}^{n}{n-1\choose i-1}r^is^{n-i}+\sum_{j=0}^n{n-1\choose j}r^js^{n-j}&\text{ since }{\textstyle {n-1\choose -1}={n-1\choose n}=0}\\ &=\sum_{i=0}^{n}\left[{n-1\choose i-1}+{n-1\choose i}\right]r^is^{n-i}\\ &=\sum_{i=0}^{n}{n\choose i}r^is^{n-i}&\text{ by Pascal's identity} \end{aligned}$
In this section, we go over some examples of rings.

Possibly the ring that is the most ring of them all is the integers $\ZZ$ . In fact the axioms above are basically modeled after integer addition and multiplication, which is why it is very easy to check that $\ZZ$ is a ring.

More interestingly, $\ZZ_p$ , the integers mod $p$ , form a ring. Just as with integers, addition and multiplication work with the elements $\bar{a}$ of $\ZZ_p$ (called residue classes) where $\bar{a}=\bar{b}$ if $a,b$ differ by a factor of $p$ .

Other examples include the rationals $\QQ$ , the real numbers $\RR$ , and the complex numbers $\CC$ . There is also the zero ring (or trivial ring) $0$ , the ring containing just a zero element $0$ .

In this section, we study operations on rings.

Just like with groups, we can operate on rings as mathematical objects in their own right.

First, the direct product of two rings $R\times S$ is the same as with groups – the elements of $R\times S$ are all the pairs $(r_R,s_S)$ , with addition and multiplication defined pointwise. (When we have multiple rings like $R$ and $S$ , we typically distinguish their elements by adding the ring’s name as a subscript. For example, $1_R$ is the multiplicative identity in $R$ , and $0_S$ is the additive identity in $S$ .)

Second, we can adjoin a new element to a ring. Basically this means adding the element to the ring and taking the closure under addition and multiplication. For instance, the ring $R[e]$ is the result of adjoining $e\notin R$ to ring $R$ . Here are some examples:
- $\QQ[\sqrt{2}]$ where $\sqrt{2}\in\RR$
- $\ZZ[i]$ where $i\in\CC$
- $\RR[i]=\CC$
We’ll make heavy use of this when we start talking about polynomial rings, but for now it’s just good to keep in mind that we can do this.

In this section, we define the units of a ring.

We mentioned that the integers $\ZZ$ are the prototypical example of a ring. Integer addition and multiplication are commutative, we have an additive identity $0$ and a multiplicative identity $1$ , there are additive inverses, and multiplication distributes over addition. So all the ring axioms hold on the integers.

What about multiplicative inverses? Do they exist for the integers? The multiplicative inverse of an integer $z$ is a value $z^{-1}$ such that $z\cdot z^{-1}=1$ . But the only integers that can satisfy this are $z=1$ and $z=-1$ . So it seems like only unit values of the integers can be multiplicative inverses.

Generalizing from integers to arbitrary rings $R$ , invertible elements of a ring (with respect to multiplication) are called its units. The units of any ring $R$ form a multiplicative group, denoted $R^\times$ or sometimes $R^*$ . Note that $R^\times$ is never a ring, because rings require the additive identity $0$ , and $0$ is never a unit.

Theorem: $0$ is never a unit of a ring $R$ .

To be a unit, we’d have to have $0\cdot 0^{-1}=1$ , but that’s impossible since $0$ multiplied by anything is zero.
Theorem: $u,v$ are units iff their product $uv$ is a unit.

If $u,v$ are units then $(uv)(v^{-1}u^{-1})=u(vv^{-1})u^{-1}=u1u^{-1}=1$ shows that $uv$ is a unit.

If $uv$ is a unit then $u(v(uv)^{-1})=(uv)(uv)^{-1}=1$ shows $u$ is a unit, and a symmetric argument shows $v$ is a unit.
A ring comprised of only zero and units is called a field. We know that all nonzero elements in a field are invertible and have the identity element $1$ , so the nonzero elements of a field $F$ actually form a multiplicative group $F^\times$ .

Theorem: The multiplicative group $F^\times$ of a field $F$ has order $|F|-1$ .

The multiplicative group takes only the units of $F$ , of which there are $|F|-1$ since everything but zero is a unit in $F$ .

Fields have many special properties, which we’ll dive into in depth in a different exploration.

In this section, we introduce zero divisors.

Unlike the integers, however, in a ring it is possible for $a\cdot b=0$ for nonzero $a,b$ . We call such $a,b$ zero divisors because they effectively divide zero into two nonzero elements.

Zero divisors are generally undesirable, because their presence implies a lack of cancellability in the ring. In other words, when we have $a\cdot b=a\cdot c$ , we’d like to claim that $b=c$ as a result. But this relies on the fact that the map $x\mapsto a\cdot x$ is injective, which is not the case when $a$ is a zero divisor, because then both $a\cdot b=0$ and $a\cdot 0=0$ . So you can think of zero divisors as uncancellable elements in a ring.

A ring with no (nonzero) zero divisors is called an integral domain. Essentially, integral domains are exactly the rings that have cancellability, which is desirable. We also enforce the requirement that nonzero elements exist in integral domains, so as a special case, the zero ring $0$ is not an integral domain.

Let’s see some examples:
Theorem: $\ZZ_p$ is an integral domain if $p$ is prime.

Since $p$ is prime, when $p\mid ab$ for $a,b\in\ZZ_p$ then either $p\mid a$ or $p\mid b$ .

In $\ZZ_p$ , this translates to: when $ab\equiv 0\mod p$ , then either $a\equiv 0\mod p$ or $b\equiv 0\mod p$ .

So $a,b$ cannot be both zero divisors $(ab\equiv 0\mod p)$ and nonzero $(a,b\not\equiv 0\mod p)$ .

Therefore there are no nonzero zero divisors, making $\ZZ_p$ an integral domain.
Theorem: The direct product of nonzero rings cannot be an integral domain.

The result of a direct product of nonzero rings always contains the elements $(1,0)$ and $(0,1)$ .

Since $(1,0)(0,1)=(0,0)$ , both are zero divisors.

Since there are zero divisors, the direct product is not an integral domain.
Let’s see how zero divisors and units interact.

Theorem: No element can be both a zero divisor and a unit.

A zero divisor $a$ satisfies $ab=0$ for some nonzero $b$ . But if $a$ is a unit, then no such $b$ exists since left-multiplying $ab=0$ by $a^{-1}$ gives $b=0$ .

Corollary: Every field is an integral domain.

An integral domain must have no nonzero zero divisors. But every nonzero in a field is a unit by definition, and therefore not a zero divisor.
Theorem: Every finite integral domain is a field.

Take any nonzero $a\in R$ , so that the set $\{a,a^2,a^3,\ldots\}$ is not all zeros.

Since we’re in a finite ring, $\{a,a^2,a^3,\ldots\}$ eventually repeats, so that there is some $a^n$ equal to $a^m$ where $n>m$ .

Since we’re in an integral domain, we can cancel $a^m$ from both sides of $a^n=a^m$ , producing $a^{n-m}=1$ since $n>m$ .

Note that $n-m>0$ , so this can be rewritten as $a\cdot a^{n-m-1}=1$ , proving that $a$ is a unit.

Since every nonzero $a\in R$ is a unit, $R$ must be a field.
In this section, we introduce idempotents.

An idempotent in a ring $R$ is an element $e\in R$ such that $e^2=e$ , and therefore $e^k=e$ for all $k\ge 1$ . For every ring, $0$ and $1$ are trivially idempotent since $0^2=0$ and $1^2=1$ .

Theorem: Every nontrivial idempotent is a zero divisor.

For idempotents $e$ that aren’t $0$ or $1$ , we can show that since $e^2=e$ , we have $e^2-e=0$ and therefore $e(e-1)=0$ by distributivity. Since $e\ne 1$ implies $e-1$ is nonzero, and $e\ne 0$ , $e$ and $e-1$ must both be zero divisors.

Therefore the nontrivial idempotents are a special subset of zero divisors. This means that if a nontrivial idempotent exists in a ring, the ring is not an integral domain.

Studying the idempotents themselves gives rise to some interesting structures:
Theorem: The idempotents of a ring form a partially ordered set.

A partially ordered set (poset) is a set where a partial order $a\le b$ is defined for all elements $a,b$ , so that $\le$ satifies

reflexivity $\forall a\ldotp a\le a$ ,

antisymmetry $\forall a,b\ldotp a\le b\land b\le a\implies a=b$ , and

transitivity $\forall a,b,c\ldotp a\le b\land b\le c\implies a\le c$ .

On the idempotents, define $e\le f$ iff $ef=e$ or $ef=f$ . The idea is that all factors are “less than” their products. This satisfies the requirements:

Reflexivity: $ee=e^2=e$ , therefore $e\le e$ .

Antisymmetry: Assuming $ef=e$ and $fe=f$ we get $e=ef=fe=f$ .

Transitivity: Assuming $ef=f$ end $fg=g$ we get $eg=efg=fg=g$ .

Thus the idempotents form a poset.
Theorem: The idempotents of a ring form a boolean algebra.

A boolean algebra is a poset together with the operators $\lnot,\lor,\land$ corresponding to negation, disjunction, and conjunction respectively, as well as two distinguished elements $0$ and $1$ , all satisfying the following:

$\lnot x=0$ iff $x=1$ , $\lnot x=1$ iff $x=0$

$x\lor y=0$ iff $x=y=0$ , otherwise $x\lor y=1$

$x\land y=1$ iff $x=y=1$ , otherwise $x\land y=0$

$\lor$ and $\land$ are commutative

$\lor$ and $\land$ are associative

Define:

negation $\lnot e$ as $1-e$ .

disjunction $e\lor f$ as $e+f-ef$ .

conjunction $e\land f$ as $ef$ .

$0$ as the additive identity $0$ .

$1$ as the multiplicative identity $1$ .

Then:

$\lnot e=1-e$ , which is $0$ if $e=1$ and $1$ if $e=0$ .

$e\lor f=e+f-ef$ , which is $0$ if $e=f=0$ and $1$ otherwise.

$e\land f=ef$ , which is $1$ if $e=f=1$ and $0$ otherwise.

Commutativity can be shown by observing that both $e+f-ef$ and $ef$ are unchanged when you swap $e$ and $f$ .

Associativity of $\land$ comes from associativity of the product in a ring. Associativity of $\lor$ is harder but routine: $\begin{aligned} e\lor (f\lor g) &=e\lor (f+g-fg)\\ &=e+(f+g-fg)-e(f+g-fg)\\ &=e+f+g-fg-(ef+eg-efg)\\ &=e+f-ef+g-(eg+fg-efg)\\ &=(e+f-ef)+g-g(e+f-ef)\\ &=(e+f-ef)\lor g\\ &=(e\lor f)\lor g \end{aligned}$
Corollary: Every finite ring contains $2^k$ idempotents for some $k$ .

The proof of this isn’t really a ring theory proof, so I left it out. But it is a property of boolean algebras that every boolean algebra is isomorphic to the power set of a $k$ -element set, therefore every finite ring contains $2^k$ idempotents.

If every element in a ring is idempotent, then the whole ring is a boolean algebra, so we call it a boolean ring.

In this section, we introduce nilpotents.

Another special case of zero divisors are those elements that, when raised to a suitable power, become zero. These are elements $a$ that satisfy $\exists k\ldotp a^k=0$ , and they are called nilpotents. The zero element $0$ is always a nilpotent in every ring.

Theorem: A nonzero element $e$ cannot be both idempotent and nilpotent.

An idempotent element $e$ is still itself when raised to an arbitrary power $e^k=e$ . That means that, unless $e=0$ , powers of $e$ never become zero, therefore $e$ cannot be nilpotent.

Theorem: If $r,s$ are nilpotent elements of a ring $R$ , then $r+s$ and $r-s$ are also nilpotent.

Say $r^n=0$ and $s^m=0$ . Then using the binomial theorem, $(r+s)^{n+m}=\sum_{k=0}^{n+m}{n+m\choose k}r^ks^{n+m-k}$ . Ignoring the coefficient, notice that each term $r^ks^{n+m-k}$ vanishes, because if $k\ge n$ then $r^k=0$ and if $k\le n$ then $n+m-k\ge m$ thus $s^{n+m-k}=0$ . Therefore $r+s$ is nilpotent. The same argument works for $r-s$ .

Like all zero divisors, nilpotents cannot be units. However, the existence of nilpotents is special since it always implies the existence of units:

Theorem: For every nilpotent $a$ in a ring $R$ , the element $1-a$ is a unit of $R$ .

If $a^k=0$ for some $k$ , then let $S=\sum_{i=0}^{k-1}a^i$ be the sum of all the powers of $a$ up to $a^{k-1}$ , which is an element of $R$ . Observe that because $a^k=0$ , $aS=\left(\sum_{i=1}^{k}a^i\right)$ is exactly every element of $S$ except for $a^0=1$ . Then their difference $S-aS$ must be equal to 1, and by factoring out $S$ , we have $S(1-a)=1$ , meaning $1-a$ is a unit of $R$ .

Corollary: If $a$ is nilpotent in a ring $R$ and $u$ is a unit, $u+ka$ for all $k\in\ZZ$ are also units of $R$ .

For $u-a$ , use the same argument as above with the series $S=\sum_{i=0}^{k-1}(au^{-1})^i$ . $(u-a)(u^{-1}S)=S-(au^{-1})S=1$ This implies $(u-a)-a$ is a unit as well, and so on, thus $u+ka$ is a unit for all $k\le 0$ .

Similarly, for $u+a$ , use the series $S=\sum_{i=0}^{k-1}(-au^{-1})^i$ . $(u+a)(u^{-1}S)=S-(-au^{-1})S=1$ This implies $(u+a)+a$ is a unit as well, and so on, thus $u+ka$ is a unit for all $k\ge 0$ .
Corollary: $u+b$ is a unit for every unit $u$ and nilpotent $b$ .

Assume $b^k=0$ . Then note that $u+b=u+uu^{-1}b=u(1+u^{-1}b)=u(1-r)$ where $r=-u^{-1}b$ is also nilpotent: $r^k=-u^{-k}b^k=0$ .

Since $r$ is nilpotent, $1-r$ is a unit, so $u(1-r)=u+b$ is a unit.
In this section, we introduce the characteristic of a ring.

The above proof that $\forall k\in\ZZ\ldotp u+ka$ is a unit seems to imply that every nilpotent can produce an infinite number of units. However, this isn’t always true — some rings are finite. Can you think of one?

You might have come up with the ring of integers mod $n$ , denoted $\ZZ_n$ . Specifically let’s try $\ZZ_8$ , so that $2$ is a nilpotent element because $2^3=0$ in $\ZZ_8$ . Using our formula $1-k2$ , this implies that $1,3,5,7$ are units of the ring. But because $1+2+2+2+2=1$ , the ring “loops back” on itself at some point, so we don’t get any additional units.

This special ring property where addition “loops back” is called the characteristic. The characteristic of a ring $R$ is the number of times you have to add $1$ to itself before you get $0$ . This makes sense in the ring of integers mod $n$ , because in that ring, adding $1$ to itself $n$ times gives $0$ . So $\ZZ_n$ has characteristic $n$ , and we write $\char\ZZ_n=n$ . For rings like the integers $\ZZ$ , however, no amount of adding $1$ to itself will give $0$ , so for those rings the characteristic is defined to be $0$ . Thus $\char\ZZ=0$ . The idea is that the only way to get $0$ is to add $1$ zero times to itself.

Since most rings we work with have characteristic $0$ , we will only mention characteristic when it matters. For instance, it turns out that characteristic $2$ rings are particularly strange. Here’s an example of why.

Theorem: In a characteristic $2$ ring $R$ , addition is the same as subtraction.

Since $a+a=2a=0$ for every $a\in R$ , every element in $R$ is its own additive inverse: $a=-a$ Thus $a+b=a-b=-a+b=-a-b$

While rings can be of any nonnegative characteristic in general, this is not true of integral domains.

Theorem: The characteristic of an integral domain is either $0$ or a prime number $p$ .

$(ab)1=ab=0$ is precisely the requirement for the characteristic to be a composite number $ab$ . But since integral domains have no nonzero zero divisors, it’s not possible that $ab=0$ for nonzero $a,b$ . Therefore, the characteristic is either prime or zero.

Corollary: The characteristic of a field is either $0$ or a prime number $p$ .

Important note: Recall that when we adjoin an element from one ring to another, we take the additive and multiplicative closure of the result. For closure to make sense, the two rings must be of the same characteristic. So you can only adjoin elements of one ring to another if they have the same characteristic.

In this section, we talk about subrings.

A subring is a subset of elements of a ring $R$ that satisfies the ring axioms. Additionally, it must includes $1$ as the multiplicative identity. This is enough to show it includes $0$ as the additive identity as well, since $1-1=0$ .

This last requirement is curious, but it’s certainly possible for $R$ to have subsets that are rings with a different multiplicative identity. They’re just not considered subrings.

Theorem: Every ring $R$ of characteristic $0$ includes a subring isomorphic to the integers $\ZZ$ .

We can define a correspondence between the integers and a subring of $R$ . Assign every integer $n\in\ZZ$ to $n1$ , which is $1$ added to itself $n$ times. Since $\char R=0$ , each $n1$ is a unique element in $R$ . Then we know that the subset of these $n1$ elements is a ring, because we can use the ring $\ZZ$ to define addition and multiplication on the $n$ part of these elements. Thus $R$ includes $\ZZ$ as a subring.
Theorem: The intersection of two subrings is a subring of both.

The intersection is an additive group, since both subrings are additive groups, and the intersection of additive groups is also an additive group.

The intersection has $1$ from the original ring, since that’s present in both subrings.

The intersection is closed under product, since both subrings are closed under product.

The intersection inherits the multiplicative identity and distributive laws from the original ring.

Since the intersection is a subset of both given subrings, is an additive group, contains $1$ , is closed under product, and satisfies identity and distributive laws, it is a subring of both.
Theorem: Every subring of an integral domain is an integral domain.

Since integral domains don’t have zero divisors, none of its subrings can have zero divisors, so every subring is also an integral domain.

Appendix A

Our earlier definition actually breaks down into four ring axioms:
1. $(R,+)$ forms an (additive) group with zero element $0$ .
2. $(R,\cdot)$ forms a (multiplicative) monoid with unity (identity) $1$ .
3. $\cdot$ distributes over $+$ , and this actually implies $+$ is commutative, so $(R,+)$ must be an abelian group.
4. $\cdot$ is also commutative.
While we will assume all of these axioms hold for a ring, one might define more general rings by relaxing these axioms. For completeness, here are the names for some ring variants:
- A rig or a semiring is a ring where $(R,+)$ is also a monoid, dropping the requirement for additive inverses (n egatives).
- A rng is a ring where $(R,\cdot)$ is a semigroup rather than a monoid, dropping the requirement for $1$ (the multiplicative i dentity). A ring with a multiplicative identity is sometimes explicitly called a unital ring.
- A near-ring doesn’t require $+$ or $\cdot$ to be commutative. When $\cdot$ is not commutative, then left near-rings only require left distributivity (so only products on the left distribute), and similarly for right near-rings.
- Noncommutative rings are those where $\cdot$ is not commutative. Otherwise it’s a commutative ring. For our purposes, we assume all rings are commutative unless otherwise specified.
So what we call rings are technically what some people call unital commutative rings. We won’t mention these other names very much since having to deal with non-unital or non-commutative rings introduces a lot of complexity, and I’d rather get into that complexity much later.

November 28, 2023. Exploration 1: Rings and ideals
Questions:
- How do you take quotients of rings?
- What is an ideal?
- How do you work with ideals?
- What do the elements in an ideal imply about the ideal?
Recall from group theory that a normal subgroup can be used to define a quotient group, by sending every element of the normal subgroup to the group identity.

If we do the same thing with rings, we reach a slight problem. Taking $R/H$ where $H$ is a normal subgroup of $R$ ’s additive group fails certain ring axioms. Let’s see why.

Given a ring $R$ , let’s say that $H$ is a subgroup of the additive subgroup within $R$ . $H$ is normal, so by definition, $R/H$ (sending elements of $H$ to $0$ ) is a quotient group. To be a quotient ring, however, we need to define multiplication in $R/H$ .

Recall that the elements of $R/H$ are equivalence classes $[a]$ under the relation $a\sim b$ iff $b-a\in H$ . Being a quotient group means we already have addition defined as $[a]+[b]=[a+b]$ . To see what multiplication $[x][a]$ must be, we’ll have to rely on the ring axioms:
- Multiplicative identity: if $1a=a=a1$ in $R$ , then $[1][a]=[a]=[a][1]$ in $R/H$ .
- Multiplication is associative: if $(ab)c=a(bc)$ in $R$ , then $([a][b])[c]=[a]([b][c])$ in $R/H$ .
- Multiplication is distributive: if $a(b+c)=ab+ac$ in $R$ , then $[a]([b]+[c])=[a][b]+[a][c]$ in $R/H$ .
So we get three constraints. The first constraint implies that the definition $[a][b]=[ab]$ could work, and in fact this definition satisfies all the constraints:
- Multiplicative identity: $[1][a]=[1a]=[a]=[a1]=[a][1]$
- Multiplication is associative: $([a][b])[c]=[(ab)c]=[a(bc)]=[a]([b][c])$
- Multiplication is distributive: $[a]([b]+[c])=[a(b+c)]=[ab+ac]=[a][b]+[a][c]$
So the definition $[a][b]=[ab]$ seemingly allows $R/H$ satisfies the ring axioms. The only step left is to show well-definedness of $[a][b]=[ab]$ .

Therefore: In order for $R/H$ to be a ring, $H$ must be a subgroup of $R$ ’s additive subgroup, and additionally $xH=H$ for all $x\in R$ .

Recall that $[x]+[a]=[x+a]$ only works because we showed $x+[a]=[x+a]$ , making it well defined. In the same vein, for $[x][a]=[xa]$ , we need to show that $x[a]=[xa]$ for all $x\in [x]$ . This is the same as proving that $xa\sim xb$ iff $a\sim b$ . Let’s see how this works out: $\begin{aligned} &xa\sim xb\\ \iff& xb\in [xa]\\ \iff& xb-xa\in H\\ \iff& x(b-a)\in H\\ \iff& b-a\in H\\ \iff& b\in[a]\\ \iff& a\sim b \end{aligned}$ Note that the cancellation step $x(b-a)\in H\iff b-a\in H$ requires $xH=H$ . This imposes $xH=H$ as a requirement for multiplication in $R/H$ to be well-defined.

These “additive subgroups that absorb multiplication” are known as ideals. Just as you need a normal subgroup to quotient a group, you need an ideal to quotient a ring. The elements of a quotient ring are still known as cosets, just like for quotient groups in group theory.
Theorem: $I$ is an ideal of $R$ iff $I$ is nonempty, and that for any $a,b\in I$ and $r\in R$ , we have $a-b\in I$ and $ra\in I$ .

We just showed that an ideal $I$ is an additive subgroup of $R$ that absorbs multiplication by elements $r\in R$ .

$a-b\in I$ implies closure under additive inverse (let $a=0$ ) and addition (let $b=-b$ ), thus $I$ is an additive subgroup of $R$ .

$ra\in I$ implies that $I$ absorbs multiplication by elements $r\in R$ .
In this section, we classify a ring by its ideals.

In general, to find all the ideals of a ring, you need to find all the additive subgroups of the ring’s additive group. Then the ideals are those additive subgroups that also absorb product. Let’s see some examples.

$0$ and $R$ are always ideals of every ring $R$ , where $0$ refers to the zero ring that only contains the zero element $0$ .

Therefore: Proper ideals do not contain $1$ .

What happens if an ideal contains $1$ ? Since every product with $1$ must be in the ideal, that means every element of the ring is in the ideal, thus the ideal must be equal to the ring itself.

Therefore: Proper ideals do not contain units.

What happens if an ideal contains a unit? Then $a^{-1}a=1$ is also in the ideal, which means the ideal must contain $1$ . But as we just observed, proper ideals don’t contain $1$ .

Now we are ready to explore specific kinds of ideals and how they interact with rings.

First, there is the ideal generated by an integer, $nR$ , containing every element in $R$ added to itself $n$ times. These ideals are generally only found in finite rings, or the ring of integers.

Second, every element $a\in R$ can generate a principal ideal $(a)$ containing every possible product with $a$ . $0$ and $R$ are both principal ideals, generated by $0$ and $1$ respectively, and so they are sometimes written $(0)$ and $(1)$ .

Sometimes $aR$ , the product of every element in $R$ with $a$ , is used to denote the principal ideal $(a)$ because it’s the same thing. For example:

Theorem: $\ZZ_p$ , the integers mod $p$ , is isomorphic to $\ZZ/p\ZZ$ , the integers quotiented by the principal ideal $(p)$ .

The integers mod $p$ essentially set $p=0$ , which is the same as what happens when you send the principal ideal $(p)$ to $0$ .

Principal ideals $(a)$ have an important property that if the generator $a$ factors into non-units $bc$ , each factor $b$ or $c$ generates a strictly larger principal ideal.

Theorem: Given a principal ideal $(a)$ and a non-unit $b$ , the ideal $(ab)$ is strictly contained in $(a)$ .

Since $a$ generates $ab$ , it is clear that $(ab)\subseteq(a)$ . The case that $(ab)=(a)$ is not possible — this implies that $ab$ generates $a$ , i.e. there is some $b^{-1}$ such that $abb^{-1}=a$ , implying that $b$ is a unit. Therefore, if $b$ is a non-unit, $(ab)\subsetneq(a)$ .

Third, let’s look at prime ideals. They are similar to prime numbers: if $p$ divides a product $ab$ , then either $p$ divides $a$ or $p$ divides $b$ . By replacing “ $p$ divides” with “ $P$ contains”, you get a prime ideal $P$ , an ideal with the property that if $ab$ is in a prime ideal, either $a$ or $b$ is also in the prime ideal.
Theorem: Quotienting by a prime ideal results in an integral domain.

Let $[a],[b]$ be two nonzero cosets in the quotient. Let $P$ be a prime ideal, so that $ab\in P$ implies $a\in P$ or $b\in P$ .

Translating this into coset terms, we have $[ab]=[0]$ implies $[a]=[0]$ or $[b]=[0]$ , both of which are false since $[a]$ and $[b]$ are defined to be nonzero.

But that means $[a][b]=[0]$ has no nonzero solutions, i.e. there are no zero divisors.

No zero divisors is the definition of an integral domain.
Finally, we have one last type of ideal: a maximal ideal, one that isn’t contained in a larger (proper) ideal. It is possible that there are no maximal ideals (in certain infinite rings), or more than one maximal ideal.
Theorem: Quotienting by a maximal ideal results in a field.

Let $M$ be a maximal ideal of a ring $R$ .

We must show that every nonzero coset $[a]$ in the quotient $R/M$ is a unit, i.e. there is always some coset $[b]$ such that $[b][a]=[1]$ .

Note that since $[a]\ne [0]$ , $a\notin M$ . Therefore the ideal $\<a,M\>$ , which is generated by $M$ plus an element $a\notin M$ , is a larger ideal.

But since $M$ is maximal, no larger proper ideal contains $M$ . So $\<a,M\>$ must not be proper – it must be the whole ring $R$ , and therefore $1\in\<a,M\>$ .

This means that we somehow obtained $1$ by adding some multiple of $a$ to an element $m\in M$ . That is, $ka+m=1$ for some $k\in R$ .

In coset terms, we have $[k][a]+[m]=[1]$ . Since $m\in M$ , $[m]=[0]$ , so this simplifies to $[k][a]=[1]$ .

This implies that every element $[a]$ of the quotient is a unit, and therefore the quotient is a field.
A final note: a simple ring is one where the only ideals are $0$ and $R$ . Since we’re only talking about commutative rings, we actually find that simple rings and fields coincide.
Theorem: The simple rings are exactly the fields.

Since an ideal containing a unit must be $R$ itself, and fields only contain zero and units, fields only have the ideals $\{0\}$ and $R$ .

Conversely, a ring with only ideals $\{0\}$ and $R$ implies that any nontrivial principal ideal $(a)$ is equal to $R$ , which contains $1$ . That means some multiple of $a$ is equal to $1$ , and $ka=1$ implies that every (nonzero) element $a$ is a unit, therefore the ring only contains zero and units and is therefore a field.
Corollary: All maximal ideals are prime.

In this section, we learn how to manipulate ideals.

First, let’s talk about the sum of two ideals. It is an ideal.
Theorem: The sum of two ideals $A+B$ , defined as the set $\{a+b\mid a\in A, b\in B\}$ , is an ideal.

To prove it is an ideal, we need to prove it is a normal subgroup of the additive group of the ring, and that it absorbs product.

The group product $A+B$ of two normal subgroups $A,B$ is a normal subgroup. (proof)

Since $r(A+B)=rA+rB=A+B$ , we know $A+B$ also absorbs product.

Therefore $A+B$ is an ideal.
What about the union and intersection of two ideals?
Theorem: The intersection of two ideals $A\cap B$ is an ideal.

To prove it is an ideal, we need to prove it is a normal subgroup of the additive group of the ring, and that it absorbs product.

The intersection $A\cap B$ of two normal subgroups $A,B$ is a normal subgroup. (proof)

Since $rA=A$ and $rB=B$ , then by definition of $A\cap B$ , we know $A\cap B$ also absorbs product as well ( $r(A\cap B)=A\cap B$ ).

Therefore $A\cap B$ is an ideal.
Theorem: The union of two ideals $A\cap B$ is not always an ideal.

Consider the counterexample $(2)\cup (3)$ in $\ZZ$ , which contains all multiples of $2$ and $3$ . But this subset of $\ZZ$ is not closed under addition (e.g. $2+3=5$ ) and is therefore not an ideal.

In this section, we explore what kinds of elements an ideal can contain.

We’ve already established that a proper ideal cannot contain units. What are some other limitations on what an ideal can contain?

First, consider: when does an ideal contain a zero divisor?
Therefore: Nontrivial ideals always contain a zero divisor, if one exists.

Let’s say that the ring contains a zero divisor $r$ and some ideal $A$ .

Since product with a zero divisor results in either zero or a zero divisor, $rA$ contains nothing but zero and zero divisors. Then since ideals absorb product, $rA\subseteq A$ .

If $rA$ has a zero divisor, then that zero divisor is in $A$ and we are done. Otherwise $rA=\{0\}$ , but that means $ra=0$ for all $a\in A$ , meaning for all nonzero $a\in A$ , $a$ is a zero divisor.

Either way, as long as $A$ is nontrivial (not the zero ring), then it contains a zero divisor.
Next, when does an ideal contain an idempotent?
Therefore: Prime ideals contain an idempotent element, if one exists.

Let’s say that the ring contains an idempotent $a$ and some ideal $A$ .

$a^2=a$ implies $0=a(1-a)$ . For a prime ideal, this is interesting: since all ideals contain $0$ , a prime ideal must contain one of the factors of $0$ , i.e. either $a$ or $(1-a)$ .

Both are idempotent: $a$ is by definition, and $(1-a)^2=a^2-2a+1=a-2a+1=1-a$ .
Finally, when does an ideal contain a nilpotent?
Theorem: Every prime ideal $P$ contains every nilpotent element.

Like every ideal, $P$ contains $0$ . Since $0$ is nilpotent, every ideal contains at least one nilpotent element. So let $r$ be an arbitrary nilpotent element so that $r^n=0$ for some $n$ .

This means $0$ can be factored into $r\cdot r^{n-1}$ . By the property of prime ideals, either $r\in P$ (so that we are done), or $r^{n-1}\in P$ , in which case we recursively apply this logic for $r^{n-1}$ . Eventually you get to $r\cdot r$ which proves that $r\in P$ .

Since $r$ is an arbitrary nilpotent element, this shows that every nilpotent element is in $P$ .
There is a deeper result here, though it requires Zorn’s lemma to prove.
Theorem: The intersection of all prime ideals in $R$ is exactly the set $N$ of all nilpotent elements of $R$ .

Since every prime ideal contains $N$ , so does the intersection of all prime ideals. To prove that $N$ contains the intersection of all prime ideals, we must prove every element of the intersection $r$ is nilpotent. Assume towards contradiction that $r$ is not nilpotent.

Then the set $S=\{s\in R\mid r^ns=0\text{ for some }n\ge 1\}$ is a nonempty set (since $s=0$ is in $S$ ) of proper ideals (since $s=1$ can’t be in $S$ , as it would imply $r^n=0$ , and including $1$ implies not being a proper ideal).

Note that $S$ cannot contain $r$ or any of its powers $r^i$ , because otherwise $r^nr^i=0$ would imply $r$ is nilpotent, violating our assumption.

Zorn’s lemma (for ideals): Every nonempty set of proper ideals $S$ ordered by inclusion includes some maximal ideal $M$ .

Zorn’s lemma states that every poset that defines an upper bound for every nonempty chain in the poset contains a maximal element.

Ideals ordered by inclusion form a typical poset. Then a chain of ideals looks like $I_1\subseteq I_2\subseteq I_3\subseteq\ldots$ .

The union of a chain of ideals is an ideal as well. It’s an upper bound since it contains every ideal in the chain. Therefore every nonempty chain has an upper bound.

Then by Zorn’s lemma, every nonempty set of proper ideals contains a maximal element, which is a maximal ideal.

So by Zorn’s lemma, $S$ contains a maximal ideal $M$ .

Since all maximal ideals are prime, this means $M$ is a prime ideal that doesn’t contain $r$ .

But this contradicts the fact that $r$ is in the intersection of prime ideals. Therefore $r$ must be nilpotent in order to be in the intersection of prime ideals.

This means the intersection of prime ideals is exactly all the nilpotent elements of $R$ .

November 30, 2023. Exploration 2: Ring homomorphisms
Questions:
- TODO
Just like with groups, there are ring homomorphisms: maps between rings that preserve the ring properties. While group homomorphisms need only preserve the identity and the group product, ring homomorphisms must preserve addition, multiplication, and the multiplicative identity. (Since $1-1=0$ , preserving the multiplicative identity $1$ also preserves the additive identity $0$ .) Therefore all ring homomorphisms are also group homomorphisms of the rings’ additive abelian groups.
Theorem: Ring homomorphisms $\theta$ preserve rational expressions (expressions involving addition, subtraction, multiplication, integer scalar multiplication, powers, and division by units).

By induction on the rational expression.

$\forall a,b\in R\ldotp a+b\in R\implies \theta(a)+\theta(b)\in R$ since rings are closed under addition by definition, and homomorphisms preserve addition.

$\forall a,b\in R\ldotp a-b\in R\implies \theta(a)-\theta(b)\in R$ since rings are closed under addition and additive inverse by definition, and homomorphisms preserve addition and additive inverse.

$\forall a,b\in R\ldotp ab\in R\implies \theta(a)\theta(b)\in R$ since rings are closed under multiplication by definition, and homomorphisms preserve multiplication. This also implies that units are preserved, because $ab=1$ implies $\theta(a)\theta(b)=\theta(1)$ .

$\forall a\in R,k\in\ZZ\ldotp ka\in R\implies k\theta(a)\in R$ since this is just repeated addition, and rings are closed under addition by definition, and homomorphisms preserve addition.

$\forall a\in R,k\in\ZZ\ldotp a^k\in R\implies \theta(a)^k\in R$ since this is just repeated multiplication, and rings are closed under multiplication by definition, and homomorphisms preserve multiplication.

$\forall a\in R,b\in R^\times\ldotp a/b\in R\implies \theta(a)/\theta(b)\in R$ since this is just multiplication by multiplicative inverse, and rings are closed under multiplication by definition, and homomorphisms preserve multiplication and units.
Theorem: Ring homomorphisms preserve units, idempotents, and nilpotents.

Since a ring homomorphism must preserve equations composed of only rational expressions, it preserves the equation $ab=1$ . But that’s just the definition of being a unit.

The same goes for $r^2=r$ (idempotents) and $r^n=0$ (nilpotents).
Theorem: Ring homomorphisms preserve characteristic.

This is a direct consequence of preserving addition and the multiplicative identity $1$ : if $n\cdot 1_R=0$ in $R$ , then that maps to $n\cdot 1_S=0$ in $S$ .
The kernel $\ker\theta$ of a ring homomorphism $\theta:R\to S$ is the subset of elements in $R$ that get mapped to $0_S$ by $\theta$ .

Theorem: The kernel of a ring homomorphism $\theta:R\to S$ is an ideal of $R$ .

Kernel is always an additive subgroup of $R$ , like from group theory. To show it absorbs product and is therefore an ideal, note that $\theta(\ker\theta)=\{0\}$ by definition, and $0$ absorbs all products.

The image $\im\theta$ of a ring homomorphism $\theta:R\to S$ is the subset of elements in $S$ that get mapped to $\theta$ .

Theorem: The image of a ring homomorphism $\theta:R\to S$ is a subring of $S$ .

Since $\theta$ is a ring homomorphism, this is implied. $\im\theta$ is an additive subgroup closed under multiplication, and $\theta$ preserves unity.

Just like with group homomorphisms, if a ring homomorphism has trivial kernel, it is injective — $\theta(r)=\theta(s)$ implies $r=s$ .
Theorem: If a ring homomorphism $\theta:R\to S$ has trivial kernel, then it is injective.

( $\to$ ) If the kernel is trivial (only $0$ maps to $0$ ), then $\theta(r)=\theta(s)$ i.e. $\theta(r-s)=0$ implies $r-s$ must be $0$ , which means $r=s$ .

( $\from$ ) If $\theta(r)=\theta(s)$ implies $r=s$ , then in particular $\theta(r)=\theta(0)=0$ implies $r=0$ , i.e. only $0$ can map to $0$ , which means the kernel is trivial.
If $R$ is a field $F$ , then ring homomorphisms become field homomorphisms. Field homomorphisms are just ring homomorphisms in the sense that they need only respect the ring axioms. However, the nature of fields gives all field homomorphisms certain properties:
Theorem: Every field homomorphism is injective.

Since the kernel of a ring homomorphism $\theta:F\to K$ is an ideal of $F$ , and the only ideals in a field $F$ are $0$ and $F$ itself, the kernel is either trivial ( $0$ ) or $F$ itself.

But since ring homomorphisms must preserve the multiplicative identity $1$ , the kernel cannot include $1$ (since it doesn’t get mapped to zero).

Therefore the kernel is trivial, so $\theta$ is injective.
In this section, we learn some ways to construct ring homomorphisms.

We learned earlier that we can quotient a ring $R$ by an ideal $I$ to get a quotient ring $R/I$ where all elements of $I$ are sent to $0$ . Since the result is a ring, the coset map $\pi:R\to R/I$ is always a ring homomorphism.

What happens if we send an element $r\in R$ to another element $s$ ?
Therefore: Renaming $r\in R$ to $s\in R$ , i.e. making the two elements equal in $R$ , is a ring homomorphism.

Sending an element $r\in R$ to $s\in R$ is the same as making the two elements equal in $R$ .

That is, we’re enforcing the equation $r=s$ in $R$ .

One way to do this is to note that the equation is equal to $r-s=0$ . Then sending the element $r-s$ to $0$ is the same as making $r=s$ in $R$ .

But that’s the same as quotienting by the principal ideal $(r-s)$ , and we know that quotient is always a ring homomorphism.
In this section, we show how to use the First Isomorphism Theorem to quickly prove facts about rings by constructing a homomorphism.
First Isomorphism Theorem: Given a ring homomorphism $\theta:R\to S$ , $R/\ker\theta\iso\im\theta$ .

Just like we did in proving the First Isomorphism Theorem for groups, we prove that the map $\bar{\theta}=[r]\mapsto\theta(r):R/\ker\theta\to\im\theta$ is a ring isomorphism.

$\bar{\theta}$ is well defined: $\ker\theta$ is an ideal of $R$ (proof) so $R/\ker\theta$ is a well-defined factor ring. Then from the universal property of the group quotient, we know that $\bar{\theta}$ is a well-defined group homomorphism.

$\bar{\theta}$ preserves unity and product: they are basically inherited from $\theta$ .

$\bar{\theta}([1])=\theta(1)=1$

$\bar{\theta}([a][b])=\bar{\theta}([ab])=\theta(ab)=\theta(a)\theta(b)$

Therefore, $\bar{\theta}$ is also a ring homomorphism.

$\bar{\theta}$ is bijective: $\bar{\theta}$ is onto, since the output is $\theta(r)$ for all $r\in R$ , which encompasses $\im\theta$ . The reverse direction of the proof above for $[a]=[b]\iff\theta(a)=\theta(b)$ shows $\bar{\theta}$ is one-to-one.

Therefore, $\bar{\theta}$ is an isomorphism.
In general, we define a isomorphism $\theta:R\to S$ . Then the first isomorphism theorem gives you $R/(\ker\theta)\iso S$ . Here’s an example of how it can be used:
Theorem: For $n\in\ZZ$ , the quotient $\ZZ/(n)$ is isomorphic to $\ZZ_n$ , the integers mod $n$ .

Define the map $\theta:\ZZ\to\ZZ_n$ , which is just the residue map from the integers to their corresponding equivalence class in $\ZZ_n$ . This map is obviously surjective, since each class in $\ZZ_n$ is typically represented by some integer in $\ZZ$ .

The kernel of $\theta$ is exactly every multiple of $n$ , i.e. the principal ideal generated by $n$ .

By the First Isomorphism Theorem, we have $R/\ker\theta\iso\im\theta$ . With $\ker\theta=(n)$ and $\im\theta=\ZZ/(n)$ (since it is surjective), we get $R/(n)\iso\ZZ/(n)$ .
Here’s another isomorphism theorem:
Theorem: $(R\times S)/(A\times B)\iso R/A\times S/B$ .

The map $(r,s)\mapsto ([r]_A,[s]_B)$ has kernel $A\times B$ and image $R/A\times S/B$ .

By the First Isomorphism Theorem, $(R\times S)/\ker\theta\iso\im\theta$ , so we get $(R\times S)/(A\times B)\iso R/A\times S/B$ immediately.
Recall that the sum of two ideals is an ideal.

Chinese Remainder Theorem: Given $A,B$ ideals of $R$ , $A+B=R\implies R/(A\cap B)\iso R/A\times R/B$

Let $\psi=r\mapsto ([r]_A,[r]_B)$ , which you can check is a surjective homomorphism. Then the result is provided by the First Isomorphism Theorem, since $\ker\psi=A\cap B$ , and by surjectivity, $\im\psi=R/A\times R/B$ .

Corollary: When $A\cap B=\{0\}$ , we get $R/\{0\}\iso R\iso R/A\times R/B$ .

December 1, 2023. Exploration 3: Polynomials
Questions:
- How do we factor a polynomial with coefficients in a ring?
- How do we study the rest of ring theory from the lens of polynomial rings?
- What does the derivative of a polynomial mean in the context of ring theory?
Recall that to adjoin an element $a$ to a ring $R$ is to create a ring $R[a]$ generated by the two: something like $\<a,R\>$ .

When you adjoin a symbol $x$ that commutes with the entire ring $R$ , the result is a polynomial ring. It turns out polynomial rings are fundamental in the sense that much of ring theory is built upon generalizing the properties of polynomials, such as irreducibility.

The elements $f$ of a polynomial ring $R[x]$ are called polynomials with coefficients in $R$ , and look like this:

where $a_i\in R$ are known as the coefficients of the polynomial $f$ , $x$ is known as an indeterminate over $R$ , and the exponent of the leading $x^i$ is known as the degree of the polynomial, denoted $\deg f$ . The degree of a constant polynomial $c\in R$ is zero, and the degree of the zero polynomial $0$ is undefined.

Let’s show some basic facts that hold when we form a polynomial ring.
Theorem: $R$ is an integral domain iff $R[x]$ is an integral domain.

WTS the product of two nonzero polynomials $f,g$ in $R[x]$ is always nonzero.

Let $a_n$ and $b_m$ be the leading coefficients of $f,g$ respectively. Since $f,g$ are nonzero, $a_n$ and $b_m$ are nonzero.

Then the leading coefficient of $fg$ is $a_nb_m$ . Since $R$ is an integral domain, this product is nonzero.

Then since the leading coefficient of $fg$ is nonzero, $fg$ is nonzero, and we are done.
In this section, we define the operations possible on polynomials.

Polynomial addition/subtraction is straightforward – it is pairwise addition/subtraction of corresponding coefficients.

Polynomial multiplication is more complex. In general, it looks like $(\sum_{i=0}^n r_ix^i)(\sum_{j=0}^m s_jx^j)=(\sum_{k=0}^{n+m} r_ks_{n+m-k}x^k)$ Note that this means that the resulting coefficients are the convolution of the coefficients $r_i$ and $s_j$ . Also note that this implies that $\deg fg=\deg f+\deg g$ .

What about polynomial division? Dividing a polynomial $f$ by a polynomial $g\ne 0$ with remainder $r$ means finding unique polynomials $q,r$ where $\deg r<\deg g$ (or $r=0$ ) such that $f=gq+r$ . Let’s explore how this can be done.
Therefore: For any two polynomials $f,g\in R[x]$ (where $g\ne 0$ ) we can find unique polynomials $q,r$ (where $\deg r<\deg g$ or $r=0$ ) such that $f=gq+r$ , but only if the leading coefficient of $g$ is a unit. This theorem is known as the division algorithm.

Let $n=\deg f$ and $m=\deg g$ .

First of all, if $f=0$ then the only solution is $q=r=0$ . Otherwise if $n<m$ , then the only solution is $q=0$ and $r=f$ . Therefore we can induct on $n$ with the assumption that $n\ge m$ .

If $n=0$ , then $n\ge m$ implies $m=0$ and therefore $f,g$ are both constant polynomials $\in R$ .

Then we have $f=qg+r$ in $R$ . In $R$ , the condition that either $r=0$ or $\deg r<m$ collapses to just $r=0$ since $m=0$ .

Since $r=0$ , $f=qg+r$ becomes $qg=f$ in $R$ . Thus the solution is $q=fg^{-1}$ and $r=0$ . This requires $g$ be a unit in $R$ so when $g$ is constant it must be a unit.

Otherwise, assume that for all $\deg f<n$ , there is some $q,r$ such that $f=gq+r$ . WTS that’s true for $\deg f=n$ as well.

With the intention of applying the induction hypothesis, we’d like to obtain a polynomial of degree less than $n$ . This is easy when $f$ and $g$ have the same leading coefficient and degree; then $f-g$ has degree less than $n$ . Otherwise, we must use $f-kg$ where $k$ is a factor that transforms the leading coefficient and degree of $g$ to match those of $f$ .

Let $f=\sum_i^n a_ix^i$ and $g=\sum_j^m b_jx^j$ , where $a_n$ and $b_m$ are nonzero. Then the value of $k$ that lets us do this is $k=a_nb_m^{-1}x^{n-m}$ .

Note that this means that the leading coefficient $b_m$ of $g$ must be a unit. (This matches our earlier finding that when $g$ is constant, it must be a unit.) Then we have $\begin{aligned} kg&=k\sum_j^m b_jx^j\\ kg&=k\left(b_mx^m+\sum_j^{m-1} b_jx^j\right)\\ kg&=a_nb_m^{-1}x^{n-m}\left(b_mx^m+\sum_j^{m-1} b_jx^j\right)\\ kg&=a_nx^n+\sum_j^{m-1} a_nb_m^{-1}b_jx^{j+n-m} \end{aligned}$

Since $f$ and $kg$ have the same leading term $a_nx^n$ , the difference $f-kg$ cancels out the leading terms and therefore has degree less than $n$ . Then the induction hypothesis with $f-kg$ and $g$ gives us unique polynomials $q',r'$ such that $f-kg=q'g+r$ (where either $r=0$ or $\deg r<\deg g$ ). $\begin{aligned} f-kg&=q'g+r\\ f&=(k+q')g+r'\\ \end{aligned}$

Thus we have a solution $q=k+q'$ and $r=r'$ .
Corollary: The division algorithm above produces a unique solution $q,r$ .

To see this, towards contradiction assume we have any two distinct solutions $q,r$ and $q',r'$ where $f=gq+r=gq'+r$ implies $g(q-q')=r'-r$ .

We know that $\deg r<\deg g$ and therefore the RHS $r'-r$ has degree $<\deg g$ .

Then the LHS $g(q-q')$ has degree $\ge\deg g+\deg(q-q')$ .

Since the LHS and RHS should have the same degree, this is a contradiction.
In summary, we always have addition, subtraction, and product defined for $R[x]$ . But you can only divide by polynomials whose leading coefficient is a unit in $R$ . This means polynomial rings in general don’t admit a division algorithm, because they have nonzero polynomials that you can’t divide by. Rings that do admit such a division algorithm are known as Euclidean domains.

Theorem: If $R$ is a field, then $R[x]$ is a Euclidean domain.

You can only divide by nonzero polynomials in $R[x]$ whose leading coefficient is a unit in $R$ , but every leading coefficient is a unit when $R$ is a field. Thus you can divide by any nonzero polynomial in $R[x]$ , making $R[x]$ a Euclidean domain.

Corollary: For any two polynomials $f,g\in R[x]$ where $g$ is a nonzero monic polynomial (a polynomial whose leading coefficient is $1$ ), we can find unique polynomials $q,r$ (where $\deg r<\deg g$ or $r=0$ ) such that $f=gq+r$ .

Following from the division algorithm, you can only divide by polynomials whose leading coefficient is a unit, and $1$ is always a unit. Therefore you can always divide by monic polynomials.

Corollary: Same, but for nonzero polynomials whose leading coefficient is a unit.

Theorem: Every field is a Euclidean domain.

Since every nonzero element in a field is a unit, you can already divide by every element in a field: $a=qb$ . Therefore all fields have a division algorithm and are Euclidean domains.

Note that the division algorithm in question requires a notion of degree for every nonzero element in the ring. To generalize the notion of degree to non-polynomial rings, this means having some function $\phi:R\setminus\{0\}\to\NN$ mapping each nonzero to a natural number $\ge 0$ called the Euclidean norm, where $\phi(r)=0$ for all units $r\in R$ . The Euclidean norm also needs to satisfy the divisibility condition where for every $a\in R$ and nonzero $b\in R$ , there is a division $a=bq+r$ where either $r=0$ or $\phi(r)<\phi(b)$ . (We don’t need to mention the norm $\phi$ in the above proof, since $r=0$ in that proof.)

Theorem: If a Euclidean norm $\phi$ exists in an integral domain $R$ , then a division algorithm exists in $R$ (making $R$ a Euclidean domain.)

The Euclidean condition norm requires that for every $a,b\in R$ , we have either $a=qb+r$ or where either $r=0$ or $\phi(r)<\phi(b)$ . This is exactly the required division.

Then in general, a Euclidean domain is an integral domain for which a Euclidean norm is defined.

Remember that for polynomial rings $R[x]$ that are not Euclidean domains, you can only divide by polynomials whose leading coefficient is a unit in $R$ . Then:
Theorem: Let $f$ and $g$ be nonzero monic polynomials, each of which divides the other. Then $f=g$ .

If $f$ and $g$ divide each other, we have $f=gq$ and $g=fq'$ .

$\deg f=\deg gq$ implies $\deg f\ge\deg g$ .

$\deg g=\deg fq'$ implies $\deg g\ge\deg f$ .

Therefore $\deg f=\deg g$ , which implies that $q,q'$ are constant polynomials that don’t affect the degree of the product.

The fact that both $f,g$ are monic shows that the leading coefficient is $1$ before and after multiplying by $q,q$ ’, which implies that $q,q'$ are both $1$ .

Therefore $f=g$ .
This last theorem actually applies in all integral domains. Let’s modify the statement a bit:
Theorem: Let $f$ and $g$ be nonzero elements in an integral domain $R$ . Each of $f$ and $g$ generates the other iff they differ by a unit.

$f$ and $g$ being nonzero in an integral domain implies that neither of them are zero divisors.

If $f$ and $g$ generate each other, we have $f=gq$ and $g=fq'$ , and therefore $f=fqq'$ .

Then $0=f(qq'-1)$ where $f$ is not a zero divisor, implying $qq'-1=0$ .

This means $qq'=1$ , i.e. $q,q'$ are units.

Thus $f=gq$ means $f$ differs from $g$ by a unit.

The converse is trivial — if $f,g$ differ by a unit $u$ , then they generate each other via the unit: $f=ug$ and $g=u^{-1}f$ .
In this section, we explore the consequences of evaluating polynomials.
One of the most important things we can do with polynomials is evaluation, in which we substitute $x$ $x$ with $a\in R$ $a \in R$ .
Therefore: we can evaluate a polynomial to an element of $R$ by substituting $x$ with $b$ .

Such a mapping is always a ring homomorphism.

The resulting expression $\sum_i a_ib^i$ is a rational expression, made up of elements of $R$ connected with addition, subtraction, multiplication, integer scalar multiplication, powers, and division by units.

Since ring homomorphisms preserve rational expressions, the resulting expression is an element of $R$ .
This substitution is formalized as the evaluation map (a ring homomorphism) $\varphi_b:R[x]\to R$ . Applying the evaluation map to a polynomial $f$ can be denoted $\varphi_b(f)$ , but is more often denoted $f(b)$ , the evaluation of $f$ at $b$ .

Lemma: The evaluation map is surjective.

Since there is a constant polynomial $r\in R[x]$ for every $r\in R$ , and constant polynomials remain unchanged by the evaluation map, every element of $R$ gets mapped to.
Theorem: $R[x]/(x)\iso R$ .

The evaluation map $\varphi_0:R[x]\to R$ , which evaluates a polynomial at $0$ , has kernel $(x)$ . To see this, notice that for every $f\in R[x]$ , $\varphi_0(f)=f(0)=0$ implies the constant coefficient is $0$ , and therefore $\begin{aligned} f(x)&=a_nx^n+\ldots+a_2x^2+a_1x+0a_0\\ f(x)&=a_nx^n+\ldots+a_2x^2+a_1x\\ f(x)&=x(a_nx^{n-1}+\ldots+a_2x+a_1)\\ f(x)&\in (x) \end{aligned}$ so the kernel of $\varphi_0$ is $(x)$ .

The first ring isomorphism theorem says $R[x]/\ker\varphi_0\iso\im\varphi_0$ . We just showed $\ker\varphi=(x)$ , and since $\varphi_0$ is surjective, $\im\varphi_0=R$ , therefore the above becomes $R[x]/(x)\iso R$ .
Remember that you can always divide by a monic polynomial in polynomial rings. We’ll see that division relates to evaluation in multiple ways:

Remainder Theorem: When $f\in R[x]$ is divided by $x-a$ , the remainder is $f(a)$ .

Since $x-a$ is monic, the division algorithm implies some unique $q,r$ such that $f=(x-a)q+r$ . Evaluating at $a$ gives $f(a)=(a-a)q+r=r$ . Therefore, the remainder $r$ is $f(a)$ .

Factor Theorem: In polynomial rings over a field $F[x]$ , $f(a)=0$ iff $f=(x-a)q$ for some unique polynomial $q$ .

The evaluation $f(a)$ replaces $x$ with $a$ . If $f=(x-a)q$ , then $f(a)=(a-a)q=0q=0$ . Conversely, if $f(a)=0$ , then since $x-a$ is monic, by the remainder theorem $f$ divided by $x-a$ results in $f=(x-a)q+f(a)$ . But $f(a)=0$ , so this becomes $f=(x-a)q$ .

When $f(a)=0$ , then $a$ is called a root of $f$ , and the following are equivalent:
- $f(a)=0$
- $f=(x-a)q$ for some polynomial $q$
- $f\in (x-a)$ , the principal ideal generated by $x-a$
Since the factor theorem lets you factor a polynomial by finding its roots, finding the roots of polynomials is pretty important if you want to factor polynomials.

Theorem: A degree $n$ polynomial has at most $n$ roots.

This is easily proved by induction by applying the factor theorem $n$ times.

The theorems below are useful for factoring a polynomial (i.e. find roots) in a given polynomial ring. Here is an important one for integer polynomials $\in\ZZ[x]$ :
Rational Roots Theorem: Let $f=a_0+a_1x+a_2x^2+\ldots+a_nx^n\in\ZZ[x]$ . If $a_0\ne 0$ and $a_n\ne 0$ , then for every rational root $c/d$ , $c$ divides the constant $a_0$ and $d$ divides the leading coefficient $a_n$ .

Let $c/d$ be a rational root of the polynomial $f=a_0+a_1x+a_2x^2+\ldots+a_nx^n\in\ZZ[x]$ .

Assume $c,d$ are coprime integers – if they are not, divide both $c$ and $d$ by their GCD to make them coprime.

Since $c/d$ is a root of $f$ , we have $f(c/d)=0$ : $a_0+a_1(c/d)+a_2(c/d)^2+\ldots+a_n(c/d)^n=0$

Multiply both sides by $d^n$ : $a_0d^n+a_1d^{n-1}c+a_2d^{n-2}c^2+\ldots+a_nc^n=0$

From here, we can go in two directions. First, isolate the $a_0d^n$ term and factor out $c$ from the other terms: $c(a_1d^{n-1}+a_2d^{n-2}c+\ldots+a_nc^{n-1})=-a_0d^n$ This means $c$ is a factor of $-a_0d^n$ . Since $c,d$ are coprime integers, $c$ must divide $a_0$ .

Second, isolate the $a_nc^n$ term instead and factor out $d$ from the other terms: $d(a_0d^{n-1}+a_1d^{n-2}c+a_2d^{n-3}c^2+\ldots+a_{n-1}c^{n-1})=-a_nc^n$ Similarly, this means $d$ is a factor of $-a_nc^n$ . Since $c,d$ are coprime integers, $d$ must divide $a_n$ .
Corollary: when $f$ is monic, the only rational roots are all the integer factors of $a_0$ .

To remember this theorem, you can think about the polynomial $x-c$ which obviously has a root $c/1$ . So the numerator divides the constant $c$ , and the denominator divides the leading coefficient $1$ .
Corollary: $\sqrt{m}$ is irrational ( $\notin\QQ$ ) unless $m$ is the square of an integer.

Try to interpret $\sqrt{m}$ as a rational root ( $\in\QQ$ ) of some polynomial $\in\ZZ[x]$ .

If $\sqrt{m}\in\QQ$ , it would be a rational root $c/d$ of the polynomial $x^2-m$ . We know $\sqrt{m}$ is one such root.

By Rational Roots Theorem, $c\mid m$ and $d\mid 1$ , therefore $d=\pm 1$ . Then $c/d=\pm c$ so the root $\sqrt{m}$ must be some integer $\pm c$ . Therefore $m$ is the square of some integer.
Corollary: $\sqrt[n]{m}\notin\QQ$ unless $m$ is the $n$ th power of an integer.

(Same proof as above)

If $\sqrt[n]{m}\in\QQ$ , it would be a rational root $c/d$ of the polynomial $x^n-m$ . We know $\sqrt[n]{m}$ is one such root.

By Rational Roots Theorem, $c\mid m$ and $d\mid 1$ , therefore $d=\pm 1$ . Then $c/d=\pm c$ so the root $\sqrt[n]{m}$ must be some integer $\pm c$ . Therefore $m$ is the $n$ th power of some integer.
In this section, we classify the unique factorization of real and complex polynomials.

All the work involved in factoring complex polynomials rests on this one theorem:
Fundamental Theorem of Algebra (FTA): If $f\in\CC[x]$ is a nonconstant polynomial, then $f$ has a root in $\CC$ .

The proof often requires complex analysis or topology. Here is a proof due to Artin, trying not to use many ideas outside of ring theory:

It is a theorem that if you treat evaluation of the polynomial $f\in\CC[x]$ as a map $\CC\to\CC$ , then $f$ is continuous.

Evaluate $f$ at each point on a circle $C_r$ of radius $r$ around the origin of the complex plane. Since $f$ is continuous, the images $f(C_r)$ represent some loop on the complex plane. Each point on the circle $C_r$ can be represented in polar coordinates as $z=re^{i\theta}$ for some angle $\theta$ . Let its corresponding point on the loop $f(C_r)$ be $f(z)$ .

There are two loops we want to consider:

First, with $r$ approaching $0$ , we know the images $f(C_r)$ are all close to the constant coefficient $c_0$ of $f$ . So $f(C_r)$ is a tiny loop around $c_0$ . We assume $c_0$ is nonzero – if it’s zero then $f(0)=0$ implies $0$ is a root and we are done.

Second, with $r$ approaching $\infty$ , we know the images $f(C_r)$ are very large but also represent some loop. Since for large values the leading term $c_nz^n$ of $f(z)$ dominates the other terms, which we can write as $f(z)-c_nz^n$ , we know $|f(z)-c_nz^n|<|c_nz^n|$ and (because $|c_nz^n|=c_n|r^ne^{in\theta}|=c_nr^n$ ) therefore $|f(z)-c_nz^n|<c_nr^n$ . This implies the distance between $f(z)$ and the leading term $c_nz^n$ is always less than $c_nr^n$ , no matter what $\theta$ is. If you imagine $\theta$ increasing, then as $c_nz^n=c_nr^ne^{in\theta}$ walks a circle of radius $c_nr^n$ around the origin, while $f(z)$ (being continuous) is following a distance less than $c_nr^n$ behind and therefore the loop it traces must also enclose the origin.

Importantly, the first loop does not enclose the origin, while the second one does. Since $f$ is continuous, varying $r$ within $(0,\infty)$ will continously vary the corresponding loop between one that doesn’t enclose the origin and one that does. So at some $r$ between $0$ and $\infty$ , the resulting loop crosses the origin, and therefore we get a root $f(re^{i\theta})=0$ for some $\theta$ .
Corollary (FTA in $\CC[x]$ ): For every complex polynomial $f\in\CC[x]$ , you can write it in the form $f=k\prod_i(x-u_i)$ where $k$ is the leading coefficient and $u_i$ are all the roots.

This is repeated application of the FTA. You apply FTA to find a root $a$ , factor it out with the factor theorem to get $f=(x-a)g$ , and repeat with $g$ until $g$ is a constant polynomial $k$ . Since we’re always factoring out a monic polynomial $x-a$ , the leading coefficient never changes, and so the final $k$ is the leading coefficient of $f$ .
Conjugate Root Theorem: if $a+bi\in\CC$ is a root of a real polynomial $f\in\RR[x]$ , then $a-bi$ is also a root.

Since complex conjugation $\overline{x}$ fixes the real numbers, $f(\overline{a+bi})=\overline{f(a+bi)}$ when $f$ is a real polynomial.

Therefore, if $f(a+bi)=0$ , then $\overline{f(a+bi)}=\overline{0}$ and therefore $\overline{f(\overline{a-bi})}=f(a-bi)=0$ implies $a-bi$ is also a root of $f$ .
Corollary (FTA in $\RR[x]$ ): For every real polynomial $f\in\RR[x]$ , you can write it in the form $f=k\prod_i(x-u_i)\prod_iq_i$ where $k$ is the leading coefficient, $u_i$ are all the real roots, and $q_i$ are all the monic irreducible real quadratics.

Do the same thing you did for complex polynomials, factoring $f$ into a product $f=k\prod_i(x-u_i)$ where the roots $u_i$ are complex.

Since the coefficients of $f$ are real, by the Conjugate Root Theorem, the complex roots come in conjugate pairs. Then the product of their corresponding factors, $(x-(a+bi))(x-(a-bi))=x^2-2ax+(a^2+b^2)$ , is a monic irreducible real quadratic, i.e one of the $q_i$ .

Therefore, after combining all complex factors into monic real quadratic factors, $f$ can be written in the form $k\prod_i(x-u_i)\prod_iq_i$ where $q_i$ are the product of each conjugate pair of complex factors within the original $u_i$ .
Corollary: all irreducible polynomials in $\RR[x]$ are linear or quadratic (have degree $1$ or $2$ ).

We see that the Fundamendal Theorem of Algebra implies that (nonconstant) polynomials in $\CC[x]$ or $\RR[x]$ factor uniquely into a constant times a product of (monic) irreducible factors.

In this section, we examine the ideals of a polynomial ring.

Let’s explore what the ideals $A$ of a polynomial ring $R[x]$ look like.
Therefore: If $R[x]$ is a Euclidean domain, then every ideal in $R[x]$ is principal – generated by a single element, in this case a unique monic polynomial.

If $A$ is the zero ideal, it is generated by $0$ and we are done.

Otherwise, $A$ contains a nonzero polynomial.

We can divide by monic polynomials, but $A$ doesn’t necessarily contain one, unless $R[x]$ is a Euclidean domain – then we can make any polynomial monic by dividing the polynomial by the leading coefficient. Since any product with an element in the ideal $A$ is also in $A$ , $A$ always contains a monic polynomial.

Take a monic polynomial of minimal degree $g$ in $A$ . Every polynomial $f\in A$ must have $g$ as a factor because the division algorithm lets us divide by monic polynomials.

Then the division algorithm says $f=gq+r$ where either $r=0$ or $\deg r<\deg g$ .

Since $r=f-gq$ , $r$ must also be in $A$ . We can also make $r$ monic and the result is also in $A$ .

Since $\deg r<\deg g$ contradicts $g$ being the monic polynomial of minimal degree in $A$ . Therefore $r=0$ .

This means $f=gq$ . Since $f$ was arbitrary, $g$ generates $A$ .

To show that $g$ uniquely generates $A$ , assume that $h$ is another generator of $A$ . Then $g$ and $h$ generate each other: $g=hq$ and $h=gq'$ . But monic polynomials that divide each other are equal, so $g=h$ .

$\deg g=\deg hq$ implies $\deg g\ge\deg h$ , and $\deg h=\deg gq'$ implies $\deg h\ge\deg g$ . Therefore $\deg g=\deg h$ .

$\deg g=\deg h$ implies that $q,q'$ are constant polynomials that don’t affect the degree of the product. The fact that both $g,h$ are monic (leading coefficient is $1$ before and after multiplying by $q,q$ ’) implies that $q,q'$ are both $1$ . This implies $g=h$ .
This means we can write all quotient rings of $F[x]$ ( $F$ a field) as $F[x]/(h)$ , where $h$ is some monic polynomial. Since the generator of every ideal is unique, this implies a bijection between monic polynomials in $F[x]$ and nonzero ideals in $F[x]$ .

When every ideal of a ring is principal, we have a principal ideal domain (PID).

It turns out every Euclidean domain is a PID.
Theorem: All Euclidean domains are PIDs.

Let $R$ be a Euclidean domain with norm $f$ . Then for every element $a\in R$ , we can divide $a$ by nonzero $b\in R$ to get $a=bq+r$ where either $r=0$ or $f(r)<f(b)$ .

We can show that for every ideal $I$ in $R$ , every element $a$ of $I$ is generated by some element $b$ with the smallest norm $f(b)$ . That is, $a=bq$ for some $q\in R$ .

This follows immediately from the division algorithm. We have $a=bq+r$ , and since $f(b)$ is minimal by definition, there is no $f(r)$ such that $f(r)<f(b)$ , therefore $r=0$ .
In this section, we determine what it means to quotient in polynomial rings defined over a field $F$ .

In particular, when $F$ is a field, then $F[x]$ is a Euclidean domain and therefore a PID. Let’s explore polynomial rings defined over a field $F$ .
Therefore: The elements of $F[x]/(h)$ are every polynomial in $F[x]$ with degree less than $\deg h$ , under the relation $h(x)=0$ .

When you quotient by some ideal $\<h\>$ , you’re effectively sending the polynomial $h$ to $0$ and therefore enforcing the relation $h(x)=0$ on the polynomial ring. That means whenever $f=hq$ , $f=0$ .

But in a Euclidean domain, every polynomial $f$ of degree at least $\deg h$ can factor out $h$ : $f=hq$ . This means every polynomial of degree $\deg h$ and above gets sent to $0$ .

The remaining polynomials are necessarily of degree less than $\deg h$ .
Example: $F[x]/(x^2)\iso$ all linear and constant polynomials in $F[x]$ .

We identify $x^2$ with $0$ , so all the degree $2+$ polynomials get sent to $0$ , leaving only the degree $1$ and $0$ polynomials (and the zero polynomial).
Example: $\RR[x]/(x^2+1)\iso\CC$ .

Here we identify $x^2+1$ with $0$ . We can understand this as $x=\sqrt{-1}=i$ (in $\CC$ ).

Since every resulting polynomial is at most degree $1$ , they are in the form $c_0+c_1x$ .

In other words, the elements of $\RR[x]/(x^2+1)$ are $c_0+c_1i$ .

This is exactly how we write elements $a+bi$ of $\CC$ , so the two rings are isomorphic.
Example: $\QQ[x]/(x^3-2)\iso\QQ(\sqrt[3]{2})$ .

Here we identify $x^3-2$ with $0$ . We can understand this as $x=\sqrt[3]{2}$ (in $\RR$ ).

Since every resulting polynomial is at most degree $2$ , they are in the form $c_0+c_1x+c_2x^2$ .

In other words, the elements of $\RR[x]/(x^2+1)$ are $c_0+c_1\sqrt[3]{2}+c_2\sqrt[3]{2}^2$ .

But this is the same as adjoining an element $\sqrt[3]{2}$ to $\QQ$ .
To generalize these last two examples, when $F$ is a field and $h$ is irreducible, we can take the quotient $F[x]/(h)$ to form a ring isomorphic to $F[c]$ , where $c$ is a solution to $h=0$ (i.e. a root of $h$ ). It turns out the result is a field:
Theorem: $F[x]/(h)$ is a field iff $h$ is irreducible.

Since $F[x]$ is a PID, the result follows directly from this theorem.

(Note that if $h$ is reducible, then $F[x]/(h)$ sends $h=fg$ to $0$ , meaning there are zero divisors $f,g$ so the result fails to be even an integral domain.)
In other words, the quotient rings of polynomial rings can be used to construct field extensions, which we’ll explore in depth later on.

In this section, we introduce how the derivative of a polynomial helps determine roots.

The derivative $f'$ of the polynomial $f\in F[x]$ can be constructed by taking each term of $f$ ( $a_ix^i$ ), and mapping it to $ia_ix^{i-1}$ , where $i$ is mapped into $F$ using the canonical homomorphism $\ZZ\to F$ (i.e. taking $1+1+\ldots$ but $i$ times.)

The derivative helps us determine when a polynomial has multiple roots. $\alpha$ is a multiple root of $f$ if $f$ has a factor $(x-\alpha)^n$ for some $n\ge 2$ .
Theorem: If the derivative of a polynomial $f$ shares a root $\alpha$ with $f$ , then $\alpha$ is a multiple root of $f$ .

Since $f$ has $\alpha$ as a root, factor out $f=(x-\alpha)g$ . To show that $\alpha$ is a multiple root, we can show that $g$ also has $\alpha$ as a root, i.e. $g(\alpha)=0$ .

By the product rule, $f'=(x-\alpha)g'+g$ . Since $f'(\alpha)=0$ , we have $0=(\alpha-\alpha)g'(\alpha)+g(\alpha)$ , implying $0=g(\alpha)$ , implying $\alpha$ is a root of $g$ .
Corollary: If the derivative of an irreducible polynomial $f$ is nonzero, then $f$ and $f'$ share no factors.

Irreducible polynomials $f$ can only share a factor (itself) with either 0 or polynomials of greater or equal degree. Since the derivative $f'$ always has lesser degree, $f'$ has to be zero in order for $f$ and $f'$ to share a factor.

December 4, 2023. Exploration 4: Factorization
Questions:
- When is a factorization unique?
To factor an element is to consider it as a product of other elements in the ring. For instance, in the ring of integers $\ZZ$ , the integer $6$ typically factors into $2\cdot 3$ . By multiplying both factors by the unit $-1$ , we obtain $-2\cdot -3$ as another valid factorization. Are these the only factorizations of $6$ (up to reordering)?

No — we also have $1\cdot 6$ and $-1\cdot -6$ . general, factorizations like $a=1a$ , or $a=u(u^{-1}a)$ (where $u$ is a unit) are trivial factorizations, because such factorizations exist in any ring and are therefore not interesting. So when we talk about factorizations we generally mean nontrivial factorizations, i.e. factorizations into nonzero non-units.

When an nonzero, non-unit element cannot be factored further because all of its factorizations are trivial, we call it an irreducible element, or simply an irreducible. Otherwise, the element is reducible.

Another way to define an irreducible element is the following:

Theorem: If an irreducible $a$ factors into $bc$ , then either $b$ or $c$ is a unit.

The fact that $a$ is irreducible means any factorization $bc$ is trivial, so either $b$ or $c$ is a unit.

In this section, we explore whether factorization terminates.

Let $a$ be a reducible element. If we repeatedly perform nontrivial factorizations on $a$ , we’d first obtain $a=bc$ . Then nontrivial factorizations of $b$ and $c$ gives us $a=(de)(fg)$ . Ideally this process terminates once there are no more nontrivial factorizations possible, i.e. all our factors are irreducibles. But does this process ever terminate?

$\begin{aligned} a_1&=a_2b_2&\text{ assume }a_2\text{ is reducible}\\ &=(a_3b_3)b_2&\text{ assume }a_3\text{ is reducible}\\ &=((a_4b_4)b_3)b_2&\text{ assume }a_4\text{ is reducible}\\ &=\ldots \end{aligned}$

With every step we shave off some nonzero non-unit factor $b_i$ . But nothing says that $a_i$ eventually becomes irreducible, and nothing says that this process must terminate. In fact, even if the ring is finite, it’s possible to find an infinite amount of factors $b_i$ that you can shave off from a given element $a_1$ . We’ll actually see an example of that later.

When we’re in an integral domain, there is a better way to characterize this process that avoids introducing extraneous variables like $b_i$ . We use a chain of inclusions of principal ideals — the above can be summarized as $(a_1)\subsetneq(a_2)\subsetneq(a_3)\subsetneq(a_4)\subsetneq\ldots$ The expression $(a_1)\subsetneq(a_2)$ says two things:
- $(a_1)$ , being a subset of $(a_2)$ , implies $a_2\mid a_1$ , equivalent to our $a_1=a_2b_2$ above.
- $(a_1)$ , being a proper subset of $(a_2)$ , means this $a_1=a_2b_2$ cannot be a trivial factorization, because in an integral domain $a_1,a_2$ differ by a unit $b_2$ if and only if $(a_1)=(a_2)$ .
Therefore the existence of this “ascending chain” of principal ideals expresses the existence of a sequence of nontrivial factorizations, exactly what we were expressing before with $a_i$ and $b_i$ . Then in order to ensure that the process of factoring $a_1$ terminates, we must guarantee that every such chain starting from $(a_1)$ is finite.

Expanding this guarantee to all elements in the ring results in a condition called the ascending chain condition on principal ideals (ACCP): the guarantee that there exists no infinite strictly ascending chain of principal ideals in the ring. If an integral domain satisfies the ACCP, then factorization always terminates, leaving you with a product of irreducibles.
Theorem: If a ring $R$ satisfies the ACCP, then so does its polynomial ring $R[x]$ .

Let $(f_1)\subsetneq(f_2)\subsetneq\ldots$ be an ascending chain of principal ideals in $R$ . WTS this chain is finite.

As mentioned previously, this is just a fancy way of saying $f_{i-1}=f_iq$ where $q$ is not a unit. For polynomial rings, where nonzero non-units are the non-constant polynomials, this means that $f_i$ divides $f_{i-1}$ in a way that decreases the degree of $f_{i-1}$ .

Since degree is finite and is strictly decreasing as you go up the chain, the chain must include an ideal generated by a polynomial of degree $0$ , i.e. a constant polynomial $c\in R$ .

But since $R$ satisfies ACCP, there cannot be an infinite ascending chain of ideals starting from $c$ , and therefore the inclusion of $(c)$ in the chain implies it is finite. Therefore $R[x]$ satisfies the ACCP.
If we are guaranteed that the process of factorization always terminates, then every (nonzero, nonunit) element $a$ can be factored into a product of irreducibles $\prod_i p_i$ (times some unit $u$ ). For this reason, integral domains for which ACCP holds are called factorization domains.

In this section, we briefly examine how zero divisors complicate factorization.

Our assumption above of being in an integral domain is in fact required, because the existence of zero divisors loses the guarantee that factorization terminates if the ring satisfies the ACCP.

The easiest example of this is when you have a nontrivial idempotent $e$ , a special zero divisor. In that case, we have $e=e\cdot e=e\cdot e\cdot e=e\cdot e\cdot e\cdot e=\ldots$ implying that $e$ factors into an infinite copies of itself, never reaching an irreducible element. Therefore the existence of a nontrivial idempotent shows that not every nonzero nonunit can be factored into a product of irreducibles, even if the ring satisfies the ACCP.

This makes integral domains, which have no zero divisors, particularly suited for nontrivial factorizations. Here’s one property that showcases this fact:

Theorem: In an integral domain, two distinct nontrivial factorizations cannot share a factor.

Since cancellability holds in integral domains, if an element $a$ has two nontrivial factorizations $a=bc=bd$ that share the factor $b$ , then we cancel $b$ to get $c=d$ , implying that the two factorizations are not distinct.

We can use this to prove a useful fact about irreducibles in an integral domain:

Corollary: In an integral domain, two elements $a,b$ that differ by an element differ by a unique element $c$ .

If $a=bc$ for some $c$ that is not unique, then there are at least two distinct factorizations $a=bc=bd$ , which isn’t possible in an integral domain as we just proved. Therefore $c$ is unique.

Corollary: In an integral domain, if two irreducible elements $a,b$ differ by an element, then they differ by a unique unit $c$ .

This is the above theorem combined with the fact that if $a=bc$ for some $c$ , then irreducibles only have trivial factorizations, and thus $c$ can only be a unnit.

In this section, we explore what is required to make factorizations unique.

Recall the fundamental theorem of arithmetic: “every integer greater than $1$ can be factored into a unique product of prime numbers.”

This statement mirrors the guarantee provided by factorization domains: every element can be factored into irreducibles. But is this factorization unique? In other words, does the fundamental theorem of arithmetic extend to all factorization domains? Let’s see:
Therefore: in a factorization domain, if every irreducible $p$ dividing some product $\prod_i p_i$ must divide one of the factors $p_i$ , then factorization is unique (up to units).

Let $R$ be an factorization domain, i.e. an integral domain that satisfies the ACCP. Then every nonzero non-unit element $a\in R$ can be written as a product of irreducible elements $p_i$ (and a unit $u$ ): $a=u\prod_{i=1}^np_i$

Recall that in an integral domain, an irreducible element $b$ dividing a product $a$ implies that $a,b$ differ by a unique factor $c$ . That means if $a$ has two nontrivial factorizations that share a factor $p_1$ , then $a=p_1b$ , so they differ by a unique element $b$ . If this $b$ also has two nontrivial factorizations that share a factor $p_2$ , then $b=p_2c$ , so they differ by a unique element $c$ , and so on.

Thus if we enforce the property that every two factorizations of the same element must share a factor, then eventually this process stops at $1=1$ , where every factor $p_i$ must be unique, implying that factorizations are unique.
Note that in a factorization domain, every two factorizations $a=bc=bd$ share a factor $b$ iff they share some irreducible factor $p$ , so given $p$ , we have $a=pn$ for some unique $n$ . Then: $\begin{aligned} a&=pn\\ \iff bc&=pn\\ \end{aligned}$ Since $p$ is irreducible, it must be a factor of $b$ or of $c$ . Then the other factors ( $c$ or $b$ respectively) divide $n$ , so either $n=cc'$ or $n=bb'$ . $\begin{aligned} \iff bc=p(cc')&\quad\text{ or }\quad bc=p(bb')\\ \iff b=pc'&\quad\text{ or }\quad c=pb'\\ \iff p\mid b&\quad\text{ or }\quad p\mid c \end{aligned}$ So our original condition that two factorizations share a factor $b$ is equivalent to saying that if an irreducible factor $p$ divides a factorization $bc$ , then either $p$ divides $b$ or $p$ divides $c$ . $p\mid bc\implies p\mid b\text{ or }p\mid c$ When this condition holds for an element $p$ for all factorizations, we say that $p$ is prime.
Theorem: In an integral domain, if $p$ is prime, then its only non-unit divisor is $p$ .

Say that $q$ is a non-unit divisor of p, so $p=qr$ for some non-unit $r$ . Then $p\mid p$ implies $p\mid qr$ . By the prime property, either $p\mid q$ or $p\mid r$ .

If $p\mid q$ , then $p,q$ divide each other, meaning $q=p$ and we are done.

If $p\mid r$ , then $r=ps$ for some $s$ , meaning $p=q(ps)$ . Since we’re in an integral domain, we cancel $p$ on both sides to get $1=qs$ , implying that $q$ is a unit.
Thus if every irreducible in an factorization domain $R$ has this property (every irreducible is prime), then the fundamental theorem of arithmetic holds, thus factorization in $R$ is unique. When every irreducible is prime, we call $R$ a unique factorization domain (UFD).

In fact, this property is enough to show the converse of the fundamental theorem:
Theorem: UFDs are exactly the factorization domains in which every irreducible is prime.

( $\to$ ): Assuming factorization is unique, we must show that for an irreducible $p$ , $p\mid ab$ implies $p\mid a$ or $p\mid b$ .

If $p\mid ab$ , then we have $pq=ab$ for some $q$ .

Let the unique factorizations of $a,b,q$ be $u_a\prod_i a_i,u_b\prod_i b_i,u_q\prod_i q_i$ respectively. Then $p\left(u_q\prod_i q_i\right)=\left(u_a\prod_i a_i\right)\left(u_b\prod_i b_i\right)$

Since factorization is unique, $p$ appearing on the LHS implies it appears on the RHS as well (perhaps after multiplying by a unit). Then it will appear as either a factor $a_i$ or a factor $b_i$ .

That means either $p\mid a$ or $p\mid b$ , and therefore the arbitrary irreducible $p$ is prime.

( $\from$ ): proved earlier.
In this section, we explore the ideals generated by irreducibles.

Before we get deeper into primes, let’s check out the ideals generated by irreducibles. One reason irreducibles are interesting is because they are exactly the elements that generate maximal principal ideals:
Theorem: In an integral domain, only the irreducible elements generate maximal principal ideals.

$(\to)$ If $p$ is irreducible, WTS $(p)$ is a maximal principal ideal.

$(p)$ is maximal among proper principal ideals if $(p)\subseteq (q)$ for some other proper principal ideal $(q)$ implies $(p)=(q)$ .

$(p)\subseteq (q)$ means $p=qr$ for some $r$ , which must be a unit because $p$ is irreducible and $q$ (being in a proper ideal $(q)$ ) is not a unit. (proof) So we can write $q=r^{-1}p$ .

But that means $(q)\subseteq (p)$ . Therefore $(p)=(q)$ and $(p)$ is indeed maximal among proper principal ideals.

$(\from)$ If $(p)$ is a maximal principal ideal, then WTS $p$ is irreducible.

We just need to show that whenever $p=qr$ , either $q$ or $r$ is a unit.

If neither are units then we have $(p)=(qr)\subsetneq(q)$ (proof). Again since $q$ is not a unit, $(q)$ is a proper principal ideal. (proof)

But then $(p)$ is not maximal since it is properly contained in a proper principal ideal, contradiction. Therefore either $q$ or $r$ is a unit, and so $p$ is irreducible.
Corollary: In a PID, irreducible elements are the elements that generate maximal ideals.

Recall that a principal ideal domain (PID) is one where every ideal is a principal ideal. They, too, are UFDs:
Theorem: Every PID is a UFD.

WTS PIDs satisfy the ACCP and thus are a factorization domain.

In a PID, the union of any ascending chain of ideals $(a_1)\subsetneq(a_2)\subsetneq\ldots$ is a principal ideal $(a_n)$ .

But this union contains every ideal in the chain, meaning all $(a_i)\subseteq(a_n)$ . Importantly, any ideal after $(a_n)$ in the chain must be equal to $(a_n)$ , and therefore any ascending chain of ideals cannot grow infinitely.

Therefore PIDs satisfy the ACCP.

WTS every irreducible in a PID is prime, so that the PID is a UFD.

Let $p$ be an irreducible and say $p=ab$ .

Irreducibles are the elements that generate maximal principal ideals, so in particular, $(p)$ is maximal.

Assume WLOG that $p\notmid a$ , so that proving $p\mid b$ is enough to show that $p$ is prime.

Let $(p,a)$ be the ideal generated by $p$ and $a$ .

If $p\notmid a$ , then $(p)\ne(p,a)$ . In fact, since $(p)$ is maximal and $(p,a)$ is strictly larger than $(p)$ , then $(p,a)$ must be the ideal $(1)$ which is the entire ring.

In that case, some linear combination of $p$ and $a$ generates $1$ , so we have some $xp+ya=1$ for some $x,y$ . Multiply by $b$ to get $(xp)b+y(ab)=b$ , showing that $xp$ and $ab$ generate $b$ . But $p$ divides $xp$ and $ab$ , therefore $p$ divides any element they generate, including $b$ . Therefore $p$ is prime.
Corollary: Every polynomial ring over a field is a UFD.

Recalling that the integers $\ZZ$ are a PID, we get an interesting result: the integer polynomials form a UFD.

Lemma: $R$ is a PID and $p$ is prime iff $R/(p)$ is a field.

Theorem: If $p$ prime in $\ZZ$ , then $\ZZ_p[x]$ is a UFD.

Corollary: $R$ is a PID and $p$ is irreducible in $R$ iff $R/(p)$ is a field.

This is just a restatement of the above, since PIDs are UFDs and irreducibles are prime in UFDs.

In this section, we explore the ideals generated by primes.

Let’s first get a little more intuition of why prime ideals are called prime:

Theorem: Prime elements $p$ generate prime principal ideals $(p)$ .

We need to show that $ab\in (p)$ implies either $a\in (p)$ or $b\in (p)$ . $\begin{aligned} ab&\in (p)\\ ab&=kp&\text{ for some }k\\ p&\mid ab\\ p\mid a\quad~&\text{or }\quad p\mid b&\text{ because }p\text{ is prime}\\ a=k_1p\quad~&\text{or }\quad b=k_2p&\text{ for some }k_1,k_2\\ a\in(p)\quad~&\text{or }\quad b\in(p) \end{aligned}$ Therefore, $(p)$ is a prime ideal.

The converse holds in integral domains, with a similar proof.

Theorem: In an integral domain, all principal prime ideals $(p)$ are generated by primes $p$ .

Given $(p)$ is a prime ideal, we need to show that $p\mid ab$ implies $p\mid a$ or $p\mid b$ . $\begin{aligned} p&\mid ab\\ ab&=kp&\text{ for some }k\\ ab&\in (p)&\text{ since }kp\in (p)\\ a\in (p)\quad~&\text{or }\quad b\in (p)&\text{ because }(p)\text{ is a prime ideal}\\ a=k_1p\quad~&\text{or }\quad b=k_2p&\text{ for some }k_1,k_2\\ p\mid a\quad~&\text{or }\quad p\mid b \end{aligned}$

Corollary: In an integral domain, the principal prime ideals are exactly the ideals generated by primes.

Corollary: $p$ is prime iff $R/(p)$ is an integral domain.

We know that if every irreducible is prime, then we must be in a UFD. But when is every prime irreducible?

Theorem: Every prime is irreducible exactly when the ring is an integral domain.

If a prime $p$ is reducible to $p=ab$ , then $p\mid a$ or $p\mid b$ . WLOG assume $p\mid a$ , so $a=kp$ for some $k$ . Then: $\begin{aligned} p&=ab\\ p&=kpb\\ p-kpb&=0\\ p(1-kb)&=0\\ 1-kb&=0&\text{ since we're in an integral domain}\\ 1&=kb \end{aligned}$ implies that $b$ is a unit, and since $p=ab$ implies $b$ is a unit, $p$ is irreducible.

December 5, 2023. Exploration 5: Irreducibility
Questions:
- When is a polynomial irreducible?
- What happens to irreducibility when you turn a ring into a polynomial ring?
- How do you determine irreducibility given an element?
Recall that in a UFD, every element factors into a unique product of irreducible elements. For instance, in the ring of integers $\ZZ$ , element uniquely factor into positive factors. In the polynomial ring $R[x]$ , elements uniquely factor into monic factors.

In an arbitrary UFD, there is no general way to identify such elements (like “positive” or “monic”), but we can do something else: call two elements associate (denoted $a\sim b$ ) if they differ by a unit.

Theorem: Only the units are associate to $1$ .

If an element $r$ is associate to $1$ , it means $r=1u$ for some unit $u$ , which implies $r$ is that unit.

Then instead of saying that a factorization is unique up to multiplication by units, we can equivalently say factorization is unique up to associates.

Here are some properties of associate elements:

Theorem: In an integral domain, two nonzero elements are associate iff each generates the other.

Corollary: In an integral domain, two nonzero elements are associate iff they divide each other.

This is because if two nonzero elements $a,b$ generate each other, then $a=bx$ and $b=ay$ implying $a\mid b$ and $b\mid a$ .

Theorem: In an integral domain, if $a$ divides $b$ , so do all the associates of $a$ .

Assume $a\mid b$ . If $c$ is an arbitrary associate of $a$ ( $c\sim a$ ), then $a=uc$ for some unit $u$ , thus: $\begin{aligned} a&\mid b\\ uc&\mid b\\ uc&=bd&\text{ for some }d\\ c&=b(u^{-1}d)\\ c&\mid b \end{aligned}$

In this section, we define GCD and LCM and their implications for UFDs.

The greatest common divisor (GCD) of two elements $\gcd(a,b)$ is a “greatest” element in the ring that divides both $a$ and $b$ , in the sense that any other divisor of both $a$ and $b$ divides a GCD.

Similarly, the least common multiple (LCM) of two elements $\lcm(a,b)$ is a “least” element in the ring that is a multiple of both $a$ and $b$ , in the sense that a LCM divides any other multiple of both $a$ and $b$ .

Note that this notion of “least” and “greatest” rely on some divisibility partial order, which only exists in some rings. $\gcd(a,b)$ is a greatest element among all divisors of both $a$ and $b$ , and $\lcm(a,b)$ is a least element among all multiples of both $a$ and $b$ .

Also note that there can exist multiple GCDs and LCMs in a ring, a fact that we’ll deal with immediately:
Theorem: In an integral domain, GCD and LCM are uniquely determined up to associates, if they exist.

Recall that elements that divide each other are associate.

If there are two GCDs, then by definition they divide each other and are thus associates.

Similarly, if there are two LCMs, then by definition they are a multiple of each other, meaning they divide each other, and are thus associates.
The GCD is very powerful, as the rest of this exploration will demonstrate. But first we need this important property of the GCD:
Theorem: $\gcd(ca,cb)\sim c\cdot\gcd(a,b)$

It is enough to prove that the two sides divide each other.

$\gcd(ca,cb)\mid c\cdot\gcd(a,b)$ : any divisor of both $a$ and $b$ must divide $\gcd(a,b)$ , by definition of GCD. Multiplying by $c$ , any divisor of both $ca$ and $cb$ must divide $c\cdot\gcd(a,b)$ . But $\gcd(ca,cb)$ is such a divisor.

$c\cdot\gcd(a,b)\mid\gcd(ca,cb)$ : since $\gcd(a,b)$ divides both $a$ and $b$ , multiply by $c$ to find that $c\cdot\gcd(a,b)$ divides both $ca$ and $cb$ . Therefore it also divides their greatest common divisor, $\gcd(ca,cb)$ .
Using this we can prove that, in fact, the existence of GCDs in a factorization domain is equivalent to saying that every irreducible is prime.
Theorem: In a factorization domain, every irreducible is prime iff GCDs exist.

( $\to$ )

Since every irreducible is prime in a factorization domain, it’s a UFD (proof) and therefore every element has a unique factorization into prime elements.

Then to get the GCD, you can just take the unique product of all the common factors of two elements, which means taking the minimum exponent of each common prime power factor.

(Note that you can similarly get the LCM by taking the unique product of all the common multiples of two elements, which means taking the maximum exponent of each common prime power factor.)

( $\from$ )

Given irreducible $p$ , WTS it is prime.

If $p\mid ab$ , then $d=\gcd(a,p)$ is a divisor of $p$ .

Since $p$ is irreducible, either $d\sim p$ or $d\sim 1$ .

If $d\sim p$ , we already know that $d\mid a$ by definition of GCD. Since associates of divisors are also divisors (proof), we have $p\mid a$ .

Otherwise, $d\sim 1$ means $\gcd(a,p)\sim 1$ . Multiply by $b$ on both sides to get $\gcd(a,p)b\sim b$ . Then by the lemma, $\gcd(ab,pb)\sim b$ , and therefore $ab\mid b$ . Recalling that $p\mid ab$ , this implies $p\mid b$ .

So either $p\mid a$ or $p\mid b$ , meaning irreducible $p$ is prime.
Corollary: A factorization domain is a UFD iff GCDs exist.

This is because a factorization domain is a UFD iff every irreducible is prime, which we just proved is equivalent to saying GCDs exist.

In this section, we introduce the Bézout identity relating GCDs to elements in the ring.

We know that fields and Euclidean domains permit division by some division algorithm. Observe that both also have GCDs:

The existence of GCDs in a Euclidean domain imply support for what is called the extended Euclidean algorithm in Euclidean domains. This algorithm calculates the GCD of some given $f$ and nonzero $g$ as a linear combination of $f$ and $g$ , resulting in the Bézout identity $\gcd(f,g)=af+bg$ .
Theorem: For every $f$ and nonzero $g$ in a Euclidean domain $R$ , their GCD can be expressed $\gcd(f,g)=af+bg$ for some $a,b\in R$ .

If we divide some $f\in R$ by some nonzero $g\in R$ , we have $f=gq_1+r_1$ for some $q_1,r_1\in F[x]$ (where either $r_1=0$ or $\deg r_1<\deg g$ .)

If $r_1=0$ , we are done: then $\gcd(f,g)=f$ and we have $\gcd(f,g)=1f+0g$ .

Otherwise, dividing $g$ by $r_1$ gives $g=r_1q_2+r_2$ .

If $r_2=0$ , we are done: then $\gcd(f,g)=r_1$ and $f=gq_1+r_1$ implies $\gcd(f,g)=1f-q_1g$ .

Otherwise, dividing $r_1$ by $r_2$ gives $r_1=r_2q_3+r_3$ .

If $r_3=0$ , we are done: then $\gcd(f,g)=r_2$ and $g=r_1q_2+r_2$ implies $\gcd(f,g)=g-r_1q_2$ , which simplifies to $\begin{aligned} \gcd(f,g)&=g-r_1q_2\\ &=g-(f-gq_1)q_2&\text{ using }f=gq_1+r_1\\ &=g-fq_2+gq_1q_2\\ &=(-q_2)f+(1+q_1q_2)g\\ \end{aligned}$

Otherwise, the idea is to continue doing this until you get $r_k=0$ , thus $\gcd(f,g)=r_{k-1}$ and you can backsubstitute each $r_i$ to express the GCD as a linear combination $\gcd(f,g)=af+bg$ for some $a,b$ .
Finally, two elements are coprime if their GCD is a unit, i.e. the GCD is associate to $1$ .

Corollary: If $f$ or $g$ are coprime in a Euclidean domain $R$ , $1=af+bg$ for some $a,b\in R$ .

Coprime means the GCD is associate to $1$ , so $\gcd(f,g)=af+bg$ becomes $1=af+bg$ .

Corollary: If $f$ is prime, then for every nonzero $g$ , $\gcd(f,g)$ must be either $1$ or $f$ .

Since $f$ is prime, its only nonunit divisor is $f$ . Therefore $\gcd(f,g)=f$ if $f\mid g$ , and $\gcd(f,g)=1$ otherwise.

Theorem: In a PID, nonzero prime ideals $(p)$ are maximal.

$(p)$ (being a nonzero principal ideal) contains every element with a factor of $p$ , so any element $r\in R$ outside of $(p)$ must be coprime to $p$ . Since adding any outside element $r$ to $(p)$ makes it equal to $(p,r)=(\gcd(p,r))=(1)=R$ , i.e. not a proper ideal anymore, $(p)$ must be maximal.

Theorem: Every quotient of a Euclidean domain $R$ by a prime ideal is a Euclidean domain.

Since Euclidean domains are PIDs, and nonzero prime ideals $(p)$ are maximal in PIDs, either $(p)$ is zero or a maximal ideal. If $(p)$ is zero then $R/(p)\cong R$ results in the same ring, which is a Euclidean domain. If $(p)$ is maximal, then $R/(p)$ is a field and therefore a Euclidean domain.

In this section, we revisit the relationship between rings and polynomial rings defined over them.

We already have enough to prove an interesting fact about polynomial rings that are also PIDs:

Theorem: If $R[x]$ is a PID, $R$ is a field.

Corollary: If $R[x]$ is a PID, then it is also a Euclidean domain.

This is because polynomial rings defined over a field are Euclidean domains.

Now just like we did in rings in general, we call a non-constant polynomial $f\in R[x]$ irreducible if it can’t be factored into two non-units. However, there are a number of facts about polynomials that help us determine whether a polynomial is irreducible.

First, since $x$ has no inverse, the units in a polynomial ring can only be the units in $R$ as constant ( $\deg 0$ ) polynomials.

Second, the inability to factor into two non-units is a hard property to prove in general, but there is a shortcut for polynomial rings:

Theorem: For polynomials with degree $\ge 2$ , irreducibility implies having no roots.

According to the factor theorem, having a root $a$ means $f=(x-a)q$ , where $\deg q$ is at least $1$ since $\deg f$ is at least $2$ . But that means you just factored into two non-unit factors $(x-a)$ and $q$ , meaning that $f$ is reducible.

This means saying a polynomial is irreducible is one way to say it has no roots. Take the contrapositive, and you get a way to check irreducibility: if a polynomial has roots, then the polynomial is reducible.

The converse is not true in general, but it is true for degree $2$ and $3$ polynomials:

Theorem: For degree $2,3$ polynomials, having no roots implies irreducibility.

When we factor a polynomial $f$ into a product $ab$ , we have $f=ab$ implies $\deg f=\deg a+\deg b$ . If $\deg f$ is $2$ or $3$ and has no roots (i.e. linear ( $\deg 1$ ) factors, by the factor theorem), necessarily one of $\deg a$ and $\deg b$ must be $0$ , meaning $f$ is irreducible.

Otherwise, determining irreducibility in polynomial rings $R[x]$ can be largely reduced to finding roots. For instance, $x^2+1$ has no roots in $\RR$ but does in $\CC$ , so it’s irreducible in $\RR[x]$ but reducible in $\CC[x]$ .

If we’re in $\QQ[x]$ or $\ZZ[x]$ we could think about using the Rational Roots Theorem to classify all the rational roots of the polynomial. However, finding these roots is also an enumerative task, and we’d like to avoid that.

There is a theorem that leads to fast irreducibility checking for polynomials in $\ZZ[x]$ , and it relies on the reduction homomorphism, which applies $\bmod~p$ ( $p$ prime) to coefficients of an integer polynomial $f\in\ZZ[x]$ . This gives you the reduction of $f$ modulo $p$ , written $\bar{f}$ . (Formally: $f\mapsto\bar{f}=\sum_i a_ix^i\mapsto\sum\bar{a}_ix^i:\ZZ[x]\to\ZZ_p[x]$ .)
Modular Irreducibility Test: For a nonzero polynomial $f\in\ZZ[x]$ , suppose there is a prime $p$ where

$p$ doesn’t divide the leading coefficient of $f$

reduction $\bar{f}\mod p$ is irreducible in $\ZZ_p[x]$ .

Then $f$ is irreducible in $\QQ[x]$ .

Towards contradiction, assume $f$ is not irreducible in $\QQ[x]$ . Then there is a proper factorization $f=gh$ , where $\deg g<\deg f$ .

We know that $\deg\bar{g}\le\deg g$ since it’s possible that the leading coefficient of $g$ gets mapped to $0$ , decreasing its degree. This isn’t possible for $\bar{f}$ since it’s given that $p$ doesn’t divide the leading coefficient of $f$ . So overall, we have $\deg\bar{g}\le\deg g<\deg f=\deg\bar{f}$

Similarly for $h$ , $\deg\bar{h}\le\deg h<\deg f=\deg\bar{f}$

But since $f=gh$ , we know $\bar{f}=\bar{g}\bar{h}$ where $\deg\bar{g}<\deg\bar{f}$ and $\deg\bar{h}<\deg\bar{h}$ , implying there is a proper factorization for $f$ in $\ZZ_p[x]$ . But this contradicts our second assumption.
Example: $x^3+4x^2+6x+2$ is irreducible in $\QQ[x]$ .

The rational roots theorem works, but reduction $\bmod~3$ is much easier. Reduction becomes $x^3+x^2+2$ in $\ZZ_3[x]$ . Trying each of $x=0,1,2$ shows that this has no root, therefore irreducible, therefore the original polynomial is irreducible.

Example: $x^4+2x^3+2x^2-x+1$ is irreducible in $\QQ[x]$ .

Reduction $\bmod~2$ gives $x^4+x^2+1$ in $\ZZ_2[x]$ . Trying each of $x=0,1$ shows that this has no root, so to be irreducible it must factor into two quadratics with no root. The only quadratic with no root in $\ZZ_2[x]$ is $x^2+x+1$ (Example 5) so that must be the factor. But $(x^2+x+1)^2=x^4+2x^3+3x^2+2x+1$ which is not $f$ . So there is no proper factorization of $x^4+x^2+1$ in $\ZZ_2[x]$ , so $x^4+2x^3+2x^2-x+1$ is irreducible in $\QQ[x]$ .

In this section, we draw an equivalence between irreducibility in $\ZZ[x]$ and in $\QQ[x]$ .

In $\ZZ[x]$ , we can look at the GCD of every coefficient of a polynomial. For instance, if $f=3x^3+6x+9$ , then let $c(f)=\gcd(3,6,9)=3$ . $c(f)$ is the content of $f$ . In general, content is defined for any ring that defines a GCD, including UFDs.
Gauss’ Lemma: In a UFD, $c(fg)\sim c(f)c(g)$ for nonzero polynomials $f,g$ .

If $a_i$ and $b_j$ are the coefficients of $f$ and $g$ respectively, then every coefficient of $fg$ is some sum $\sum a_ib_j$ .

But $c(fg)$ , being the GCD of every coefficient $\sum a_ib_j$ , contains exactly the irreducible factors common to all $a_ib_j$ .

In a UFD, factorization is unique (up to associates), so containing all irreducible factors common to all $a_ib_j$ implies containing all irreducible factors common to all $a_i$ ( $=c(f)$ ) along with all irreducible factors common to all $b_j$ ( $=c(g)$ ).

This implies $c(fg)$ and $c(f)c(g)$ have the same unique factorization up to associates, and are therefore associate.
If $c(f)\sim 1$ (i.e. is a unit) then call $f$ a primitive polynomial.

Corollary: In a UFD, products of primitive polynomials are primitive.
Lemma: Any polynomial ring $R[x]$ over a UFD $R$ is a UFD.

We need to prove that (1) the ACCP holds in $R[x]$ and (2) all irreducible elements of $R[x]$ are prime.

Since $R$ is a UFD, it satisfies ACCP, and therefore $R[x]$ satisfies ACCP.

WTS irreducibles are primes. Say we have some irreducible $p$ . Then, note that the quotient $R[x]/(p)$ , polynomials where $p$ is sent to $0$ , is isomorphic to $R_p[x]$ , polynomials where coefficents are mod $p$ .

Then for every irreducible $p\in R$ :
Theorem: $R[x]$ is a UFD iff $R$ is a UFD.

This requires proving the converse of the above: $R[x]$ is a UFD implies $R$ is a UFD.

Take any constant polynomial $c\in R[x]$ . Since $R[x]$ is a UFD, there is a unique factorization of $c$ into irreducibles in $R[x]$ . Since for $c=ab$ we must have $\deg c=\deg a+\deg b$ , and $\deg c=0$ , the degree of every factor in the factorization of $c$ must be $0$ . This implies that any factorization of $c$ involves only constant polynomials $\in R$ , and therefore if it’s unique in $R[x]$ , it’s unique in $R$ .

Then we need only prove that the units of $R[x]$ are the units of $R$ , and that the irreducibles in $R[x]$ are also irreducible in $R$ .

If $r\in R[x]$ is a unit, then $rr^{-1}=1$ by the same degree argument means both $r$ and $r^{-1}$ must be constant polynomials, and therefore units in $R$ .

Any $a\in R$ reducible in $R$ is reducible in $R[x]$ . Taking the contrapositive, any $a\in R$ irreducible in $R[x]$ is irreducible in $R$ .

Therefore, the unique factorization of $c$ into irreducibles in $R[x]$ (up to associates in $R[x]$ ) is also a unique factorization of $c$ into irreducibles in $R$ (up to associates in $R$ ). Since this is true for every $c\in R$ , $R$ is a UFD.
Now let’s get into irreducibility in $\QQ$ and $\ZZ$ .

Imagine taking all the nonzero non-units $a$ in an integral domain $R$ and making them units by adjoining all the elements $a^{-1}$ . (You can’t do this when $a$ is a zero divisor, since you can’t invert zero, which is why we require an integral domain.) The result is a field, since every nonzero element is now a unit. We call this the field of fractions of $R$ . The representative example is that $\QQ$ is isomorphic to the field of fractions of $\ZZ$ .
Lemma: Taking the field of fractions preserves GCDs.

WTS if $\gcd(a,b)=c$ in $R$ , then $\gcd(\frac{a}{1},\frac{b}{1})=\frac{c}{1}$ in $F$ .

Take any common divisor $d/f$ of $\frac{a}{1},\frac{b}{1}$ in $F$ . Since $F$ is a field, we can consider numerators and denominators separately. The numerators imply $d\mid a$ and $d\mid b$ in $R$ , and since $1$ has only the divisor $1$ , the denominators imply $f=1$ .

WTS $\frac{d}{1}$ divides the GCD $\frac{c}{1}$ . Since $d$ is a common divisor of $a,b$ in $R$ , and $c$ is the GCD of $a,b$ in $R$ , $d$ divides $c$ . But then $\frac{d}{1}$ divides $\frac{c}{1}$ . Therefore, $\frac{c}{1}$ is also the GCD in $F$ .
Corollary: The content of polynomials in $R[x]$ is the same as the content of polynomials in $F[x]$ , where $F$ is the field of fractions of $R$ . In particular, if a polynomial is primitive in $R[x]$ , then it is primitive in $F[x]$ .
Theorem: If $F$ is the field of fractions of the UFD $R$ , then the factorization for $f\in R[x]$ is the same as its factorization in $F[x]$ , but only when $f$ is primitive.

( $\to$ ) Let $f$ be a primitive polynomial in $R[x]$ . By Gauss’ lemma, which states $c(ab)\sim c(a)c(b)$ , any factors $f_i$ of $f$ are also primitive. Since primitive polynomials in $R[x]$ are primitive in $F[x]$ , $f_i$ are primitive in $F[x]$ . That means there are no factors to be pulled out of each $f_i$ in $F[x]$ , implying the two factorizations are the same (up to associates).

( $\from$ ) Let $f$ be a primitive polynomial in $F[x]$ . Let $f=\frac{a}{b}\cdot\frac{c}{d}$ for $a,b,c,d\in R[x]$ where $b,d$ are nonzero. Then $bdf=ac$ , and this equation should also hold true in $R[x]$ . Since $f$ is primitive, $ac$ is also primitive (by Gauss’ lemma), and therefore $bd$ must be a unit. Therefore $f=ac$ , and the factorizations are the same (up to associates)
Corollary: Primitive polynomials are reducible in $R[x]$ iff they are reducible in $F[x]$ , since they have the same factorization.

Corollary: Primitive polynomials are irreducible in $R[x]$ iff they are irreducible in $F[x]$ .

Corollary: A primitive polynomial $f$ is irreducible in $\ZZ[x]$ iff $f$ is irreducible in $\QQ[x]$ .

Recall that the representative example of fields of fractions is $\QQ$ being the field of fractions of $\ZZ$ .

This result leads to a very useful test for irreducibility in $\QQ[x]$ .
Eisenstein Criterion: For a polynomial $f\in\ZZ[x]$ of degree $\ge 1$ , suppose there is a prime $p$ where

$p$ divides all coefficients of $f$ , except

$p$ does not divide the leading coefficient, and

$p^2$ does not divide the constant coefficient.

Then $f$ is irreducible in $\QQ[x]$ .

First, we prove that if $p$ divides all coefficients of $f\in\ZZ[x]$ except the leading coefficient, then $f$ being reducible in $\ZZ[x]$ implies $p^2$ must divide the constant coefficient.

If every coefficient of $f$ but the leading coefficient has a factor $p$ , then take the reduction mod $p$ to get $\bar{f}=ax^n\in\ZZ_p[x]$ .

Since $f$ is reducible in $\ZZ[x]$ , it is reducible and therefore has a proper factorization in $\ZZ_p[x]$ .

$p$ is prime, so $\ZZ_p[x]$ is a UFD. (proof) Then the unique proper factorization of $ax^n$ looks like $ax^n=(bx^i)(cx^j)$ .

Since the factors $bx^i,cx^j$ are all zero in their non-leading coefficients, then back in $\ZZ[x]$ they both must have a factor $p$ in all non-leading coefficients.

But this implies that the constant coefficient of $ax^n=(bx^i)(cx^j)$ in $\ZZ_[x]$ has a factor $p^2$ .

Take the contrapositive: if the constant coefficient has no factor $p^2$ , then $f$ is irreducible in $\ZZ[x]$ .

Then use Gauss’ Lemma: $f$ irreducible over $\ZZ[x]$ iff $f$ irreducible over $\QQ[x]$ .
Corollary: $f(x)$ is irreducible if $f(x+a)$ is irreducible.

This shift is an automorphism on $R[x]$ , therefore it preserves irreducibility.

Combine with Eisenstein to show that if the criterion holds after a shift, the original polynomial is irreducible.

TODO explain this better
Corollary: $x^n\pm p$ (for any $n\ge 1$ and prime $p$ ) is always irreducible by Eisenstein.

Here are a couple of examples of applying the Eisenstein criterion on various polynomials.

Example: $2x^5+27x^3-18x+12$ is irreducible in $\QQ[x]$

Eisenstein criterion with $p=3$ .

Example: $\QQ[x]$ contains an irreducible polynomial of every degree $n\ge 1$ .

This polynomial is $x^n-2$ , which is irreducible by the Eisenstein criterion with $p=2$ .
Example: For a prime $p$ , $x^n\pm p$ is always irreducible, except in characteristic $p$ .

Eisenstein Criterion: For a polynomial $f\in\ZZ[x]$ of degree $\ge 1$ , suppose there is a prime $p$ where

$p$ divides all coefficients of $f$ , except

$p$ does not divide the leading coefficient, and

$p^2$ does not divide the constant coefficient.

Then $f$ is irreducible in $\QQ[x]$ .

TODO
Example: The $p$ th cyclotomic polynomial $\Phi_p=\sum_ix^{p-i}$ is irreducible (for prime $p$ ).

Replace $x$ with $x+1$ and apply binomial theorem. TODO

TODO prove that every polynomial ring over any field has at least one nonconstant irreducible polynomial

Summary

In this exploration we touched on a number of ways to determine reducibility and irreducibility for polynomial rings. Here we introduce a grab-bag of methods:

Proving $f$ not irreducible:
- If there’s no constant term, then $0$ is a root.
- If the coefficients sum to $0$ , then $1$ is a root.
- (in $\ZZ_p[x]$ ) If there’s a factorization in $\ZZ[x]$ then there is a factorization in its quotient $\ZZ_p[x]$ .
- (in $\QQ[x]$ ) Rational Roots Theorem. If you find a rational root then not irreducible over $\QQ$ . (This does not help in $\ZZ_p[x]$ – everything divides everything there.)
Proving $f$ irreducible:
- (general) Irreducible in $\QQ[x]$ implies irreducible in $\ZZ[x]$ implies irreducible in $\ZZ_p[x]$ .
- (in $\QQ[x]$ ) Modular Irreducibility Test. If coefficients are integers, take reduction mod $p$ ( $p$ prime) where $p$ doesn’t divide leading coefficient. Try all elements in $\ZZ_p[x]$ to see if they are roots – if not, it’s irreducible in $\ZZ_p[x]$ and therefore irreducible in $\QQ[x]$ .
- (in $\QQ[x]$ ) Eisenstein criterion. If there is a prime p dividing all coefficients such that p doesn’t divide the leading coefficient and $p^2$ doesn’t divide the constant term, then irreducible in $\QQ[x]$ .
- (in $\QQ[x]$ ) Reduce to Eisenstein criterion by replacing $x$ with $x+a$ . For example: $x^2+1$ is irreducible via $x\mapsto x+1$ giving $x^2+2x+2$ . $x^3+1$ is irreducible via $x\mapsto x+2$ giving $x^3+6x^2+12x+9$ . $x^4+1$ is irreducible via $x\mapsto x+1$ giving $x^4+4x^3+6x^2+4x+2$ .
- (in $\ZZ[x]$ ) Gauss’ Lemma. If you can show Eisenstein criterion for irreducibility in $\QQ[x]$ , then assuming the polynomial is primitive (can’t factor out a unit) the polynomial is also irreducible in $\ZZ[x]$ .
- Last resort: try every possible factorization (e.g. given $\deg 5$ , try $\deg 4$ and $\deg 1$ , then $\deg 3$ and $\deg 2$ ), letting variables stand in for coefficients. If there is no possible assignment of variables in $F$ such that $f=gh$ , then no possible factorization.
Both:
- Degree $2$ or $3$ ? Then having no roots is the same as irreducibility. (for quadratics: use discriminant)
We end with a classification of polynomial rings as integral domains, based on all we’ve proved.

December 6, 2023. Exploration 5.1: Norm
Questions:
- What is a norm?
Recall that Euclidean domains have a division algorithm. This division algorithm required a notion of degree, which is defined as the degree of the polynomial in the case of polynomial rings. In non-polynomial rings, this notion of degree generalizes to a function $R\setminus\{0\}\to\NN$ mapping each nonzero to a natural number $\ge 0$ called the Euclidean norm.

In this exploration, we’ll be looking at the Euclidean domain $\ZZ[i]$ . This is a fine example of an Euclidean domain that is not a field – it contains $2$ and not $1/2$ so it lacks multiplicative inverses, thus not a field, but you can define a division algorithm on it.
Theorem: The Gaussian integers $\ZZ[i]$ are a Euclidean domain.

We can define an explicit division for Gaussian integers, thus proving the condition for being a Euclidean domain.

We can just use division in the complex numbers $\CC$ except with remainder. Given two Gaussian integers $a+bi$ and $c+di$ , transform $\frac{a+bi}{c+di}$ as we do in $\CC$ : $\begin{aligned} \frac{a+bi}{c+di} &=\frac{(a+bi)(c-di)}{(c+di)(c-di)}\\ &=\frac{ac-adi+bci+bd}{c^2+d^2}\\ &=\frac{ac+bd}{c^2+d^2}+\frac{bc-ad}{c^2+d^2}i \end{aligned}$

The remaining divisions by $c^2+d^2$ are in the integers $\ZZ$ , a Euclidean domain. Therefore we can use the division algorithm in the integers to obtain $q_1,r_1,q_2,r_2$ , and rewrite the above to $q_1+q_2i$ with remainder $r_1+r_2i$ . These satisfy the property that $q_1=\text{round}\left(\frac{ac+bd}{c^2+d^2}\right)\quad\text{and}\quad q_2=\text{round}\left(\frac{bc-ad}{c^2+d^2}\right)$ and due to the rounding, $|r_1|,|r_2|<\frac{\sqrt{c^2+d^2}}{2}$ Thus $a+bi=(q_1+q_2i)(c+di)+(r_1+r_2i)$ This completes the division algorithm.
This division algorithm differs from the one for polynomial rings — instead of enforcing that the remainder has degree less than that of the divisor, we enforce that the magnitude of the remainder is less than $\frac{\sqrt{c^2+d^2}}{2}$ , which is the magnitude of $c+di$ divided by $2$ . If we define the Euclidean norm $N(\cdot)$ in the Gaussian integers to be $N(a+bi)=a^2+b^2$ , then we can make a statement about the norm of the remainder $r_1+r_2i$ :

$N(r_1+r_2i)=r_1^2+r_2^2\le\left(\frac{\sqrt{c^2+d^2}}{2}\right)^2+\left(\frac{\sqrt{c^2+d^2}}{2}\right)^2=\frac{c^2+d^2}{2}\le N(c+di)$

This shows $N(r_1+r_2i)\le N(c+di)$ , which is the generalized form of $\deg r<\deg g$ in the polynomial version. Indeed, $N(\cdot)=\deg\cdot$ is the Euclidean norm in polynomial rings.

So in general, in particular for non-polynomial rings, a Euclidean domain defines a Euclidean norm $N:R\setminus\{0\}\to\NN$ , and admits a division algorithm that lets you divide $a$ by nonzero $b$ to obtain a quotient $q$ and remainder $r$ such that $N(r)<N(b)$ .

TODO proof that norm is all that is required to define a division algorithm

In this section, we explore how to define PIDs via norms.

We just saw that an integral domain $R$ is a Euclidean domain if and only if it admits a Euclidean norm. Can we extend this kind of definition to other types of domains?

The answer is yes: for PIDs, we can similarly define such a norm.

We will use this to prove that the Gaussian integers are a PID, without using the fact that it is a Euclidean domain.

Let $N(a+bi)=a^2+b^2$

$R\setminus\{0\}\to\NN$

Previously we only discussed irreducibility in the context of polynomial rings. However, elements in non-polynomial rings can also be irreducible – we only require that the element be not factorable into two nonunits.

December 10, 2023. Exploration 6: Finite fields
Questions:
- How do we work with positive characteristic?
- What do finite fields look like?
Recall that the characteristic of every integral domain must be either zero or prime. Since all fields are integral domains, we can categorize each field $F$ as either an infinite field ( $\char F=0$ ) or a finite field ( $\char F=p$ ).

Let’s begin with the finite fields. By the end of this exploration, we will have classified all of the finite fields.

In this section, we discuss the ring of integers mod $n$ .

All we know so far about a finite field is that it must be of some prime characteristic $p$ . Let’s first study the case where its order is exactly $p$ — in other words, let’s study the ring of integers mod $p$ .

First we prove that the integers mod $p$ form a field:
Theorem: If $p$ is prime, $\ZZ_p$ is a field.

Recall that quotienting a Euclidean domain by a prime ideal results in a Euclidean domain. But $\ZZ_p\iso\ZZ/(p)$ where $\ZZ$ is a Euclidean domain (since integer division exists) and $(p)$ is a prime ideal, therefore $\ZZ_p$ is a Euclidean domain as well.

To show that $\ZZ_p$ is also a field, recall that in a Euclidean domain, Bezout’s identity for coprime integers $n,m$ gives $1=an+bm$ for some $a,b\in R$ . Then since $[p]=[0]$ in $\ZZ_p$ , and $p$ is coprime to every positive integer $k$ less than $p$ , we can use Bezout’s identity with $[p]$ and $[k]$ to get $[1]=[a][0]+[b][k]=[b][k]$ which implies every $[k]$ for $1\le k<p$ is a unit in $\ZZ_p$ . Therefore every nonzero element in $\ZZ_p$ is a unit, making $\ZZ_p$ a field.
Why study $\ZZ_p$ when this is about finite fields? It turns out every finite field (with prime characteristic $p$ ) contains $\ZZ_p$ :
Theorem: Every prime characteristic ring contains a copy of $\ZZ_p$ , the integers mod $p$ .

Every ring contains the elements $\{1,(1+1),(1+1+1),\ldots\}$ , but for prime characteristic rings where $p\cdot 1=0$ , this only goes up to adding $p-1$ copies of $1$ .

These are exactly the elements that form $\ZZ_p$ , the integers mod $p$ , so every prime characteristic ring contains $\ZZ_p$ in the sense that you can define an isomorphism between those elements and $\ZZ_p$ .
In this section, we study the effects of prime characteristic.

Rings of prime characteristic have some interesting properties. For instance, recall the binomial theorem true for all rings: for every $a,b\in R$ and every nonnegative integer $n$ , we have $(a+b)^n=\sum_{k=0}^n{n\choose k}a^kb^{n-k}$

In a ring of prime characteristic, this sum simplifies significantly to: $(a+b)^p=a^p+b^p$
Theorem: $(a+b)^p=a^p+b^p$ for all $a,b$ in a ring of prime characteristic $p$ .

By the binomial theorem, $(a+b)^p=\sum_{k=0}^p{p\choose k}a^kb^{p-k}$ . Let’s focus on the coefficients $p\choose k$ .

${p\choose k}=\frac{p!}{k!(p-k)!}$ always includes a factor $p$ in the numerator, but $p$ (being prime) is not a factor of the denominator unless $k=0$ or $k=p$ .

Thus for other values of $k$ , the coefficient $p\choose k$ must be zero since it includes a factor $p$ in a field of characteristic $p$ . For the special cases $k=0$ and $k=p$ , the coefficient $p\choose k$ is equal to $1$ .

Therefore $(a+b)^p=1a^p+0+\ldots+0+1b^p=a^p+b^p$ .
This theorem is rather important, because it makes the map $a\mapsto a^p$ look awfully like an homomorphism on rings of prime characteristic $p$ . Which it is:
Theorem: The Frobenius endomorphism $a\mapsto a^p$ is an endomorphism on a ring of prime characteristic $p$ .

Let $\phi$ be the map $a\mapsto a^p$ . By the previous proof we observe $\phi(a+b)=\phi(a)+\phi(b)$ .

To be a homomorphism, $\phi$ must also preserve additive inverses and multiplication.

Preserving subtraction is easy for odd primes $p$ , because $\phi(-a)=(-a)^p=-a^p=-\phi(a)$ . For the remaining prime $p=2$ , recall that every element is its own additive inverse in characteristic $2$ , so we can directly write $\phi(a)=-\phi(a)$ .

Preserving multiplication is straightforward. $\phi(ka)=(ka)^p=k^pa^p=\phi(k)\phi(a)$ .

Thus $\phi$ is a homomorphism from the ring to itself, i.e. an endomorphism.
Note that Fermat’s little theorem states that for any integer $a$ , we have $a^p\equiv a\mod p$ . So specifically for the integers mod $p$ , $\ZZ_p$ , the Frobenius endomorphism $a\mapsto a^p(\equiv a)$ is actually the identity automorphism, mapping every $a$ to itself. We’ll make use of this fact soon.

In this section, we classify the finite fields.

Now let’s use our knowledge of the properties of prime characteristic to study all finite fields.

Let $F[x]$ be a polynomial ring over a finite field $F$ .
Theorem: Every element of $\ZZ_p$ is a root of the polynomial $x^p-x\in\ZZ_p[x]$ .

Fermat’s little theorem implies that for any integer $a$ not divisible by $p$ (i.e. any nonzero, since $p$ is prime), we have $a^{p-1}-1\equiv 0\mod p$ .

In other words, the polynomial $x^{p-1}-1$ has every nonzero element of $\ZZ_p[x]$ as a root. We can add the zero element as a root if we multiply by $x$ to get $x^p-x$ .

Thus, if you have a polynomial ring defined over a prime characteristic field like $\ZZ_p[x]$ , $x^p-x$ is a polynomial that has all $p$ distinct roots.
TODO extend to arbitrary finite field $F$ .

Now csonsider the quotient $F[x]/(f)$ for some polynomial $f\in F[x]$ . We actually already know two facts about $F[x]/(f)$ :

Theorem: $F[x]/(f)$ is a field iff $f$ is irreducible.

Theorem: $F[x]/(f)$ is finite with $|F|^{\deg f}$ elements.

$F[x]/(f)$ consists of cosets $[f]$ whose representatives are polynomials $f\in F[x]$ . Earlier we proved that the degree of every representative is less than $\deg f$ , and therefore has at most $\deg f$ terms with $|F|$ possible coefficients for each term. This implies that there are $|F|^{\deg f}$ distinct cosets in $F[x]/(f)$ .

Corollary: Given an irreducible degree $n$ polynomial $f\in F[x]$ , if $F$ is a finite field of prime order $p$ , then $F[x]/(f)$ is a finite field of prime power order $p^n$ .

So we can create fields of prime power order $p^n$ by taking the quotient $\ZZ_p[x]/(f)$ , where $\deg f=n$ .

One way to visualize a finite field created in this manner is by considering its elements as polynomials, whose coefficients are essentially a tuple. The elements of the prime order finite field $\ZZ_p$ are constant polynomials represented by $1$ -tuples: for example, $\ZZ_5$ is just the set $\{(0),(1),(2),(3),(4)\}$ . Quotienting $\ZZ_5$ by $x^5-3$ (which is irreducible by Eisenstein) gives us a quotient field whose cosets are represented by polynomials up to degree $4$ , as shown earlier. Degree $4$ polynomials $ax^4+bx^3+cx^2+dx+e$ can be represented by $5$ -tuples $(a,b,c,d,e)$ , where there $|\ZZ_5|=5$ choices for each component representing the coefficient of its representative polynomial.

In this section, we prove the uniqueness of the finite fields up to isomorphism.

We just established that a finite field of prime power order $p^n$ exists for every prime $p$ and $n\ge 1$ , and that we can construct said field as $\ZZ_p[x]/(f)$ where $f$ is a degree $n$ irreducible polynomial over $\ZZ_p$ . It is a well-established fact that its multiplicative group (consisting of all nonzero elements) is cyclic:

Theorem: Given $F$ a finite field of prime power order $p^n$ , its multiplicative group $F^\times$ is cyclic.

$F^\times$ , like all finite abelian groups, decomposes into cyclic groups of prime power order: $F^\times\iso C_{q_1}\times C_{q_2}\times\ldots\times C_{q_n}$ Since the order of the component cyclic fields $q_1,q_2,\ldots,q_i$ are pairwise coprime, their direct product is cyclic.

This has a direct implication about the fields of prime power order.
Theorem: All fields of prime power order $p^n$ are unique up to isomorphism.

We just showed that for an irreducible degree $n$ polynomial $g\in F[x]$ , the quotient $\ZZ_p[x]/(g)$ is a finite field of order $p^n$ . So finite fields of order $p^n$ exist for every prime $p$ and integer $n\ge 1$ .

If we can define a field homomorphism between two arbitrary fields of prime power order $p^n$ , then they are isomorphic because all field homomorphisms are injective and the domain and codomain have the same order $p^n$ .

Consider the multiplicative group $K^\times$ of an arbitrary field $K$ of prime power order $p^n$ . The order of $K^\times$ is exactly $p^n-1$ . By Lagrange’s theorem, the order of every element $a$ of $K^\times$ must divide the order of $K^\times$ , so $a^{p^n-1}=1$ , which can be rearranged to $a^{p^n}-a=0$ .

Then $a^{p^n}-a=0$ holds for every $a\in K^\times$ . But since $0^{p^n}-0=0$ , this extends to all $a\in K$ . Since this holds for arbitrary fields of prime power order, let $K_1,K_2$ be two such fields.

Then every element of $a\in K_1$ or $b\in K_2$ is a distinct root of the polynomial $f=x^{p^n}-x$ , in the sense that $f(a)=0$ for all $a\in K_1$ and $f(b)=0$ for all $b\in K_2$ .

Recall that ring homomorphisms (and therefore field homomorphisms) preserve rational expressions, including polynomial expressions. This means if $f(a)=0$ , then applying $\phi$ to both sides gives $\phi(f(a))=f(\phi(a))=0$ , implying that $\phi(a)$ is a root of $f$ as well.

Since $f$ , by construction, has $p^n$ distinct roots, $\phi$ maps all $p^n$ elements of $K_1$ to some element in $K_2$ . Therefore $\phi$ is a well-defined field homomorphism and we are done.
Since all fields of order $p^n$ are isomorphic to each other, this means that instead of writing the cumbersome term $\ZZ_p[x]/(f)$ , we can just refer to the field of prime power order $q=p^n$ , which is denoted $\FF_q$ or sometimes $\GF(q)$ .

Significantly, it turns out there are no other finite fields.
Theorem: every ring not of a prime power order is not a field.

If a ring $R$ is not of a prime power order, then the prime factorization of its order $p_1^{k_1}p_2^{k_2}\ldots p_n^{k_n}$ must contain at least two distinct prime powers.

Consider the principal ideal $p_i^{k_i}R$ for some $i$ . Since prime powers are pairwise coprime, that means the Chinese Remainder Theorem can be used to show $R\iso R/p_1^{k_1}R\times R/p_2^{k_2}R\times\ldots\times R/p_n^{k_n}R$ .

But since a direct product of rings contains zero divisors, and zero divisors cannot be units, $R$ contains nonzero nonunits and is therefore not a field.
Corollary: The only finite fields are the ones of prime power order.

In this section we describe primitive elements.

The fact that $\FF_q^\times$ is cyclic implies that every finite field $\FF_q$ has a primitive element $\alpha$ , the generator of $F^\times$ .

Recall that we can adjoin elements of one ring to another ring of the same characteristic. What happens when we adjoin $\alpha$ to another finite field of the same characteristic?

Note that the finite fields of any given characteristic $p$ have a subfield ordering:
Theorem: $\FF_{p^n}$ is a subfield of $\FF_{p^m}$ iff $n\mid m$ .

The result follows pretty quickly after we prove a number theoretic lemma:

Lemma: $n\mid m$ iff $p^n-1\mid p^m-1$ .

( $\to$ ) This is a straightforward calculation. $\begin{aligned} n&\mid m\\ m&=kn\\ p^m&=p^{kn}\\ p^m-1&=p^{kn}-1\\ p^m-1&=(p^n-1)(p^{n(k-1)}+p^{n(k-2)}+\ldots+p^n+1)\\ p^n-1&\mid p^m-1 \end{aligned}$

( $\from$ ) We know that $n\lt m$ . Then $\begin{aligned} &\gcd(p^m-1,p^n-1)\\ &=\gcd(p^{m-n}p^n-1,p^n-1)\\ &=\gcd(p^{m-n}p^n-1-(p^n-1),p^n-1)&\text{ using }\gcd(a,b)=\gcd(a-kb,b)\\ &=\gcd(p^{m-n}p^n-p^n,p^n-1)\\ &=\gcd(p^{m-n}p^n-p^n,p^n-1)\\ &=\gcd(p^n(p^{m-n}-1),p^n-1)\\ &=\gcd(p^{m-n}-1,p^n-1)&\text{ because }p^n\text{ and }p^n-1\text{ are coprime}\\ \end{aligned}$ Thus $\gcd(p^m-1,p^n-1)=\gcd(p^{m-n}-1,p^n-1)$ . Applying this recursively amounts to applying the Euclidean algorithm on the powers $(m,n)$ , resulting in $\begin{aligned} &\gcd(p^m-1,p^n-1)\\ &=\gcd(p^{\gcd(m,n)}-1,p^{\gcd(m,n)}-1)\\ &=p^{\gcd(m,n)}-1 \end{aligned}$ Since $p^n-1\mid p^m-1$ implies $p^n-1$ is the GCD of the two, the LHS becomes $p^n-1$ . Then: $\begin{aligned} p^n-1&=p^{\gcd(m,n)}-1\\ p^n&=p^{\gcd(m,n)}\\ n&=\gcd(m,n)\\ n&\mid m \end{aligned}$ as required.

Then:

( $\to$ ) By Lagrange’s theorem, the order of the subfield’s multiplicative group $|\FF_{p^n}^\times|$ divides that of the larger field $|\FF_{p^m}^\times|$ . Thus $p^n-1\mid p^m-1$ , so $n\mid m$ by the lemma.

( $\from$ ) If $n\mid m$ , then $\begin{aligned} n&\mid m\\ p^n-1&\mid p^m-1&\text{ by the lemma}\\ x^{p^n-1}-1&\mid x^{p^m-1}-1&\text{ by the lemma}\\ f\cdot (x^{p^n-1}-1)&=x^{p^m-1}-1&\text{ for some polynomial }f\in\ZZ[x]\\ f\cdot (x^{p^n}-x)&=x^{p^m}-x&\text{ multiply both sides by }x\\ \end{aligned}$ which shows that every root of $x^{p^n}-x$ is a root of $x^{p^m}-x$ . But since the roots of $x^{p^n}-x$ are exactly $\FF_{p^n}$ , this implies every element of $\FF_{p^n}$ is in $\FF_{p^m}$ , i.e. $\FF_{p^n}$ is a subfield of $\FF_{p^m}$ .
Corollary: Adjoining the primitive element $\alpha$ of $\FF_{p^n}$ to $\FF_{p^m}$ results the larger of the two finite fields.

If $\FF_{p^n}$ is larger, then $\FF_{p^m}$ is a subfield of $\FF_{p^n}$ . Since $\alpha$ generates $\FF_{p^n}$ , the resulting field is $\FF_{p^n}$ , since that already includes $\FF_{p^m}$ .

If $\FF_{p^m}$ is larger, then $\FF_{p^n}$ is a subfield of $\FF_{p^m}$ and therefore $\alpha$ is already in $\FF_{p^m}$ , so the resulting field is $\FF_{p^m}$ .
Corollary: Adjoining two primitive elements $\alpha,\beta$ is the same as adjoining the one with larger order.

This follows directly from the previous result. WLOG assume $\alpha$ generates the larger field. Then $\alpha$ generates $\beta$ , thus you need only adjoin $\alpha$ .

December 11, 2023. Exploration 7: Fields
Questions:
- TODO
Recall that we can categorize each field $F$ as either an infinite field ( $\char F=0$ ) or a finite field ( $\char F=p$ ). Last time we discussed finite fields ( $\char F=p$ ). Now we extend the discussion to infinite fields ( $\char F=0$ ).

Theorem: Every infinite field (where $\char F=0$ ) contains the rationals $\QQ$ .

In characteristic $0$ , the rationals $\QQ$ are generated by $1$ : first by generating the integers $\ZZ$ via repeated addition and subtraction, then treating each integer as a unit and taking closure under product and inverses of every integer to get $\QQ$ . Since every ring contains $1$ , which generates $\QQ$ , every field in characteristic $0$ contains $\QQ$ .

Now recall that constructing a finite field $\FF_q$ (for some prime power $q=p^n$ ) is done by quotienting $\ZZ_p$ (i.e. $\FF_p$ ) by an irreducible polynomial $\in\ZZ_p[x]$ of degree $n$ . We can do this with infinite fields as well. Let $F$ be a field, and let $f$ be an irreducible polynomial in $F[x]$ , so that $(f)$ is maximal and therefore $F[x]/(f)$ is a field. This exploration will be all about studying the characteristics of these quotient fields $F[x]/(f)$ .

In this section, we study quotient fields.

First of all, quotienting by $(f)$ sends $f\in F[x]$ to $0$ . Where does this quotient send $x\in F[x]$ ?
Therefore: for polynomials $f$ irreducible in $F[x]$ , the quotient $F[x]/(f)$ maps $x$ to a root of $f$ that exists outside of the field $F$ .

Let’s say we have $f=x^2+1$ , which is irreducible in $\RR[x]$ . Then the quotient $\RR[x]/(f)$ sends $f$ to the zero coset $[0]$ , meaning we have $[x^2+1]=[0]$ which implies $[x]^2+[1]=[0]$ .

This means that $x$ gets mapped to some element that is a root of the polynomial $x^2+[1]$ (where coefficients are cosets from the quotient $\RR[x]/(f)$ ).

In summary, $x^2+1$ has no roots $\RR$ but it does in $\RR[x]/(f)$ (the coset $[x]$ ). The idea is that the act of adjoining $x$ and then quotienting creates this root.
Since $\RR[x]/(f)$ contains $\RR$ as a subfield, $\RR[x]/(f)$ is an extension of $\RR$ . So just like with finite fields, our new field is “larger” in some sense, which we’ll learn how to measure later. The important point is that when we extend a field in this manner, we’re doing so by adding a new root of an irreducible polynomial $f\in F[x]$ to $F$ !

In this section, we show another perspective for extending fields.

Recall that one way to turn any ring into a field is to adjoin units to it. That is, for every nonzero element $a$ , adjoin an element $a^{-1}$ so that $a$ becomes a unit. This results in a field known as the field of fractions, the idea being that for every $a\in F$ , you adjoin $\frac{1}{a}$ to $F$ , so now everything in $F$ looks like a fraction.

The field of fractions gives us another way to extend fields. If we adjoin some new element $\alpha$ to an existing field $F$ , and then take the field of fractions of the result, you get a new field, which we denote $F(\alpha)$ . This operation is known as a simple extension of $F$ by $\alpha$ . In this case, $\alpha$ is the generator of the simple extension.

So we have two ways of extending a field $F$ : the first way is to quotient its polynomial ring by an irreducible polynomial, and the second way is to adjoin a new element and take the field of fractions. Surprisingly, these turn out ot be equivalent. We’ll first prove a very interesting lemma about simple extensions.
Lemma: If $\alpha$ is the root of any polynomial $f\in F[x]$ , then $F[\alpha]=F(\alpha)$ .

WLOG we can assume $f$ is irreducible in $F[x]$ . This is because if $f$ is reducible, $\alpha$ must be a root of at least one of the irreducible factors of $f$ , and we can take that factor to be $f$ instead.

$F[\alpha]$ is exactly what you get when you evaluate every polynomial $F[x]$ at this new root $\alpha$ . To show that $F[\alpha]$ is a field, let $g(\alpha)$ be an arbitrary nonzero element of $F[\alpha]$ , where $g$ is some nonzero polynomial in $F[x]$ . WTS $g(\alpha)$ is a unit.

By the extended Euclidean algorithm, we have $\gcd(f,g)=af+bg$ for some $a,b\in F[x]$ .

Since $f$ is irreducible in $F[x]$ , and therefore prime, $\gcd(f,g)$ must be $1$ or $f$ . Because $g(\alpha)$ is nonzero, $\alpha$ is not a root of $g$ , implying $g$ doesn’t have a factor $f$ (which does have $\alpha$ as a root.) Therefore $\gcd(f,g)$ must be $1$ .

Using the assumption that $\alpha$ is a root of $f$ , we get $f(\alpha)=0$ . Then apply the evaluation map at $\alpha$ to both sides: $\begin{aligned} \gcd(f,g)&=a\cdot f+b\cdot g\\ \gcd(f,g)(\alpha)&=a(\alpha)\cdot f(\alpha)+b(\alpha)\cdot g(\alpha)\\ 1&=a(\alpha)\cdot 0+b(\alpha)\cdot g(\alpha)\\ 1&=b(\alpha)\cdot g(\alpha)\\ \end{aligned}$ which shows that our arbitrary nonzero $g(\alpha)$ is a unit.

Therefore every nonzero element of $F[\alpha]$ is a unit, and thus $F[\alpha]$ is its own field of fractions equal to $F(\alpha)$ .
Now we show that the two methods of extending a field — quotienting by an irreducible and doing a simple extension — are equivalent.
Theorem: If $f$ is irreducible over $F$ , the quotient $F[x]/(f)$ is isomorphic to a simple extension $F(\alpha)$ where $\alpha$ is some root of $f$ .

Define a kind of evaluation map $g\mapsto g(\alpha)$ and call it $\varphi_\alpha:F[x]\to F(\alpha)$ .

The kernel of $\varphi_\alpha$ is every polynomial that evaluates to $0$ at $\alpha$ . $f$ is one such polynomial.

Since $f$ is irreducible, it generates a maximal ideal $(f)$ of $F[x]$ , which contains every polynomial that also has $\alpha$ as a root. Therefore the kernel of $\varphi_\alpha$ is exactly $(f)$ , since they all evaluate to $0$ at $\alpha$ .

As for the image of $\varphi_\alpha$ , note that evaluating all polynomials in $F[x]$ at $\alpha$ results in exactly $F[\alpha]$ , which by the above lemma is exactly $F(\alpha)$ .

Then by the first isomorphism theorem, which states $F[x]/\ker\varphi_\alpha\iso\im\varphi_\alpha$ , we have $F[x]/(f)\iso F(\alpha)$ .
Since there are two isomorphic ways to extend fields, in general, should we write the quotient field $F[x]/(f)$ or should we write simple extension $F(\alpha)$ ? A quotient field makes explicit that we add roots of $f$ to the field, so implicitly the result has the root $\alpha$ of $f$ . A simple extension makes explicit that we add $\alpha$ to the field, so implicitly we’ve added the roots of its polynomial $f$ .

In fact we can ignore this detail, and simply write $K/F$ or “ $K$ is a field extension of $F$ ” (not to be confused with the identical notation used for quotients). Unlike the above, this notation can express any size of field extension. Specifically, $K/F$ could denote that $K$ is a (perhaps infinite) composition of simple extensions of $F$ , for example, $K=F(\alpha)(\beta)(\gamma)(\delta)$ .

The general definition of a field extension of $F$ , written $K/F$ , is any field $K$ that contains $F$ as a subfield. Since you can theoretically build $K$ from $F$ by continually taking simple extensions of elements in $K$ not in $F$ , every field extension can be expressed as a (perhaps infinite) composition of simple extensions.

Another notation is useful when we start getting into field extensions of field extensions (called towers): $K\supseteq L\supseteq F$ . This notation pretty much only exists since $K/L/F$ is ambiguous (is the slash quotienting or a field extension?). Like $K/F$ , it doesn’t tell you how the extensions were obtained, but it does tell you about the existence of an intermediate extension $L$ .

But for the purposes of this exploration where we are explicitly focusing on the structure of quotient fields, we’ll use the $F[x]/(f)$ form throughout.

In this section, we explore the properties of the roots within the extended fields.

The lemma we proved earlier about $F(\alpha)=F[\alpha]$ is actually quite foundational. In fact, we can prove a kind of converse:
Theorem: $F(\alpha)\iso F[\alpha]$ iff $\alpha$ is the root of some polynomial $f\in F[x]$ .

To show the forward direction, note that if $F(\alpha)\iso F[\alpha]$ , then $F[\alpha]$ is a field, so in particular $\alpha^{-1}$ is an element in $F[\alpha]$ corresponding to some polynomial $g\in F[x]$ .

Since $\alpha^{-1}\alpha=1$ , we have $\alpha^{-1}\alpha-1=0$ .

Then the corresponding polynomial $f=gx-1\in F[x]$ has $\alpha$ as a root, by construction.

The aforementioned lemma proves the backward direction.
This is a pretty important equivalence to have, so important that when $\alpha$ is the root of a polynomial in $F[x]$ , where $F$ is a field, we say that $\alpha$ is algebraic over $F$ .

Corollary: $F(\alpha)=F[\alpha]$ iff $\alpha$ is algebraic over $F$ ..

Whenever we have an element $\alpha$ that is algebraic over $F$ , we know that the simple extension $F(\alpha)$ , by the theorem we proved, results in a field isomorphic to $F[x]/(f)$ for some irreducible polynomial $f$ with root $\alpha$ .

Consider the relationship between the root $\alpha$ and its irreducible polynomial $f$ . It’s easy to identify the root $\alpha$ given $f$ — $\alpha$ is exactly the coset $[x]$ in $F[x]/(f)$ . But given $\alpha$ , can we identify a irreducible polynomial $f$ where $\alpha$ is a root?

For this, we need look no further than the kernel of the evaluation map $\varphi_\alpha$ , which is defined to be all the polynomials $\in F[x]$ for which $f(\alpha)=0$ . In fact, since $F[x]$ is an Euclidean domain and therefore a PID, the ideal $\ker\varphi_\alpha$ must be principal i.e. generated by some polynomial $f$ . If we can identify $f$ , then we have identified a polynomial where $\alpha$ is a root. We can deduce a couple things about $f$ . First of all, $f$ is irreducible:

Theorem: Over a field $F$ , the polynomial $f\in F[x]$ generating $\ker\varphi_\alpha$ is irreducible.

Since $F$ (being a field) has no zero divisors, the equation $\varphi_\alpha(f)=0$ means any nonzero factor of $\varphi_\alpha(f)$ would constitute a zero divisor, therefore $\varphi_\alpha(f)$ is irreducible and so $f$ is irreducible.

Second, $f$ is unique up to associates:

Theorem: Over a field $F$ , the polynomial $f\in F[x]$ generating $\ker\varphi_\alpha$ is unique (up to associates).

If $g$ also generates $\ker\varphi_\alpha$ , then $f$ and $g$ divide each other and therefore differ by a unit. Then $f$ is unique up to associates. (If we make $f$ monic, then it is unique, period.)

Corollary: Over a field $F$ , the monic polynomial $f\in F[x]$ generating $\ker\varphi_\alpha$ is unique.

Thus, is some unique monic irreducible polynomial $f$ that generates the kernel of $\varphi_\alpha$ , and it is known as the minimal polynomial of $\alpha$ over $F$ . So indeed, every root $\alpha$ has a minimal polynomial $f\in F[x]$ that is unique, irreducible over the base field $F$ , and has $\alpha$ as a root.

Thus every time we quotient by some irreducible polynomial $f$ (i.e. $F[x]/(f)$ ) we’re actually adjoining a new root $\alpha=[x]$ , and any time we adjoin a root $\alpha$ (i.e. $F(\alpha)$ ), we’re actually quotienting $F$ by its minimal polynomial $f$ .

Corollary: $F(\alpha)\iso F[x]/(f)$ whenever $f$ is the minimal polynomial of $\alpha$ .

This lets us prove an interesting fact about simple extensions. If a minimal polynomial $f$ has two roots $\alpha,\beta$ , then adjoining $\alpha$ has the same effect as adjoining $\beta$ . The converse is also true.

Theorem: Any two simple extensions by algebraic elements are isomorphic, $F(\alpha)\iso F(\beta)$ , iff $\alpha$ and $\beta$ have the same minimal polynomial.

If $\alpha$ and $\beta$ have the same minimal polynomial $f$ , then both $F(\alpha)$ and $F(\beta)$ are both isomorphic to $F[x]/(f)$ , and therefore to each other.

Corollary: Equivalently, if $\alpha$ and $\beta$ are roots of the same irreducible polynomial $f$ over $F$ , then $F(\alpha)\iso F(\beta)$ .

There is also a deeper theorem about automorphisms of field extensions that we’ll use later. We prove it here now:

Theorem: Every automorphism $\sigma:F(\alpha)\to F(\alpha)$ that fixes $F$ will permute the roots of the minimal polynomial $f$ of $\alpha$ .

Suppose $f(\alpha)=0$ for $f$ a minimal polynomial over $F$ . Every automorphism $\sigma:K\to K$ that fixes $F$ must satisfy $\begin{aligned} f(\alpha)&=0\\ \sigma(f(\alpha))&=\sigma(0)\\ f(\sigma(\alpha))&=0&\text{ since coefficients}\in F\text{ are fixed by }\sigma \end{aligned}$ implying that $\sigma(\alpha)$ is a root of $f$ as well.

In summary, what we’ve shown here is that given any algebraic element $\alpha$ , we can refer to its minimal polynomial, which is unique and always exists. Different simple extensions, then, can be identified with the minimal polynomial of its generator.

What about larger extensions? Are we able to identify extensions like $F(\alpha)(\beta)=F(\alpha,\beta)$ with some kind of minimal polynomial?

In this section, we demonstrate how non-simple extensions can be reduced to a simple extension.

If we find that $F(\alpha,\beta)$ is equal to some simple extension $F(\gamma)$ , then we can classify the extension with the minimal polynomial of $\gamma$ . But when does such a $\gamma$ exist?
Lemma: Every extension $F(\alpha,\beta)$ by two elements $\alpha,\beta$ algebraic over $F$ is equal to a simple extension, provided that the minimal polynomials of $\alpha$ and $\beta$ have distinct roots.

First, if $F$ is a finite field, then $F(\alpha,\beta)$ is the same as $F(\alpha)$ if $\alpha$ has a larger order, and $F(\beta)$ otherwise. (proof) This trivially makes $F(\alpha,\beta)$ a simple extension.

Now we prove the case where $F$ is infinite. Consider an arbitrary two-element extension $F(\alpha,\beta)$ by algebraic elements $\alpha,\beta$ . We will argue that this extension is equal to the simple extension $F(\alpha+c\beta)$ for some nonzero $c\in F$ .

The goal is to choose $c$ such that $\beta\in F(\alpha+c\beta)$ . This is because then $(\alpha+c\beta)-c\beta=\alpha\in F(\alpha+c\beta)$ as well, making $F(\alpha+c\beta)$ equal to $F(\alpha,\beta)$ .

To do this, we will prove that the number of nonzero $c\in F$ for which $\beta\notin F(\alpha+c\beta)$ is finite. Since the base field $F$ is infinite, that leaves infinitely many $c$ for which $\beta\in F(\alpha+c\beta)$ .

Since $\beta\notin F(\alpha+c\beta)$ , $F(\alpha+c\beta)$ is a subfield of $F(\alpha,\beta)$ .

Adjoin every root of the the minimal polynomials $f,g$ of $\alpha,\beta$ respectively to $F(\alpha+c\beta)$ . This results in a field $K$ , which has $F(\alpha+c\beta)$ as a subfield. Then every automorphism $\sigma$ that fixes $F(\alpha+c\beta)$ must permute the roots of $f$ and $g$ .

Then we have: $\begin{aligned} \alpha+c\beta&=\sigma(\alpha+c\beta)\\ \alpha+c\beta&=\sigma(\alpha)+c\sigma(\beta)\\ \alpha-\sigma(\alpha)&=c\sigma(\beta)-c\beta\\ \alpha-\sigma(\alpha)&=c(\sigma(\beta)-\beta)\\ \frac{\alpha-\sigma(\alpha)}{\sigma(\beta)-\beta}&=c\\ \end{aligned}$ For such a nonzero element $c$ to exist, we must ensure that both the numerator and denominator are nonzero, i.e. $\sigma(\alpha)$ must not map to $\alpha$ , and $\sigma(\beta)$ must not map to $\beta$ . Here we must use the assumption that $f$ and $g$ have distinct roots, so that there exists some $\sigma(\alpha)\ne\alpha$ and $\sigma(\beta)\ne\beta$ .

This implies there are finitely many $c$ that exist, since both $f,g$ have finitely many other roots ( $\deg f-1,\deg g-1$ that $\alpha,\beta$ can be permuted to.

Since $\beta\notin F(\alpha+c\beta)$ implies that there are only finitely many choices for $c$ , the contrapositive says that all other choices of $c\in F$ imply $\beta\in F(\alpha+c\beta)$ . Since $F$ is an infinite field, there are infinitely many choices, so such a $c$ exists such that $\beta\in F(\alpha+c\beta)$ , implying $F(\alpha+c\beta)=F(\alpha,\beta)$ .
Given a couple of definitions, this lemma implies our main result:

Primitive Element Theorem: Every finite separable extension of $F$ is simple.

A finite extension is one obtained by adjoining finitely many elements TODO they have to be algebraic. $F(\alpha,\beta,\ldots)$ . We can iteratively apply the lemma to show that all finite extensions of $F$ are simple.

A algebraic extension is simply a field extension by algebraic elements. All finite extensions are algebraic.

When a polynomial has distinct roots (i.e. no multiple roots) in some extension field, we call it a separable polynomial. Similarly, when every minimal polynomial is separable for elements in a field extension $K/F$ , we say it is a separable extension.

Then we can reword the above lemma concisely as a key theorem:

This is called the Primitive Element Theorem because when a field extension $K/F$ can be expressed as a simple extension $F(\alpha)$ , we say that $\alpha$ is a primitive element for the extension $K/F$ , since $\alpha$ generates the extension $K/F$ . So another way to word it is:

Primitive Element Theorem: Every finite separable extension has a primitive element.

In this section, we discuss separability.

So how do we prove separability?

In characteristic $0$ , recall that one can identify multiple roots using the derivative of a polynomial. Recall that the derivative of a polynomial $f=a_nx^n+\ldots+a_2x^2+a_1x+a_0$ is defined $f'=na_nx^{n-1}+\ldots+2a_2x+a_1$ The product rule of derivatives states that the derivative of a product is always $(fg)'=f'g + fg'$

The implication for multiple roots is this. If $f$ has a multiple root $\alpha$ , then it factors into $(x-\alpha)^2g$ . The product rule guarantees that the derivative of $f$ is $\begin{aligned} &~((x-\alpha)^2g)'\\ =&~((x-\alpha)(x-\alpha))'g+(x-\alpha)^2g'\\ =&~2(x-\alpha)g+(x-\alpha)^2g'\\ =&~(x-\alpha)(2g+(x-\alpha)g')\\ \end{aligned}$

This implies that if $f$ has multiple root $\alpha$ , then $f'$ also has $\alpha$ as a root. Since the converse is also true, this means having multiple roots is the same as sharing roots with the derivative.

In other words, to check if $f$ is separable, all we need to do is check if $f$ shares any roots with its derivative $f'$ in its splitting field. As a matter of fact, this is true for any irreducible polynomial in characteristic $0$ .
Theorem: In characteristic $0$ , every irreducible polynomial is separable.

It is enough to prove that $\gcd(f,f')$ is constant in the splitting field, because that means they cannot share any factor $(x-\alpha)$ , and therefore $f$ has no multiple root.

Since $f$ is irreducible, $\gcd(f,f')$ is equal to either $f$ or a constant polynomial. If it’s constant, we are done. Otherwise, $\gcd(f,f')=f$ is only possible if $f'=0$ , since $f'$ has smaller degree than $f$ .

But in characteristic $0$ , the only polynomials that have a zero derivative are the constant polynomials. This is because any nonconstant $a_kx_k^k$ would contribute the term $ka_kx_k^{k-1}$ , which is always nonzero in characteristic $0$ .

Since $f$ is irreducible, it is not a constant polynomial, and therefore $f'$ is nonzero, and thus $\gcd(f,f')$ cannot be $f$ and must be a constant.
Corollary: Every field extension in characteristic $0$ is separable.

Since minimal polynomials are irreducible, every minimal polynomial is separable by the above theorem, and therefore every extension in characteristic $0$ is separable.

Since characteristic $0$ fields are precisely the infinite fields, this gives us a corollary of the Primitive Element Theorem: Every finite extension of an infinite field has a primitive element.

In summary, since this exploration deals only with finite extensions of infinite fields, we can pretty much always assume that field extensions are simple, unless otherwise stated.

In this section, we go wild with adding roots to a field.

When we add a root $\alpha$ to a field via quotienting by some polynomial $f$ irreducible over $F$ , we’re also making it so that $f$ is actually reducible over the resulting field, in the sense that $f$ now factors into $(x-\alpha)g$ for some $g$ . You can imagine factoring $g$ into irreducibles, and repeating the process of adjoining their roots to make more linear factors $(x-\beta)$ , until you’ve decomposed the original $f$ into linear factors over an extension field $K/F$ .

The resulting field is known as a splitting field of $f$ over $F$ . If $K$ is a splitting field of $f$ , we say that $f$ splits over $K$ , or equivalently $f$ splits in $K[x]$ . For example: $x^2+1$ splits over $\ZZ(i)$ , because $x^2+1=(x+i)(x-i)$ .

You could go even further than splitting fields. Imagine constructing an extension field $K$ by splitting every polynomial over $F$ , and then every polynomial over the resulting splitting field, and repeating until you’ve created a field extension $K/F$ so large such that every polynomial in $K[x]$ splits.

Then we’ve taken the algebraic closure of $F$ . We say $E$ is an algebraically closed field, i.e. one in which every nonconstant polynomial in $E[x]$ has a root in $E$ . The canonical example of an algebraically closed field is $\CC$ . The proof that $\CC$ is algebraically closed is called the Fundamental Theorem of Algebra, which we’ve previously proved.

In this section, we examine how to compute the splitting field of a given polynomial over $F$ .

How can we identify the splitting field of $f$ over $F$ ? It turns out that splitting fields are unique (up to isomorphism):
Lemma: For any symbols $\alpha,\beta$ , $F(\alpha)(\beta)=F(\beta)(\alpha)$ .

When we do a simple extension by an element $F(\alpha)$ , we adjoin the element $\alpha$ to $F$ and take the field of fractions. By definition, this is the smallest field containing $F$ and $\alpha$ .

Then $F(\alpha)(\beta)$ is the smallest field containing $F(\alpha)$ and $\beta$ , i.e. it is the smallest field containing $F$ , $\alpha$ , and $\beta$ .

Likewise, $F(\beta)(\alpha)$ is the smallest field containing $F(\beta)$ and $\alpha$ , i.e. it is the smallest field containing $F$ , $\beta$ , and $\alpha$ .

Therefore these are the exact same field.
Theorem: The splitting field of $f$ over $F$ is unique up to isomorphism.

The splitting field always exists, since we can always quotient by the non-linear irreducible part of $f$ to adjoin a new root $\alpha$ .

So the process takes $f$ and adjoins a root to $F$ to get $f=(x-\alpha)g$ for some $g\in F(\alpha)$ .

Repeating this, we get a splitting field isomorphic to $F(\alpha_1)(\alpha_2)\ldots(\alpha_n)$ .

By the lemma, all reorderings of simple extensions are equal to each other. Since that means constructing a splitting field gives you a field isomorphic to $F(\alpha_1)(\alpha_2)\ldots(\alpha_n)$ regardless of the order you adjoin roots, all splitting fields of $f$ are isomorphic.
Just like how we can refer to the minimal polynomial of a given root $\alpha$ , this theorem lets us refer to the splitting field of a given polynomial $f$ .

As an example, take the splitting field of $x^3-2$ over $F$ , which is irreducible over $F$ . The roots of this polynomial (in $\CC$ ) are $\sqrt[3]{2}$ , $\zeta_3\sqrt[3]{2}$ , and $-\zeta_3\sqrt[3]{2}$ , where $\zeta_3$ is the third root of unity. None of these roots exist in $F$ , since $x^3-2$ is irreducible over $F$ .

In general, taking the splitting field of a polynomial $f$ over $F$ , means finding the minimal extension field $K/F$ that contains all $\deg f$ roots of $f$ , so that $f$ splits into linear factors in $K$ . When $f$ is irreducible, this means adding all $\deg f$ roots to $F$ .

Since splitting fields are unique up to isomorphism, we can add the roots in any order. First, let’s adjoin $\sqrt[3]{2}$ to obtain $F(\sqrt[3]{2})$ . If the base field contains $\zeta_3$ , this would add all the other roots as well, and we are done. Otherwise, the other roots are not added, since it is impossible to express $\zeta_3\sqrt[3]{2}$ in $F(\sqrt[3]{2})$ without $\zeta_3$ .

The next step is to adjoin $\zeta_3\sqrt[3]{2}$ . But then taking the closure means this operation adds the remaining root is $-\zeta_3\sqrt[3]{2}$ , and since we’ve added all roots, the result is the splitting field of $x^3-2$ .

Therefore, the process of constructing a splitting field for a polynomial $f\in F[x]$ is inherently tied to knowing the roots of the polynomial, and involves the following steps:
- Find new roots $\alpha_i$ of $f$ that generate extensions $F(\alpha_i)$ which are linearly disjoint over $F$ : extensions where each element in $\alpha_1,\alpha_2,\ldots$ cannot be expressed in terms of the others.
- Adjoin each $\alpha_i$ to the base field $F$ , obtaining $F(\alpha_1,\alpha_2,\ldots)$ .
- Done!
Now the problem becomes: how do we find roots that generate linearly disjoint extensions? What does it take for two roots to be linearly disjoint?

In this section, we discover the conditions for when two elements generate linearly disjoint extensions.

Recall that given a base field $F$ , all simple extensions of $F$ can be distinguished by its minimal polynomial, in the sense that different simple extensions have different minimal polynomials.

The degree of an extension is the degree of its minimal polynomial, or infinity if the extension is infinite. If the extension is written $K/F$ , then its degree is written $[K:F]$ .

Theorem: Finite field extensions $K/F$ have finite degree $[K:F]$ .

Since all algebraic elements have a minimal polynomial (necessarily of finite degree), every extension by algebraic element(s) are finite. This means TODO

Theorem: Two simple extensions $F(\alpha)$ and $F(\beta)$ are linearly disjoint iff the degree $[F(\alpha,\beta):F]$ is equal to the product of $[F(\alpha):F]\cdot[F(\beta):F]$

To be linearly disjoint, the two extensions must have the property that $F(\alpha)$ doesn’t contain $\beta$ and $F(\beta)$ does not contain $\alpha$ .

WLOG assume the first case where $F(\alpha)$ doesn’t contain $\beta$ . That would mean the minimal polynomial of $\beta$ over $F(\alpha)$ is the same as the minimal polynomial of $\beta$ over $F$ , i.e. $[F(\alpha)(\beta):F(\alpha)]=[F(\beta):F]$ . By a symmetric argument, we have $[F(\beta)(\alpha):F(\beta)]=[F(\alpha):F]$ . This implies that $[F(\alpha,\beta):F]=[F(\alpha)(\beta):F(\beta)]\cdot[F(\beta):F]=[F(\alpha):F]\cdot[F(\beta):F]=$ .

TODO verify this

TODO left off here

we’ve covered:

primitive element theorem: every finite extension is a simple extension, which is generated by a primitive element

field has degree equal to the minimal polynomial of its primitive element

two extensions of $F$ are linearly disjoint iff its degree is equal to the product of degrees of its components

splitting field definition based on linear disjointness

theorem: finite field extensions have finite degree

An element that is not algebraic over $F$ is called transcendental over $F$ . A key property of transcendental elements $\alpha$ is that the extensions $F(\alpha)$ they generate are linearly disjoint from every extension of $F$ (except the ones that include $\alpha$ ).

Theorem: If $\alpha$ is transcedental over $F$ , then $F(\alpha)$ is linearly disjoint from any extension of $F$ that doesn’t include $\alpha$ .

TODO

But in general if you have two algebraic elements $\alpha$ and $\beta$ , then whether they are linearly disjoint depends on their minimal polynomials.

Theorem: Two algebraic elements $\alpha$ and $\beta$ generate linearly disjoint extensions if their minimal polynomials are coprime. TODO this only works one direction, non-coprime polynomials can still generate linearly disjoint extension

Let $f$ and $g$ be the minimal polynomials of $\alpha$ and $\beta$ respectively.

To be honest, this rigorous construction of the splitting field of $f$ is just a minor detail. Just like with minimal polynomials, we can ignore this actual construction, and just refer to the fact that given any polynomial $f$ , we can refer to its splitting field, which is unique (up to isomorphism) and always exists.

December 12, 2023. Exploration 8: Galois theory
Questions:
- Which polynomial equations can be solved?
In the finite fields exploration, we’ve seen that given the finite field $\FF_p$ of prime order $p$ , taking the quotient $\FF_p/f$ by an irreducible polynomial $f$ of degree $n$ gives you the finite field $\FF_{p^n}$ of prime power order $p^n$ .

We’ve also seen that quotienting by an irreducible polynomial is equivalent to adjoining a root $\alpha$ of that irreducible polynomial $f$ . In other words, if you extend a finite field $F$ by an algebraic element $\alpha$ , then $F(\alpha)$ has its order raised to the $n$ th power, where $n$ is the degree of the minimal polynomial of $\alpha$ .

So we have two notions of field extension that are equivalent:
1. We can quotient its polynomial ring by a degree $n$ irreducible polynomial, or
2. we can adjoin an algebraic element whose minimal polynomial is degree $n$ .
For simplicity, we might want to just refer to the field extension $K/F$ that raises its order by $n$ . Call that a degree $n$ extension, and we denote degree with $[K:F]=n$ . Then the degree of an algebraic element is the degree of its minimal polynomial.

Note that this notion of degree carries over to infinite fields too. Although a degree $n$ extension of an infinite field results in a field that is just as infinite, the idea is that every element in the new field is a $n$ -tuple of elements in the original field. TODO describe finite fields as tuples

and the degree of a simple extension $F(\alpha)$ be the degree of its primitive element $\alpha$ . We can also define the degree $[E:F]$ of a finite field extension $E/F$ in general: decompose it into a composition of simple extensions by algebraic elements, and multiply the degrees of each simple extension.

Using the concept of the degree of an extension, we can start classifying some field extensions.
Theorem: A degree $1$ field extension gives back a field isomorphic to the original field.

If $F(\alpha)$ is degree $1$ , its minimal polynomial is $x-\alpha$ .

We can show that $F[x]/(x-\alpha)\iso F$ using the first ring isoomorphism theorem.

The evaluation map at $\alpha$ , $\varphi_\alpha:F[x]\to F$ , has a kernel consisting of all polynomials with $\alpha$ as a root. In other words, all polynomials that have the factor $x-\alpha$ , which is just the ideal $(x-\alpha)$ .

Since the evaluation map is surjective, $\im\varphi_\alpha=F$ .

The theorem states that $F[x]/\ker\varphi_\alpha\iso\im\varphi_\alpha$ . In other words, $F[x]/(x-\alpha)\iso F$ .
Let’s explore the degree $2$ extensions.

The quadratic extensions

The canonical example of a degree $2$ extension, or a quadratic extension, is $\RR[x]/(x^2+1)\iso\CC$ , which we proved a while ago.

Since degree $2$ extensions are formed by quotienting by a monic irreducible degree $2$ polynomial, the general form of a quadratic extension must be the quotient $F[x]/(x^2+bx+c)$ . We’ll now prove a theorem that simplifies this.
Theorem: If $F$ is of characteristic $\ne 2$ , every quadratic extension of $F$ can be written as $F[x]/(x^2-\delta)$ , which is isomorphic to $F(\sqrt{\delta})$ , for some element $\delta\in F$ .

By definition, quadratic extensions are extensions of degree $2$ . Since finite field extensions are quotients by a suitable irreducible polynomial of the same degree, the quotient $F[x]/(x^2+bx+c)$ can be used to represent any degree $2$ extension.

You can always find the roots of $x^2+bx+c$ using the quadratic formula $\frac12(-b\pm\sqrt{b^2-4c})$ .

Since $2$ is in the denominator, this requires the field to not be of characteristic $2$ (otherwise $2=0$ and you can’t divide by zero).

This means every root adjoined by quotienting by $(x^2+bx+c)$ can be expressed in terms of elements of $F$ together with $\sqrt{b^2-4c}$ . Since $b^2-4c\in F$ , this means we need only adjoin a square root of an element in $F$ to add every root of $x^2+bx+c$ , implying that $F[x]/(x^2+bx+c)\iso F(\sqrt{b^2-4c})$ .

The minimal polynomial of $\sqrt{b^2-4c}$ is $x^2-(b^2-4c)$ .

Therefore, every quadratic extension $F[x]/(x^2+bx+c)$ is isomorphic to the quadratic extension $F[x]/(x^2-(b^2-4c))$ . If we let $\delta=b^2-4c$ , then this becomes $F[x]/(x^2-\delta)$ , which is isomorphic to $F(\sqrt{\delta})$ .
The implication is that every quadratic extension can be obtained by a square root of some element in the base field. So the general form of a quadratic extension is $F(\sqrt{\delta})$ for some $\delta\in F$ . Note that the proof implies that the roots of a quadratic extension are $\pm\sqrt{\delta}$ . Therefore the roots are symmetric in a sense – adding one root $\delta$ always adds the other root, because you can obtain it via $-\delta$ .

Whenever two roots of the same irreducible polynomial can be expressed in terms of each other within the base field, we call them algebraic conjugates, or just conjugates. This generalizes the concept of complex conjugates in $\CC$ to any field extension by an algebraic element. In general, quadratic extensions always produce two conjugate roots, but larger extensions might have a larger set of roots (that are all conjugate to each other). We’ll see an example now.

The cyclotomic extensions

A $n$ th root of unity $\zeta_n^k=\exp(2\pi i\frac{k}{n})$ is primitive if it can be used to express all the other $n$ th roots of unity. This is true whenever $k$ is coprime to $n$ .

The $n$ th cyclotomic polynomial, $\Phi_n$ , is the minimal polynomial over $\QQ$ for any primitive $n$ th root of unity $\zeta_n$ . It’s irreducible and unique by definition of minimal polynomial, therefore we can use it to write the $n$ th cyclotomic extension $\QQ[x]/(\Phi_n)$ .
Theorem: The roots of the $n$ th cyclotomic polynomial are exactly the primitive $n$ th roots of unity.

First of all it is clear that all primitive $n$ th roots of unity $\zeta_n$ share the same minimal polynomial ( $\Phi_n$ ), and are therefore roots of $\Phi_n$ .

Now we need to show these are the only roots of $\Phi_n$ . To do this, we make a degree argument.

First, note that the polynomial $x^n-1$ contains all $n$ th roots of unity, not just the primitive ones. Since every $n$ th root of unity is a primitive $d$ th root of unity for some divisor $d$ of $n$ , and $x^n-1$ (by virtue of containing all $n$ th roots of unity) must contain all these primitive $d$ th roots of unity for each $d$ , we can conclude that each $\Phi_d$ is a factor of $x^n-1$ . This includes $\Phi_n$ .

Second, note that these $\Phi_d$ are disjoint. If $\prod_{d_1}$ and $\prod_{d_2}$ share a root $\zeta$ , then by definition both of those polynomials must be the minimal polynomial of $\zeta$ . But minimal polynomials are unique, which implies $\prod_{d_1}=\prod_{d_2}$ .

This proves the identity $x^n-1=\prod_{d\mid n}\Phi_d$ .

Finally, note that if any root $\zeta$ of $\Phi_n$ is not a primitive $n$ th root of unity, then it must be a primitive $d$ th root of unity for some $d\mid n$ . But then $\Phi_d$ and $\Phi_n$ would share a root $\zeta$ , which contradicts the fact that each $\Phi_d$ is disjoint. Thus the only roots of $\Phi_n$ are all the primitive $n$ th roots of unity $\zeta_n$ .
Since primitive $n$ th roots of unity, by definition, can express all the $n$ th roots of unity, adding one root adds them all. In other words, the roots of $\Phi_n$ are conjugate. Like the quadratic extensions, the cyclotomic extensions are an example of where all of the roots of the minimal polynomial are conjugate.

Since there are $\varphi(n)$ primitive $n$ th roots of unity, cyclotomic extensions are degree $\varphi(n)$ . Note that when $n$ is prime, then $\varphi(n)=n-1$ , and thus $p$ th cyclotomic extensions are always of degree $p-1$ .

In this section, we discover properties of conjugate roots.

In both quadratic and cyclotomic extensions, all the roots we adjoined were conjugate to each other.

In general, not all roots adjoined by a field extension are conjugate to each other. One case is when a field extension is made up of multiple extensions. For example, a biquadratic extension $F(\sqrt{\alpha})(\sqrt{\beta})$ consists of two different quadratic extensions where there is no $q\in F$ such that $\alpha=q^2\beta$ . That last condition ensures that $\sqrt{\alpha}$ and $\sqrt{\beta}$ cannot be expressed in terms of the other in the base field $F$ . This means we’ve added the roots $\pm\alpha$ (which are conjugates) and $\pm\beta$ (which are conjugates), but $\alpha$ is not conjugate to $\beta$ .

Here’s a more concrete example. Over $F$ , take the splitting field of $x^3-2$ , whose roots could be expressed as $\sqrt[3]{2}$ , $\zeta_3\sqrt[3]{2}$ , and $\zeta_3^2\sqrt[3]{2}$ . Taking $F[x]/(x^3-2)$ will give us one of the roots — perhaps $\sqrt[3]{2}$ — but not the others. In order to split $x^3-2$ , we must add one of the remaining roots, say $\zeta_3\sqrt[3]{2}$ . Note that each of these three roots can be expressed with the other two:
- $\sqrt[3]{2}=(\zeta_3\sqrt[3]{2})^{-1}(\zeta_3^2\sqrt[3]{2})^2$
- $\zeta_3\sqrt[3]{2}=(\sqrt[3]{2})^{-1}(\zeta_3^2\sqrt[3]{2})^2$
- $\zeta_3^2\sqrt[3]{2}=(\sqrt[3]{2})^{-1}(\zeta_3\sqrt[3]{2})^2$
So we only need to adjoin two of the roots in order to split $x^3-2$ . Unlike the previous example, where taking a single quotient $F[x]/(x^2+1)$ was enough to split $x^2+1$ , taking a single quotient $F[x]/(x^3-2)$ is not enough to create the splitting field of $x^3-2$ , since it only adds one root. We can distinguish these two types of extensions.

Define a normal extension $K/F$ as a field extension of $F$ such that $K$ is the splitting field for some polynomial $f$ over $F$ . Then $F[x]/(x^2+1)$ is clearly normal, since it’s the splitting field of $x^2+1$ . To show that $F[x]/(x^3-2)$ is not normal, we rely on the following theorem:
Theorem: If $K/F$ is normal, then every polynomial irreducible over $F$ that has a root in $K$ splits in $K$ .

If $K/F$ is normal, then by definition $K$ is the splitting field for some polynomial $f$ over $F$ .

Any polynomial $g$ irreducible over $F$ with a root in $K$ is a factor of $f$ , since $K$ (being the splitting field of $f$ ) adds only roots of $f$ and nothing more.

But since $f$ splits in its splitting field $K$ , any factor of $f$ splits in $K$ , therefore $g$ splits in $K$ .
Corollary: By the contrapositive, if some polynomial irreducible over $F$ with a root in $K$ doesn’t split in $K$ , then $K/F$ is not normal.

So to prove an extension $K/F$ is normal, we need to show that it is the splitting field for some polynomial over $F$ . To prove an extension not normal, we need to show that it doesn’t split some polynomial irreducible over $F$ with a root in $K$ . Since $F[x]/(x^3-2)$ doesn’t split $x^3-2$ but contains one of its roots, $F[x]/(x^3-2)$ is not normal.

Normal extensions are interesting because they have the following property that we like to have:

Theorem: If $K/F$ is normal and contains a root $\alpha$ , then it contains all the conjugates of $\alpha$ .

By the previous theorem, if $K$ contains a root $\alpha$ , then it splits its minimal polynomial $f$ . Therefore it contains all the roots of $f$ , which are the conjugate roots of $\alpha$ .

We’ll see the implications of this property in the next section.

If we work with splitting fields, which are always normal extensions by definition, then we can always assume the following:
Therefore: Splitting fields in characteristic $0$ are Galois extensions.

Splitting fields are normal by definition of normal definitions being splitting fields. Thus it contains all conjugates.

Splitting fields are finite since they need only add finitely many roots to split a finite polynomial, and all field extensions in characteristic $0$ are separable. Being finite separable means the splitting fields in characteristic $0$ are simple extensions $F(\alpha)$ . Since splitting fields are normal, $\alpha$ generates every root (not just some roots) of every irreducible polynomial over $F$ whose roots are in $K$ .

Such extensions (finite separable and normal) are known as Galois extensions, and they have some very interesting properties which we will explore in the next section.
In this section, we explore the intermediate fields of a Galois extension.

Again, consider the splitting field of $x^3-2$ : $K=F(\sqrt[3]{2},\zeta_3\sqrt[3]{2})$ . The intermediate fields of $K/F$ refers to all the subfields of $K$ that contain $F$ (including $K$ and $F$ ). It turns out that $K$ has six intermediate fields:

The basic idea is that you can try to adjoin every combination of elements in $K$ not in $F$ , i.e. every expression composed of the roots adjoined. The intermediate fields are the result. Unfortunately, there are infinite possibilities of what to adjoin, and adjoining one set of elements often gives you the same field as adjoining a different set of elements. In general, finding intermediate fields is highly non-trivial without the tools we’re about to present.

The main way we define intermediate fields of $K/F$ is by defining $F$ -automorphisms, automorphisms $\sigma:K\to K$ that fix $F$ .
Theorem: Elements in $K/F$ fixed by an $F$ -automorphism $\sigma$ on $K$ form an intermediate field of $K/F$ .

All automorphisms fix the identities $0$ and $1$ . So to prove the fixed elements of $\sigma$ form a subfield of $K$ , we just check that they contain all additive inverses, all nonzero multiplicative inverses, and are closed under addition and multiplication.

Inverses: $\sigma(-a)=-\sigma(a)=-a$ and for nonzero $a$ , $\sigma(a^{-1})=\sigma(a)^{-1}=a^{-1}$

Closure: $\sigma(a+b)=\sigma(a)+\sigma(b)=a+b$ and $\sigma(ab)=\sigma(a)\sigma(b)=ab$

Since all $F$ -automorphisms fix $F$ by definition, $F$ is included in the fixed elements, which we just proved is a subfield of $K$ . That means the fixed elements must form an intermediate field of $K/F$ .
This means the task of finding intermediate fields of $K/F$ can be reduced to the task of identifying $F$ -automorphisms of $K/F$ .

$F$ -automorphisms have a number of properties that we can leverage in order to find them. First of all:

Theorem: If $K/F$ is a finite separable extension of $F$ , any $F$ -automorphism $\sigma:K\to K$ that fixes $F$ must map each root to one of its conjugates.

We have already proved this for the case that $K$ is a simple extension $F(\alpha)$ of $F$ . Since the Primitive Element Theorem guarantees that all finite separable extensions are simple, the proof extends to all finite separable extensions.

Corollary: As all automorphisms are permutations of the given field, $F$ -automorphisms are exactly permutations that only permute roots of the given field extension.

Corollary: Since normal extensions $K/F$ contain all conjugate roots, $F$ -automorphisms of a normal extension represents a permutation of all roots adjoined to $K$ .

Corollary: Consider Galois extensions, which are normal and also separable (the roots adjoined to them are all distinct). This means that unique permutations on those roots correspond to unique $F$ -automorphisms.

Thus for non-Galois extensions it is possible that two distinct $F$ -automorphisms correspond to the same intermediate field, but for Galois extensions we can simply identify all permutations on roots to identify all $F$ -automorphisms and therefore all intermediate fields. This is the goal of the next section.

Side note: the reason normal extensions are called normal is the same reason normal subgroups are called normal subgroups – their $F$ -automorphisms happen to be invariant under conjugation.
Theorem: If $\sigma$ and $\tau$ are $F$ -automorphisms of a normal extension $K/F$ , then $\sigma\tau\sigma^{-1}$ is also a $F$ -automorphism of $K/F$ .

First of all, since each of $\sigma,\tau,\sigma^{-1}$ fix $F$ , the composition fixes $F$ as well.

Second, since each of $\sigma,\tau,\sigma^{-1}$ are automorphisms in a normal extension, they represent permutations on the roots of polynomials.

Permutations compose, so the composition also permutes the roots of polynomials, and is thus a $F$ -automorphism of $K/F$ .
In this section, we identify the permutations on roots of a field extension.

The last corollary showed that for Galois extensions, we can count the number of $F$ -automorphisms by counting the number of permutations of the roots in $K$ . Define the Galois group $G(K/F)$ of an extension $K/F$ as the set of all $F$ -automorphisms on $K$ , which represent permutations on the roots adjoined to $K$ . These permutations form a group, since you can always invert a permutation, and the composition of two permutations is a permutation and is an associative operation, and there is an identity permutation that swaps no roots.

Here are some theorems about Galois groups and extensions which we’ll use later:
Theorem: The degree of any Galois extension $K/F$ is equal to the order of its Galois group $G(K/F)$ .

The order of $G(K/F)$ is precisely the number of $F$ -automorphisms of $K$ .

Recall that a Galois extension is finite separable, and therefore has a primitive element $\alpha$ (which is a root). Since every root adjoined to $K$ can be expressed in terms of the primitive element $\alpha$ , each $F$ -automorphism $\sigma$ is characterized by where it sends $\alpha$ .

Since $K/F$ is normal, every root of $f$ is in $K$ and can be mapped to (i.e. are candidates for $\sigma(\alpha)$ ). Since $K/F$ is separable, these roots are distinct. Thus there are $\deg f$ possible $F$ -automorphisms, one for each of the $\deg f$ distinct roots of $f$ .

In other words, the order of the Galois group $|G(K/F)|$ is equal to the degree of $f$ , which is the degree of $K/F$ by definition.
Theorem: The roots of an irreducible polynomial are interchangeable under the Galois group. In other words, if $K/F$ is a Galois extension and $f$ is irreducible over $F$ , the Galois group $G(K/F)$ acts transitively on the roots of $f$ .

If $\alpha$ and $\beta$ are roots of $f$ , then $F(\alpha)$ and $F(\beta)$ are isomorphic. But this isomorphism is an $F$ -automorphism in the splitting field $K$ , since it fixes $F$ and swaps $\alpha$ and $\beta$ . Therefore there is indeed a $F$ -automorphism in $G(K/F)$ that maps arbitrary $\alpha$ to arbitrary $\beta$ .

Theorem: If $K/F$ is Galois of prime degree $p$ , its Galois group $G(K/F)$ is cyclic.

If $K/F$ is Galois of degree $p$ , its Galois group $G(K/F)$ is also of order $p$ , and therefore cyclic.
Lemma: A Galois extension $K/F$ is also a Galois extension $K/L$ over every intermediate field $L$ of $K/F$ .

Lemma 1: If $K/F$ is normal, then for any intermediate field $L$ , $K/L$ is normal.

$K/F$ , being normal, is the splitting field for some polynomial $f$ over $F$ . Since it is a splitting field, $K$ is already the minimal field containing $F$ together with the roots of $f$ . Then $K$ is also the minimal field containing any intermediate field $L$ of $K/F$ together with the roots of $f$ , meaning it is also the splitting field for $f$ over $L$ , and thus $K/L$ is normal.

Lemma 2: If $K/F$ is separable, then for any intermediate field $L$ , $K/L$ is separable.

Recall that an extension $K/F$ is separable if every element of $K$ is the root of a separable polynomial over $F$ . Recall that a separable polynomial has no repeated roots.

Every minimal polynomial $g$ over $L$ is potentially of lesser degree than the corresponding minimal polynomial $f$ over $F$ , due to $L$ containing more roots than $F$ . Thus, $g$ divides $f$ over $L$ .

That means every root of $g$ is a root of $f$ . That implies that $f\in F[x]$ is has distinct roots, $g\in L[x]$ has distinct roots too. So separable polynomials in $F[x]$ are also separable polynomials in $L[x]$ .

$K/F$ is separable, so every element of $K$ is a root of some separable polynomial over $F$ , which we just showed are also separable polynomials over $L$ , implying that $K/L$ is separable.

A Galois extension is one that is normal and separable. If $K/F$ is normal and separable, then $K/L$ is normal by Lemma 1, and separable by Lemma 2, thus Galois.
Recall that quadratic extensions (degree $2$ Galois extensions) are always obtainable by adjoining a square root. We can generalize this to prime degree Galois extensions. It turns out that, like quadratic extensions, they are always obtainable by adjoining a $p$ th root, provided that the base field contains the $p$ th roots of unity. This last requirement was not necessary for quadratic extensions, since the $2$ nd roots of unity are $1$ and $-1$ , which always exist.
Theorem: Degree $2$ extensions are Galois.

Quadratic extensions are always expressible by adjoining a square root. Thus they are simple extensions, and simple extensions are finite and separable.

Degree $2$ is the minimum degree required to split an irreducible quadratic polynomial. Since every irreducible with a root in the quadratic extension must split in the extension, quadratic extensions are normal.

So quadratic extensions are finite separable and normal, meaning they’re Galois extensions.
Kummer’s Theorem: (for prime $p$ ) A degree $p$ extension $K/F$ is Galois iff $K$ is expressible by adjoining a $p$ th root of an element $a\in F$ , whenever $F$ is a subfield of $\CC$ containing the $p$ th roots of unity $\zeta_p$ .

( $\to$ ) If $K/F$ is Galois of degree $p$ , then $K=F(\alpha)$ for some $p$ th root $\alpha$ of an element in $F$ .

Since $K/F$ is Galois of degree $p$ , its Galois group $G(K/F)$ is cyclic with a generator $\sigma$ .

Since $p\ne 1$ , $\sigma$ is not the identity. Therefore there must be some element $\alpha\in K$ not fixed by $\sigma$ . Let $f$ be the minimal polynomial of $\alpha$ .

The orbit of $\sigma$ under $G(K/F)$ contains all the roots of $f$ , since $G(K/F)$ must act transitively on the roots of $f$ . Since $K/F$ is separable, these roots are distinct, and therefore there are $p$ roots.

As $f$ is a degree $p$ irreducible polynomial in a field $F$ containing the $p$ th roots of unity, it must be the polynomial $x^p-a$ for some $a\in F$ . This implies $\alpha$ is a $p$ th root.

Since every root is expressible in terms of $\alpha$ , $F(\alpha)$ is a splitting field of $f$ . But since $K/F$ is normal, $K$ is also a splitting field of $f$ containing the exact same roots. Thus they must be the same field: $K=F(\alpha)$ (not merely an isomorphism)

Thus $K$ is obtained by adjoining a $p$ th root.

( $\from$ ) If $K=F(\alpha)$ for some $p$ th root $\alpha$ of an element $a$ in $F$ , then $K/F$ is Galois of degree $p$ .

$\alpha$ is a root of its minimal polynomial $x^p-a$ , and therefore is a $p$ th root of $a$ . We can write the $p$ th roots of $a$ as $\zeta_p^k\alpha$ , and they are all in $K$ since we assume $\zeta_p\in F$ . Thus $x^p-a$ splits in $K$ .

The derivative of $x^p-a$ is $px^{p-1}$ . Assuming we’re not in characteristic $p$ , $x^p-a$ shares no roots with its derivative and is therefore separable.

Any polynomial with a root in $K$ can express that root in terms of $\alpha$ , since $K=F(\alpha)$ is simple. Since the minimal polynomial of $\alpha$ splits in $K$ into distinct roots, any polynomial with a root in $K$ must split in $K$ as well into distinct roots. Therefore, $K/F$ is normal and separable and therefore Galois. Since $x^p-a$ is the minimal polynomial of $\alpha$ and is of degree $p$ , $K/F$ is of degree $p$ .
Such extensions $K$ are Kummer extensions, characterized by the fact that if a field contains the $p$ th roots of unity, you can obtain a degree $p$ Galois extension by adjoining any $p$ th root that isn’t in the base field. This is a generalization of quadratic extensions (where $p=2$ ), in which you obtain a degree $2$ Galois extension by adjoining any square root not in the base field.

In this section, we generalize a correspondence between the Galois group of an extension and its intermediate fields.

Let’s revisit the intermediate fields of the splitting field of $x^3-2$ :

It turns out the Galois group for this extension is the symmetric group $S_3$ . Now notice how the subgroup diagram of $S_3$ below looks exactly like the intermediate field diagram above, but flipped upside-down:

In fact, the subgroups of $S_3$ directly correspond a set of permutations of the three roots of $x^3-2$ . The correspondence between subgroups of $S_3$ and the corresponding intermediate field is that each intermediate field $L$ contains all the elements $K^H$ that are fixed by every $F$ -automorphism in the corresponding subgroup $H$ of the Galois group $G(K/F)$ . Call $K^H$ the fixed field of $H$ .
Lemma: The intersection of two subfields is a subfield of both.

Since $F_1$ and $F_2$ are fields and therefore integral domains, their subring $F_1\cap F_2$ is also an integral domain.

All that is left is to show every nonzero element of $F_1\cap F_2$ has a multiplicative inverse. But since $F_1$ and $F_2$ both have the same multiplicative inverses for all their nonzero elements, so does $F_1\cap F_2$ .
Theorem: The fixed field $K^H$ of a subgroup $H$ of a field extension $K/F$ is an intermediate field of $K/F$ .

We already know that a single $F$ -automorphism defines an intermediate field of $K/F$ . A subgroup containing multiple $F$ -automorphisms will fix elements $K^H$ corresponding to the intersection of the fixed elements of each individual $F$ -automorphism. Being an intersection of subfields of $K$ , $K^H$ must be a subfield of each of those subfields, and is therefore also a subfield of $K$ containing $F$ (i.e. an intermediate field of $K/F$ .)

This is enough to prove that in general, the intermediate fields of a Galois extension correspond to the subfields of the Galois group.
Fundamental Theorem of Galois Theory: If $K/F$ is Galois, its intermediate fields $L$ are in a one-to-one correspondence with the subgroups of its Galois group $G(K/F)$ .

First, for every intermediate field $L$ , we can identify the permutations of $K$ that fix $L$ , which is the Galois group $G(K/L)$ . Call this map $\sigma=L\mapsto G(K/L)$ .

Second, for every subgroup $H\le G(K/F)$ , we can identify the elements of $K$ fixed by every permutation in $H$ , which is its fixed field $K^H$ . Call this map $\tau=H\mapsto K^H$ .

The theorem states that $\sigma$ and $\tau$ are inverses. In other words, the subset relations below are actually equalities:

This essentially only requires us to prove two things:

For $\sigma=L\mapsto G(K/L)$ : we must have $[K:L]=|G(K/L)|$ .

Let $G=G(K/L)$ . Recall that $G$ is exactly all the automorphisms of $K$ that fix $L$ . This is a subgroup of $G(K/F)$ .

Since $K/L$ is finite and separable, it has a primitive element $\alpha$ . Let $f$ be its minimal polynomial.

Apply the orbit-stabilizer theorem on the action of $G$ on the roots of $f$ . The orbit-stabilizer theorem states that the size of the orbit of $\alpha$ is equal to the size of the group divided by the size of the stabilizer of $\alpha$ : $|O_\alpha|=|G|/|G_\alpha|$ . Choosing $\alpha$ as a primitive element makes it so its stabilizer $G_\alpha$ is trivial, since if an automorphism fixes $\alpha$ , it fixes the whole extension.

We know that $|G_\alpha|=1$ , therefore the order of $G$ matches the order of the orbit of $\alpha$ . We just need to show that there are $[K:L]$ elements in the orbit of $\alpha$ .

Since the minimal polynomial of $\alpha$ is irreducible over $L$ , its Galois group $G$ acts transitively on its roots. Therefore all of its roots are in the orbit of $\alpha$ . Since $K/L$ is separable, the roots $\alpha$ are distinct, meaning the number of roots is equal to the degree $[K:L]$ of the extension.

For $\tau=H\mapsto K^H$ : we must have $|H|=[K:K^H]$ .

Since $K^H$ is an intermediate field of $K/F$ , we know $K/K^H$ is Galois, and therefore $|G(K/K^H)|=[K:K^H]$ .

The automorphisms of $K$ that fix the fixed group of $H$ are exactly $H$ , so $G(K/K^H)=H$ by definition. Therefore $|H|=[K:K^H]$ .
Because of this fundamental correspondence, the arduous task of studying the intermediate fields of a Galois extension can be reduced to the much simpler tasks of studying the subgroups of its Galois group.

Recall that all splitting fields in characteristic $0$ are Galois extensions. Since each splitting field is defined over an irreducible polynomial, we can assign to each irreducible polynomial the Galois group of its splitting field. In fact, you can go to this website to look up the Galois group of every irreducible polynomial in $\ZZ[x]$ !

December 21, 2023. Exploration 9: Solvability
Questions:
- When exactly does a formula for the roots of a polynomial exist?
Recall that we discussed quadratic extensions and cyclotomic extensions. What are their degrees and Galois groups?

For quadratic extensions, it is very straightforward. Since all quadratic extensions are simple extensions by a root of a degree $2$ polynomial, every quadratic extension is degree $2$ , with Galois group $C_2$ .

What about cyclotomic extensions? Recall that the $n$ th cyclotomic extension is created by adjoining a primitive $n$ th root of unity. Since the minimal polynomial of such a root is the $n$ th cyclotomic polynomial, whose roots are the $\varphi(n)$ primitive $n$ th roots of unity, the degree of a cyclotomic extension is $\varphi(n)$ with a cyclic Galois group $\ZZ/n\ZZ^\times$ of order $\varphi(n)$ .

Now let’s move on to cubics.

Solving cubic polynomials

Cubic polynomials have the form $x^3+a_2x^2+a_1x+a_0$ . That’s a lot, but we can eliminate the $a_2x^2$ term with the substitution $x\mapsto x-\frac{a_2}{3}$ (the depressed cubic substitution):

$\begin{aligned} &~(x-\frac{a_2}{3})^3+a_2(x-\frac{a_2}{3})^2+a_1(x-\frac{a_2}{3})+a_0\\ =&~(x^3-\frac{3a_2x^2}{3}+\frac{3a_2^2x}{9}-\frac{a_2^3}{27})+a_2(x^2-\frac{2xa_2}{3}+\frac{a_2^2}{9})+a_1(x-\frac{a_2}{3})+a_0\\ =&~x^3-a_2x^2+\frac{a_2^2x}{3}-\frac{a_2^3}{27}+a_2x^2-\frac{2a_2^2x}{3}+\frac{a_2^3}{9}+a_1x-\frac{a_1a_2}{3}+a_0\\ =&~x^3+(a_1-\frac{a_2^2}{3})x+\frac{2a_2^3}{27}-\frac{a_1a_2}{3}+a_0\\ =&~x^3+\frac{3a_1-a_2^2}{3}x+\frac{2a_2^3-9a_1a_2+27a_0}{27}\\ \end{aligned}$

Thus we can always express cubic polynomials as $x^3+a_1x+a_0$ . In the splitting field, we have $x^3+a_1x+a_0=(x-\alpha_1)(x-\alpha_2)(x-\alpha_3)$ Expanding the RHS symmetric polynomial gives us

$\begin{aligned} \alpha_1+\alpha_2+\alpha_3&=0\\ \alpha_1\alpha_2+\alpha_2\alpha_3+\alpha_1\alpha_3&=a_1\\ \alpha_1\alpha_2\alpha_3&=-a_0 \end{aligned}$

The first equation, $\alpha_1+\alpha_2+\alpha_3=0$ , shows that $\alpha_3$ is generated by $\alpha_1,\alpha_2$ . Thus the splitting field $K/F$ can be written $F(\alpha_1,\alpha_2)$ if $\alpha_1$ does not generate $\alpha_2$ , and $F(\alpha_1)$ otherwise.

So there’s a tower of fields:

Consider $f=(x-\alpha_1)h$ over $F(\alpha_1)$ . Either $h$ splits (so $\alpha_1$ generates $\alpha_2$ ) and the top two fields are equal, or $h$ doesn’t split and is irreducible (so $\alpha_2$ is degree $2$ , since $h$ is quadratic).

This means the degree of the splitting field of a cubic is either $[K:F]=3$ or $[K:F]=6$ .

Splitting fields of a polynomial over $F$ are Galois extensions over $F$ . Since the degree of this splitting field is either $3$ or $6$ , and the Galois group must consist of permutations of the three roots, $G(K/F)$ must be a subgroup of the symmetric group $S_3$ , so it must be either the dihedral group $S_3$ or the cyclic group $C_3$ .

In conclusion: while the splitting fields of quadratics are always degree $2$ with Galois group $C_2$ , splitting fields of cubics are either degree $3$ or $6$ , with Galois group either $S_3$ or $C_3$ .

Solving quartic polynomials

There’s some dark magic involving nested square roots and factoring into quadratics, so I’ve ignored this bit. Just know that for any quartic polynomial $f$ , you can define a resolvent cubic $g$ and a discriminant $D$ , which together determine the Galois group.

It turns out that the Galois group $G$ for splitting fields of quartic polynomials are also subgroups of $S_4$ for the same reason (they permute the four roots). $S_4$ is order $24$ , so there are a lot more than just two subgroups like in the cubic case.

The full breakdown is as follows:
- If $g$ is irreducible, then $G\iso A_4$ when $D$ is a rational square, and $G\iso S_4$ otherwise.
- If $g$ splits partially into a linear times quadratic, then $G\iso D_4$ when $f$ is irreducible over $F(\sqrt{D})$ , and $G\iso C_4$ otherwise.
- If $g$ splits completely, then $G\iso K_4$ .
Again, I won’t get into quartics too much, since the important one is quintics. Just know that we can solve quartics similarly to how we solved cubics.

Solving quintic polynomials

Solving quintic (degree $5$ ) equations was the original goal of Galois.

Like with the other cases, the Galois group of the splitting field is a subgroup of $S_5$ . It turns out $S_5$ is not solvable in the sense of $S_4$ and $S_3$ .

In this section, we describe solvability and what it means.

A root $\alpha\in\CC$ is solvable over $F$ if you can use radicals (like square roots, $n$ th roots, nested roots) in $F$ to write $\alpha$ .

Motivation: Galois showed that the roots $\alpha\in\CC$ of certain irreducible $f\in\QQ[x]$ of degree $5$ are not solvable.

For instance, take the root $\sqrt[5]{10}$ of $x^5-10\in\QQ[x]$ . It is solvable over $\QQ$ since we expressed it as a radical over $\QQ$ . In other words, $\QQ\subseteq\QQ(\sqrt[5]{10})=\QQ(\alpha)$ .

Another example: take the root $\sqrt[5]{1+\sqrt[3]{2}}$ of $(x^5-1)^3-2\in\QQ[x]$ . It is solvable over $\QQ$ since we could express it in radicals over $\QQ$ . In other words, $\QQ\subseteq\QQ(\sqrt[3]{2})\subseteq\QQ(\sqrt[5]{1+\sqrt[3]{2}})$ .

In general, to prove $\alpha$ is solvable over $F$ , we prove there’s a chain of fields $F\subseteq F_1\subseteq F_2\subseteq F_3\subseteq\ldots\subseteq K$ such that (conditions $A$ ):
- $\alpha\in K$ (root is in the final extension)
- $\forall i\ldotp\exists\alpha_i\in F_i\ldotp F_i=F_{i-1}(\alpha_i)$ (each extension is built by adjoining to previous extension)
- $\forall i\ldotp\exists m>0\ldotp\alpha_i^m\in F_{i-1}$ (adjoined element is a $m$ th root of an element in the previous extension)
We shall prove that to prove the above, it is enough to prove the following:

$\alpha$ is solvable over $F$ if there’s a chain of subfields $F\subseteq F_1\subseteq F_2\subseteq F_3\subseteq\ldots\subseteq K$ such that (conditions $B$ ):
- $\alpha\in K$ (root is in the final extension)
- $\forall i\ldotp F_i/F_{i-1}$ is Galois of prime degree
Theorem: Every Galois extension $K/F$ of prime degree $p$ can be broken down into a cyclotomic and a Kummer extension.

Since $K$ is Galois and thus normal, $K$ is a splitting field for $x^p-a$ for some $a\in F$ , which implies $K$ contains the $p$ th roots of unity.

Let $L=F(\zeta_p)$ so that $L/F$ is a cyclotomic extension that contains the $p$ th roots of unity, so that $L$ is a subfield of $K$ that extends $F$ , i.e. $K\supseteq L\supseteq F$ .

Then since $K/F$ is Galois, $K/L$ is also Galois. Since $L$ contains the $p$ th roots of unity, and $K/L$ is Galois, by Kummer’s theorem, $K/L$ is a Kummer extension of degree $p$ .
Since both cyclotomic and Kummer extensions are obtained by adjoining some root of an element in their base field, this means every Galois extension is equivalent to extensions that satisfy conditions $A$ . Thus, given a chain that satisfies conditions $B$ , we can construct a chain that satisfies conditions $A$ .

Because of the Fundamental Theorem of Galois Theory, towers of field extensions $F\subseteq F_1\subseteq F_2\subseteq F_3\subseteq\ldots\subseteq K$ correspond to chains of subgroups. In particular, towers of Galois extensions correspond to chains of normal subgroups:
Theorem: Every intermediate Galois extension $L_1/L_2$ corresponds to a normal subgroup relation $H_1\lhd H_2$ in the original Galois group $G(K/F)$ .

( $\to$ )

According to the fundamental theorem of Galois theory, $L_1/L_2$ indeed corresponds to a subgroup relation $H_1\le H_2$ between unique subgroups $H_1$ and $H_2$ . To prove that this is a normal subgrouping, we use the fact that $L_1/L_2$ is Galois, and therefore normal.

Since $H_1=G(K/L_1)$ and $H_2=G(K/L_2)$ under the correspondence, we can see that the elements of $H_1$ are $L_1$ -automorphisms and the elements of $H_2$ are $L_2$ -automorphisms. In particular, the subgrouping $H_1\le H_2$ implies that all $L_1$ -automorphisms are $L_2$ -automorphisms.

Recall that in normal extensions like $L_1/L_2$ , $L_2$ -automorphisms are invariant under conjugation. Thus, the elements of $H_1$ , being $L_2$ -automorphisms, are invariant under conjugation, implying that $H_1\lhd H_2$ .

( $\from$ )

Say you have $H_1\lhd H_2$ . Again, $H_1=G(K/L_1)$ contains $L_1$ -automorphisms $\tau$ and $H_2=G(K/L_2)$ contains $L_2$ -automorphisms $\sigma$ under the correspondence. The normal subgrouping implies that $\sigma^{-1}\tau\sigma$ is also a $L_1$ -automorphism in $H_1$ .

Let $\alpha\in L_1$ be the root of some polynomial $f$ irreducible over $L_2$ . Since $G(K/L_2)=H_2$ permutes the roots of $f$ , $\sigma(\alpha)$ represents an arbitrary conjugate of $\alpha$ . To show $L_1/L_2$ is normal, we just need to show that $\sigma(\alpha)$ is in $L_1$ , implying that $f$ splits in $L_1$ , and thus $L_1/L_2$ is normal.

But since $\sigma^{-1}\tau\sigma$ fixes $L_1$ , it must fix $\alpha\in L_1$ . So we have $\sigma^{-1}\tau(\sigma(\alpha))=\alpha$ , implying $\tau(\sigma(\alpha))=\sigma(\alpha)$ , implying that $\tau$ fixes $\sigma(\alpha)$ . But $\tau$ is a $L_1$ -automorphism, meaning elements it fixes are in $L_1$ .
This implies:
Theorem: Every Galois extension $K/F$ whose Galois group is abelian can be broken down into cyclotomic and Kummer extensions.

If $G(K/F)$ is abelian, every subgroup $H$ is normal, and by the above theorem, must correspond to an intermediate Galois field extension $K/L$ .

If $G(K/F)$ is abelian, then it decomposes into a direct product of cyclic groups. By the Chinese Remainder Theorem we can decompose those cyclic groups into a direct product of cyclic groups of prime order. Write this decomposition in any order: $G(K/F)\iso C_{p_1}\times C_{p_2}\times\ldots\times C_{p_k}$ Then we can construct a chain of normal subgroups (where each quotient is of prime order) by incrementally popping off the rightmost factor: $\begin{aligned} \{1\}~~\lhd~~ C_{p_1}~~\lhd~~ C_{p_1}\times C_{p_2}~~\lhd~~\ldots~~\lhd~~ C_{p_1}\times C_{p_2}\times\ldots\times C_{p_k} \end{aligned}$

These subgroups all correspond to a prime degree intermediate Galois field extension, and we can break each of those down into a cyclotomic and Kummer extension.
This gives us conditions $C$ :

$\alpha$ is solvable over $F$ if there’s a chain of subfields $F\subseteq F_1\subseteq F_2\subseteq F_3\subseteq\ldots\subseteq K$ such that (conditions $C$ ):
- $\alpha\in K$ (root is in the final extension)
- $\forall i\ldotp F_i/F_{i-1}$ is Galois with an abelian Galois group
Finally, we can convert this into the realm of group theory using the fact that intermediate Galois extensions correspond to normal Galois subgroups. This gives us conditions $D$ :

$\alpha$ is solvable over $F$ if there’s a chain of normal subgroups $\{1\}\lhd\ldots\lhd G(K/L_2)\lhd G(K/L_1)\lhd G(K/F)$ such that (conditions $D$ ):
- $\alpha\in K$ (root is in the final extension)
- $\forall i\ldotp |G(K/L_i)|/|G(K/L_{i+1})|$ is abelian
Using this final version of conditions for solvability, we can prove some basic theorems about solvability.

Theorem: Subgroups of solvable groups are solvable.

Let $H$ be said subgroup and let $G_i$ be an element in the chain that proves the group solvable. Then in each $G_i$ of the chain, just take the intersection $H_i=H\cap G_i$ . This just gets rid of some elements in each $G_i$ , which doesn’t change the condition “its elements commute with everything” in the definition of normal subgroup and abelian quotient. So each subgroup relation in the chain is still normal, and each quotient is still abelian.

Theorem: Non-abelian simple groups are not solvable.

If $G$ is simple, then the only proper normal subgroup is $\{1\}$ , meaning our chain can only look like $\{1\}\lhd G$ . But $G/\{1\}$ is non-abelian, and thus the only possible chain does not satisfy conditions $D$ , thus not solvable.

In this section, we prove that degree $5$ polynomials are not solvable.

Note that all degree $5$ polynomials have $5$ roots, and thus the Galois group is at maximum $S_5$ , the symmetric group for five elements. This means all it takes to show not all degree $5$ polynomials are solvable is to prove that $S_5$ is not solvable.

Theorem: $S_5$ is not solvable.

We know that $A_n$ is a subgroup of $S_n$ by definition. That means if $S_5$ is solvable, then $A_5$ would be solvable. However, $A_5$ is simple, and also not abelian since $(1~2~3)(2~3~4)=(1~2)(3~4)$ but $(2~3~4)(1~2~3)=(1~3)(2~4)$ , and is therefore is not solvable. Thus, $S_5$ cannot be solvable.

April 15, 2024. Exploration 10: Noncommutative rings
Questions:
- TODO
Domains

In our travels we’ve built up a hefty classification of integral domains: PIDs, UFDs, fields, Euclidean domains, and more. Here is a complete classification:

Now is the time to get into noncommutative rings. We begin with the ring at the bottom, a domain: a noncommutative integral domain.

In this section, we describe left and right versions of zero divisors.

The definition of domain is not exactly “no zero divisors”, though that is a sufficient definition. A more precise definition is to say that a domain has no left zero divisors and no right zero divisors. This is to distinguish elements $a$ where some nonzero $b$ exists so that $ab=0$ (making $a$ a left zero divisor and $b$ a right zero divisor) and elements $a$ where some nonzero $b$ exists so that $ba=0$ (making $b$ a left zero divisor and $a$ a right zero divisor). Our original notion of zero divisor in a commutative ring is a two-sided zero divisor, which is an element that is both a left and right zero divisor.

The distinction becomes important because if a ring has right zero divisors and no left zero divisors, for instance, then you partially get the cancellation property of integral domains, because you can cancel factors on the left: $ab=ac$ implies $b=c$ .

A domain is then a ring (not necessarily commutative) with no nonzero left or right zero divisors.

In this section, we describe left and right versions of ideals.

What happens to ideals in the noncommutative scenario? Let’s again go through how we defined an ideal, and spot where commutativity was used.

Therefore: for noncommutative rings, a left ideal can be defined as one that absorbs multiplication on the left.

We need to show that $x[a]=[xa]$ for all $x\in [x]$ . This is the same as proving that $xa\sim xb$ iff $a\sim b$ . Let’s see how this works out: $\begin{aligned} &xa\sim xb\\ \iff& xb\in [xa]\\ \iff& xb-xa\in H\\ \iff& x(b-a)\in H\\ \iff& b-a\in H\\ \iff& b\in[a]\\ \iff& a\sim b \end{aligned}$ Note that the cancellation step $x(b-a)\in H\iff b-a\in H$ requires $xH=H$ . This imposes $xH=H$ as a requirement for multiplication in $R/H$ to be well-defined.

Similarly, a right ideal absorbs multiplication on the right, and our original notion of ideal is a two-sided ideal, which are both left and right ideals.

To quotient a ring with an ideal it must be both a left and right ideal, so there are no changes needed for our understanding of quotient rings.

In this section, we measure the degree of commutativity of a ring.

Just like with groups, we can see how commutative a ring is by studying elements that commute with all other elements. Just like with groups, the set of all such elements is called the center $Z(R)$ of the ring $R$ .
Theorem: Forming a polynomial ring $R[x]$ from a ring $R$ preserves its center $Z(R)$ . That is, $Z(R)[x]$ is in $Z(R[x])$ .

The center of a ring $Z(R)$ is all elements that (multiplicatively) commute with every element in $R$ .

We must show that an arbitrary element $\sum_i z_ix^i\in Z(R)[x]$ commutes with every element in $R[x]$ .

Let $r$ be an arbitrary element of $R[x]$ . Since each term $z_ix^i$ commutes with everything, we can show that $r(\sum_i z_ix^i)=\sum_i rz_ix^i=\sum_i z_ix^ir=(\sum_i z_ix^i)r$

Therefore an arbitrary element $\sum_i z_ix^i\in Z(R)[x]$ commutes with all of $R[x]$ . Therefore $Z(R)[x]\subseteq Z(R[x])$ .

The converse, $Z(R[x])\subseteq Z(R)[x]$ is similarly straightforward; if an element $\sum_i r_ix^i$ is in the center of $R[x]$ , then we need to show that $r_i$ commutes with an arbitrary element $s\in R$ . $\begin{aligned} s(\sum_i r_ix^i)&=(\sum_i r_ix^i)s\\ \sum_i sr_ix^i&=\sum_i r_ix^is\\ \sum_i sr_ix^i&=\sum_i r_isx^i\\ sr_i&=r_is \end{aligned}$ implying each coefficient $r_i$ is in $Z(R)$ , meaning the polynomial $\sum_i r_ix^i$ is in $Z(R)[x]$ .
Corollary: If $R$ is a commutative ring, so is $R[x]$ .

rings

In this section, we go over some examples of rings.

In this section, we study operations on rings.

In this section, we define the units of a ring.

In this section, we introduce zero divisors.

In this section, we introduce idempotents.

In this section, we introduce nilpotents.

In this section, we introduce the characteristic of a ring.

In this section, we talk about subrings.

Appendix A

In this section, we classify a ring by its ideals.

In this section, we learn how to manipulate ideals.

In this section, we explore what kinds of elements an ideal can contain.

In this section, we learn some ways to construct ring homomorphisms.

In this section, we show how to use the First Isomorphism Theorem to quickly prove facts about rings by constructing a homomorphism.

In this section, we define the operations possible on polynomials.

In this section, we explore the consequences of evaluating polynomials.

In this section, we classify the unique factorization of real and complex polynomials.

In this section, we examine the ideals of a polynomial ring.

In this section, we determine what it means to quotient in polynomial rings defined over a field FFF.

In this section, we introduce how the derivative of a polynomial helps determine roots.

In this section, we explore whether factorization terminates.

In this section, we briefly examine how zero divisors complicate factorization.

In this section, we explore what is required to make factorizations unique.

In this section, we explore the ideals generated by irreducibles.

In this section, we explore the ideals generated by primes.

In this section, we define GCD and LCM and their implications for UFDs.

In this section, we introduce the Bézout identity relating GCDs to elements in the ring.

In this section, we revisit the relationship between rings and polynomial rings defined over them.

In this section, we draw an equivalence between irreducibility in Z[x]\ZZ[x]Z[x] and in Q[x]\QQ[x]Q[x].

Summary

In this section, we explore how to define PIDs via norms.

In this section, we discuss the ring of integers mod nnn.

In this section, we study the effects of prime characteristic.

In this section, we classify the finite fields.

In this section, we prove the uniqueness of the finite fields up to isomorphism.

In this section we describe primitive elements.

In this section, we study quotient fields.

In this section, we show another perspective for extending fields.

In this section, we explore the properties of the roots within the extended fields.

In this section, we demonstrate how non-simple extensions can be reduced to a simple extension.

In this section, we discuss separability.

In this section, we go wild with adding roots to a field.

In this section, we examine how to compute the splitting field of a given polynomial over FFF.

In this section, we discover the conditions for when two elements generate linearly disjoint extensions.

The quadratic extensions

The cyclotomic extensions

In this section, we discover properties of conjugate roots.

In this section, we explore the intermediate fields of a Galois extension.

In this section, we identify the permutations on roots of a field extension.

In this section, we generalize a correspondence between the Galois group of an extension and its intermediate fields.

Solving cubic polynomials

Solving quartic polynomials

Solving quintic polynomials

In this section, we describe solvability and what it means.

In this section, we prove that degree 555 polynomials are not solvable.

Domains

In this section, we describe left and right versions of zero divisors.

In this section, we describe left and right versions of ideals.

In this section, we measure the degree of commutativity of a ring.

In this section, we determine what it means to quotient in polynomial rings defined over a field $F$ .

In this section, we draw an equivalence between irreducibility in $\ZZ[x]$ and in $\QQ[x]$ .

In this section, we discuss the ring of integers mod $n$ .

In this section, we examine how to compute the splitting field of a given polynomial over $F$ .

In this section, we prove that degree $5$ polynomials are not solvable.