tinytheorems - about groups - rings - modules - models atom/rss

Exploration 6: Finite fields

.

Questions:


Recall that the characteristic of every integral domain must be either zero or prime. Since all fields are integral domains, we can categorize each field FF as either an infinite field (char F=0\char F=0) or a finite field (char F=p\char F=p).

Let’s begin with the finite fields. By the end of this exploration, we will have classified all of the finite fields.

In this section, we discuss the ring of integers mod nn.

All we know so far about a finite field is that it must be of some prime characteristic pp. Let’s first study the case where its order is exactly pp — in other words, let’s study the ring of integers mod pp.

First we prove that the integers mod pp form a field:

Theorem: If pp is prime, Zp\ZZ_p is a field.

Why study Zp\ZZ_p when this is about finite fields? It turns out every finite field (with prime characteristic pp) contains Zp\ZZ_p:

Theorem: Every prime characteristic ring contains a copy of Zp\ZZ_p, the integers mod pp.
  • Every ring contains the elements {1,(1+1),(1+1+1),}\{1,(1+1),(1+1+1),\ldots\}, but for prime characteristic rings where p1=0p\cdot 1=0, this only goes up to adding p1p-1 copies of 11.
  • These are exactly the elements that form Zp\ZZ_p, the integers mod pp, so every prime characteristic ring contains Zp\ZZ_p in the sense that you can define an isomorphism between those elements and Zp\ZZ_p.

In this section, we study the effects of prime characteristic.

Rings of prime characteristic have some interesting properties. For instance, recall the binomial theorem true for all rings: for every a,bRa,b\in R and every nonnegative integer nn, we have (a+b)n=k=0n(nk)akbnk(a+b)^n=\sum_{k=0}^n{n\choose k}a^kb^{n-k}

In a ring of prime characteristic, this sum simplifies significantly to: (a+b)p=ap+bp(a+b)^p=a^p+b^p

Theorem: (a+b)p=ap+bp(a+b)^p=a^p+b^p for all a,ba,b in a ring of prime characteristic pp.
  • By the binomial theorem, (a+b)p=k=0p(pk)akbpk(a+b)^p=\sum_{k=0}^p{p\choose k}a^kb^{p-k}. Let’s focus on the coefficients (pk)p\choose k.
  • (pk)=p!k!(pk)!{p\choose k}=\frac{p!}{k!(p-k)!} always includes a factor pp in the numerator, but pp (being prime) is not a factor of the denominator unless k=0k=0 or k=pk=p.
  • Thus for other values of kk, the coefficient (pk)p\choose k must be zero since it includes a factor pp in a field of characteristic pp. For the special cases k=0k=0 and k=pk=p, the coefficient (pk)p\choose k is equal to 11.
  • Therefore (a+b)p=1ap+0++0+1bp=ap+bp(a+b)^p=1a^p+0+\ldots+0+1b^p=a^p+b^p.

This theorem is rather important, because it makes the map aapa\mapsto a^p look awfully like an homomorphism on rings of prime characteristic pp. Which it is:

Theorem: The Frobenius endomorphism aapa\mapsto a^p is an endomorphism on a ring of prime characteristic pp.
  • Let ϕ\phi be the map aapa\mapsto a^p. By the previous proof we observe ϕ(a+b)=ϕ(a)+ϕ(b)\phi(a+b)=\phi(a)+\phi(b).
  • To be a homomorphism, ϕ\phi must also preserve additive inverses and multiplication.
    • Preserving subtraction is easy for odd primes pp, because ϕ(a)=(a)p=ap=ϕ(a)\phi(-a)=(-a)^p=-a^p=-\phi(a). For the remaining prime p=2p=2, recall that every element is its own additive inverse in characteristic 22, so we can directly write ϕ(a)=ϕ(a)\phi(a)=-\phi(a).
    • Preserving multiplication is straightforward. ϕ(ka)=(ka)p=kpap=ϕ(k)ϕ(a)\phi(ka)=(ka)^p=k^pa^p=\phi(k)\phi(a).
  • Thus ϕ\phi is a homomorphism from the ring to itself, i.e. an endomorphism.

Note that Fermat’s little theorem states that for any integer aa, we have apa mod pa^p\equiv a\mod p. So specifically for the integers mod pp, Zp\ZZ_p, the Frobenius endomorphism aap(a)a\mapsto a^p(\equiv a) is actually the identity automorphism, mapping every aa to itself. We’ll make use of this fact soon.

In this section, we classify the finite fields.

Now let’s use our knowledge of the properties of prime characteristic to study all finite fields.

Let F[x]F[x] be a polynomial ring over a finite field FF.

Theorem: Every element of Zp\ZZ_p is a root of the polynomial xpxZp[x]x^p-x\in\ZZ_p[x].
  • Fermat’s little theorem implies that for any integer aa not divisible by pp (i.e. any nonzero, since pp is prime), we have ap110 mod pa^{p-1}-1\equiv 0\mod p.
  • In other words, the polynomial xp11x^{p-1}-1 has every nonzero element of Zp[x]\ZZ_p[x] as a root. We can add the zero element as a root if we multiply by xx to get xpxx^p-x.
  • Thus, if you have a polynomial ring defined over a prime characteristic field like Zp[x]\ZZ_p[x], xpxx^p-x is a polynomial that has all pp distinct roots.

TODO extend to arbitrary finite field FF.

Now csonsider the quotient F[x]/(f)F[x]/(f) for some polynomial fF[x]f\in F[x]. We actually already know two facts about F[x]/(f)F[x]/(f):

Theorem: F[x]/(f)F[x]/(f) is a field iff ff is irreducible.

Theorem: F[x]/(f)F[x]/(f) is finite with Fdeg f|F|^{\deg f} elements.

F[x]/(f)F[x]/(f) consists of cosets [f][f] whose representatives are polynomials fF[x]f\in F[x]. Earlier we proved that the degree of every representative is less than deg f\deg f, and therefore has at most deg f\deg f terms with F|F| possible coefficients for each term. This implies that there are Fdeg f|F|^{\deg f} distinct cosets in F[x]/(f)F[x]/(f).

Corollary: Given an irreducible degree nn polynomial fF[x]f\in F[x], if FF is a finite field of prime order pp, then F[x]/(f)F[x]/(f) is a finite field of prime power order pnp^n.

So we can create fields of prime power order pnp^n by taking the quotient Zp[x]/(f)\ZZ_p[x]/(f), where deg f=n\deg f=n.

One way to visualize a finite field created in this manner is by considering its elements as polynomials, whose coefficients are essentially a tuple. The elements of the prime order finite field Zp\ZZ_p are constant polynomials represented by 11-tuples: for example, Z5\ZZ_5 is just the set {(0),(1),(2),(3),(4)}\{(0),(1),(2),(3),(4)\}. Quotienting Z5\ZZ_5 by x53x^5-3 (which is irreducible by Eisenstein) gives us a quotient field whose cosets are represented by polynomials up to degree 44, as shown earlier. Degree 44 polynomials ax4+bx3+cx2+dx+eax^4+bx^3+cx^2+dx+e can be represented by 55-tuples (a,b,c,d,e)(a,b,c,d,e), where there Z5=5|\ZZ_5|=5 choices for each component representing the coefficient of its representative polynomial.

In this section, we prove the uniqueness of the finite fields up to isomorphism.

We just established that a finite field of prime power order pnp^n exists for every prime pp and n1n\ge 1, and that we can construct said field as Zp[x]/(f)\ZZ_p[x]/(f) where ff is a degree nn irreducible polynomial over Zp\ZZ_p. It is a well-established fact that its multiplicative group (consisting of all nonzero elements) is cyclic:

Theorem: Given FF a finite field of prime power order pnp^n, its multiplicative group F×F^\times is cyclic.

F×F^\times, like all finite abelian groups, decomposes into cyclic groups of prime power order: F×Cq1×Cq2××CqnF^\times\iso C_{q_1}\times C_{q_2}\times\ldots\times C_{q_n} Since the order of the component cyclic fields q1,q2,,qiq_1,q_2,\ldots,q_i are pairwise coprime, their direct product is cyclic.

This has a direct implication about the fields of prime power order.

Theorem: All fields of prime power order pnp^n are unique up to isomorphism.
  • We just showed that for an irreducible degree nn polynomial gF[x]g\in F[x], the quotient Zp[x]/(g)\ZZ_p[x]/(g) is a finite field of order pnp^n. So finite fields of order pnp^n exist for every prime pp and integer n1n\ge 1.
  • If we can define a field homomorphism between two arbitrary fields of prime power order pnp^n, then they are isomorphic because all field homomorphisms are injective and the domain and codomain have the same order pnp^n.
  • Consider the multiplicative group K×K^\times of an arbitrary field KK of prime power order pnp^n. The order of K×K^\times is exactly pn1p^n-1. By Lagrange’s theorem, the order of every element aa of K×K^\times must divide the order of K×K^\times, so apn1=1a^{p^n-1}=1, which can be rearranged to apna=0a^{p^n}-a=0.
  • Then apna=0a^{p^n}-a=0 holds for every aK×a\in K^\times. But since 0pn0=00^{p^n}-0=0, this extends to all aKa\in K. Since this holds for arbitrary fields of prime power order, let K1,K2K_1,K_2 be two such fields.
  • Then every element of aK1a\in K_1 or bK2b\in K_2 is a distinct root of the polynomial f=xpnxf=x^{p^n}-x, in the sense that f(a)=0f(a)=0 for all aK1a\in K_1 and f(b)=0f(b)=0 for all bK2b\in K_2.
  • Recall that ring homomorphisms (and therefore field homomorphisms) preserve rational expressions, including polynomial expressions. This means if f(a)=0f(a)=0, then applying ϕ\phi to both sides gives ϕ(f(a))=f(ϕ(a))=0\phi(f(a))=f(\phi(a))=0, implying that ϕ(a)\phi(a) is a root of ff as well.
  • Since ff, by construction, has pnp^n distinct roots, ϕ\phi maps all pnp^n elements of K1K_1 to some element in K2K_2. Therefore ϕ\phi is a well-defined field homomorphism and we are done.

Since all fields of order pnp^n are isomorphic to each other, this means that instead of writing the cumbersome term Zp[x]/(f)\ZZ_p[x]/(f), we can just refer to the field of prime power order q=pnq=p^n, which is denoted Fq\FF_q or sometimes GF(q)\GF(q).

Significantly, it turns out there are no other finite fields.

Theorem: every ring not of a prime power order is not a field.
  • If a ring RR is not of a prime power order, then the prime factorization of its order p1k1p2k2pnknp_1^{k_1}p_2^{k_2}\ldots p_n^{k_n} must contain at least two distinct prime powers.
  • Consider the principal ideal pikiRp_i^{k_i}R for some ii. Since prime powers are pairwise coprime, that means the Chinese Remainder Theorem can be used to show RR/p1k1R×R/p2k2R××R/pnknRR\iso R/p_1^{k_1}R\times R/p_2^{k_2}R\times\ldots\times R/p_n^{k_n}R.
  • But since a direct product of rings contains zero divisors, and zero divisors cannot be units, RR contains nonzero nonunits and is therefore not a field.

Corollary: The only finite fields are the ones of prime power order.

In this section we describe primitive elements.

The fact that Fq×\FF_q^\times is cyclic implies that every finite field Fq\FF_q has a primitive element α\alpha, the generator of F×F^\times.

Recall that we can adjoin elements of one ring to another ring of the same characteristic. What happens when we adjoin α\alpha to another finite field of the same characteristic?

Note that the finite fields of any given characteristic pp have a subfield ordering:

Theorem: Fpn\FF_{p^n} is a subfield of Fpm\FF_{p^m} iff nmn\mid m.

The result follows pretty quickly after we prove a number theoretic lemma:

Lemma: nmn\mid m iff pn1pm1p^n-1\mid p^m-1.
  • (\to) This is a straightforward calculation. nmm=knpm=pknpm1=pkn1pm1=(pn1)(pn(k1)+pn(k2)++pn+1)pn1pm1\begin{aligned} n&\mid m\\ m&=kn\\ p^m&=p^{kn}\\ p^m-1&=p^{kn}-1\\ p^m-1&=(p^n-1)(p^{n(k-1)}+p^{n(k-2)}+\ldots+p^n+1)\\ p^n-1&\mid p^m-1 \end{aligned}
  • (\from) We know that n<mn\lt m. Then gcd(pm1,pn1)=gcd(pmnpn1,pn1)=gcd(pmnpn1(pn1),pn1) using gcd(a,b)=gcd(akb,b)=gcd(pmnpnpn,pn1)=gcd(pmnpnpn,pn1)=gcd(pn(pmn1),pn1)=gcd(pmn1,pn1) because pn and pn1 are coprime\begin{aligned} &\gcd(p^m-1,p^n-1)\\ &=\gcd(p^{m-n}p^n-1,p^n-1)\\ &=\gcd(p^{m-n}p^n-1-(p^n-1),p^n-1)&\text{ using }\gcd(a,b)=\gcd(a-kb,b)\\ &=\gcd(p^{m-n}p^n-p^n,p^n-1)\\ &=\gcd(p^{m-n}p^n-p^n,p^n-1)\\ &=\gcd(p^n(p^{m-n}-1),p^n-1)\\ &=\gcd(p^{m-n}-1,p^n-1)&\text{ because }p^n\text{ and }p^n-1\text{ are coprime}\\ \end{aligned} Thus gcd(pm1,pn1)=gcd(pmn1,pn1)\gcd(p^m-1,p^n-1)=\gcd(p^{m-n}-1,p^n-1). Applying this recursively amounts to applying the Euclidean algorithm on the powers (m,n)(m,n), resulting in gcd(pm1,pn1)=gcd(pgcd(m,n)1,pgcd(m,n)1)=pgcd(m,n)1\begin{aligned} &\gcd(p^m-1,p^n-1)\\ &=\gcd(p^{\gcd(m,n)}-1,p^{\gcd(m,n)}-1)\\ &=p^{\gcd(m,n)}-1 \end{aligned} Since pn1pm1p^n-1\mid p^m-1 implies pn1p^n-1 is the GCD of the two, the LHS becomes pn1p^n-1. Then: pn1=pgcd(m,n)1pn=pgcd(m,n)n=gcd(m,n)nm\begin{aligned} p^n-1&=p^{\gcd(m,n)}-1\\ p^n&=p^{\gcd(m,n)}\\ n&=\gcd(m,n)\\ n&\mid m \end{aligned} as required.

Then:

  • (\to) By Lagrange’s theorem, the order of the subfield’s multiplicative group Fpn×|\FF_{p^n}^\times| divides that of the larger field Fpm×|\FF_{p^m}^\times|. Thus pn1pm1p^n-1\mid p^m-1, so nmn\mid m by the lemma.
  • (\from) If nmn\mid m, then nmpn1pm1 by the lemmaxpn11xpm11 by the lemmaf(xpn11)=xpm11 for some polynomial fZ[x]f(xpnx)=xpmx multiply both sides by x\begin{aligned} n&\mid m\\ p^n-1&\mid p^m-1&\text{ by the lemma}\\ x^{p^n-1}-1&\mid x^{p^m-1}-1&\text{ by the lemma}\\ f\cdot (x^{p^n-1}-1)&=x^{p^m-1}-1&\text{ for some polynomial }f\in\ZZ[x]\\ f\cdot (x^{p^n}-x)&=x^{p^m}-x&\text{ multiply both sides by }x\\ \end{aligned} which shows that every root of xpnxx^{p^n}-x is a root of xpmxx^{p^m}-x. But since the roots of xpnxx^{p^n}-x are exactly Fpn\FF_{p^n}, this implies every element of Fpn\FF_{p^n} is in Fpm\FF_{p^m}, i.e. Fpn\FF_{p^n} is a subfield of Fpm\FF_{p^m}.

Corollary: Adjoining the primitive element α\alpha of Fpn\FF_{p^n} to Fpm\FF_{p^m} results the larger of the two finite fields.
  • If Fpn\FF_{p^n} is larger, then Fpm\FF_{p^m} is a subfield of Fpn\FF_{p^n}. Since α\alpha generates Fpn\FF_{p^n}, the resulting field is Fpn\FF_{p^n}, since that already includes Fpm\FF_{p^m}.
  • If Fpm\FF_{p^m} is larger, then Fpn\FF_{p^n} is a subfield of Fpm\FF_{p^m} and therefore α\alpha is already in Fpm\FF_{p^m}, so the resulting field is Fpm\FF_{p^m}.

Corollary: Adjoining two primitive elements α,β\alpha,\beta is the same as adjoining the one with larger order.

This follows directly from the previous result. WLOG assume α\alpha generates the larger field. Then α\alpha generates β\beta, thus you need only adjoin α\alpha.

< Back to category Exploration 6: Finite fields (permalink)
Exploration 5.1: Norm Exploration 7: Fields