Exploration 3: Polynomials

December 1, 2023.

Questions:

How do we factor a polynomial with coefficients in a ring?
How do we study the rest of ring theory from the lens of polynomial rings?
What does the derivative of a polynomial mean in the context of ring theory?

Recall that to adjoin an element $a$ to a ring $R$ is to create a ring $R[a]$ generated by the two: something like $\<a,R\>$ .

When you adjoin a symbol $x$ that commutes with the entire ring $R$ , the result is a polynomial ring. It turns out polynomial rings are fundamental in the sense that much of ring theory is built upon generalizing the properties of polynomials, such as irreducibility.

The elements $f$ of a polynomial ring $R[x]$ are called polynomials with coefficients in $R$ , and look like this:

where $a_i\in R$ are known as the coefficients of the polynomial $f$ , $x$ is known as an indeterminate over $R$ , and the exponent of the leading $x^i$ is known as the degree of the polynomial, denoted $\deg f$ . The degree of a constant polynomial $c\in R$ is zero, and the degree of the zero polynomial $0$ is undefined.

Let’s show some basic facts that hold when we form a polynomial ring.

Theorem:

R

is an integral domain iff

R[x]

is an integral domain.

WTS the product of two nonzero polynomials $f,g$ in $R[x]$ is always nonzero.
Let $a_n$ and $b_m$ be the leading coefficients of $f,g$ respectively. Since $f,g$ are nonzero, $a_n$ and $b_m$ are nonzero.
Then the leading coefficient of $fg$ is $a_nb_m$ . Since $R$ is an integral domain, this product is nonzero.
Then since the leading coefficient of $fg$ is nonzero, $fg$ is nonzero, and we are done.

In this section, we define the operations possible on polynomials.

Polynomial addition/subtraction is straightforward – it is pairwise addition/subtraction of corresponding coefficients.

Polynomial multiplication is more complex. In general, it looks like $(\sum_{i=0}^n r_ix^i)(\sum_{j=0}^m s_jx^j)=(\sum_{k=0}^{n+m} r_ks_{n+m-k}x^k)$ Note that this means that the resulting coefficients are the convolution of the coefficients $r_i$ and $s_j$ . Also note that this implies that $\deg fg=\deg f+\deg g$ .

What about polynomial division? Dividing a polynomial $f$ by a polynomial $g\ne 0$ with remainder $r$ means finding unique polynomials $q,r$ where $\deg r<\deg g$ (or $r=0$ ) such that $f=gq+r$ . Let’s explore how this can be done.

Therefore: For any two polynomials

f,g\in R[x]

(where

g\ne 0

) we can find unique polynomials

q,r

(where

\deg r<\deg g

r=0

) such that

f=gq+r

, but only if the leading coefficient of

g

is a unit. This theorem is known as the division algorithm.

Let $n=\deg f$ and $m=\deg g$ .
First of all, if $f=0$ then the only solution is $q=r=0$ . Otherwise if $n<m$ , then the only solution is $q=0$ and $r=f$ . Therefore we can induct on $n$ with the assumption that $n\ge m$ .
If $n=0$ $n = 0$ , then $n\ge m$ $n \geq m$ implies $m=0$ $m = 0$ and therefore $f,g$ $f, g$ are both constant polynomials $\in R$ $\in R$ .
- Then we have $f=qg+r$ in $R$ . In $R$ , the condition that either $r=0$ or $\deg r<m$ collapses to just $r=0$ since $m=0$ .
- Since $r=0$ , $f=qg+r$ becomes $qg=f$ in $R$ . Thus the solution is $q=fg^{-1}$ and $r=0$ . This requires $g$ be a unit in $R$ so when $g$ is constant it must be a unit.
Otherwise, assume that for all $\deg f<n$ $deg f < n$ , there is some $q,r$ $q, r$ such that $f=gq+r$ $f = g q + r$ . WTS that’s true for $\deg f=n$ $deg f = n$ as well.
- With the intention of applying the induction hypothesis, we’d like to obtain a polynomial of degree less than $n$ . This is easy when $f$ and $g$ have the same leading coefficient and degree; then $f-g$ has degree less than $n$ . Otherwise, we must use $f-kg$ where $k$ is a factor that transforms the leading coefficient and degree of $g$ to match those of $f$ .
- Let $f=\sum_i^n a_ix^i$ and $g=\sum_j^m b_jx^j$ , where $a_n$ and $b_m$ are nonzero. Then the value of $k$ that lets us do this is $k=a_nb_m^{-1}x^{n-m}$ .
- Note that this means that the leading coefficient $b_m$ of $g$ must be a unit. (This matches our earlier finding that when $g$ is constant, it must be a unit.) Then we have $\begin{aligned} kg&=k\sum_j^m b_jx^j\\ kg&=k\left(b_mx^m+\sum_j^{m-1} b_jx^j\right)\\ kg&=a_nb_m^{-1}x^{n-m}\left(b_mx^m+\sum_j^{m-1} b_jx^j\right)\\ kg&=a_nx^n+\sum_j^{m-1} a_nb_m^{-1}b_jx^{j+n-m} \end{aligned}$
- Since $f$ and $kg$ have the same leading term $a_nx^n$ , the difference $f-kg$ cancels out the leading terms and therefore has degree less than $n$ . Then the induction hypothesis with $f-kg$ and $g$ gives us unique polynomials $q',r'$ such that $f-kg=q'g+r$ (where either $r=0$ or $\deg r<\deg g$ ). $\begin{aligned} f-kg&=q'g+r\\ f&=(k+q')g+r'\\ \end{aligned}$
- Thus we have a solution $q=k+q'$ and $r=r'$ .

Corollary: The division algorithm above produces a unique solution

q,r

To see this, towards contradiction assume we have any two distinct solutions $q,r$ and $q',r'$ where $f=gq+r=gq'+r$ implies $g(q-q')=r'-r$ .

We know that $\deg r<\deg g$ and therefore the RHS $r'-r$ has degree $<\deg g$ .
Then the LHS $g(q-q')$ has degree $\ge\deg g+\deg(q-q')$ .
Since the LHS and RHS should have the same degree, this is a contradiction.

In summary, we always have addition, subtraction, and product defined for $R[x]$ . But you can only divide by polynomials whose leading coefficient is a unit in $R$ . This means polynomial rings in general don’t admit a division algorithm, because they have nonzero polynomials that you can’t divide by. Rings that do admit such a division algorithm are known as Euclidean domains.

Theorem: If

R

is a field, then

R[x]

is a Euclidean domain.

You can only divide by nonzero polynomials in $R[x]$ whose leading coefficient is a unit in $R$ , but every leading coefficient is a unit when $R$ is a field. Thus you can divide by any nonzero polynomial in $R[x]$ , making $R[x]$ a Euclidean domain.

Corollary: For any two polynomials

f,g\in R[x]

where

g

is a nonzero monic polynomial (a polynomial whose leading coefficient is

1

), we can find unique polynomials

q,r

(where

\deg r<\deg g

r=0

) such that

f=gq+r

Following from the division algorithm, you can only divide by polynomials whose leading coefficient is a unit, and $1$ is always a unit. Therefore you can always divide by monic polynomials.

Corollary: Same, but for nonzero polynomials whose leading coefficient is a unit.

Theorem: Every field is a Euclidean domain.

Since every nonzero element in a field is a unit, you can already divide by every element in a field: $a=qb$ . Therefore all fields have a division algorithm and are Euclidean domains.

Note that the division algorithm in question requires a notion of degree for every nonzero element in the ring. To generalize the notion of degree to non-polynomial rings, this means having some function $\phi:R\setminus\{0\}\to\NN$ mapping each nonzero to a natural number $\ge 0$ called the Euclidean norm, where $\phi(r)=0$ for all units $r\in R$ . The Euclidean norm also needs to satisfy the divisibility condition where for every $a\in R$ and nonzero $b\in R$ , there is a division $a=bq+r$ where either $r=0$ or $\phi(r)<\phi(b)$ . (We don’t need to mention the norm $\phi$ in the above proof, since $r=0$ in that proof.)

Theorem: If a Euclidean norm

\phi

exists in an integral domain

R

, then a division algorithm exists in

R

(making

R

a Euclidean domain.)

The Euclidean condition norm requires that for every $a,b\in R$ , we have either $a=qb+r$ or where either $r=0$ or $\phi(r)<\phi(b)$ . This is exactly the required division.

Then in general, a Euclidean domain is an integral domain for which a Euclidean norm is defined.

Remember that for polynomial rings $R[x]$ that are not Euclidean domains, you can only divide by polynomials whose leading coefficient is a unit in $R$ . Then:

Theorem: Let

f

and

g

be nonzero monic polynomials, each of which divides the other. Then

f=g

If $f$ and $g$ divide each other, we have $f=gq$ and $g=fq'$ .
$\deg f=\deg gq$ implies $\deg f\ge\deg g$ .
$\deg g=\deg fq'$ implies $\deg g\ge\deg f$ .
Therefore $\deg f=\deg g$ , which implies that $q,q'$ are constant polynomials that don’t affect the degree of the product.
The fact that both $f,g$ are monic shows that the leading coefficient is $1$ before and after multiplying by $q,q$ ’, which implies that $q,q'$ are both $1$ .
Therefore $f=g$ .

This last theorem actually applies in all integral domains. Let’s modify the statement a bit:

Theorem: Let

f

and

g

be nonzero elements in an integral domain

R

. Each of

f

and

g

generates the other iff they differ by a unit.

$f$ and $g$ being nonzero in an integral domain implies that neither of them are zero divisors.
If $f$ and $g$ generate each other, we have $f=gq$ and $g=fq'$ , and therefore $f=fqq'$ .
Then $0=f(qq'-1)$ where $f$ is not a zero divisor, implying $qq'-1=0$ .
This means $qq'=1$ , i.e. $q,q'$ are units.
Thus $f=gq$ means $f$ differs from $g$ by a unit.
The converse is trivial — if $f,g$ differ by a unit $u$ , then they generate each other via the unit: $f=ug$ and $g=u^{-1}f$ .

In this section, we explore the consequences of evaluating polynomials.

One of the most important things we can do with polynomials is evaluation, in which we substitute

x

with

a\in R

Therefore: we can evaluate a polynomial to an element of

R

by substituting

x

with

b

Such a mapping is always a ring homomorphism.
The resulting expression $\sum_i a_ib^i$ is a rational expression, made up of elements of $R$ connected with addition, subtraction, multiplication, integer scalar multiplication, powers, and division by units.
Since ring homomorphisms preserve rational expressions, the resulting expression is an element of $R$ .

This substitution is formalized as the evaluation map (a ring homomorphism) $\varphi_b:R[x]\to R$ . Applying the evaluation map to a polynomial $f$ can be denoted $\varphi_b(f)$ , but is more often denoted $f(b)$ , the evaluation of $f$ at $b$ .

Lemma: The evaluation map is surjective.

Since there is a constant polynomial $r\in R[x]$ for every $r\in R$ , and constant polynomials remain unchanged by the evaluation map, every element of $R$ gets mapped to.

Theorem:

R[x]/(x)\iso R

The evaluation map $\varphi_0:R[x]\to R$ , which evaluates a polynomial at $0$ , has kernel $(x)$ . To see this, notice that for every $f\in R[x]$ , $\varphi_0(f)=f(0)=0$ implies the constant coefficient is $0$ , and therefore $\begin{aligned} f(x)&=a_nx^n+\ldots+a_2x^2+a_1x+0a_0\\ f(x)&=a_nx^n+\ldots+a_2x^2+a_1x\\ f(x)&=x(a_nx^{n-1}+\ldots+a_2x+a_1)\\ f(x)&\in (x) \end{aligned}$ so the kernel of $\varphi_0$ is $(x)$ .
The first ring isomorphism theorem says $R[x]/\ker\varphi_0\iso\im\varphi_0$ . We just showed $\ker\varphi=(x)$ , and since $\varphi_0$ is surjective, $\im\varphi_0=R$ , therefore the above becomes $R[x]/(x)\iso R$ .

Remember that you can always divide by a monic polynomial in polynomial rings. We’ll see that division relates to evaluation in multiple ways:

Remainder Theorem: When

f\in R[x]

is divided by

x-a

, the remainder is

f(a)

Since $x-a$ is monic, the division algorithm implies some unique $q,r$ such that $f=(x-a)q+r$ . Evaluating at $a$ gives $f(a)=(a-a)q+r=r$ . Therefore, the remainder $r$ is $f(a)$ .

Factor Theorem: In polynomial rings over a field

F[x]

f(a)=0

iff

f=(x-a)q

for some unique polynomial

q

The evaluation $f(a)$ replaces $x$ with $a$ . If $f=(x-a)q$ , then $f(a)=(a-a)q=0q=0$ . Conversely, if $f(a)=0$ , then since $x-a$ is monic, by the remainder theorem $f$ divided by $x-a$ results in $f=(x-a)q+f(a)$ . But $f(a)=0$ , so this becomes $f=(x-a)q$ .

When $f(a)=0$ , then $a$ is called a root of $f$ , and the following are equivalent:

$f(a)=0$
$f=(x-a)q$ for some polynomial $q$
$f\in (x-a)$ , the principal ideal generated by $x-a$

Since the factor theorem lets you factor a polynomial by finding its roots, finding the roots of polynomials is pretty important if you want to factor polynomials.

Theorem: A degree

n

polynomial has at most

n

roots.

This is easily proved by induction by applying the factor theorem $n$ times.

The theorems below are useful for factoring a polynomial (i.e. find roots) in a given polynomial ring. Here is an important one for integer polynomials $\in\ZZ[x]$ :

Rational Roots Theorem: Let

f=a_0+a_1x+a_2x^2+\ldots+a_nx^n\in\ZZ[x]

. If

a_0\ne 0

and

a_n\ne 0

, then for every rational root

c/d

c

divides the constant

a_0

and

d

divides the leading coefficient

a_n

Let $c/d$ be a rational root of the polynomial $f=a_0+a_1x+a_2x^2+\ldots+a_nx^n\in\ZZ[x]$ .
Assume $c,d$ are coprime integers – if they are not, divide both $c$ and $d$ by their GCD to make them coprime.
Since $c/d$ is a root of $f$ , we have $f(c/d)=0$ : $a_0+a_1(c/d)+a_2(c/d)^2+\ldots+a_n(c/d)^n=0$
Multiply both sides by $d^n$ : $a_0d^n+a_1d^{n-1}c+a_2d^{n-2}c^2+\ldots+a_nc^n=0$
From here, we can go in two directions. First, isolate the $a_0d^n$ term and factor out $c$ from the other terms: $c(a_1d^{n-1}+a_2d^{n-2}c+\ldots+a_nc^{n-1})=-a_0d^n$ This means $c$ is a factor of $-a_0d^n$ . Since $c,d$ are coprime integers, $c$ must divide $a_0$ .
Second, isolate the $a_nc^n$ term instead and factor out $d$ from the other terms: $d(a_0d^{n-1}+a_1d^{n-2}c+a_2d^{n-3}c^2+\ldots+a_{n-1}c^{n-1})=-a_nc^n$ Similarly, this means $d$ is a factor of $-a_nc^n$ . Since $c,d$ are coprime integers, $d$ must divide $a_n$ .

Corollary: when $f$ is monic, the only rational roots are all the integer factors of $a_0$ .

To remember this theorem, you can think about the polynomial $x-c$ which obviously has a root $c/1$ . So the numerator divides the constant $c$ , and the denominator divides the leading coefficient $1$ .

Corollary:

\sqrt{m}

is irrational (

\notin\QQ

) unless

m

is the square of an integer.

Try to interpret $\sqrt{m}$ as a rational root ( $\in\QQ$ ) of some polynomial $\in\ZZ[x]$ .
If $\sqrt{m}\in\QQ$ , it would be a rational root $c/d$ of the polynomial $x^2-m$ . We know $\sqrt{m}$ is one such root.
By Rational Roots Theorem, $c\mid m$ and $d\mid 1$ , therefore $d=\pm 1$ . Then $c/d=\pm c$ so the root $\sqrt{m}$ must be some integer $\pm c$ . Therefore $m$ is the square of some integer.

Corollary:

\sqrt[n]{m}\notin\QQ

unless

m

is the

n

th power of an integer.

(Same proof as above)
If $\sqrt[n]{m}\in\QQ$ , it would be a rational root $c/d$ of the polynomial $x^n-m$ . We know $\sqrt[n]{m}$ is one such root.
By Rational Roots Theorem, $c\mid m$ and $d\mid 1$ , therefore $d=\pm 1$ . Then $c/d=\pm c$ so the root $\sqrt[n]{m}$ must be some integer $\pm c$ . Therefore $m$ is the $n$ th power of some integer.

In this section, we classify the unique factorization of real and complex polynomials.

All the work involved in factoring complex polynomials rests on this one theorem:

Fundamental Theorem of Algebra (FTA): If

f\in\CC[x]

is a nonconstant polynomial, then

f

has a root in

\CC

The proof often requires complex analysis or topology. Here is a proof due to Artin, trying not to use many ideas outside of ring theory:

It is a theorem that if you treat evaluation of the polynomial $f\in\CC[x]$ as a map $\CC\to\CC$ , then $f$ is continuous.
Evaluate $f$ at each point on a circle $C_r$ of radius $r$ around the origin of the complex plane. Since $f$ is continuous, the images $f(C_r)$ represent some loop on the complex plane. Each point on the circle $C_r$ can be represented in polar coordinates as $z=re^{i\theta}$ for some angle $\theta$ . Let its corresponding point on the loop $f(C_r)$ be $f(z)$ .
There are two loops we want to consider:
- First, with $r$ approaching $0$ , we know the images $f(C_r)$ are all close to the constant coefficient $c_0$ of $f$ . So $f(C_r)$ is a tiny loop around $c_0$ . We assume $c_0$ is nonzero – if it’s zero then $f(0)=0$ implies $0$ is a root and we are done.
- Second, with $r$ approaching $\infty$ , we know the images $f(C_r)$ are very large but also represent some loop. Since for large values the leading term $c_nz^n$ of $f(z)$ dominates the other terms, which we can write as $f(z)-c_nz^n$ , we know $|f(z)-c_nz^n|<|c_nz^n|$ and (because $|c_nz^n|=c_n|r^ne^{in\theta}|=c_nr^n$ ) therefore $|f(z)-c_nz^n|<c_nr^n$ . This implies the distance between $f(z)$ and the leading term $c_nz^n$ is always less than $c_nr^n$ , no matter what $\theta$ is. If you imagine $\theta$ increasing, then as $c_nz^n=c_nr^ne^{in\theta}$ walks a circle of radius $c_nr^n$ around the origin, while $f(z)$ (being continuous) is following a distance less than $c_nr^n$ behind and therefore the loop it traces must also enclose the origin.
Importantly, the first loop does not enclose the origin, while the second one does. Since $f$ is continuous, varying $r$ within $(0,\infty)$ will continously vary the corresponding loop between one that doesn’t enclose the origin and one that does. So at some $r$ between $0$ and $\infty$ , the resulting loop crosses the origin, and therefore we get a root $f(re^{i\theta})=0$ for some $\theta$ .

Corollary (FTA in

\CC[x]

): For every complex polynomial

f\in\CC[x]

, you can write it in the form

f=k\prod_i(x-u_i)

where

k

is the leading coefficient and

u_i

are all the roots.

This is repeated application of the FTA. You apply FTA to find a root $a$ , factor it out with the factor theorem to get $f=(x-a)g$ , and repeat with $g$ until $g$ is a constant polynomial $k$ . Since we’re always factoring out a monic polynomial $x-a$ , the leading coefficient never changes, and so the final $k$ is the leading coefficient of $f$ .

Conjugate Root Theorem: if

a+bi\in\CC

is a root of a real polynomial

f\in\RR[x]

, then

a-bi

is also a root.

Since complex conjugation $\overline{x}$ fixes the real numbers, $f(\overline{a+bi})=\overline{f(a+bi)}$ when $f$ is a real polynomial.
Therefore, if $f(a+bi)=0$ , then $\overline{f(a+bi)}=\overline{0}$ and therefore $\overline{f(\overline{a-bi})}=f(a-bi)=0$ implies $a-bi$ is also a root of $f$ .

Corollary (FTA in

\RR[x]

): For every real polynomial

f\in\RR[x]

, you can write it in the form

f=k\prod_i(x-u_i)\prod_iq_i

where

k

is the leading coefficient,

u_i

are all the real roots, and

q_i

are all the monic irreducible real quadratics.

Do the same thing you did for complex polynomials, factoring $f$ into a product $f=k\prod_i(x-u_i)$ where the roots $u_i$ are complex.
Since the coefficients of $f$ are real, by the Conjugate Root Theorem, the complex roots come in conjugate pairs. Then the product of their corresponding factors, $(x-(a+bi))(x-(a-bi))=x^2-2ax+(a^2+b^2)$ , is a monic irreducible real quadratic, i.e one of the $q_i$ .
Therefore, after combining all complex factors into monic real quadratic factors, $f$ can be written in the form $k\prod_i(x-u_i)\prod_iq_i$ where $q_i$ are the product of each conjugate pair of complex factors within the original $u_i$ .

Corollary: all irreducible polynomials in $\RR[x]$ are linear or quadratic (have degree $1$ or $2$ ).

We see that the Fundamendal Theorem of Algebra implies that (nonconstant) polynomials in $\CC[x]$ or $\RR[x]$ factor uniquely into a constant times a product of (monic) irreducible factors.

In this section, we examine the ideals of a polynomial ring.

Let’s explore what the ideals $A$ of a polynomial ring $R[x]$ look like.

Therefore: If

R[x]

is a Euclidean domain, then every ideal in

R[x]

is principal – generated by a single element, in this case a unique monic polynomial.

If $A$ is the zero ideal, it is generated by $0$ and we are done.
Otherwise, $A$ contains a nonzero polynomial.
We can divide by monic polynomials, but $A$ doesn’t necessarily contain one, unless $R[x]$ is a Euclidean domain – then we can make any polynomial monic by dividing the polynomial by the leading coefficient. Since any product with an element in the ideal $A$ is also in $A$ , $A$ always contains a monic polynomial.
Take a monic polynomial of minimal degree $g$ in $A$ . Every polynomial $f\in A$ must have $g$ as a factor because the division algorithm lets us divide by monic polynomials.
Then the division algorithm says $f=gq+r$ where either $r=0$ or $\deg r<\deg g$ .
Since $r=f-gq$ , $r$ must also be in $A$ . We can also make $r$ monic and the result is also in $A$ .
Since $\deg r<\deg g$ contradicts $g$ being the monic polynomial of minimal degree in $A$ . Therefore $r=0$ .
This means $f=gq$ . Since $f$ was arbitrary, $g$ generates $A$ .
To show that $g$ $g$ uniquely generates $A$ $A$ , assume that $h$ $h$ is another generator of $A$ $A$ . Then $g$ $g$ and $h$ $h$ generate each other: $g=hq$ $g = h q$ and $h=gq'$ $h = g q^{'}$ . But monic polynomials that divide each other are equal, so $g=h$ $g = h$ .
- $\deg g=\deg hq$ implies $\deg g\ge\deg h$ , and $\deg h=\deg gq'$ implies $\deg h\ge\deg g$ . Therefore $\deg g=\deg h$ .
- $\deg g=\deg h$ implies that $q,q'$ are constant polynomials that don’t affect the degree of the product. The fact that both $g,h$ are monic (leading coefficient is $1$ before and after multiplying by $q,q$ ’) implies that $q,q'$ are both $1$ . This implies $g=h$ .

This means we can write all quotient rings of $F[x]$ ( $F$ a field) as $F[x]/(h)$ , where $h$ is some monic polynomial. Since the generator of every ideal is unique, this implies a bijection between monic polynomials in $F[x]$ and nonzero ideals in $F[x]$ .

When every ideal of a ring is principal, we have a principal ideal domain (PID).

It turns out every Euclidean domain is a PID.

Theorem: All Euclidean domains are PIDs.

Let $R$ be a Euclidean domain with norm $f$ . Then for every element $a\in R$ , we can divide $a$ by nonzero $b\in R$ to get $a=bq+r$ where either $r=0$ or $f(r)<f(b)$ .
We can show that for every ideal $I$ in $R$ , every element $a$ of $I$ is generated by some element $b$ with the smallest norm $f(b)$ . That is, $a=bq$ for some $q\in R$ .
This follows immediately from the division algorithm. We have $a=bq+r$ , and since $f(b)$ is minimal by definition, there is no $f(r)$ such that $f(r)<f(b)$ , therefore $r=0$ .

In this section, we determine what it means to quotient in polynomial rings defined over a field $F$ .

In particular, when $F$ is a field, then $F[x]$ is a Euclidean domain and therefore a PID. Let’s explore polynomial rings defined over a field $F$ .

Therefore: The elements of

F[x]/(h)

are every polynomial in

F[x]

with degree less than

\deg h

, under the relation

h(x)=0

When you quotient by some ideal $\<h\>$ , you’re effectively sending the polynomial $h$ to $0$ and therefore enforcing the relation $h(x)=0$ on the polynomial ring. That means whenever $f=hq$ , $f=0$ .
But in a Euclidean domain, every polynomial $f$ of degree at least $\deg h$ can factor out $h$ : $f=hq$ . This means every polynomial of degree $\deg h$ and above gets sent to $0$ .
The remaining polynomials are necessarily of degree less than $\deg h$ .

Example:

F[x]/(x^2)\iso

all linear and constant polynomials in

F[x]

We identify $x^2$ with $0$ , so all the degree $2+$ polynomials get sent to $0$ , leaving only the degree $1$ and $0$ polynomials (and the zero polynomial).

Example:

\RR[x]/(x^2+1)\iso\CC

Here we identify $x^2+1$ with $0$ . We can understand this as $x=\sqrt{-1}=i$ (in $\CC$ ).
Since every resulting polynomial is at most degree $1$ , they are in the form $c_0+c_1x$ .
In other words, the elements of $\RR[x]/(x^2+1)$ are $c_0+c_1i$ .
This is exactly how we write elements $a+bi$ of $\CC$ , so the two rings are isomorphic.

Example:

\QQ[x]/(x^3-2)\iso\QQ(\sqrt[3]{2})

Here we identify $x^3-2$ with $0$ . We can understand this as $x=\sqrt[3]{2}$ (in $\RR$ ).
Since every resulting polynomial is at most degree $2$ , they are in the form $c_0+c_1x+c_2x^2$ .
In other words, the elements of $\RR[x]/(x^2+1)$ are $c_0+c_1\sqrt[3]{2}+c_2\sqrt[3]{2}^2$ .
But this is the same as adjoining an element $\sqrt[3]{2}$ to $\QQ$ .

To generalize these last two examples, when $F$ is a field and $h$ is irreducible, we can take the quotient $F[x]/(h)$ to form a ring isomorphic to $F[c]$ , where $c$ is a solution to $h=0$ (i.e. a root of $h$ ). It turns out the result is a field:

Theorem:

F[x]/(h)

is a field iff

h

is irreducible.

Since $F[x]$ is a PID, the result follows directly from this theorem.
(Note that if $h$ is reducible, then $F[x]/(h)$ sends $h=fg$ to $0$ , meaning there are zero divisors $f,g$ so the result fails to be even an integral domain.)

In other words, the quotient rings of polynomial rings can be used to construct field extensions, which we’ll explore in depth later on.

In this section, we introduce how the derivative of a polynomial helps determine roots.

The derivative $f'$ of the polynomial $f\in F[x]$ can be constructed by taking each term of $f$ ( $a_ix^i$ ), and mapping it to $ia_ix^{i-1}$ , where $i$ is mapped into $F$ using the canonical homomorphism $\ZZ\to F$ (i.e. taking $1+1+\ldots$ but $i$ times.)

The derivative helps us determine when a polynomial has multiple roots. $\alpha$ is a multiple root of $f$ if $f$ has a factor $(x-\alpha)^n$ for some $n\ge 2$ .

Corollary: If the derivative of an irreducible polynomial

f

is nonzero, then

f

and

f'

share no factors.

Irreducible polynomials $f$ can only share a factor (itself) with either 0 or polynomials of greater or equal degree. Since the derivative $f'$ always has lesser degree, $f'$ has to be zero in order for $f$ and $f'$ to share a factor.

< Back to category Exploration 3: Polynomials (permalink)
Exploration 2: Ring homomorphisms Exploration 4: Factorization

Exploration 3: Polynomials

In this section, we define the operations possible on polynomials.

In this section, we explore the consequences of evaluating polynomials.

In this section, we classify the unique factorization of real and complex polynomials.

In this section, we examine the ideals of a polynomial ring.

In this section, we determine what it means to quotient in polynomial rings defined over a field FFF.

In this section, we introduce how the derivative of a polynomial helps determine roots.

In this section, we determine what it means to quotient in polynomial rings defined over a field $F$ .