tinytheorems - about groups - rings - modules - models atom/rss

Introduction to ring theory

.

Questions:


In 1892, David Hilbert, while working with algebraic number theory, coined the term “Zahlring” (number ring) , which was later shortened to “Ring”, and finally translated in English to “ring” to refer to the structure we’re about to introduce.

(R,+,)(R,+,\cdot) is a ring, defined as some underlying set RR whose elements have a notion of addition (++) and multiplication (\cdot). There are various definitions of a ring (see the appendix) but for our purposes we choose the strongest definition, where the following axioms must hold for all rings:

  1. (R,+)(R,+) forms an (additive) abelian group with identity 00 (called the zero element).
  2. (R,)(R,\cdot) forms a (multiplicative) commutative monoid with identity 11 (called unity).
  3. \cdot distributes over ++.

One fact that holds for all rings is the binomial theorem, which we can derive from these axioms. We adopt a common notation for repeated addition and repeated multiplication:

Binomial Theorem: For any r,sRr,s\in R for a ring RR, (r+s)n=k=0n(nk)rksnk(r+s)^n=\sum_{k=0}^n{n\choose k}r^ks^{n-k} for all nonnegative nn.

Proof by induction on nn.

  • Base case n=0n=0: (r+s)0=1=(00)r0s0(r+s)^0=1={0\choose 0}r^0s^0. Note that this only exists since we have a multiplicative identity 11.
  • Inductive case n>0n>0: The inductive hypothesis gives (r+s)n1=k=0n1(n1k)rksn1k(r+s)^{n-1}=\sum_{k=0}^{n-1}{n-1\choose k}r^ks^{n-1-k}. Note that commutativity of the product lets us express every term of this sum as crisjc\cdot r^is^j for some integers i,ji,j and some integer coefficient c=(i+j1i)c={i+j-1\choose i}.
  • Now observe (r+s)n=(r+s)(r+s)n1=r(r+s)n1+s(r+s)n1 by distributivity=ri=0n1(n1i)risn1i+sj=0n1(n1j)rjsn1j by IH=i=0n1(n1i)ri+1sn1i+j=0n1(n1j)rjsnj=i=1n(n1i1)risni+j=0n1(n1j)rjsnj=i=0n(n1i1)risni+j=0n(n1j)rjsnj since (n11)=(n1n)=0=i=0n[(n1i1)+(n1i)]risni=i=0n(ni)risni by Pascal’s identity\begin{aligned} (r+s)^n&=(r+s)(r+s)^{n-1}\\ &=r(r+s)^{n-1}+s(r+s)^{n-1}&\text{ by distributivity}\\ &=r\sum_{i=0}^{n-1}{n-1\choose i}r^is^{n-1-i}+s\sum_{j=0}^{n-1}{n-1\choose j}r^js^{n-1-j}&\text{ by IH}\\ &=\sum_{i=0}^{n-1}{n-1\choose i}r^{i+1}s^{n-1-i}+\sum_{j=0}^{n-1}{n-1\choose j}r^js^{n-j}\\ &=\sum_{i=1}^{n}{n-1\choose i-1}r^is^{n-i}+\sum_{j=0}^{n-1}{n-1\choose j}r^js^{n-j}\\ &=\sum_{i=0}^{n}{n-1\choose i-1}r^is^{n-i}+\sum_{j=0}^n{n-1\choose j}r^js^{n-j}&\text{ since }{\textstyle {n-1\choose -1}={n-1\choose n}=0}\\ &=\sum_{i=0}^{n}\left[{n-1\choose i-1}+{n-1\choose i}\right]r^is^{n-i}\\ &=\sum_{i=0}^{n}{n\choose i}r^is^{n-i}&\text{ by Pascal's identity} \end{aligned}

In this section, we go over some examples of rings.

Possibly the ring that is the most ring of them all is the integers Z\ZZ. In fact the axioms above are basically modeled after integer addition and multiplication, which is why it is very easy to check that Z\ZZ is a ring.

More interestingly, Zp\ZZ_p, the integers mod pp, form a ring. Just as with integers, addition and multiplication work with the elements aˉ\bar{a} of Zp\ZZ_p (called residue classes) where aˉ=bˉ\bar{a}=\bar{b} if a,ba,b differ by a factor of pp.

Other examples include the rationals Q\QQ, the real numbers R\RR, and the complex numbers C\CC. There is also the zero ring (or trivial ring) 00, the ring containing just a zero element 00.

In this section, we study operations on rings.

Just like with groups, we can operate on rings as mathematical objects in their own right.

First, the direct product of two rings R×SR\times S is the same as with groups – the elements of R×SR\times S are all the pairs (rR,sS)(r_R,s_S), with addition and multiplication defined pointwise. (When we have multiple rings like RR and SS, we typically distinguish their elements by adding the ring’s name as a subscript. For example, 1R1_R is the multiplicative identity in RR, and 0S0_S is the additive identity in SS.)

Second, we can adjoin a new element to a ring. Basically this means adding the element to the ring and taking the closure under addition and multiplication. For instance, the ring R[e]R[e] is the result of adjoining echar "338Re\notin R to ring RR. Here are some examples:

We’ll make heavy use of this when we start talking about polynomial rings, but for now it’s just good to keep in mind that we can do this.

In this section, we define the units of a ring.

We mentioned that the integers Z\ZZ are the prototypical example of a ring. Integer addition and multiplication are commutative, we have an additive identity 00 and a multiplicative identity 11, there are additive inverses, and multiplication distributes over addition. So all the ring axioms hold on the integers.

What about multiplicative inverses? Do they exist for the integers? The multiplicative inverse of an integer zz is a value z1z^{-1} such that zz1=1z\cdot z^{-1}=1. But the only integers that can satisfy this are z=1z=1 and z=1z=-1. So it seems like only unit values of the integers can be multiplicative inverses.

Generalizing from integers to arbitrary rings RR, invertible elements of a ring (with respect to multiplication) are called its units. The units of any ring RR form a multiplicative group, denoted R×R^\times or sometimes RR^*. Note that R×R^\times is never a ring, because rings require the additive identity 00, and 00 is never a unit.

Theorem: 00 is never a unit of a ring RR.

To be a unit, we’d have to have 001=10\cdot 0^{-1}=1, but that’s impossible since 00 multiplied by anything is zero.

Theorem: u,vu,v are units iff their product uvuv is a unit.
  • If u,vu,v are units then (uv)(v1u1)=u(vv1)u1=u1u1=1(uv)(v^{-1}u^{-1})=u(vv^{-1})u^{-1}=u1u^{-1}=1 shows that uvuv is a unit.
  • If uvuv is a unit then u(v(uv)1)=(uv)(uv)1=1u(v(uv)^{-1})=(uv)(uv)^{-1}=1 shows uu is a unit, and a symmetric argument shows vv is a unit.

A ring comprised of only zero and units is called a field. We know that all nonzero elements in a field are invertible and have the identity element 11, so the nonzero elements of a field FF actually form a multiplicative group F×F^\times.

Theorem: The multiplicative group F×F^\times of a field FF has order F1|F|-1.

The multiplicative group takes only the units of FF, of which there are F1|F|-1 since everything but zero is a unit in FF.

Fields have many special properties, which we’ll dive into in depth in a different exploration.

In this section, we introduce zero divisors.

Unlike the integers, however, in a ring it is possible for ab=0a\cdot b=0 for nonzero a,ba,b. We call such a,ba,b zero divisors because they effectively divide zero into two nonzero elements.

Zero divisors are generally undesirable, because their presence implies a lack of cancellability in the ring. In other words, when we have ab=aca\cdot b=a\cdot c, we’d like to claim that b=cb=c as a result. But this relies on the fact that the map xaxx\mapsto a\cdot x is injective, which is not the case when aa is a zero divisor, because then both ab=0a\cdot b=0 and a0=0a\cdot 0=0. So you can think of zero divisors as uncancellable elements in a ring.

A ring with no (nonzero) zero divisors is called an integral domain. Essentially, integral domains are exactly the rings that have cancellability, which is desirable. We also enforce the requirement that nonzero elements exist in integral domains, so as a special case, the zero ring 00 is not an integral domain.

Let’s see some examples:

Theorem: Zp\ZZ_p is an integral domain if pp is prime.
  • Since pp is prime, when pabp\mid ab for a,bZpa,b\in\ZZ_p then either pap\mid a or pbp\mid b.
  • In Zp\ZZ_p, this translates to: when ab0 mod pab\equiv 0\mod p, then either a0 mod pa\equiv 0\mod p or b0 mod pb\equiv 0\mod p.
  • So a,ba,b cannot be both zero divisors (ab0 mod p)(ab\equiv 0\mod p) and nonzero (a,bchar "3380 mod p)(a,b\not\equiv 0\mod p).
  • Therefore there are no nonzero zero divisors, making Zp\ZZ_p an integral domain.

Theorem: The direct product of nonzero rings cannot be an integral domain.
  • The result of a direct product of nonzero rings always contains the elements (1,0)(1,0) and (0,1)(0,1).
  • Since (1,0)(0,1)=(0,0)(1,0)(0,1)=(0,0), both are zero divisors.
  • Since there are zero divisors, the direct product is not an integral domain.

Let’s see how zero divisors and units interact.

Theorem: No element can be both a zero divisor and a unit.

A zero divisor aa satisfies ab=0ab=0 for some nonzero bb. But if aa is a unit, then no such bb exists since left-multiplying ab=0ab=0 by a1a^{-1} gives b=0b=0.

Corollary: Every field is an integral domain.

An integral domain must have no nonzero zero divisors. But every nonzero in a field is a unit by definition, and therefore not a zero divisor.

Theorem: Every finite integral domain is a field.
  • Take any nonzero aRa\in R, so that the set {a,a2,a3,}\{a,a^2,a^3,\ldots\} is not all zeros.
  • Since we’re in a finite ring, {a,a2,a3,}\{a,a^2,a^3,\ldots\} eventually repeats, so that there is some ana^n equal to ama^m where n>mn>m.
  • Since we’re in an integral domain, we can cancel ama^m from both sides of an=ama^n=a^m, producing anm=1a^{n-m}=1 since n>mn>m.
  • Note that nm>0n-m>0, so this can be rewritten as aanm1=1a\cdot a^{n-m-1}=1, proving that aa is a unit.
  • Since every nonzero aRa\in R is a unit, RR must be a field.

In this section, we introduce idempotents.

An idempotent in a ring RR is an element eRe\in R such that e2=ee^2=e, and therefore ek=ee^k=e for all k1k\ge 1. For every ring, 00 and 11 are trivially idempotent since 02=00^2=0 and 12=11^2=1.

Theorem: Every nontrivial idempotent is a zero divisor.

For idempotents ee that aren’t 00 or 11, we can show that since e2=ee^2=e, we have e2e=0e^2-e=0 and therefore e(e1)=0e(e-1)=0 by distributivity. Since echar "338=1e\ne 1 implies e1e-1 is nonzero, and echar "338=0e\ne 0, ee and e1e-1 must both be zero divisors.

Therefore the nontrivial idempotents are a special subset of zero divisors. This means that if a nontrivial idempotent exists in a ring, the ring is not an integral domain.

Studying the idempotents themselves gives rise to some interesting structures:

Theorem: The idempotents of a ring form a partially ordered set.

A partially ordered set (poset) is a set where a partial order aba\le b is defined for all elements a,ba,b, so that \le satifies

  • reflexivity a.aa\forall a\ldotp a\le a,
  • antisymmetry a,b.abba    a=b\forall a,b\ldotp a\le b\land b\le a\implies a=b, and
  • transitivity a,b,c.abbc    ac\forall a,b,c\ldotp a\le b\land b\le c\implies a\le c.

On the idempotents, define efe\le f iff ef=eef=e or ef=fef=f. The idea is that all factors are “less than” their products. This satisfies the requirements:

  • Reflexivity: ee=e2=eee=e^2=e, therefore eee\le e.
  • Antisymmetry: Assuming ef=eef=e and fe=ffe=f we get e=ef=fe=fe=ef=fe=f.
  • Transitivity: Assuming ef=fef=f end fg=gfg=g we get eg=efg=fg=geg=efg=fg=g.

Thus the idempotents form a poset.

Theorem: The idempotents of a ring form a boolean algebra.

A boolean algebra is a poset together with the operators ¬,,\lnot,\lor,\land corresponding to negation, disjunction, and conjunction respectively, as well as two distinguished elements 00 and 11, all satisfying the following:

  • ¬x=0\lnot x=0 iff x=1x=1, ¬x=1\lnot x=1 iff x=0x=0
  • xy=0x\lor y=0 iff x=y=0x=y=0, otherwise xy=1x\lor y=1
  • xy=1x\land y=1 iff x=y=1x=y=1, otherwise xy=0x\land y=0
  • \lor and \land are commutative
  • \lor and \land are associative

Define:

  • negation ¬e\lnot e as 1e1-e.
  • disjunction efe\lor f as e+fefe+f-ef.
  • conjunction efe\land f as efef.
  • 00 as the additive identity 00.
  • 11 as the multiplicative identity 11.

Then:

  • ¬e=1e\lnot e=1-e, which is 00 if e=1e=1 and 11 if e=0e=0.
  • ef=e+fefe\lor f=e+f-ef, which is 00 if e=f=0e=f=0 and 11 otherwise.
  • ef=efe\land f=ef, which is 11 if e=f=1e=f=1 and 00 otherwise.
  • Commutativity can be shown by observing that both e+fefe+f-ef and efef are unchanged when you swap ee and ff.
  • Associativity of \land comes from associativity of the product in a ring. Associativity of \lor is harder but routine: e(fg)=e(f+gfg)=e+(f+gfg)e(f+gfg)=e+f+gfg(ef+egefg)=e+fef+g(eg+fgefg)=(e+fef)+gg(e+fef)=(e+fef)g=(ef)g\begin{aligned} e\lor (f\lor g) &=e\lor (f+g-fg)\\ &=e+(f+g-fg)-e(f+g-fg)\\ &=e+f+g-fg-(ef+eg-efg)\\ &=e+f-ef+g-(eg+fg-efg)\\ &=(e+f-ef)+g-g(e+f-ef)\\ &=(e+f-ef)\lor g\\ &=(e\lor f)\lor g \end{aligned}

Corollary: Every finite ring contains 2k2^k idempotents for some kk.

The proof of this isn’t really a ring theory proof, so I left it out. But it is a property of boolean algebras that every boolean algebra is isomorphic to the power set of a kk-element set, therefore every finite ring contains 2k2^k idempotents.

If every element in a ring is idempotent, then the whole ring is a boolean algebra, so we call it a boolean ring.

In this section, we introduce nilpotents.

Another special case of zero divisors are those elements that, when raised to a suitable power, become zero. These are elements aa that satisfy k.ak=0\exists k\ldotp a^k=0, and they are called nilpotents. The zero element 00 is always a nilpotent in every ring.

Theorem: A nonzero element ee cannot be both idempotent and nilpotent.

An idempotent element ee is still itself when raised to an arbitrary power ek=ee^k=e. That means that, unless e=0e=0, powers of ee never become zero, therefore ee cannot be nilpotent.

Theorem: If r,sr,s are nilpotent elements of a ring RR, then r+sr+s and rsr-s are also nilpotent.

Say rn=0r^n=0 and sm=0s^m=0. Then using the binomial theorem, (r+s)n+m=k=0n+m(n+mk)rksn+mk(r+s)^{n+m}=\sum_{k=0}^{n+m}{n+m\choose k}r^ks^{n+m-k}. Ignoring the coefficient, notice that each term rksn+mkr^ks^{n+m-k} vanishes, because if knk\ge n then rk=0r^k=0 and if knk\le n then n+mkmn+m-k\ge m thus sn+mk=0s^{n+m-k}=0. Therefore r+sr+s is nilpotent. The same argument works for rsr-s.

Like all zero divisors, nilpotents cannot be units. However, the existence of nilpotents is special since it always implies the existence of units:

Theorem: For every nilpotent aa in a ring RR, the element 1a1-a is a unit of RR.

If ak=0a^k=0 for some kk, then let S=i=0k1aiS=\sum_{i=0}^{k-1}a^i be the sum of all the powers of aa up to ak1a^{k-1}, which is an element of RR. Observe that because ak=0a^k=0, aS=(i=1kai)aS=\left(\sum_{i=1}^{k}a^i\right) is exactly every element of SS except for a0=1a^0=1. Then their difference SaSS-aS must be equal to 1, and by factoring out SS, we have S(1a)=1S(1-a)=1, meaning 1a1-a is a unit of RR.

Corollary: If aa is nilpotent in a ring RR and uu is a unit, u+kau+ka for all kZk\in\ZZ are also units of RR.

For uau-a, use the same argument as above with the series S=i=0k1(au1)iS=\sum_{i=0}^{k-1}(au^{-1})^i. (ua)(u1S)=S(au1)S=1(u-a)(u^{-1}S)=S-(au^{-1})S=1 This implies (ua)a(u-a)-a is a unit as well, and so on, thus u+kau+ka is a unit for all k0k\le 0.

Similarly, for u+au+a, use the series S=i=0k1(au1)iS=\sum_{i=0}^{k-1}(-au^{-1})^i. (u+a)(u1S)=S(au1)S=1(u+a)(u^{-1}S)=S-(-au^{-1})S=1 This implies (u+a)+a(u+a)+a is a unit as well, and so on, thus u+kau+ka is a unit for all k0k\ge 0.

Corollary: u+bu+b is a unit for every unit uu and nilpotent bb.
  • Assume bk=0b^k=0. Then note that u+b=u+uu1b=u(1+u1b)=u(1r)u+b=u+uu^{-1}b=u(1+u^{-1}b)=u(1-r) where r=u1br=-u^{-1}b is also nilpotent: rk=ukbk=0r^k=-u^{-k}b^k=0.
  • Since rr is nilpotent, 1r1-r is a unit, so u(1r)=u+bu(1-r)=u+b is a unit.

In this section, we introduce the characteristic of a ring.

The above proof that kZ.u+ka\forall k\in\ZZ\ldotp u+ka is a unit seems to imply that every nilpotent can produce an infinite number of units. However, this isn’t always true — some rings are finite. Can you think of one?

You might have come up with the ring of integers mod nn, denoted Zn\ZZ_n. Specifically let’s try Z8\ZZ_8, so that 22 is a nilpotent element because 23=02^3=0 in Z8\ZZ_8. Using our formula 1k21-k2, this implies that 1,3,5,71,3,5,7 are units of the ring. But because 1+2+2+2+2=11+2+2+2+2=1, the ring “loops back” on itself at some point, so we don’t get any additional units.

This special ring property where addition “loops back” is called the characteristic. The characteristic of a ring RR is the number of times you have to add 11 to itself before you get 00. This makes sense in the ring of integers mod nn, because in that ring, adding 11 to itself nn times gives 00. So Zn\ZZ_n has characteristic nn, and we write char Zn=n\char\ZZ_n=n. For rings like the integers Z\ZZ, however, no amount of adding 11 to itself will give 00, so for those rings the characteristic is defined to be 00. Thus char Z=0\char\ZZ=0. The idea is that the only way to get 00 is to add 11 zero times to itself.

Since most rings we work with have characteristic 00, we will only mention characteristic when it matters. For instance, it turns out that characteristic 22 rings are particularly strange. Here’s an example of why.

Theorem: In a characteristic 22 ring RR, addition is the same as subtraction.

Since a+a=2a=0a+a=2a=0 for every aRa\in R, every element in RR is its own additive inverse: a=aa=-a Thus a+b=ab=a+b=aba+b=a-b=-a+b=-a-b

While rings can be of any nonnegative characteristic in general, this is not true of integral domains.

Theorem: The characteristic of an integral domain is either 00 or a prime number pp.

(ab)1=ab=0(ab)1=ab=0 is precisely the requirement for the characteristic to be a composite number abab. But since integral domains have no nonzero zero divisors, it’s not possible that ab=0ab=0 for nonzero a,ba,b. Therefore, the characteristic is either prime or zero.

Corollary: The characteristic of a field is either 00 or a prime number pp.

Important note: Recall that when we adjoin an element from one ring to another, we take the additive and multiplicative closure of the result. For closure to make sense, the two rings must be of the same characteristic. So you can only adjoin elements of one ring to another if they have the same characteristic.

In this section, we talk about subrings.

A subring is a subset of elements of a ring RR that satisfies the ring axioms. Additionally, it must includes 11 as the multiplicative identity. This is enough to show it includes 00 as the additive identity as well, since 11=01-1=0.

This last requirement is curious, but it’s certainly possible for RR to have subsets that are rings with a different multiplicative identity. They’re just not considered subrings.

Theorem: Every ring RR of characteristic 00 includes a subring isomorphic to the integers Z\ZZ.

We can define a correspondence between the integers and a subring of RR. Assign every integer nZn\in\ZZ to n1n1, which is 11 added to itself nn times. Since char R=0\char R=0, each n1n1 is a unique element in RR. Then we know that the subset of these n1n1 elements is a ring, because we can use the ring Z\ZZ to define addition and multiplication on the nn part of these elements. Thus RR includes Z\ZZ as a subring.

Theorem: The intersection of two subrings is a subring of both.
  • The intersection is an additive group, since both subrings are additive groups, and the intersection of additive groups is also an additive group.
  • The intersection has 11 from the original ring, since that’s present in both subrings.
  • The intersection is closed under product, since both subrings are closed under product.
  • The intersection inherits the multiplicative identity and distributive laws from the original ring.
  • Since the intersection is a subset of both given subrings, is an additive group, contains 11, is closed under product, and satisfies identity and distributive laws, it is a subring of both.

Theorem: Every subring of an integral domain is an integral domain.

Since integral domains don’t have zero divisors, none of its subrings can have zero divisors, so every subring is also an integral domain.

Appendix A

Our earlier definition actually breaks down into four ring axioms:

  1. (R,+)(R,+) forms an (additive) group with zero element 00.
  2. (R,)(R,\cdot) forms a (multiplicative) monoid with unity (identity) 11.
  3. \cdot distributes over ++, and this actually implies ++ is commutative, so (R,+)(R,+) must be an abelian group.
  4. \cdot is also commutative.

While we will assume all of these axioms hold for a ring, one might define more general rings by relaxing these axioms. For completeness, here are the names for some ring variants:

So what we call rings are technically what some people call unital commutative rings. We won’t mention these other names very much since having to deal with non-unital or non-commutative rings introduces a lot of complexity, and I’d rather get into that complexity much later.

< Back to category Introduction to ring theory (permalink)
Exploration 1: Rings and ideals