tinytheorems - about groups - rings - modules - models atom/rss

Exploration 1: Commutativity

.

Questions:


When taking products of group elements, like ghgh, the ordering matters. In other words, gh=hggh=hg is not true in general, unless gg or hh is the identity element ee. We saw this with the symmetric group, where the ordering of composing permutations matters.

In this exploration, we’ll look at all the ways gh=hggh=hg can actually be true. That is, we’ll explore how we can actually make gg commute with hh.

In this section, we look at elements that commute with everything.

Every group has an identity element ee, and the identity element always commutes with any element: eg=g=geeg=g=ge. In the worst case, the identity element is the only element that commutes with every element.

On the flipside, abelian groups are exactly those groups where every element commutes with every element. So at maximum, every element commutes with every element. The cyclic group is a good example of this.

When some given element gg commutes with every element in the group, we say that gg is central. The central elements of a group GG are collectively referred to as the center of GG, written Z(G)Z(G). As discussed, the center is always in between the minimum “just the identity element” (a trivial center) and the maximum “every element” (in the case of abelian groups.)

Example: The group of integers under addition is abelian.

Integer addition is commutative, so g+h=h+gg+h=h+g for all g,hZg,h\in\ZZ.

Example: The symmetric group with n3n\ge 3 elements has a trivial center.
  • To be in the center, a permutation σ\sigma must commute with every permutation.
  • In particular, σ\sigma must commute with (a b)(a~b), so we have σ(a b)=(a b)σ\sigma(a~b)=(a~b)\sigma, which implies σ\sigma either fixes a,ba,b or swaps a,ba,b.
  • Using the same logic for (b c)(b~c), σ\sigma either fixes b,cb,c or swaps b,cb,c. But σ\sigma can’t swap b,cb,c since that would mean σ\sigma neither fixes nor swaps a,ba,b (using the assumption that n3n\ge 3 so that a,b,ca,b,c are distinct.) So the only option is that σ\sigma fixes a,b,ca,b,c.
  • Applying this argument inductively to all elements a,b,c,d,a,b,c,d,\ldots we show that σ\sigma must fix every element in order to commute with all these 22-cycles. Therefore, σ\sigma can only be the identity permutation.

Example: The center of the group of invertible n×nn\times n matrices under matrix multiplication are exactly scalar multiples of the identity matrix.
  • Let ZZ be a central matrix, so that it commutes with every invertible matrix in the group.
  • In particular, it commutes with the matrix E=I+EijE=I+E_{ij} (where ichar "338=ji\ne j), which is the identity matrix except with a 11 at an off-diagonal entry at row ii column jj. For example: E=[100011001](for n=3,i=2,j=3)E=\left[\begin{matrix}1&0&0\\0&1&1\\0&0&1\end{matrix}\right]\quad(\text{for }n=3,i=2,j=3) This matrix EE is a member of our group because it is invertible (it is triangular and therefore det(E)\det(E) is the product of the diagonal, which is 11, thus invertible.)
  • Note that by matrix multiplication, (ZE)ik(ZE)_{ik} is equal to the inner product of (the iith row of ZZ) and (the kkth column of EE), which by construction is exactly (ZE)ik={Zik+0 for kchar "338=jZik+Zii for k=j(ZE)_{ik}=\begin{cases}Z_{ik}+{\color{red}0}&\text{ for }k\ne j\\Z_{ik}+{\color{red}Z_{ii}}&\text{ for }k=j\end{cases} ZE=Z(I+Eij)=Z+[Z1,1Z1,2Z1,3Z2,1Z2,2Z2,3Z3,1Z3,2Z3,3][000001000]=Z+[00Z1,200Z2,200Z3,2]ZE=Z(I+E_{ij})=Z+\left[\begin{matrix}Z_{1,1}&Z_{1,2}&Z_{1,3}\\Z_{2,1}&Z_{2,2}&Z_{2,3}\\Z_{3,1}&Z_{3,2}&Z_{3,3}\end{matrix}\right]\left[\begin{matrix}0&0&0\\0&0&1\\0&0&0\end{matrix}\right]=Z+\left[\begin{matrix}0&0&Z_{1,2}\\{\color{red}0}&{\color{red}0}&{\color{red}Z_{2,2}}\\0&0&Z_{3,2}\end{matrix}\right] Similarly, (EZ)kj={Zkj+0 for kchar "338=iZkj+Zjj for k=i(EZ)_{kj}=\begin{cases}Z_{kj}+{\color{blue}0}&\text{ for }k\ne i\\Z_{kj}+{\color{blue}Z_{jj}}&\text{ for }k=i\end{cases} EZ=(I+Eij)Z=Z+[000001000][Z1,1Z1,2Z1,3Z2,1Z2,2Z2,3Z3,1Z3,2Z3,3]=Z+[000Z3,1Z3,2Z3,3000]EZ=(I+E_{ij})Z=Z+\left[\begin{matrix}0&0&0\\0&0&1\\0&0&0\end{matrix}\right]\left[\begin{matrix}Z_{1,1}&Z_{1,2}&Z_{1,3}\\Z_{2,1}&Z_{2,2}&Z_{2,3}\\Z_{3,1}&Z_{3,2}&Z_{3,3}\end{matrix}\right]=Z+\left[\begin{matrix}0&0&{\color{blue}0}\\Z_{3,1}&Z_{3,2}&{\color{blue}Z_{3,3}}\\0&0&{\color{blue}0}\end{matrix}\right]
  • Since ZE=EZZE=EZ due to ZZ being in the center, equate the above at index i,ji,j to obtain (ZE)ij=(EZ)ijZij+Zii=Zij+ZjjZii=Zjj\begin{aligned} (ZE)_{ij}&=(EZ)_{ij}\\ Z_{ij}+{\color{red}Z_{ii}}&=Z_{ij}+{\color{blue}Z_{jj}}\\ {\color{red}Z_{ii}}&={\color{blue}Z_{jj}}\\ \end{aligned} for all ichar "338=ji\ne j. This implies all the diagonal entries of ZZ are equal.
  • As for the off-diagonal entries, note that ZEijZE_{ij} is ZkiZ_{ki} only on column jj and zero otherwise, and EijZE_{ij}Z is ZjkZ_{jk} only on row ii and zero otherwise. That means for kchar "338=ik\ne i, (ZE)kj=(EZ)kj    Zkj+Zki=Zkj+0    Zki=0(ZE)_{kj}=(EZ)_{kj}\implies Z_{kj}+Z_{ki}=Z_{kj}+0\implies Z_{ki}=0 and for kchar "338=jk\ne j, (ZE)ik=(EZ)ik    Zik+0=Zik+Zjk    Zjk=0(ZE)_{ik}=(EZ)_{ik}\implies Z_{ik}+0=Z_{ik}+Z_{jk}\implies Z_{jk}=0 meaning that any off-diagonal entry of ZZ must be zero, i.e. ZZ is diagonal.
  • Since ZZ is diagonal with every diagonal entry equal, ZZ must be a scalar multiple of the identity matrix λI\lambda I. So every central matrix must be in the form λI\lambda I.
  • Conversely, all matrices in the form λI\lambda I are central because (λI)M=λMI=M(λI)(\lambda I)M=\lambda MI=M(\lambda I) for an arbitrary matrix MM.
  • Therefore the center of the group of invertible matrices consists of exactly the scalar multiples of the identity matrix.

In this section, we learn about subsets that commute with everything.

Being a central element is a rather strong property. A central element must commute with every element.

Instead of considering single elements that commute with all elements in a group, consider subsets that commute with all elements in a group. Let SGS\subseteq G be a subset of GG, and let SgSg (resp. gSgS) represent the result of right-multiplying (resp. left-multiplying) all of SS by some gGg\in G. Then if Sg=gSSg=gS for all gGg\in G, we can characterize SS as something like being a “central” subset of GG. How do its properties differ from those of central elements?

Notice that since Sg=gSSg=gS implies S=gSg1S=gSg^{-1} for all gGg\in G, SS has the property that for any of its elements sSs\in S, then the element gsg1gsg^{-1} is also in SS, for every gGg\in G.

This map sgsg1s\mapsto gsg^{-1} is called conjugation by gg. When t=gsg1t=gsg^{-1}, we say that tt is a conjugate of ss by gg.

So our earlier property that Sg=gSSg=gS for all gGg\in G can be rewritten as S=gSg1S=gSg^{-1} for all gGg\in G. Since the idea is that SS is left unchanged after conjugation by arbitrary gg, we call this property invariance under conjugation. Specifically, a set SS or element xx is invariant under conjugation when conjugation by every gGg\in G leaves SS or xx unchanged, i.e. gSg1=SgSg^{-1}=S or gxg1=xgxg^{-1}=x.

Theorem: The central elements are exactly the elements invariant under conjugation.
  • zz is central iff zg=gzzg=gz for all gGg\in G.
  • zz is invariant under conjugation iff z=gzg1z=gzg^{-1} for all gGg\in G.
  • But again, zg=gzzg=gz and z=gzg1z=gzg^{-1} imply each other, so these conditions are equivalent.

Theorem: The subsets SS that commute with every gGg\in G are exactly those that are invariant under conjugation.
  • To commute with every gGg\in G, SS must satisfy Sg=gSSg=gS for all gGg\in G.
  • To be invariant under conjugation, SS must satisfy S=gSg1S=gSg^{-1} for all gGg\in G.
  • But Sg=gSSg=gS and S=gSg1S=gSg^{-1} imply each other, so these conditions are equivalent.

Corollary: The union of two subsets S,TS,T invariant under conjugation is also invariant under conjugation.

Since group product distributes over set union, we have (ST)g=SgTg(S\cup T)g=Sg\cup Tg for all gGg\in G. But since S,TS,T are invariant under conjugation, we have from the previous theorem that both S,TS,T commute with every element gGg\in G, and therefore (ST)g=SgTg=gSgT=g(ST)(S\cup T)g=Sg\cup Tg=gS\cup gT=g(S\cup T) implying that STS\cup T commutes with everything and is therefore invariant under conjugation.

So if we want to study subsets that are “central”, we can think about constructing subsets that are invariant under conjugation. The obvious way to create such subsets is to take every conjugate of some element gGg\in G to arrive at the subset [g][g]. Since every conjugate of gg is in [g][g], [g][g] must be invariant under conjugation. We can prove that more rigorously:

Theorem: The set [g][g] of all conjugates of gGg\in G is invariant under conjugation.
  • Since every element in CC is a conjugate of gg by some element hGh\in G, we can represent every element of [g][g] as hgh1hgh^{-1}.
  • To show that conjugating this element (by arbitrary kGk\in G) gives the element in [g][g], note that k(hgh1)k1=(kh)g(kh)1[g]k(hgh^{-1})k^{-1}=(kh)g(kh)^{-1}\in [g], i.e conjugating an arbitrary element hgh1[g]hgh^{-1}\in [g] by an arbitrary element kGk\in G results in an element (kh)g(kh)1(kh)g(kh)^{-1} in [g][g], thus [g][g] is invariant under conjugation.

Thus every element gGg\in G gives rise to a subset [g]G[g]\subseteq G that contains exactly all the conjugates of gg, which automatically makes it invariant under conjugation.

In this section, we discuss equivalence relations.

Imagine doing this process for every element in GG, so that for every element gGg\in G you can identify a subset [g][g] that gg belongs to.

In fact, every element gg belongs to exactly one subset [g][g]. To see this, note that if g[g]g\in [g] was also in a second subset [h][h], then gg must be a conjugate of hh. But since [g],[h][g],[h] must both be invariant under conjugation (as we just proved), every conjugate of gg must also be in [h][h] (so [g][h][g]\subseteq [h]), and every conjugate of hh must also be in [g][g] (so [h][g][h]\subseteq [g].) This implies [g]=[h][g]=[h].

Then the subsets [g][g] collectively partition the set GG. A partition of a set is defined as any grouping of all elements into subsets such that each element belongs to exactly one subset.

The reason that conjugation gives rise to a partition is due to three essential properties:

So any relation that is transitive, symmetric, and reflexive gives rise to a partition of the underlying set into these subsets [g][g]. Such a relation is called an equivalence relation, often denoted \sim. The partitions [g][g] are called equivalence classes, each containing all elements equivalent to its representative gg by the given equivalence relation \sim.

Like all equivalence relations, conjugation partitions the group into equivalence classes, called conjugacy classes. Each conjugacy class [g][g] contains all elements hgh1hgh^{-1} conjugate to gg.

In this section, we learn how to determine the size of conjugacy classes.

The size of a conjugacy class [g]|[g]| is the number of distinct elements in the form hgh1hgh^{-1} for some hGh\in G. How do we determine the number of distinct conjugates of gg?

To think about this problem, imagine taking the conjugate of gg by every element in GG, so you have potentially G|G| conjugates of gg. Whenever two conjugates of gg are equal, aga1=bgb1aga^{-1}=bgb^{-1}, then they are two ways to write the same conjugate. Note that this condition aga1=bgb1aga^{-1}=bgb^{-1} is trivially an equivalence relation aba\sim b, since it’s based on equality:

NB: This \sim is a different equivalence relation than the one we gave for conjugacy.

Being an equivalence relation, \sim divides GG into equivalence classes, where two elements a,ba,b are in the same equivalence class iff conjugating gg by aa and by bb results in the same element. Since each equivalence class represents a distinct conjugate of gg, the number of distinct conjugates of gg is equal to the number of equivalence classes.

So, how do we find the number of equivalence classes?

Here is the key insight: the equality aga1=bgb1aga^{-1}=bgb^{-1} can be rewritten as (b1a)g=g(b1a)(b^{-1}a)g=g(b^{-1}a) meaning two conjugates of gg are equal every time the element b1ab^{-1}a commutes with gg. Let CG(g)C_G(g) denote the set of such elements in GG that commute with gg, called the centralizer of gg. Then the condition aga1=bgb1aga^{-1}=bgb^{-1} simplifies to the condition b1aCG(g)b^{-1}a\in C_G(g), which further simplifies to abCG(g)a\in bC_G(g). This directly characterizes the equivalence classes: every equivalence class [a][a] under \sim is in the form bCG(g)bC_G(g) for some bGb\in G.

There is a bijection between any two equivalence classes bCG(g)bC_G(g) and aCG(g)aC_G(g). Simply left-multiply bCG(g)bC_G(g) by ab1ab^{-1} to obtain aCG(g)aC_G(g). This is a bijection because there exists an inverse: left-multiplying by ba1ba^{-1}. Why is this important? Because the existence of a bijection between every equivalence class implies that every equivalence class has the same size: CG(g)|C_G(g)|.

If the size of every equivalence class is CG(g)|C_G(g)|, then the number of equivalence classes is G/CG(g)|G|/|C_G(g)|. Since each equivalence class represents a distinct conjugate of gg, we have obtained the number of distinct conjugates of gg: [g]=G/CG(g)|[g]|=|G|/|C_G(g)|

Thus the size of the conjugacy class [g][g] is the order of the group divided by the size of gg’s centralizer. This reduces the problem to finding the size of a centralizer. For example:

Example: The centralizer of every element in an abelian group is the group itself.

Since everything commutes with everything in an abelian group, the centralizer of every element consists of the whole group.

Corollary: Abelian groups are exactly the groups where every conjugacy class is of size G/CG(g)=1|G|/|C_G(g)|=1.

We can also do this for permutations and matrices, but please skip these if the domain is not familiar to you!

Example: The centralizer of a permutation σ\sigma in SnS_n consists of all permutations that preserve the cycle structure of σ\sigma, of which there are jjNjNj!\prod_j j^{N_j}N_j! (where NjN_j denotes the number of disjoint cycles of length jj in σ\sigma.)

Express σ\sigma in disjoint cycle notation so that we can take an arbitrary cycle (a1 a2  an)(a_1~a_2~\ldots~a_n).

If τ\tau is in the centralizer of σ\sigma, then it commutes with σ\sigma. In particular, for each aia_i, we have τ(σ(ai))=σ(τ(ai))\tau(\sigma(a_i))=\sigma(\tau(a_i)), which simplifies to τ(ai+1)=σ(τ(ai))\tau(a_{i+1})=\sigma(\tau(a_i)). Because this equation implies that σ\sigma takes τ(ai)\tau(a_i) to τ(ai+1)\tau(a_{i+1}), we know that τ(ai)\tau(a_i) and τ(ai+1)\tau(a_{i+1}) must be in the same cycle. Using the same argument with τ1\tau^{-1} (which also commutes with σ\sigma), we know τ1(ai)\tau^{-1}(a_i) and τ1(ai+1)\tau^{-1}(a_{i+1}) must be in the same cycle.

If τ\tau takes elements in the same cycle to the same cycle, and the preimage τ1\tau^{-1} also takes elements of the same cycle to the same cycle, then any τ\tau in the centralizer of σ\sigma simply permutes the elements within each cycle of σ\sigma. This means each cycle in σ\sigma is mapped to a cycle of the same length, thus preserving the cycle structure of σ\sigma.

The number of such τ\tau is the number of such permutations, which is the product of the number of permutations of each cycle of σ\sigma. If there are NjN_j cycles of length jj in σ\sigma, then the number of τ\tau (the size of the centralizer of σ\sigma) is equal to CSn(σ)=jjNjNj!|C_{S_n}(\sigma)|=\prod_j j^{N_j}N_j!

Corollary: The size of the conjugacy class of a permutation σ\sigma in SnS_n is G/CG(σ)=n!/jjNjNj!|G|/|C_G(\sigma)|=n!/\prod_j j^{N_j}N_j! (where NjN_j denotes the number of disjoint cycles of length jj in σ\sigma.)

Example: The centralizer of a diagonalizable matrix MM in the group of invertible n×nn\times n matrices under matrix multiplication consists of all matrices block-diagonalizable in the same eigenbasis of MM, of which there are jFnj2nj\prod_j |F|^{n_j^2-n_j} (where njn_j denotes the multiplicity of the jjth distinct eigenvalue, and F|F| denotes the size of the field that the matrices are defined over.)
  • A diagonalizable matrix MM is one that can be represented as M=PDP1M=PDP^{-1} where DD is a diagonal matrix and the columns of PP form an eigenbasis of MM. To find the centralizer, we’re trying to find all matrices AA that commute with PDP1PDP^{-1}, i.e. APDP1=PDP1AAPDP^{-1}=PDP^{-1}A
  • Let A=PBP1A=PBP^{-1} for some BB. Then we have PBP1PDP1=PDP1PBP1PBP^{-1}PDP^{-1}=PDP^{-1}PBP^{-1} which simplifies to BD=DBBD=DB
  • Thus BB must commute with the diagonal matrix DD. By definition of matrix multiplication, (BD)ij(BD)_{ij} must be equal to kBikDkj=BijDjj\sum_k B_{ik}D_{kj}=B_{ij}D_{jj}, since DD is diagonal meaning DkjD_{kj} is zero everywhere except when k=jk=j. Likewise, (DB)ij=kDikBkj=DiiBij(DB)_{ij}=\sum_k D_{ik}B_{kj}=D_{ii}B_{ij}. Since BD=DBBD=DB we get (BD)ij=(DB)ij    BijDjj=DiiBij    BijDjj=BijDii(BD)_{ij}=(DB)_{ij}\implies B_{ij}D_{jj}=D_{ii}B_{ij}\implies B_{ij}D_{jj}=B_{ij}D_{ii}.
  • BijDjj=BijDiiB_{ij}D_{jj}=B_{ij}D_{ii} is trivially true if i=ji=j. For ichar "338=ji\ne j, it means that BijB_{ij} is nonzero only if Dii=DjjD_{ii}=D_{jj}. This means BB has nonzero entries only within blocks where the eigenvalues DiiD_{ii} are equal. In other words, BB is block-diagonal, with each block corresponding to the eigenspace of each distinct eigenvalue.
  • BB is block-diagonal, so A=PBP1A=PBP^{-1} is block-diagonalizable in the same eigenbasis PP of MM. Therefore, the matrices AA that commute with MM are exactly the ones block-diagonalizable in the same eigenbasis of MM.
  • Thus the size of the centralizer of MM can be computed as the product of the number of possible matrices for each block in the block diagonal form of BB.
  • Then we can count then number of matrices by counting the number of possible BB. Each n×nn\times n block in BB has n2nn^2-n entries (all entries but the diagonal) that can be filled with any value, therefore each n×nn\times n block contributes Fn2n|F|^{n^2-n} possibilities where F|F| denotes the size of the field that the matrices are defined over. Blocks are determined by the multiplicity of eigenvalues of MM, thus we take the product jFnj2nj\prod_j |F|^{n_j^2-n_j} where njn_j denotes the multiplicity of the jjth distinct eigenvalue.

Corollary: The size of the conjugacy class of a diagonalizable matrix MM in the group of invertible n×nn\times n matrices under matrix multiplication is G/CG(M)=(i=0n1FnFi)/(jFnj2nj)|G|/|C_G(M)|=(\prod_{i=0}^{n-1} |F|^n-|F|^i)/(\prod_j |F|^{n_j^2-n_j}) (where njn_j denotes the multiplicity of the jjth distinct eigenvalue, and F|F| denotes the size of the field that the matrices are defined over.)

In this section, we learn about the relationship between conjugacy classes and central elements.

Here’s an easy-to-prove fact:

Theorem: Central elements are exactly the elements that are invariant under conjugation.

Being central means gz=zggz=zg for every gGg\in G, and being invariant under conjugation means z=gzg1z=gzg^{-1} for every gGg\in G. But gz=zg    z=gzg1gz=zg\iff z=gzg^{-1}.

Corollary: The conjugacy class [z][z] of a central element zz is always a singleton set, and the representative zz of a singleton conjugacy class [z][z] is a central element.

The conjugacy class of zz contains all elements conjugate to zz. But since zz is invariant under conjugation, as we just proved, the only element conjugate to zz is zz itself. Therefore zz is in a singleton conjugacy class.

Conversely, if zz is in a singleton conjugacy class, it means only zz is conjugate to zz, therefore zz is invariant under conjugation, therefore central.

In particular, the identity element ee (which is always central) is always in a singleton conjugacy class.

An important result that arises from this is that we can split a group’s conjugacy classes into two types: the central conjugacy classes (which are the singleton conjugacy classes) and the non-central conjugacy classes. This relationship is described below: G=Z(G)+i[gi]|G|=|Z(G)|+\sum_i|[g_i]| where [gi]|[g_i]| denotes the size of the iith non-central conjugacy class. Using what we learned earlier, we can rewrite this as G=Z(G)+iG/CG(gi)|G|=|Z(G)|+\sum_i|G|/|C_G(g_i)|

The result above is known as the class equation of a group, and is used in many number-theoretic proofs about the center. For example:

Theorem: Every group with prime power order (a pp-group) has a non-trivial center, whose order is divisible by pp.
  • pp-groups are of prime power order, so G=pn|G|=p^n where n1n\ge 1.
  • Using the fact that [gi]=G/CG(gi)|[g_i]|=|G|/|C_G(g_i)|, we know that the size of the conjugacy class of gig_i must be a factor of G|G|, i.e. it must be a prime power pkip^{k_i} where kink_i\le n.
  • Then we can write the class equation as pn=Z(G)+ipkip^n=|Z(G)|+\sum_i p^{k_i} Since the LHS is divisible by pp, so must the RHS. The sum ipki\sum_i p^{k_i} is divisible by pp, because each of its terms is divisible by pp. Then in order for the RHS as a whole to be divisible by pp, Z(G)|Z(G)| must also be divisible by pp.
  • But if Z(G)|Z(G)| is divisible by pp then it is not 11, therefore the center is non-trivial.

In this section, we learn about subgroups.

A subgroup of a group GG is a subset of GG that is also a group. In other words, it is a subset of GG that includes identity and is closed under product and inverse.

Every element gg generates a subgroup g\<g\> by taking powers of itself: g={,g3,g2,g1,e,g,g2,g3,}\<g\>=\{\ldots,g^{-3},g^{-2},g^{-1},e,g,g^2,g^3,\ldots\}. We already know this as the cyclic group generated by gg. You can also generate subgroups from multiple elements by taking all products and inverses: g,h={e,g,h,g2,gh,hg,h2,g3,}\<g,h\>=\{e,g,h,g^2,gh,hg,h^2,g^3,\ldots\}

In fact, some of the subsets we’ve been working with are actually subgroups, as proven below:

Theorem: The center Z(G)Z(G) is a subgroup of GG.
  • The identity ee is always central, so eZ(G)e\in Z(G).
  • The product of two central elements g1,g2Z(G)g_1,g_2\in Z(G) is central.
    • Proof: (g1g2)h=g1hg2=h(g1g2)(g_1g_2)h=g_1hg_2=h(g_1g_2) for all hGh\in G, thus g1g2Z(G)g_1g_2\in Z(G).
  • The inverse of a central element gZ(G)g\in Z(G) is central.
    • Proof: gh=hgg1ghg1=g1hgg1hg1=g1h\begin{aligned}gh&=hg\\g^{-1}ghg^{-1}&=g^{-1}hgg^{-1}\\hg^{-1}&=g^{-1}h\end{aligned} for all hCG(g)h\in C_G(g), thus h1CG(g)h^{-1}\in C_G(g).
  • Thus Z(G)Z(G) forms a group, and is a subgroup of GG.

Theorem: The centralizer CG(S)C_G(S) is a subgroup of GG.
  • Recall that to be in the centralizer, every gCG(S)g\in C_G(S) must commute with SS.
  • CG(S)C_G(S) has identity: Clearly eCG(S)e\in C_G(S) because eS=S=SeeS=S=Se.
  • CG(S)C_G(S) has inverses: If gCG(S)g\in C_G(S) then for all hGh\in G we have hg=ghhg=gh, which is the same as saying gh1=h1ggh^{-1}=h^{-1}g, therefore g1CG(S)g^{-1}\in C_G(S).
  • CG(S)C_G(S) has products: If g,hCG(S)g,h\in C_G(S), then for all kGk\in G we have gk=kggk=kg and hk=kghk=kg. Then (gh)k=gkh=k(gh)(gh)k=gkh=k(gh), thus ghCG(S)gh\in C_G(S).
  • Since it has identity, inverses, and products, the centralizer CG(S)C_G(S) is always a subgroup of GG.

Theorem: The intersection of two subgroups is a subgroup of both.
  • Both subgroups contain ee, so their intersection contains ee.
  • Both subgroups are closed under product, so their intersection is closed under product.
    • To make this more clear, this is because the product of two elements that are contained in both subgroups must remain contained in both subgroups by definition of subgroup.
  • Both subgroups are closed under inverses, so their intersection is closed under inverses.
  • Since the intersection is a subset of both given subgroups, contains identity, and is closed under product and inverses, it is a subgroup of both.

Since the center is the intersection of all centralizers of a group, the above theorem provides an alternate proof that the center is a subgroup (using the fact that all centralizers are subgroups.)

In this section, we make a group the least abelian it can be.

To make a group the least abelian it can be, we want it to have the smallest possible center. A group whose center is trivial is called centerless.

One way to make the center of a group smaller is to map every central element zZ(G)z\in Z(G) to the identity element ee. Let π\pi be this map.

The problem is, if az=baz=b, then when we map zz to ee we get a=ba=b under π\pi. So when defining π\pi, not only do we need to map central elements to ee, we need to collapse elements differing by some zz into the same element. How do we achieve this definition? Equivalence relations!

Say aba\sim b when a,ba,b differ by a central element, so that a,ba,b are in the same equivalence class exactly when they differ by a central element (and therefore a=ba=b under π\pi.) Then as usual, \sim partitions GG into equivalence classes [g][g], and as each class represents a distinct element under π\pi, we can define π\pi as sending every element gGg\in G to its corresponding equivalence class π(g)\pi(g). This accomplishes the goal of mapping central elements to an identity element (the equivalence class of ee), while collapsing elements differing by some central elements to the same equivalence class.

Well, that’s the plan, anyways. We don’t yet have any idea what the structure of π(g)\pi(g) is, much less a guarantee that it’s a group.

To ensure it’s a group, note that for a,ba,b to differ by a central element, we can say az=baz=b for some zZ(G)z\in Z(G). Let’s shorten that condition to baZ(G)b\in aZ(G). To explain: aZ(G)aZ(G) contains every element that differs from aa by some central element, and bb is one of them iff a,ba,b differ by some central element.

But the equivalence relation ab    baZ(G)a\sim b\iff b\in aZ(G) just checks for set membership. In other words, we have essentially b[a]    baZ(G)b\in [a]\iff b\in aZ(G), meaning that we can identify aZ(G)aZ(G) itself as the equivalence class [a][a]! Thus the following manipulations are valid: g[a]=g(aZ(G))=(ga)Z(G)=[ga]g[a]=g(aZ(G))=(ga)Z(G)=[ga] [a][b]=aZ(G)bZ(G)=(ab)Z(G)=[ab][a][b]=aZ(G)bZ(G)=(ab)Z(G)=[ab] [g]1=Z(G)1g1=g1Z(G)=[g1][g]^{-1}=Z(G)^{-1}g^{-1}=g^{-1}Z(G)=[g^{-1}] Note that this is only possible since Z(G)Z(G) commutes with every element by definition of being the center, and that Z(G)Z(G) is a subgroup of GG: Z(G)Z(G)=Z(G)Z(G)Z(G)=Z(G) because subgroups are closed under product and Z(G)1=Z(G)Z(G)^{-1}=Z(G) because subgroups are closed under inverse.

Armed with these manipulations and the knowledge that π(a)=[a]=aZ(G)\pi(a)=[a]=aZ(G), let’s ensure that the image of π\pi is indeed a group.

So im π\im\pi is indeed a group!

Above, we used a number of mechanisms to arrive at a new group by sending the center to ee. Here’s a summary:

This overall operation, of sending a subgroup to ee to obtain a new group, is called quotienting the group GG by the subgroup Z(G)Z(G). The resulting group of equivalence classes is known as a quotient group, and in this case, it is denoted G/Z(G)G/Z(G). (reads as “GG mod the center of GG.”)

This quotient group gives rise to one of the most efficient ways to tell if a group is abelian.

Theorem: GG is abelian iff G/Z(G)G/Z(G) is cyclic.

If GG is abelian, then G=Z(G)G=Z(G) by definition, so G/Z(G)G/Z(G) sends every element of G=Z(G)G=Z(G) to the identity ee. Thus G/Z(G)G/Z(G) is trivial, therefore cyclic. To show the other direction, assume G/Z(G)G/Z(G) is cyclic, generated by some generating equivalence class [g]=gZ(G)[g]=gZ(G).

Every element of the cyclic group G/Z(G)G/Z(G) is expressible as a power of the generator (gZ(G))k=gkZ(G)(gZ(G))^k=g^kZ(G). Each of these equivalence classes gkZ(G)g^kZ(G) consists of products of gkg^k with each element in Z(G)Z(G). Since the equivalence classes cover the entire group GG, every element of GG is expressible as some element gkzg^kz where zZ(G)z\in Z(G).

But then two arbitrary elements gk1z1,gk2z2g^{k_1}z_1,g^{k_2}z_2 must commute via (gk1z1)(gk2z2)=z2gk1gk2z1=z2gk1+k2z1=z2gk2gk1z1=(gk2z2)(gk1z1)(g^{k_1}z_1)(g^{k_2}z_2)=z_2g^{k_1}g^{k_2}z_1=z_2g^{k_1+k_2}z_1=z_2g^{k_2}g^{k_1}z_1=(g^{k_2}z_2)(g^{k_1}z_1) thus GG is abelian.

In this section, we formalize quotient groups.

In general, given a subgroup HGH\le G, we can attempt to find the quotient group G/HG/H by doing the following:

Since we want to send all elements of HH to ee, we want to consider two elements equivalent if they differ by any element hHh\in H. The idea is that after mapping HH to ee, every pair of elements that differ by an element in HH now differ by ee, and are therefore the same element. Define \sim so that aba\sim b whenever a,ba,b differ by an element in HH. Like before, we can express this relation more succinctly: ab    a,b differ by some hH    ah=b for some hH    h=a1b for some hH    a1bH    baH\begin{aligned} a\sim b\iff&a,b\text{ differ by some }h\in H\\ \iff&ah=b&\text{ for some }h\in H\\ \iff&h=a^{-1}b&\text{ for some }h\in H\\ \iff&a^{-1}b\in H\\ \iff&b\in aH \end{aligned} the idea being that aHaH contains all elements that differ from aa by a factor of some hHh\in H, and if bb is one of them, then a,ba,b differ by some hh as required.

Is \sim a equivalence relation? Let’s see:

Since \sim is an equivalence relation, we can use it to partition GG into equivalence classes [g][g]. Like before, we note that b[a]b\in [a] iff aba\sim b iff baHb\in aH, so each equivalence class [a][a] is exactly aHaH.

Now, do these equivalence classes form a group? Let’s see:

Here we encounter a problem: unlike the center Z(G)Z(G), the subgroup HH doesn’t commute with all elements, and therefore aHbHaHbH is not equal to abHabH in general.
  • Contains identity: [e][e] satisfies [e][g]=[eg]=[g]=[ge]=[g][e][e][g]=[eg]=[g]=[ge]=[g][e], and is therefore the identity.
  • Contains product: We need to show that the product [a][b][a][b] yields another equivalence class. [a][b]=(aH)(bH)=aHbHchar "338=abH=[ab][a][b]=(aH)(bH)=aHbH\ne abH=[ab]

Recall that a subset SS of GG commutes with all elements of GG exactly when SS is invariant under conjugation. So we need the subgroup HH to be invariant under conjugation, and then we can proceed with showing that the equivalence classes aHaH form a group by letting HH commute with every element.

Therefore: If HGH\le G is invariant under conjugation, then the equivalence classes of GG under the relation aba\sim b iff baHb\in aH form a group.
  • Contains identity: [e][e] satisfies [e][g]=[eg]=[g]=[ge]=[g][e][e][g]=[eg]=[g]=[ge]=[g][e], and is therefore the identity.
  • Contains product: [a][b]=(aH)(bH)=aHbH=abHH=abH=[ab][a][b]=(aH)(bH)=aHbH=abHH=abH=[ab] so the product of two equivalence classes [a],[b][a],[b] is [ab][ab], which exists because abHab\in H by closure under product in HH.
  • Contains inverse: [g]1=(gH)1=H1g1=g1H=[g1][g]^{-1}=(gH)^{-1}=H^{-1}g^{-1}=g^{-1}H=[g^{-1}] so the inverse [g1][g^{-1}] of [g][g] exists because g1Hg^{-1}\in H by closure under inverse in HH.

Theorem: If a subgroup HH has unique order, then it is normal.

First, note that the conjugate of a subgroup gHg1gHg^{-1} is a subgroup of the same order:

  • Contains identity: e=geg1gHg1e=geg^{-1}\in gHg^{-1}
  • Absorbs product: (gHg1)(gHg1)=gHHg1=gHg1(gHg^{-1})(gHg^{-1})=gHHg^{-1}=gHg^{-1}
  • Absorbs inverse: (gHg1)1=gH1g1=gHg1(gHg^{-1})^{-1}=gH^{-1}g^{-1}=gHg^{-1}
  • Same order: the map hghg1h\mapsto ghg^{-1} is a map HgHg1H\to gHg^{-1} and is a bijection since it has an inverse hg1hgh\mapsto g^{-1}hg. Thus H=gHg1|H|=|gHg^{-1}|.

Since the conjugate of a subgroup must have the same order but HH has unique order, HH must be invariant under conjugation, thus normal.

When a subgroup HGH\le G is invariant under conjugation, we call it a normal subgroup, and denote it by HGH\lhd G. And as we’ve shown, sending the elements of HH to ee forms a quotient group G/HG/H if and only if HH is a normal subgroup. The elements of every quotient group, the equivalence classes aHaH, are called cosets of HH. So the above result can be written as:

Theorem: HGH\lhd G iff the cosets of HH form a quotient group G/HG/H.

This enshrines the importance of normal subgroups – they are exactly the subgroups HH that you can send to ee to construct a quotient group. We already know of such a subgroup: the center of a group Z(G)Z(G) is always a normal subgroup of GG. This makes sense since Z(G)Z(G) is made up of elements that commute with every element in GG, so it must be invariant under conjugation. In fact any element of the center must generate a normal subgroup:

Theorem: Central elements generate normal subgroups.

Given that central elements zZ(G)z\in Z(G) are invariant under conjugation, it follows that any product of a central element with itself is also invariant under conjugation. This means the subgroup z\<z\> generated by zz is necessarily invariant under conjugation in GG, and therefore a normal subgroup of GG.

Given HH a subgroup, aHaH is a coset even if HH is not a normal subgroup — being a normal subgroup just means the cosets form a group. Nevertheless, the fact that you can make cosets out of any subgroup leads directly to a very important theorem:

Lagrange’s Theorem: The order of every subgroup divides the order of the group.

For any subgroup HGH\le G, consider its cosets aHaH. There is a bijection between any two cosets aHaH and bHbH: left-multiply aHaH by ab1ab^{-1} to get bHbH. It’s a bijection because the inverse is left-multiplying by ba1ba^{-1}. The existence of a bijection between every coset implies that every coset has the same size, and in particular they are all the same size as eH=HeH=H, which is H|H|.

Since cosets are equivalence classes, they form a partition of the group GG. But since this partitions GG into equally-sized equivalence classes of size H|H|, H|H| must divide G|G|.

Corollary: The order of G/HG/H is G/H|G|/|H|.

We showed earlier that the cosets partition GG. Since the cosets are all the same size, H|H|, the number of such cosets must be G/H|G|/|H|. But the cosets are exactly the elements of the quotient G/HG/H, so the order of G/HG/H is G/H|G|/|H|.

We proved earlier that conjugacy classes are subsets of GG that are invariant under conjugation. Conjugacy classes have a close relationship with normal subgroups:

Theorem: Normal subgroups are exactly subgroups that are a union of conjugacy classes.

It is easy to see that if a subgroup is a union of conjugacy classes, then it is normal, since union preserves invariance under conjugation.

To show the converse, let HGH\lhd G be a normal subgroup of GG and consider an arbitrary element hHh\in H. Since HH must be invariant under conjugation, conjugating hh must give another element in HH. That is, HH contains all conjugates of hh, which means HH contains the conjugacy class [h][h]. Since this is true for arbitrary hHh\in H, HH fully contains the conjugacy class of each of its elements, meaning HH must be a union of conjugacy classes.

This theorem gives us a method to determine whether a subgroup is normal beyond checking for invariance under conjugation. All we need to do is find the conjugacy classes — then you can obtain every normal subgroup of the group by taking unions of conjugacy classes, and checking which unions form a subgroup.

For instance:

Example: {e}\{e\}, Z(G)Z(G), and GG are always normal subgroups of GG.

We already know these three are subgroups of GG. To prove that they are normal, note that they are all unions of conjugacy classes:

  • ee is central and therefore is only conjugate to itself, so it is in a singleton conjugacy class.
  • Z(G)Z(G) consists of central elements and by the same argument, is a union of singleton conjugacy classes.
  • GG trivially contains all the conjugacy classes in GG.

Since all three subgroups are unions of conjugacy classes, they must be normal.

In this section, we make a group abelian.

Is there a way to quotient a group G/HG/H such that the resulting group is abelian?

To do this, we first look at the equation gh=hggh=hg. If we move everything to one side, we get g1h1gh=eg^{-1}h^{-1}gh=e. This tells us that if g1h1ghg^{-1}h^{-1}gh is the identity element ee, then gg and hh commute. The element g1h1ghg^{-1}h^{-1}gh is called the commutator of gg and hh, and is written [g,h][g,h].

(Note that ghg1h1ghg^{-1}h^{-1} is also a commutator of gg and hh. We want to stick to one definition, so let’s use the first one, g1h1ghg^{-1}h^{-1}gh.)

What if the only commutator in the group is the identity ee? That means no matter what gg and hh you pick, their commutator [g,h][g,h] is ee and therefore they commute. So if the only commutator in the group is ee, the group is abelian.

The idea here is that if we can send all the commutators to ee, the resulting group will be abelian. But to do this, we need to make sure that the commutators form a normal subgroup HGH\lhd G, so that the quotient G/HG/H sends HH to ee. Do the commutators form a normal subgroup?

The commutators almost form a subgroup. The identity element is a commutator. The inverse of a commutator [g,h]1[g,h]^{-1} is [h,g][h,g], a commutator. But the group product of two commutators need not be a commutator.

We can solve this problem by having the commutators generate a subgroup, i.e. include all their products as part of the subgroup. Let GG' be the subgroup generated by all the commutators. We call GG' the commutator subgroup, and much later we will see why it is also called the derived subgroup. For now, let’s prove that it is normal:

Theorem: The commutator subgroup GG' is a normal subgroup of GG.

WTS the conjugate of arbitrary [g,h]G[g,h]\in G' is also a commutator. We can prove this directly.

k[g,h]k1the conjugate of [g,h]=k(ghg1h1)k1[g,h] is a commutator ghg1h1=(kgk1)(khk1)(kg1k1)(kh1k1)distribute=(kgk1)(khk1)(kgk1)1(khk1)1=[kgk1,khk1]\begin{aligned} &k[g,h]k^{-1}&\text{the conjugate of }[g,h]\\ =&k(ghg^{-1}h^{-1})k^{-1}&[g,h]\text{ is a commutator }ghg^{-1}h^{-1}\\ =&(kgk^{-1})(khk^{-1})(kg^{-1}k^{-1})(kh^{-1}k^{-1})&\text{distribute}\\ =&(kgk^{-1})(khk^{-1})(kgk^{-1})^{-1}(khk^{-1})^{-1}\\ =&[kgk^{-1},khk^{-1}] \end{aligned}

The conjugate of an arbitrary commutator in GG' is a commutator, so GG' is invariant under conjugation, and therefore a normal subgroup.

Now if we take the quotient G/GG/G', we send all the commutators to ee and are left with a group whose only commutator is ee, which is an abelian group. This quotient GabG/GG^{ab}\equiv G/G' is called the abelianization of GG.

This easily extends to quotienting by any normal subgroup containing GG' as well.

Theorem: A quotient G/NG/N is abelian iff NN includes GG'.
  • (\to) If G/NG/N is abelian, its only commutator is ee, which means any commutator of GG was sent to ee after quotienting by NN, which means NN includes the commutators of GG.
  • (\from) If NN includes all the commutators of GG, it means G/NG/N sends all the commutators to ee, which means G/NG/N is abelian.

In this section, we review abelian and centerless groups.

Abelian groups and centerless groups are both subclasses of groups. Moreover, you can study the abelian part and the centerless part separately; simply quotient by the commutator subgroup GG' to get the abelian part, and quotient by the center Z(G)Z(G) to get the centerless part. Doing both gives you the trivial group {e}\{e\}, since it is the only group that is both abelian and centerless.

In this section, we learn some more ways to find normal subgroups.

So far, we’ve encountered a few normal subgroups that always exist for a group GG:

We also found that a subgroup is normal iff it is a union of conjugacy classes, but that requires finding the conjugacy classes of a group. Are there any other shortcuts?

Here are a few other shortcuts:

Theorem: Every subgroup of an abelian group is normal.

Since gh=hggh=hg implies g=hgh1g=hgh^{-1} for every g,hGg,h\in G, every element of an abelian group is invariant under conjugation, and therefore so is every subgroup. So every subgroup of an abelian group is normal.

Theorem: For finite groups, a subgroup with unique order is normal.
  • Lemma: In a finite group, the conjugate of a subgroup gHg1gHg^{-1} is a subgroup with the same order.
    • We prove closure under group product, an identity element, and inverses exist for gHg1gHg^{-1}.
    • Closure: The product of gh1g1gh_1g^{-1} and gh2g1gh_2g^{-1} is g(h1h2)g1g(h_1h_2)g^{-1}, so gHg1gHg^{-1} is closed under group product.
    • Identity: geg1=egeg^{-1}=e shows HH contains ee.
    • Inverse: (ghg1)1=gh1g1(ghg^{-1})^{-1}=gh^{-1}g^{-1} shows HH has inverses.
    • Therefore gHg1gHg^{-1} is indeed a subgroup. To show that it has the same order, notice that product and inverse above are the same as in the original group, except with the g,g1g,g^{-1} around them. This means all elements in the subgroup were merely renamed, which is a bijection HgHg1H\leftrightarrow gHg^{-1}.
    • Since the group is finite and a bijection exists between the subgroup and its conjugate, they must have the same order.
  • Since HH has unique order, we necessarily have gHg1=HgHg^{-1}=H, which by definition means HH is invariant under conjugation, i.e. normal.

Theorem: The product HKHK of two subgroups H,KGH,K\lhd G is a subgroup iff at least one of H,KH,K is normal.

Let hHh\in H and kKk\in K, then we can show that HKHK is a subgroup:

  • Product is preserved: (hk)(hk)=hk(k1hk)k=(hh)(kk)HK(hk)(h'k')=hk(k^{-1}h''k)k'=(hh'')(kk')\in HK. This uses the fact that kh=hkkh'=h''k, assuming that HH is normal. The same argument can be made with kh=hkkh'=h'k'' when KK is normal.
  • Inverse is preserved: (hk)1=k1h1=k1(khk1)=hk1HK(hk)^{-1}=k^{-1}h^{-1}=k^{-1}(kh'k^{-1})=h'k^{-1}\in HK.
  • Contains identity: Being subgroups, both H,KH,K contain eGe\in G, thus HKHK also contains ee.

Theorem: The product HKHK of two normal subgroups H,KGH,K\lhd G is normal.

Since H,KH,K are normal, HKHK is a subgroup. To show HKHK is normal, consider gGg\in G. The conjugate of hkhk by gg is ghkg1ghkg^{-1}, which is equal to (ghg1)(gkg1)(ghg^{-1})(gkg^{-1}). Since HH and KK are themselves normal, this is equal to some element hkHKh'k'\in HK, so HKHK is invariant under conjugation.

Theorem: The intersection HKH\cap K of two normal subgroups H,KGH,K\lhd G is a normal subgroup of both.

We know that the intersection of two subgroups is a subgroup of both. To show that the subgrouping is a normal subgrouping, note that if all conjugates of an element in HH is in HH, and all conjugates of an element in KK is in KK, then all conjugates of an element in both HH and KK are in both HH and KK.

In this section, we measure the normality of a subgroup.

Recall that in the very beginning, we noted that the set of all elements that commute with every element is somewhere between “just the identity element” and “every element.”

Similarly, the set of all elements that commute with some gg (the centralizer of gg) is somewhere between g\<g\> and “every element.”

We can apply a similar approach to subgroups regarding normality. Recall that a subgroup HGH\le G is normal iff every element gGg\in G commutes with it: gH=HggH=Hg. That is to say, iff the set of elements gGg\in G satisfying gH=HggH=Hg consists of the entire group. Call this set the normalizer of HH, denoted NG(H)N_G(H).

Theorem: Every subgroup HH is a normal subgroup of its normalizer NG(H)N_G(H).

The normalizer NG(H)N_G(H) is the set of elements gGg\in G satisfying gH=HggH=Hg. But that is exactly the condition gHg1=HgHg^{-1}=H, i.e. HH is invariant under conjugation under elements in the normalizer, i.e. HH is normal in NG(H)N_G(H).

Then a subgroup HGH\le G is normal in GG when its normalizer NG(H)N_G(H) is “every element,” i.e. equal to GG. That’s the maximum; what’s the minimum? The set of elements gg such that gH=HggH=Hg must always include the elements hHh\in H, since hH=H=HhhH=H=Hh. So at minimum, NG(H)=HN_G(H)=H meaning HH is self-normalizing, and at maximum, NG(H)=GN_G(H)=G i.e. HGH\lhd G.

Note these parallels between the centralizer and the normalizer of a subgroup:

These statements hold true for arbitrary subsets SGS\subseteq G as well (but replace “normal” with “invariant under conjugation.”) Note that again, the centralizer and normalizer are always subgroups of GG, even when SS is not a subgroup of GG.

Like the centralizer, the normalizer is always a subgroup:

Theorem: The normalizer NG(S)N_G(S) is a subgroup of GG.
  • Identity: Clearly eNG(S)e\in N_G(S) because eS=S=SeeS=S=Se.
  • Inverses: If gNG(S)g\in N_G(S) we have gS=SggS=Sg, which is the same as saying Sg1=g1SSg^{-1}=g^{-1}S, therefore g1NG(S)g^{-1}\in N_G(S).
  • Product: If g,hNG(S)g,h\in N_G(S), we have gS=SggS=Sg and hS=ShhS=Sh. Another way to write that is gSg1=SgSg^{-1}=S and hSh1=ShSh^{-1}=S. Substituting, we get ghSh1g1=SghSh^{-1}g^{-1}=S, which is equivalent to saying (gh)S=S(gh)(gh)S=S(gh), thus ghNG(S)gh\in N_G(S).
  • Since it has identity, inverses, and product, NG(S)N_G(S) is always a subgroup of GG.

Recall that to arrive at the class equation: G=Z(G)+iG/CG(gi)|G|=|Z(G)|+\sum_i|G|/|C_G(g_i)| we had to use the fact that every conjugacy class is in the form bCG(g)bC_G(g) for some bGb\in G. This was because the equality aga1=bgb1aga^{-1}=bgb^{-1} can be rewritten as (b1a)g=g(b1a)(b^{-1}a)g=g(b^{-1}a) meaning two conjugates of gg are equal every time the element b1ab^{-1}a commutes with gg, and the set of these elements b1ab^{-1}a are the centralizer CG(g)C_G(g).

How many conjugates of a subgroup HH are there? In other words, what is the size of the conjugacy class of HH? In this case, we are looking at the equality aHa1=bHb1aHa^{-1}=bHb^{-1} which can be rewritten as (b1a)H=H(b1a)(b^{-1}a)H=H(b^{-1}a) meaning two conjugates of HH are equal every time the element b1ab^{-1}a commutes with HH. The set of these elements b1ab^{-1}a are exactly the normalizer NG(H)N_G(H). So distinct conjugates of HH are in one-to-one correspondence with the cosets of the form b1aNG(H)b^{-1}aN_G(H). Thus by the same logic as before (all cosets are the same size), the number of conjugates of HH is equal to G/NG(H)|G|/|N_G(H)|. TODO make these both theorems

TODO this directly shows that if normalizer is group, then H is invariant under conjugation since there’s only one conjugate

By studying the normalizer, we get an alternate way of identifying normal subgroups. Here’s how.

Given a subgroup HH, consider the partition of GG into cosets that look like gHgH. Also consider the partition of GG into cosets that look like HgHg. To distinguish the two, the cosets gHgH are called left cosets and the cosets HgHg are called right cosets. Note that an element gg is in the normalizer of HH iff gH=HggH=Hg, i.e. the left coset gHgH coincides with the right coset HgHg. If this is true of gg, it must be true of every element in its coset gHgH. Because of this, the normalizer of HH is a union of cosets. More precisely, NG(H)N_G(H) is exactly the union of cosets common to both partitions of GG.

One consquence of the normalizer having elements gNG(H)g\in N_G(H) that satisfy gH=HggH=Hg is that they are exactly the elements that make HH invariant over conjugation: gHg1=HgHg^{-1}=H. In other words, the subgroup HH is normal in the normalizer: HNG(H)H\lhd N_G(H). This means that we can take the quotient NG(H)/HN_G(H)/H.

An interesting fact arises when HH is a pp-subgroup. Recall that pp-groups are groups of prime power order pnp^n. Similarly, a pp-subgroup is a subgroup of prime power order pnp^n.

Theorem: For every pp-subgroup HGH\le G, NG(H)/HG/H(modp)|N_G(H)/H|\equiv |G/H|\pmod p.

Recall that in every quotient, like NG(H)/HN_G(H)/H, TODO left off here

the larger group NG(H)N_G(H) is partitioned into equally sized partitions with

Summary

We’ve learned that:

< Back to category Exploration 1: Commutativity (permalink)
An unconventional intro to group theory Exploration 2: Products