Exploration 3: Representation theory

January 6, 2024.

Questions:

TODO

Last exploration, we studied the properties of individual $R$ -matrices. This time we’re studying $R$ -matrices collectively.

First, we define $\Hom_R(M,N)$ as the set of all $R$ -module homomorphisms $M\to N$ . When $M,N$ are free $R$ -modules, the elements of $\Hom_R(M,N)$ are $R$ -matrices. Generally this means $\Hom_R(M,N)$ is a noncommutative ring, given that matrix multiplication is not commutative in general.

Let’s start by representing groups using subsets of $\Hom$ .

For instance, we can express the group $\CC^\times$ (nonzero complex numbers under multiplication) as $2\times 2$ real matrices. We define a map $\rho:\CC^\times\to\Hom_\RR(\RR^2,\RR^2)$ :

$\rho(1)=\left[\begin{matrix}1&0\\0&1\end{matrix}\right]\quad \rho(i)=\left[\begin{matrix}0&1\\-1&0\end{matrix}\right]\quad \rho(a+bi)=\left[\begin{matrix}a&b\\-b&a\end{matrix}\right]$

so that $\begin{aligned} \rho((a+bi)(c+di)) &=\rho(a+bi)\rho(c+di)\\ &=\left[\begin{matrix}a&b\\-b&a\end{matrix}\right] \left[\begin{matrix}c&d\\-d&c\end{matrix}\right]\\ &=\left[\begin{matrix}ac-bd&ad+bc\\-ad-bc&ac-bd\end{matrix}\right]\\ &=\rho((ac-bd)+(ad+bc)i) \end{aligned}$ and $\begin{aligned} \rho((a+bi)^{-1}) &=(\rho(a+bi))^{-1}\\ &=\left[\begin{matrix}a&b\\-b&a\end{matrix}\right]^{-1}\\ &=\det\left(\left[\begin{matrix}a&b\\-b&a\end{matrix}\right]\right)\adj\left(\left[\begin{matrix}a&b\\-b&a\end{matrix}\right]\right)\\ &=\frac{1}{a^2+b^2}\left[\begin{matrix}a&-b\\b&a\end{matrix}\right]\\ &=\rho\left(\frac{a-bi}{a^2+b^2}\right) \end{aligned}$

Since it preserves product and inverse, these matrices ( homomorphisms) define a group, isomorphic to $\CC^\times$ . This is just one of many encodings of complex numbers into matrices. The key is that $i^2=-I$ , and that $1,i$ don’t interact with each other.

Here we found that $\Hom_\RR(\RR^2,\RR^2)$ was a suitable subgroup of $R$ -module homomorphisms. In general, to represent a group, we must select the subgroup of $\Hom_R(M,N)$ that consists of invertible $R$ -module homomorphisms. In practice, $M$ and $N$ are free modules $R^m$ and $R^n$ , and invertibility implies $m=n$ . Thus, when representing a group, we can choose among the invertible matrices in $\Hom_R(R^n,R^n)$ , also written as $\Aut(R^n)$ , the group of automorphisms on $R^n$ , whose elements we express as invertible $n\times n$ $R$ -matrices.

So we assign to each group element an invertible $n\times n$ $R$ -matrix. A matrix $R$ -representation of a group is a group homomorphism $G\to\Aut(R^n)$ .

Theorem: There is a one-to-one correspondence between

R

-module automorphisms and matrix

R

-representations of a finite group

G

Every representation $G\to\Aut(R^n)$ assigns each $g\in G$ to a specific permutation matrix in $\Aut(R^n)$ .
But this implicitly defines an action of $G$ on $R^n$ . The action is given by matrix muliplication in $R^n$ , where each $g$ acts by multiplying by its corresponding matrix.
Conversely, by Cayley’s theorem every group is isomorphic to a permutation group, and therefore can act on $R^{|G|}$ by permuting the $|G|$ basis $R$ -vectors. But permutations are automorphisms, so this essentially assigns each element $g\in G$ an automorphism over $R^{|G|}$ , i.e. constructing a matrix $R$ -representation $G\to\Aut(R^{|G|})$ .

The representation $G\to\Aut(R^{|G|})$ (mentioned in the above proof) is known as the regular representation of a finite group $G$ . The main idea of this representation is to have $G$ permute the indices of $R^{|G|}$ according to the permutation group isomorphic to $G$ .

In this section, we show the module analog of group actions.

In the same manner that adjoining an indeterminate symbol $x$ to $R$ gives you a polynomial ring $R[x]$ , we can adjoin a whole group to $R$ to get a group ring $R[G]$ . Elements look like $R$ -linear combinations of elements of $G$ : $\sum_{g\in G}r_gg$

Theorem: Every group ring

R[G]

is a free module.

Since every element in $R[G]$ is by definition uniquely represented as a linear combination of elements $g\in G$ , we know that $G$ spans $R[G]$ . Like in polynomial rings, the only linear combination equal to zero is the one where every coefficient is zero, so the elements of $G$ are linearly independent and therefore form a basis of $R[G]$ . Having a basis means the elements of $R[G]$ can be represented by $R$ -vectors of coefficients, and therefore the group ring $R[G]$ is a free module by construction.

Thus a group ring is simultaneously a group and a free $R$ -module.

Theorem: The group ring

R[G]

is exactly the regular representation of

G

over

R

Since $R[G]$ is a free $R$ -module, it has a basis whose elements are precisely the elements of the group $G$ . It is a property of groups that any element $g\in G$ permutes the elements of $g$ by left-multiplication. Permuting the elements is an automorphism on $G$ and therefore an automorphism on $R^|G|$ , thus $R[G]$ is exactly the regular representation $G\to\Aut(R^{|G|})$ .

Therefore: Every group action of

G

on an

R

-module

M

can be encoded by an appropriate group ring

R[G]

A representation $G\to\Aut(M)$ assigns an automorphism on the $R$ -module $M$ to every element $g\in G$ .

Since the automorphisms on $R$ -modules are exactly the linear transformations by definition, every $g\in G$ is assigned a linear transformation on $G$ . In other words, the action of $g$ on $M$ must be linear in $M$ .

But $R[G]$ is exactly every linear combinations of $G$ with respect to $R$ , and therefore contains every possible action of $G$ on $M$ .

This means that every action defined for $G$ on some module $M$ can be linearly extended to an action of $R[G]$ on $M$ . Specifically, if the group action $g\cdot m$ is defined for all $g\in G, m\in M$ , then we define $(\sum_{g\in G}r_gg)\cdot m$ as $\sum_{g\in G}r_g(g\cdot m)$ . Such a module $M$ is known as a $R[G]$ -module, a module over a group ring.

Theorem: Every group ring

R[G]

is an

R[G]

-module.

$R[G]$ is a free $R$ -module, and $G$ has a natural group action on $R[G]$ by left-multiplication in the ring. Thus it is a $R[G]$ -module.

To preserve the $R[G]$ -module structure, $R[G]$ -submodules of $R[G]$ -modules must be $G$ -invariant: just as how $R[G]$ -modules are closed under the action of $R[G]$ , $R[G]$ -submodules $W$ must be closed under the action of $R[G]$ in the sense that for all $g\in R[G],w\in W$ , $gw\in W$ .

Similarly, homomorphisms $\sigma:M\to N$ between $R[G]$ -modules $M,N$ must be $G$ -equivariant: they must commute with the action of $R[G]$ in the sense that for all $g\in R[G],m\in M$ , $g\sigma(m)=\sigma(gm)$ .

Theorem: Given finite

G

, we can always construct a

G

-equivariant version

\tilde{\sigma}:M\to M

of any endomorphism

\sigma:M\to M

of the

R[G]

-module

M

The trick to get $G$ -equivariance is to take a sum over all actions of $G$ : $\tilde{\sigma}(m)=\sum_{g\in G}g^{-1}\sigma(gm)$ which we can do because $G$ is finite. Then we can show that for $h\in G$ : $\begin{aligned} h\tilde{\sigma}(m)&=h\sum_{g\in G}g^{-1}\sigma(gm)\\ &=\sum_{g\in G}hg^{-1}\sigma(gm)\\ &=\sum_{g\in G}h(gh)^{-1}\sigma((gh)m)&\text{ since }g\mapsto gh\text{ simply reorders the sum over }G\\ &=\sum_{g\in G}g^{-1}\sigma(g(hm))\\ &=\tilde{\sigma}(hm) \end{aligned}$ and thus $\tilde{\sigma}$ is $G$ -equivariant.

Theorem: If

G

is finite and

|G|

is a unit in

R

and

M

is a free

R[G]

-module, then every

R[G]

-submodule

W

M

implicitly defines a

G

-equivariant projection

M\to W

Since $M$ is free, it has a basis, and so does the submodule $W$ . We can obtain a projection $\pi:M\to W$ by mapping the basis vectors not in $W$ to basis vectors in $W$ .
Recall that endomorphisms like projections can be made $G$ -equivariant via an averaging trick. To ensure that the resulting endomorphism $\tilde{\pi}:M\to M$ is still a projection, we need to ensure that it is idempotent and that its image is $W$ . It is enough to construct $\tilde{\pi}$ as a map that fixes elements $w\in W$ (thus ensuring idempotence and that the image is at least $W$ ) and maps other elements in $M$ to an element in $W$ (thus ensuring that the image is at most $W$ ).
Given that $|G|$ is a unit in $R$ , the map obtained by the averaging trick can be modified to ensure the above properties: $\tilde{\pi}(m)=\frac{1}{|G|}\sum_{g\in G}g^{-1}\sigma(gm)$
This $\tilde{\pi}$ fixes $w\in W$ : $\begin{aligned} \tilde{\pi}(w)&=\frac{1}{|G|}\sum_{g\in G}g^{-1}\pi(gw)\\ &=\frac{1}{|G|}\sum_{g\in G}g^{-1}\pi(w')&\text{ since }W\text{ is }G\text{-invariant}\\ &=\frac{1}{|G|}\sum_{g\in G}g^{-1}w'&\text{ since }\pi\text{ is a projection onto }W\\ &=\frac{1}{|G|}\sum_{g\in G}w&\text{ since }w'=gw\\ &=\frac{1}{|G|}|G|w\\ &=w \end{aligned}$ and maps all $m\in M$ to an element in $W$ , since $\begin{aligned} &\pi(gm)\in W&\text{ by definition of }\pi\\ \implies&g^{-1}\pi(gm)\in W&\text{ since $W$ is $G$-invariant}\\ \implies&\frac{1}{|G|}\sum_{g\in G}g^{-1}\sigma(gm)\in W&\text{ by linearity of the group action}\\ \implies&\tilde{\pi}(m)\in W \end{aligned}$
Thus $\tilde{\pi}$ as defined is a $G$ -equivariant projection onto $W$ .

In this section, we consider representations as algebraic structures in their own right.

Note that for our representation for $\CC^\times$ in the beginning, each element of $G$ is assigned a distinct element of $\Aut(\RR^n)$ , i.e. the representation is injective. When the representation is injective (i.e. the domain is isomorphic to its image), we call it a faithful representation, and they are the typical ones where each element of $G$ is represented by a distinct matrix in $\Aut(R^n)$ .

There are also non-faithful representations. An example of a very not faithful representation is the trivial representation, which represents every element as the zero element in the zero module. Another is the sign representation: $S_n\to\Aut(\ZZ)$ , which simply takes the sign ( $1$ or $-1$ ) of every permutation in $S_n$ . Another example is $\det\circ\rho$ , where $\rho$ is some matrix representation. In general, a representation is a group homomorphism $G\to\Aut(M)$ (where $M$ is some $R$ -module).

Theorem: A

R[G]

-module

M

fully describes a representation

G\to\Aut(M)

This result comes naturally from the fact that a group ring $R[G]$ encodes all possible linear group actions and therefore its action on a module $M$ models every possible automorphism on the $R$ -modules $M$ , since automorphisms on $R$ -modules are linear by definition.

Theorem: A faithful representation

\rho:G\to\Aut(M)

is one where no

g\in G

acts as the identity action on

V

except for the identity element

e\in G

A faithful representation is injective, i.e. its domain $G$ is isomorphic to its image $\Aut(M)$ . This means that only one element $g\in G$ maps to the identity automorphism, so only one element $g\in G$ acts as identity on $M$ .

Theorem: A faithful representation

\rho

, when represented as a

R[G]

-module

M

, is one where each

x\in R[G]

has a unique action on

M

If two elements $x,y\in R[G]$ have the same action on $M$ , then $x-y$ must be the zero action (sending all elements of $M$ to zero). But $x-y=0$ implies $x=y$ .

The regular representation $\rho:G\to\Aut(R^{|G|})$ (or equivalently, $\rho=R[G]$ ) represents elements of $G$ as automorphisms on the free $R$ -module $R^{|G|}$ (whose elements are $R$ -vectors). It is faithful, since it is composed of distinct permutations of $G$ , so there is always a faithful matrix representation of any group $G$ . Can we do better? Ideally, we want to represent elements of a group $G$ using $R$ -vectors that are perhaps smaller than $|G|$ , without affecting the faithfulness of $\rho$ .

By using the above fact that any representation $\rho$ is equivalent to some $R[G]$ -module $M$ , we can take subrepresentations of $\rho$ as $R[G]$ -submodules of $M$ . To begin, given a representation $\rho:G\to\Aut(M)$ , we can think about quotienting the corresponding $R[G]$ -module $M$ by one of its $R[G]$ -submodules. However, quotienting might create a representation that isn’t faithful. When does quotienting the underlying module $M$ of a faithful representation $\rho:G\to\Aut(M)$ preserve faithfulness?

Therefore: Faithful representations are exactly those where the group action is injective.

Recall that preserving the group action means that every element of the group is identified with a distinct action. In particular, that means there is only one element that behaves like the identity action: the identity element $e\in G$ . But if the representation $\rho:G\to\Aut(R^n)$ maps only $e$ to the identity in $\Aut(R^n)$ , then it is injective, i.e. faithful.

Therefore, quotienting $M$ preserves faithfulness of $\rho$ exactly when the quotient $M/W$ preserves the group action on $M$ . In other words, the submodule $W$ must be $G$ -invariant: it must remain unchanged under the group action of $G$ . But since $R[G]$ -submodules are $G$ -invariant by definition, quotienting by an $R[G]$ -submodule always preserves faithfulness.

As it turns out, by quotienting repeatedly by different $R[G]$ -submodules, we can “factor” $M$ into a direct sum of submodules known as irreducible representations, or irreps.

Theorem: For a finite group

G

, if

W

is a

G

-invariant submodule of the

R[G]

-module

M

, then

M/W

is another

G

-invariant submodule with

M\iso M/W\oplus W

, assuming

\char R\nmid |G|

Recall that if a projection exists on $M$ $M$ , then $M$ is isomorphic to $\ker\sigma\oplus\im\sigma$ . So it is enough to define a projection $\sigma:M\to M$ $σ : M \to M$ where $\ker\sigma\iso M/W$ $ker σ ≅ M / W$ and $\im\sigma=W$ $im σ = W$ . Let’s see what those conditions imply.
- To ensure that $\ker\sigma\iso M/W$ , we need to ensure that every element in $\ker\sigma$ differs by some element in $W$ .
- The requirement $\im\sigma=W$ implies that $\sigma$ maps $M$ to the $G$ -invariant submodule $W$ of $M$ . Thus we need $\sigma$ to be $G$ -equivariant ( $g\sigma(m)=\sigma(gm)$ ) so that it preserves $G$ -invariance.
- Finally, to be a projection, $\sigma$ must be idempotent.
We’ll start with the requirement that $\sigma$ is $G$ -equivariant. In order to get $G$ -equivariance, one trick is to take the sum of all products with $g$ : $\tilde{\sigma}(m)=\sum_{g\in G}gm$ Then we can show that for $h\in G$ : $\begin{aligned} h\tilde{\sigma}(m)&=h\sum_{g\in G}gm\\ &=\sum_{g\in G}hgm&\text{ by linearity in }R[G]\text{-modules}\\ &=\sum_{g\in G}gm&\text{ since }g\mapsto hg\text{ is an automorphism on }G\\ &=\tilde{\sigma}(hm) \end{aligned}$ where $\sum_{g\in G}hgm=\sum_{g\in G}gm$ because the act of multiplying every element of the sum by $h$ just permutes the order of the sum. Therefore, $\tilde{\sigma}$ commutes with the action of $G$ on $M$ , and is therefore $G$ -equivariant.
Next up is ensuring that the elements of $\ker\sigma$ differ by elements of $W$ . One way is to send every $gm$ to $W$ via some projection $\pi:M\to W$ . We redefine $\tilde{\sigma}$ : $\tilde{\sigma}(m)=\sum_{g\in G}\pi(gm)$ Then $\tilde{\sigma}$ is a linear combination of elements of $W$ . This means that $\im\tilde{\sigma}$ is a subset of $W$ , and we can show that two elements of $\ker\tilde{\sigma}$ differ by $\pi(gm)-\pi(gn)\in W$ : $\begin{aligned} \tilde{\sigma}(m)-\tilde{\sigma}(n)&=\sum_{g\in G}\pi(gm)-\sum_{g\in G}\pi(gn)\\ &=\sum_{g\in G}\pi(gm)-\pi(gn) \end{aligned}$
Finally, $\sigma$ must be a projection. Therefore, it must be idempotent, and every element of $W$ must be mapped to. Without losing the previous properties, we can have $\sigma(w)=w$ for every $w\in W$ by taking the inverse action of $G$ , and dividing by $|G|$ (which works since $\char R\nmid |G|$ ): $\sigma(m)=\frac{1}{|G|}\sum_{g\in G}g^{-1}\pi(gm)$ because $\begin{aligned} \sigma(w)&=\frac{1}{|G|}\sum_{g\in G}g^{-1}\pi(gw)\\ &=\frac{1}{|G|}\sum_{g\in G}g^{-1}\pi(w')&\text{ since }W\text{ is }G\text{-invariant}\\ &=\frac{1}{|G|}\sum_{g\in G}g^{-1}w'&\text{ since }\pi\text{ is a projection onto }W\\ &=\frac{1}{|G|}\sum_{g\in G}w&\text{ since }w'=gw\\ &=\frac{1}{|G|}|G|w\\ &=w \end{aligned}$ Therefore $\sigma(w)=w$ for all $w\in W$ , meaning $W\subseteq\im\sigma$ . By definition, $\sigma(m)$ remains a linear combination of elements of $W$ , and therefore $\im\sigma\subseteq W$ . Therefore $\im\sigma=W$ .
Finally, since $\sigma$ is an idempotent endomorphism, we know that $M$ decomposes into $\ker\sigma\oplus\im\sigma$ . Since $\ker\sigma\iso M/W$ and $\im\sigma=W$ by construction, we have $M\iso M/W\oplus W$ .

Maschke’s Theorem: Every representation

\rho

of a finite group

G

(over a field with characteristic not dividing

|G|

) is a direct sum of irreducible representations.

If the $R[G]$ -module $M$ corresponding to the representation $\rho$ doesn’t reduce into irreps, there is a proper $G$ -invariant submodule $W$ of $M$ .
By the above theorem, which we can use since $\char R\nmid |G|$ , there is some complementary $G$ -invariant submodule $W'$ such that $M\iso W\oplus W'$ .
By recursively decomposing proper $G$ -invariant submodules $W$ of each of the factors, you get smaller and smaller submodules. Since $G$ is finite, this process eventually stops when you are left with a direct sum of irreps that is isomorphic to the given representation $\rho$ .

A $R[G]$ module that can be written as a direct sum of simple modules is semisimple. Hence:

Corollary: A $R[G]$ -module is semisimple if $G$ is finite and $\char R\nmid |G|$ .

In this section, we try to find irreps of any matrix representation.

Now that we know by Maschke’s theorem that we can factor any finite group representation into irreps (assuming $\char R\nmid |G|$ ), let’s do so for matrix representations (which are always finite).

A matrix representation $G\to\Aut(R^n)$ is special compared to more general representations $G\to\Aut(M)$ because it implies that the module $M=R^n$ is a free module. This simplifies finding irreps considerably.

In this section, we find an easier way to discover the eigenvalues in an algebraically closed field.

Find the eigenvalues of a given $R$ -matrix, where $R$ is a PID.

I know that the characteristic equation is used to find eigenvalues in vector spaces, which are defined over fields. What happens if we move to the world of modules defined over integral domains?

< Back to category Exploration 3: Representation theory (permalink)
Exploration 2: Module actions Exploration 4: Character theory