In mathematics, the adjoint representation (or adjoint action) of a Lie group G is a way of representing the elements of the group as linear transformations of the group's Lie algebra, considered as a vector space. For example, in the case where G is the Lie group of invertible matrices of size n, GL(n), the Lie algebra is the vector space of all (not necessarily invertible) n-by-n matrices. So in this case the adjoint representation is the vector space of n-by-n matrices $x$, and any element g in GL(n) acts as a linear transformation of this vector space given by conjugation: $x \mapsto g x g^{-1}$.

For any Lie group, this natural representation is obtained by linearizing (i.e. taking the differential of) the action of G on itself by conjugation. The adjoint representation can be defined for linear algebraic groups over arbitrary fields.

## Definition

Let G be a Lie group and let $\mathfrak g$ be its Lie algebra (which we identify with TeG, the tangent space to the identity element in G). Define the map

$\Psi : G \to \mathrm{Aut}(G), \, g \mapsto \Psi_g$

where Aut(G) is the automorphism group of G and the automorphism Ψg is defined by

$\Psi_g(h) = ghg^{-1}\,$

for all h in G. The differential of Ψg at the identity is an automorphism of the Lie algebra $\mathfrak g$. We denote this map by Adg:

$d(\Psi_g)_e=\mathrm{Ad}_g\colon \mathfrak g \to \mathfrak g.$

To say that Adg is a Lie algebra automorphism is to say that Adg is a linear transformation of $\mathfrak g$ that preserves the Lie bracket. The map

$\mathrm{Ad}\colon G \to \mathrm{Aut}(\mathfrak g), \, g \mapsto \mathrm{Ad}_g$

is called the adjoint representation of G. This is indeed a representation of G since $\mathrm{Aut}(\mathfrak g)$ is a closed[1] Lie subgroup of $\mathrm{GL}(\mathfrak g)$ and the above adjoint map is a Lie group homomorphism. Note Ad is a trivial map if G is abelian.

If G is an (immersed) Lie subgroup of the general linear group $GL_n(\mathbb{C})$, then, since the exponential map is the matrix exponential: $\operatorname{exp}(X) = e^X$, taking the derivative of $\Psi_g(\operatorname{exp}(tX)) = ge^{t X}g^{-1}$ at t = 0, one gets: for g in G and X in $\mathfrak{g}$,

$\operatorname{Ad}_g(X) = gX g^{-1}$

where on the right we have the products of matrices.

### Adjoint representation of a Lie algebra

One may always pass from a representation of a Lie group G to a representation of its Lie algebra by taking the derivative at the identity.

Taking the derivative of the adjoint map

$\mathrm{Ad}\colon G \to \mathrm{Aut}(\mathfrak g)$

gives the adjoint representation of the Lie algebra $\mathfrak g$:

$d(\mathrm{Ad})_x: T_x(G) \to T_{Ad(x)}(\mathrm{Aut}(\mathfrak g))$
$\mathrm{ad}\colon \mathfrak g \to \mathrm{Der}(\mathfrak g).$

Here $\mathrm{Der}(\mathfrak g)$ is the Lie algebra of $\mathrm{Aut}(\mathfrak g)$ which may be identified with the derivation algebra of $\mathfrak g$. The adjoint representation of a Lie algebra is related in a fundamental way to the structure of that algebra. In particular, one can show that

$\mathrm{ad}_x(y) = [x,y]\,$

for all $x,y \in \mathfrak g$.[2]

## Examples

• If G is abelian of dimension n, the adjoint representation of G is the trivial n-dimensional representation.
• If G is a matrix Lie group (i.e. a closed subgroup of GL(n,C)), then its Lie algebra is an algebra of n×n matrices with the commutator for a Lie bracket (i.e. a subalgebra of $\mathfrak{gl}_n(\mathbf C)$). In this case, the adjoint map is given by Adg(x) = gxg−1.
• If G is SL(2, R) (real 2×2 matrices with determinant 1), the Lie algebra of G consists of real 2×2 matrices with trace 0. The representation is equivalent to that given by the action of G by linear substitution on the space of binary (i.e., 2 variable) quadratic forms.

## Properties

The following table summarizes the properties of the various maps mentioned in the definition

 $\Psi\colon G \to \mathrm{Aut}(G)\,$ $\Psi_g\colon G \to G\,$ Lie group homomorphism: $\Psi_{gh} = \Psi_g\Psi_h$ Lie group automorphism: $\Psi_g(ab) = \Psi_g(a)\Psi_g(b)$ $(\Psi_g)^{-1} = \Psi_{g^{-1}}$ $\mathrm{Ad}\colon G \to \mathrm{Aut}(\mathfrak g)$ $\mathrm{Ad}_g\colon \mathfrak g \to \mathfrak g$ Lie group homomorphism: $\mathrm{Ad}_{gh} = \mathrm{Ad}_g\mathrm{Ad}_h$ Lie algebra automorphism: $\mathrm{Ad}_g$ is linear $(\mathrm{Ad}_g)^{-1} = \mathrm{Ad}_{g^{-1}}$ $\mathrm{Ad}_g[x,y] = [\mathrm{Ad}_g x,\mathrm{Ad}_g y]$ $\mathrm{ad}\colon \mathfrak g \to \mathrm{Der}(\mathfrak g)$ $\mathrm{ad}_x\colon \mathfrak g \to \mathfrak g$ Lie algebra homomorphism: $\mathrm{ad}$ is linear $\mathrm{ad}_{[x,y]} = [\mathrm{ad}_x,\mathrm{ad}_y]$ Lie algebra derivation: $\mathrm{ad}_x$ is linear $\mathrm{ad}_x[y,z] = [\mathrm{ad}_x y ,z] + [y,\mathrm{ad}_x z]$

The image of G under the adjoint representation is denoted by Ad(G). If G is connected, the kernel of the adjoint representation coincides with the kernel of Ψ which is just the center of G. Therefore the adjoint representation of a connected Lie group G is faithful if and only if G is centerless. More generally, if G is not connected, then the kernel of the adjoint map is the centralizer of the identity component G0 of G. By the first isomorphism theorem we have

$\mathrm{Ad}(G) \cong G/Z_G(G_0).$

Given a finite-dimensional real Lie algebra $\mathfrak{g}$, by Lie's third theorem, there is a connected Lie group $\operatorname{Int}(\mathfrak{g})$ whose Lie algebra is the image of the adjoint representation of $\mathfrak{g}$ (i.e., $\operatorname{Lie}(\operatorname{Int}(\mathfrak{g})) = \operatorname{ad}(\mathfrak{g})$.) It is called the adjoint group of $\mathfrak{g}$.

Now, if $\mathfrak{g}$ is the Lie algebra of a connected Lie group G, then $\operatorname{Int}(\mathfrak{g})$ is the image of the adjoint representation of G: $\operatorname{Int}(\mathfrak{g}) = \operatorname{Ad}(G)$.

## Roots of a semisimple Lie group

If G is semisimple, the non-zero weights of the adjoint representation form a root system. To see how this works, consider the case G = SL(n, R). We can take the group of diagonal matrices diag(t1, ..., tn) as our maximal torus T. Conjugation by an element of T sends

$\begin{bmatrix} a_{11}&a_{12}&\cdots&a_{1n}\\ a_{21}&a_{22}&\cdots&a_{2n}\\ \vdots&\vdots&\ddots&\vdots\\ a_{n1}&a_{n2}&\cdots&a_{nn}\\ \end{bmatrix} \mapsto \begin{bmatrix} a_{11}&t_1t_2^{-1}a_{12}&\cdots&t_1t_n^{-1}a_{1n}\\ t_2t_1^{-1}a_{21}&a_{22}&\cdots&t_2t_n^{-1}a_{2n}\\ \vdots&\vdots&\ddots&\vdots\\ t_nt_1^{-1}a_{n1}&t_nt_2^{-1}a_{n2}&\cdots&a_{nn}\\ \end{bmatrix}.$

Thus, T acts trivially on the diagonal part of the Lie algebra of G and with eigenvectors titj−1 on the various off-diagonal entries. The roots of G are the weights diag(t1, ..., tn) → titj−1. This accounts for the standard description of the root system of G = SLn(R) as the set of vectors of the form eiej.

### Example SL(2, R)

Let us compute the root system for one of the simplest cases of Lie Groups. Let us consider the group SL(2, R) of two dimensional matrices with determinant 1. This consists of the set of matrices of the form:

$\begin{bmatrix} a & b\\ c & d\\ \end{bmatrix}$

with a, b, c, d real and ad − bc = 1.

A maximal compact connected abelian Lie subgroup, or maximal torus T, is given by the subset of all matrices of the form

$\begin{bmatrix} t_1 & 0\\ 0 & t_2\\ \end{bmatrix} = \begin{bmatrix} t_1 & 0\\ 0 & 1/t_1\\ \end{bmatrix} = \begin{bmatrix} \exp(\theta) & 0 \\ 0 & \exp(-\theta) \\ \end{bmatrix}$

with $t_1 t_2 = 1$. The Lie algebra of the maximal torus is the Cartan subalgebra consisting of the matrices

$\begin{bmatrix} \theta & 0\\ 0 & -\theta \\ \end{bmatrix} = \theta\begin{bmatrix} 1 & 0\\ 0 & 0 \\ \end{bmatrix}-\theta\begin{bmatrix} 0 & 0\\ 0 & 1 \\ \end{bmatrix} = \theta(e_1-e_2).$

If we conjugate an element of SL(2, R) by an element of the maximal torus we obtain

$\begin{bmatrix} t_1 & 0\\ 0 & 1/t_1\\ \end{bmatrix} \begin{bmatrix} a & b\\ c & d\\ \end{bmatrix} \begin{bmatrix} 1/t_1 & 0\\ 0 & t_1\\ \end{bmatrix} = \begin{bmatrix} a t_1 & b t_1 \\ c / t_1 & d / t_1\\ \end{bmatrix} \begin{bmatrix} 1 / t_1 & 0\\ 0 & t_1\\ \end{bmatrix} = \begin{bmatrix} a & b t_1^2\\ c t_1^{-2} & d\\ \end{bmatrix}$

The matrices

$\begin{bmatrix} 1 & 0\\ 0 & 0\\ \end{bmatrix} \begin{bmatrix} 0 & 0\\ 0 & 1\\ \end{bmatrix} \begin{bmatrix} 0 & 1\\ 0 & 0\\ \end{bmatrix} \begin{bmatrix} 0 & 0\\ 1 & 0\\ \end{bmatrix}$

are then 'eigenvectors' of the conjugation operation with eigenvalues $1,1,t_1^2, t_1^{-2}$. The function Λ which gives $t_1^2$ is a multiplicative character, or homomorphism from the group's torus to the underlying field R. The function λ giving θ is a weight of the Lie Algebra with weight space given by the span of the matrices.

It is satisfying to show the multiplicativity of the character and the linearity of the weight. It can further be proved that the differential of Λ can be used to create a weight. It is also educational to consider the case of SL(3, R).

## Variants and analogues

The adjoint representation can also be defined for algebraic groups over any field.

The co-adjoint representation is the contragredient representation of the adjoint representation. Alexandre Kirillov observed that the orbit of any vector in a co-adjoint representation is a symplectic manifold. According to the philosophy in representation theory known as the orbit method (see also the Kirillov character formula), the irreducible representations of a Lie group G should be indexed in some way by its co-adjoint orbits. This relationship is closest in the case of nilpotent Lie groups.

## Notes

1. ^ The condition that a linear map is a Lie algebra homomorphism is a closed condition.
2. ^ \begin{align} \mathrm{ad}_x(y) & = d (\mathrm{Ad}_{x})_{e}(y) \\ & = \lim_{\varepsilon \to 0}\frac{(I+\varepsilon x)y(I+\varepsilon x)^{-1}-y}{\varepsilon} \\ & = \lim_{\varepsilon \to 0}\frac{(I+\varepsilon x)y(I-\varepsilon x +(\varepsilon x)^2+O(\varepsilon^3))-y}{\varepsilon} \\ & = \lim_{\varepsilon \to 0}\frac{((I+\varepsilon x)yI- (I+\varepsilon x)y\varepsilon x +(I+\varepsilon x)y(\varepsilon x)^2 +O(\varepsilon^3))-y}{\varepsilon} \\ & = \lim_{\varepsilon \to 0}\frac{(I y I+\varepsilon x y I- I y \varepsilon x-\varepsilon x y \varepsilon x +Iy(\varepsilon x)^2+\varepsilon xy(\varepsilon x)^2 +O(\varepsilon^3))-y}{\varepsilon} \\ & = \lim_{\varepsilon \to 0}\frac{y+ x y \varepsilon - y x \varepsilon- x y x \varepsilon^{2} +y x^{2}\varepsilon^2 + x y x^{2}\varepsilon^2 +O(\varepsilon^3) -y}{\varepsilon} \\ & = \lim_{\varepsilon \to 0}x y - y x - x y x \varepsilon +y x^{2}\varepsilon + x y x^{2}\varepsilon +O(\varepsilon^2) \\ & = [x,y] \end{align}