Rotation group SO(3)

From Wikipedia, the free encyclopedia
  (Redirected from So(3))
Jump to: navigation, search
"SO(3)" redirects here. For its definition over an arbitrary field, see special orthogonal group.

In mechanics and geometry, the 3D rotation group, often denoted SO(3), is the group of all rotations about the origin of three-dimensional Euclidean space R3 under the operation of composition.[1] By definition, a rotation about the origin is a transformation that preserves the origin, Euclidean distance (so it is an isometry), and orientation (i.e. handedness of space). Every non-trivial rotation is determined by its axis of rotation (a line through the origin) and its angle of rotation. Composing two rotations results in another rotation; every rotation has a unique inverse rotation; and the identity map satisfies the definition of a rotation. Owing to the above properties (along with the associative property, which rotations obey), the set of all rotations is a group under composition. Moreover, the rotation group has a natural structure as a manifold for which the group operations are smooth; so it is in fact a Lie group. It is compact and has dimension 3.

Rotations are linear transformations of R3 and can therefore be represented by matrices once a basis of R3 has been chosen. Specifically, if we choose an orthonormal basis of R3, every rotation is described by an orthogonal 3x3 matrix (i.e. a 3x3 matrix with real entries which, when multiplied by its transpose, results in the identity matrix) with determinant 1. The group SO(3) can therefore be identified with the group of these matrices under matrix multiplication. These matrices are known as "special orthogonal matrices", explaining the notation SO(3).

The group SO(3) is used to describe the possible rotational symmetries of an object, as well as the possible orientations of an object in space. Its representations are important in physics, where they give rise to the elementary particles of integer spin.

Length and angle[edit]

Besides just preserving length, rotations also preserve the angles between vectors. This follows from the fact that the standard dot product between two vectors u and v can be written purely in terms of length:

\mathbf{u}\cdot\mathbf{v} = \tfrac{1}{2}\left(\|\mathbf{u}+\mathbf{v}\|^2 - \|\mathbf{u}\|^2 - \|\mathbf{v}\|^2\right).

It follows that any length-preserving transformation in R3 preserves the dot product, and thus the angle between vectors. Rotations are often defined as linear transformations that preserve the inner product on R3, which is equivalent to requiring them to preserve length. See classical group for a treatment of this more general approach, where SO(3) appears as a special case.

Orthogonal and rotation matrices[edit]

Every rotation maps an orthonormal basis of R3 to another orthonormal basis. Like any linear transformation of finite-dimensional vector spaces, a rotation can always be represented by a matrix. Let R be a given rotation. With respect to the standard basis e1, e2, e3 of R3 the columns of R are given by (Re1, Re2, Re3). Since the standard basis is orthonormal, and since R preserves angles and length, the columns of R form another orthonormal basis. This orthonormality condition can be expressed in the form

R^\mathsf{T}R = I,

where RT denotes the transpose of R and I is the 3 × 3 identity matrix. Matrices for which this property holds are called orthogonal matrices. The group of all 3 × 3 orthogonal matrices is denoted O(3), and consists of all proper and improper rotations.

In addition to preserving length, proper rotations must also preserve orientation. A matrix will preserve or reverse orientation according to whether the determinant of the matrix is positive or negative. For an orthogonal matrix R, note that det RT = det R implies (det R)2 = 1, so that det R = ±1. The subgroup of orthogonal matrices with determinant +1 is called the special orthogonal group, denoted SO(3).

Thus every rotation can be represented uniquely by an orthogonal matrix with unit determinant. Moreover, since composition of rotations corresponds to matrix multiplication, the rotation group is isomorphic to the special orthogonal group SO(3).

Improper rotations correspond to orthogonal matrices with determinant −1, and they do not form a group because the product of two improper rotations is a proper rotation.

Group structure[edit]

The rotation group is a group under function composition (or equivalently the product of linear transformations). It is a subgroup of the general linear group consisting of all invertible linear transformations of the real 3-space R3.[2]

Furthermore, the rotation group is nonabelian. That is, the order in which rotations are composed makes a difference. For example, a quarter turn around the positive x-axis followed by a quarter turn around the positive y-axis is a different rotation than the one obtained by first rotating around y and then x.

The orthogonal group, consisting of all proper and improper rotations, is generated by reflections. Every proper rotation is the composition of two reflections, a special case of the Cartan–Dieudonné theorem.

Axis of rotation[edit]

Every nontrivial proper rotation in 3 dimensions fixes a unique 1-dimensional linear subspace of R3 which is called the axis of rotation (this is Euler's rotation theorem). Each such rotation acts as an ordinary 2-dimensional rotation in the plane orthogonal to this axis. Since every 2-dimensional rotation can be represented by an angle φ, an arbitrary 3-dimensional rotation can be specified by an axis of rotation together with an angle of rotation about this axis. (Technically, one needs to specify an orientation for the axis and whether the rotation is taken to be clockwise or counterclockwise with respect to this orientation).

For example, counterclockwise rotation about the positive z-axis by angle φ is given by

R_z(\varphi) = \begin{bmatrix}\cos\varphi & -\sin\varphi & 0 \\ \sin\varphi & \cos\varphi & 0 \\ 0 & 0 & 1\end{bmatrix}.

Given a unit vector n in R3 and an angle φ, let R(φ, n) represent a counterclockwise rotation about the axis through n (with orientation determined by n). Then

  • R(0, n) is the identity transformation for any n
  • R(φ, n) = R(−φ, −n)
  • R(π + φ, n) = R(π − φ, −n).

Using these properties one can show that any rotation can be represented by a unique angle φ in the range 0 ≤ φ ≤ π and a unit vector n such that

  • n is arbitrary if φ = 0
  • n is unique if 0 < φ < π
  • n is unique up to a sign if φ = π (that is, the rotations R(π, ±n) are identical).

Topology[edit]

The Lie group SO(3) is diffeomorphic to the real projective space RP3.

Consider the solid ball in R3 of radius π (that is, all points of R3 of distance π or less from the origin). Given the above, for every point in this ball there is a rotation, with axis through the point and the origin, and rotation angle equal to the distance of the point from the origin. The identity rotation corresponds to the point at the center of the ball. Rotation through angles between 0 and −π correspond to the point on the same axis and distance from the origin but on the opposite side of the origin. The one remaining issue is that the two rotations through π and through −π are the same. So we identify (or "glue together") antipodal points on the surface of the ball. After this identification, we arrive at a topological space homeomorphic to the rotation group.

Indeed, the ball with antipodal surface points identified is a smooth manifold, and this manifold is diffeomorphic to the rotation group. It is also diffeomorphic to the real 3-dimensional projective space RP3, so the latter can also serve as a topological model for the rotation group.

These identifications illustrate that SO(3) is connected but not simply connected. As to the latter, in the ball with antipodal surface points identified, consider the path running from the "north pole" straight through the interior down to the south pole. This is a closed loop, since the north pole and the south pole are identified. This loop cannot be shrunk to a point, since no matter how you deform the loop, the start and end point have to remain antipodal, or else the loop will "break open". In terms of rotations, this loop represents a continuous sequence of rotations about the z-axis starting and ending at the identity rotation (i.e. a series of rotation through an angle φ where φ runs from 0 to ).

Surprisingly, if you run through the path twice, i.e., run from north pole down to south pole, jump back to the north pole (using the fact that north and south poles are identified), and then again run from north pole down to south pole, so that φ runs from 0 to 4π, you get a closed loop which can be shrunk to a single point: first move the paths continuously to the ball's surface, still connecting north pole to south pole twice. The second half of the path can then be mirrored over to the antipodal side without changing the path at all. Now we have an ordinary closed loop on the surface of the ball, connecting the north pole to itself along a great circle. This circle can be shrunk to the north pole without problems. The Balinese plate trick and similar tricks demonstrate this practically.

The same argument can be performed in general, and it shows that the fundamental group of SO(3) is cyclic group of order 2. In physics applications, the non-triviality of the fundamental group allows for the existence of objects known as spinors, and is an important tool in the development of the spin-statistics theorem.

The universal cover of SO(3) is a Lie group called Spin(3). The group Spin(3) is isomorphic to the special unitary group SU(2); it is also diffeomorphic to the unit 3-sphere S3 and can be understood as the group of versors (quaternions with absolute value 1). The connection between quaternions and rotations, commonly exploited in computer graphics, is explained in quaternions and spatial rotations. The map from S3 onto SO(3) that identifies antipodal points of S3 is a surjective homomorphism of Lie groups, with kernel {±1}. Topologically, this map is a two-to-one covering map.

Connection between SO(3) and SU(2)[edit]

Stereographic projection from the sphere of radius 1/2 from the north pole (x, y, z) = (0, 0, 1/2) onto the plane M given by z = −1/2 coordinatized by (ξ, η), here shown in cross section.

The general reference for this section is Gelfand, Minlos & Shapiro (1963). The points P on the sphere S = {(x, y, z) ∈ ℝ3: x2 + y2 + z2 = 1/4} can, barring the north pole N, be put into one-to-one bijection with points S(P) = P´ on the plane M defined by z = −1/2, see figure. The map S is called stereographic projection.

Let the coordinates on M be (ξ, η). The line L passing through N and P can be written

L = N + t(N - P) = (0,0,1/2) + t( (0,0,1/2) - (x, y, z) ), \quad t\in \mathbb{R}.

Demanding that the z-coordinate equals 1/2, one finds t = 1/z12, hence

S:\mathbf{S} \rightarrow M; P \mapsto P'; (x,y,z) \mapsto (\xi, \eta) = \left(\frac{x}{\frac{1}{2} - z}, \frac{y}{\frac{1}{2} - z}\right) \equiv \zeta = \xi + i\eta,

where, for later convenience, the plane M is identified with the complex plane .

For the inverse, write L as

L = N + s(P'-N) = (0,0,\frac{1}{2}) + s\left( (\xi, \eta, -\frac{1}{2}) - (0,0,\frac{1}{2})\right),

and demand x2 + y2 + z2 = 1/4 to find s = 1/1 + ξ2 + η2 and thus

S^{-1}:M \rightarrow \mathbf{S}; P' \mapsto P;(\xi, \eta) \mapsto (x,y,z) = \left(\frac{\xi}{1 + \xi^2 + \eta^2}, \frac{\eta}{1 + \xi^2 + \eta^2}, \frac{-1 + \xi^2 + \eta^2}{2 + 2\xi^2 + 2\eta^2}\right).

If g ∈ SO(3) is a rotation, then it will take points on S to points on S by its standard action Πs(g) on the embedding space 3. By composing this action with S one obtains a transformation S ∘ Πs(g) ∘ S−1 of M, ζ = P ↦ Πs(g)P =gPS(gP) ≡ Πu(g)ζ = ζ´. Thus Πu(g) is a transformation of associated to the transformation Πs(g) of 3.

It turns out that g ∈ SO(3) represented in this way by Πu(g) can be expressed as a matrix Πu(g) ∈ SU(2) (where the notation is recycled to use the same name for the matrix as for the transformation of it represents). To identify this matrix, consider first a rotation gφ about the z-axis through an angle φ,

\begin{align}x' &= x\cos \varphi - y \sin \varphi,\\
                   y' &= x\sin \varphi + y \cos \varphi,\\
                   z' &= z.\end{align}

Hence

\zeta' = \frac{x' + iy'}{\frac{1}{2} - z'} = \frac{e^{i\varphi}(x + iy)}{\frac{1}{2} - z} = e^{i\varphi}\zeta = \frac{\cos \varphi \zeta  + i \sin \varphi }{0 \zeta + 1},

which, unsurprisingly, is a rotation in the complex plane. In an analogous way, if gθ is a rotation about the x-axis through and angle θ, then

w' = e^{i\theta}w, \quad w = \frac{y + iz}{\frac{1}{2} - x},

which, after a little algebra, becomes

\zeta' = \frac{\cos \frac{\theta}{2}\zeta +i\sin \frac{\theta}{2} }{i \sin\frac{\theta}{2}\zeta + \cos\frac{\theta}{2}}.

These two rotations, gφ, gθ, thus correspond to bilinear transforms of 2 ≃ ℂ ≃ M, namely, they are examples of Möbius transformations.

A general Möbius transformation is given by

\zeta' = \frac{\alpha \zeta + \beta}{\gamma \zeta + \delta}, \quad \alpha\delta - \beta\gamma \ne 0..

The rotations, gφ, gθ generate all of SO(3) and the composition rules of the Möbius transformations show that any composition of gφ, gθ translates to the corresponding composition of Möbius transformations. The Möbius transformations can be represented by matrices

\left(\begin{matrix}\alpha & \beta\\ \gamma & \delta\end{matrix}\right), \quad \quad \alpha\delta - \beta\gamma = 1,

since a common factor of α, β, γ, δ cancels.

For the same reason, the matrix is not uniquely defined since multiplication by I has no effect on either the determinant or the Möbius transformation. The composition law of Möbius transformations follow that of the corresponding matrices. The conclusion is that each Möbius transformation corresponds to two matrices g, −g ∈ SL(2, ℂ).

Using this correspondence one may write

\begin{align}\Pi_u(g_\varphi) &= \Pi_u\left[\left(\begin{matrix}
\cos \varphi & -\sin \varphi & 0\\
\sin \varphi & \cos \varphi & 0\\
0            &            0 & 0
\end{matrix}\right)\right] = \pm

\left(\begin{matrix}
e^{i\frac{\varphi}{2}} & 0\\
0 & e^{-i\frac{\varphi}{2}}
\end{matrix}\right),\\

\Pi_u(g_\theta) &= \Pi_u\left[\left(\begin{matrix}
0            &           0 & 0\\
0 & \cos \theta & -\sin \theta\\
0 & \sin \theta & \cos \theta
\end{matrix}\right)\right] = \pm

\left(\begin{matrix}
\cos\frac{\theta}{2} & i\sin\frac{\theta}{2}\\
i\sin\frac{\theta}{2} & \cos\frac{\theta}{2}
\end{matrix}\right).\end{align}

These matrices are unitary and thus Πu(SO(3)) ⊂ SU(2) ⊂ SL(2, ℂ). In terms of Euler angles[nb 1] one finds for a general rotation

\begin{align}g(\varphi, \theta, \psi) &= g_\varphi g_\theta g_\psi  = 
\left(\begin{matrix}
\cos \varphi & -\sin \varphi & 0\\
\sin \varphi & \cos \varphi  & 0\\
           0 &            0  & 1
\end{matrix}\right)
\left(\begin{matrix}
1 &           0 &            0\\
0 & \cos \theta & -\sin \theta\\
0 & \sin \theta & \cos \theta
\end{matrix}\right)
\left(\begin{matrix}
\cos \psi & -\sin \psi & 0\\
\sin \psi & \cos \psi  & 0\\
        0 &         0  & 1
\end{matrix}\right)\\
&= 
\left(\begin{matrix}
\cos\varphi\cos\psi - \cos\theta\sin\varphi\sin\psi & -\cos\varphi\sin\psi - \cos\theta\sin\varphi\cos\psi & \sin\varphi\sin\theta\\
\sin\varphi\cos\psi + \cos\theta\cos\varphi\sin\psi & -\sin\varphi\sin\psi + \cos\theta\cos\varphi\cos\psi & -\cos\varphi\sin\theta\\
\sin\psi\sin\theta                                  & \cos\psi\sin\theta                                   & \cos\theta
\end{matrix}\right),\end{align}

 

 

 

 

(1)

one has[3]

\begin{align}\Pi_u(g(\varphi, \theta, \psi)) &= \pm
\left(\begin{matrix}
e^{i\frac{\varphi}{2}} & 0\\
0 & e^{-i\frac{\varphi}{2}}
\end{matrix}\right)
\left(\begin{matrix}
\cos\frac{\theta}{2} & i\sin\frac{\theta}{2}\\
i\sin\frac{\theta}{2} & \cos\frac{\theta}{2}
\end{matrix}\right)
\left(\begin{matrix}
e^{i\frac{\psi}{2}} & 0\\
0 & e^{-i\frac{\psi}{2}}
\end{matrix}\right)\\
&= \pm
\left(\begin{matrix}
\cos\frac{\theta}{2}e^{i\frac{\varphi + \psi}{2}} & i\sin\frac{\theta}{2}e^{i\frac{\varphi - \psi}{2}}\\
i\sin\frac{\theta}{2}e^{-i\frac{\varphi - \psi}{2}} & \cos\frac{\theta}{2}e^{-i\frac{\varphi + \psi}{2}}
\end{matrix}\right).\end{align}

 

 

 

 

(2)

For the converse, consider a general matrix

\pm\Pi_u(g_{\alpha,\beta}) = \pm\left(\begin{matrix}
\alpha & \beta\\
-\overline{\beta} & \overline{\alpha}
\end{matrix}\right) \in \mathrm{SU}(2).

Make the substitutions

\begin{align}\cos\frac{\theta}{2} &= |\alpha|,\quad \sin\frac{\theta}{2} = |\beta|, \quad (0 \le \theta \le \pi),\\
\frac{\varphi + \psi}{2} &= \arg \alpha, \quad \frac{\psi - \varphi}{2} = \arg \beta.\end{align}

With the substitutions, Π(gα, β) assumes the form of the right hand side (RHS) of (2), which corresponds under Πu to a matrix on the form of the RHS of (1) with the same φ, θ, ψ. In terms of the complex parameters α, β,

g_{\alpha,\beta} = 
\left(\begin{matrix}
\frac{1}{2}(\alpha^2 - \beta^2 + \overline{\alpha^2} - \overline{\beta^2}) & \frac{i}{2}(-\alpha^2 - \beta^2 + \overline{\alpha^2} + \overline{\beta^2}) & -\alpha\beta-\overline{\alpha}\overline{\beta}\\
\frac{i}{2}(\alpha^2 - \beta^2 - \overline{\alpha^2} + \overline{\beta^2}) & \frac{1}{2}(\alpha^2 + \beta^2 + \overline{\alpha^2} + \overline{\beta^2}) & -i(+\alpha\beta-\overline{\alpha}\overline{\beta})\\
\alpha\overline{\beta} + \overline{\alpha}\beta & i(-\alpha\overline{\beta} + \overline{\alpha}\beta) & \alpha\overline{\alpha} - \beta\overline{\beta}
\end{matrix}\right).

To verify this, substitute for α. β the elements of the matrix on the RHS of (2). After some manipulation, the matrix assumes the form of the RHS of (1).

It is clear from the explicit form in terms of Euler angles that the map p:SU(2) → SO(3);Π(±gαβ) ↦ gαβ just described is a smooth, 2:1 and onto group homomorphism. It is hence an explicit description of the universal covering map of SO(3) from the universal covering group SU(2).

Quaternions of unit norm[edit]

SU(2) is isomorphic to the quaternions of unit norm via a map given by

q = a\mathrm{1} + b\mathrm{i} + c\mathrm{j} + d\mathrm{k} = \alpha + j\beta \leftrightarrow \begin{bmatrix}\alpha & -\overline \beta \\ \beta & \overline \alpha\end{bmatrix} = U, \quad q \in \mathbb{H},\quad a,b,c,d \in \mathbb{R}, \quad \alpha, \beta \in \mathbb{C},\quad U \in \mathrm{SU}(2).[4]

This means that there is a 2:1 homomorphism from quaternions of unit norm to SO(3). Concretely, a unit quaternion, q, with

\begin{align}
 q &{}= w + \bold{i}x + \bold{j}y + \bold{k}z , \\
 1 &{}= w^2 + x^2 + y^2 + z^2 ,
\end{align}

is mapped to the rotation matrix

 Q = \begin{bmatrix}
    1 - 2 y^2 - 2 z^2 & 2 x y - 2 z w & 2 x z + 2 y w \\
    2 x y + 2 z w & 1 - 2 x^2 - 2 z^2 & 2 y z - 2 x w \\
    2 x z - 2 y w & 2 y z + 2 x w & 1 - 2 x^2 - 2 y^2
\end{bmatrix}.

This is a rotation around the vector (x,y,z) by an angle 2θ, where cos θ = w and |sin θ| = ||(x,y,z)||. The proper sign for sin θ is implied, once the signs of the axis components are fixed. The 2:1-nature is apparent since both q and q map to the same Q.

Lie algebra[edit]

Associated with every Lie group is its Lie algebra, a linear space of the same dimension as the Lie group, closed under a bilinear alternating product called the Lie bracket. The Lie algebra of SO(3) is denoted by so(3) and consists of all skew-symmetric 3 × 3 matrices. This may be seen by differentiating the orthogonality condition, ATA = I, A ∈ SO(3).[nb 2] The Lie bracket of two elements of so(3) is, as for the Lie algebra of every matrix group, given by the matrix commutator, [A1, A2] = A1A2A2A1, which is again a skew-symmetric matrix. The Lie algebra bracket captures the essence of the Lie group product in a sense made precise by the Baker–Campbell–Hausdorff formula.

The elements of so(3) are the "infinitesimal generators" of rotations, i.e. they are the elements of the tangent space of the manifold SO(3) at the identity element. If R(φ, n) denotes a counter-clockwise rotation with angle φ about the axis specified by the unit vector n, then

\left.{\operatorname{d}\over\operatorname{d}\phi} \right|_{\phi=0} R(\phi,\boldsymbol{n}) \boldsymbol{x} = \boldsymbol{n} \times \boldsymbol{x}

for every vector x in R3. This can be used to show that the Lie algebra so(3) (with commutator) is isomorphic to the Lie algebra R3 (with cross product). Under this isomorphism, a vector \boldsymbol{\omega}\in\mathbb R^3 corresponds to the linear map \bold{\tilde\omega} defined by \bold{\tilde\omega}(\boldsymbol{x})=\boldsymbol{\omega}\times\boldsymbol{x}.

In more detail, a most often suitable basis for so(3) as a 3-dimensional vector space is


 L_{\bold{x}} = \begin{bmatrix}0&0&0\\0&0&-1\\0&1&0\end{bmatrix} , \quad
 L_{\bold{y}} = \begin{bmatrix}0&0&1\\0&0&0\\-1&0&0\end{bmatrix} , \quad
 L_{\bold{z}} = \begin{bmatrix}0&-1&0\\1&0&0\\0&0&0\end{bmatrix}.

The commutation relations of these basis elements are,


 [L_{\bold{x}}, L_{\bold{y}}] = L_{\bold{z}}, \quad
 [L_{\bold{z}}, L_{\bold{x}}] = L_{\bold{y}}, \quad
 [L_{\bold{y}}, L_{\bold{z}}] = L_{\bold{x}}

which agree with the relations of the three standard unit vectors of R3 under the cross product.

As announced above, one can identify any matrix in this Lie algebra with a vector in ℝ3,[5]

\begin{align}
 \boldsymbol{\omega} &= (x,y,z) \in \mathbb{R}^3,\\
 \boldsymbol{\tilde{\omega}}  &=\boldsymbol{\omega\cdot L} = x L_{\bold{x}} + y L_{\bold{y}} + z L_{\bold{z}}
                             = \begin{bmatrix}0&-z&y\\z&0&-x\\-y&x&0\end{bmatrix} \in \mathfrak{so}(3).
\end{align}

This identification is sometimes called the hat-map.[6] Under this identification, the so(3) bracket corresponds in 3 to the cross product,

 [\tilde{\bold{u}},\tilde{\bold{v}}] = \widetilde{\bold{u}\!\times\!\bold{v} }.

The matrix identified with a vector u has the property that

 \tilde{\bold{u}} \bold{v} = \bold{u} \times \bold{v},

where ordinary matrix multiplication is implied on the left hand side. This implies that u is in the null space of the skew-symmetric matrix with which it is identified, because u × u = 0.

Isomorphism with su(2)[edit]

The Lie algebras so(3) and su(2) are isomorphic. One basis for su(2) is given by

t_1 = \frac{1}{2}\begin{bmatrix}0 & -i\\ -i & 0\end{bmatrix}, \quad t_2 = \frac{1}{2}\begin{bmatrix}0 & -1\\ 1 & 0\end{bmatrix}, \quad t_3 = \frac{1}{2}\begin{bmatrix}-i & 0\\ 0 & i\end{bmatrix}.

These are related to the Pauli matrices by ti1/2iσi. The Pauli matrices abide the physicist convention for Lie algebras. In that convention, Lie algebra elements are multiplied by i, the exponential map (below) is defined with an extra factor of i in the exponent and the structure constants remain the same, but the definition of them acquires a factor of i. Likewise, commutation relations acquire a factor of i. The commutation relations for the ti are

[t_i, t_j] = \epsilon_{ijk}t_k,

where εijk is the totally anti-symmetric symbol with ε123 = 1. The isomorphism between so(3) and su(2) can be set up in several ways. For later convenience, so(3) and su(2) are identified by mapping

L_x \leftrightarrow t_1, \quad L_y \leftrightarrow t_2, \quad L_z \leftrightarrow t_3,

and extending by linearity.

Exponential map[edit]

The exponential map for SO(3), is, since SO(3) is a matrix Lie group, defined using the standard matrix exponential series,


 \exp \colon \mathfrak{so}(3) \to SO(3);
                            A \mapsto e^A = \sum_{k=0}^{\infty} \frac{1}{k!} A^k
                              = I + A + \tfrac{1}{2} A^2 + \cdots + \tfrac{1}{k!} A^k + \cdots
.

For any skew-symmetric matrix Aso(3), eA is always in SO(3). The level of difficulty of proof depends on how a matrix group Lie algebra is defined. Hall (2003) defines the Lie algebra as the set of matrices AMn(ℝ)| etA ∈ SO(3) ∀t, in which case it is trivial. Rossmann (2002) uses for a definition derivatives of smooth curve segments in SO(3) through the identity taken at the identity, in which case it is harder.[7]

For a fixed A ≠ 0, etA, −∞ < t < ∞ is a one-parameter subgroup along a geodesic in SO(3). That this gives a one-parameter subgroup follows directly from properties of the exponential map.[8]

The exponential map provides a diffeomorphism between a neighborhood of the origin in the so(3) and a neighborhood of the identity in the SO(3).[9] For a proof, see Closed subgroup theorem.

The exponential map is surjective. This follows from the fact that that every R ∈ SO(3), since every rotation leaves an axis fixed (Euler's rotation theorem), and is conjugate to a block diagonal matrix of the form

D=\left(\begin{matrix}\cos \theta & -\sin \theta & 0\\
                     \sin \theta &  \cos \theta & 0\\
                               0 &            0 & 1\end{matrix}\right) = e^{\theta L_z},

such that A = BDB−1, and that

Be^{\theta L_z}B^{-1} = e^{B\theta L_zB^{-1}},

together with the fact that so(3) is closed under the adjoint action of SO(3), meaning that BθLzB−1so(3).

As shown above, every element Aso(3) is associated with a vector ω = θ u, where u = (x,y,z) is a unit magnitude vector. Since u is in the null space of A, if one now rotates to a new basis, through some other orthogonal matrix O, with u as the z axis, the final column and row of the rotation matrix in the new basis will be zero.

Thus, we know in advance from the formula for the exponential that exp(OAOT) must leave u fixed. It is mathematically impossible to supply a straightforward formula for such a basis as a function of u, because its existence would violate the hairy ball theorem; but direct exponentiation is possible, and yields

\begin{align}
 \exp( \tilde{\boldsymbol{\omega}} )
 &{}= \exp \left( \begin{bmatrix} 0 & -z \theta & y \theta \\ z \theta & 0&-x \theta \\ -y \theta & x \theta & 0 \end{bmatrix} \right)\\
 &{}= \boldsymbol{I} + 2cs~\boldsymbol{u\cdot L} + 2s^2 ~(\boldsymbol{u\cdot L} )^2 \\
 &{}= \begin{bmatrix}
    2 (x^2 - 1) s^2 + 1 & 2 x y s^2 - 2 z c s & 2 x z s^2 + 2 y c s \\
    2 x y s^2 + 2 z c s & 2 (y^2 - 1) s^2 + 1 & 2 y z s^2 - 2 x c s \\
    2 x z s^2 - 2 y c s & 2 y z s^2 + 2 x c s & 2 (z^2 - 1) s^2 + 1
  \end{bmatrix} ,
\end{align}

where c = cosθ2, s = sinθ2. This is recognized as a matrix for a rotation around axis u by the angle θ: cf. Rodrigues' rotation formula.

Logarithm map[edit]

Given R ∈ SO(3), let

A = \frac{R - R^{\mathrm{T}}}{2}

denote the antisymmetric part.

Then, the logarithm of A is given by[10]

\log R = \frac{\sin^{-1}||A||}{||A||}A.

This is manifest by inspection of the mixed symmetry form of Rodrigues' formula,

e^X = I + \frac{\sin \theta}{\theta}X + 2\frac{\sin^2\frac{\theta}{2}}{\theta^2}X^2, \quad \theta = ||X||,

where the first and last term on the right-hand side are symmetric.

Baker–Campbell–Hausdorff formula[edit]

Suppose X and Y in the Lie algebra are given. Their exponentials, exp(X) and exp(Y), are rotation matrices, which can be multiplied. Since the exponential map is a surjection, for some Z in the Lie algebra, exp(Z) = exp(X) exp(Y), and one may tentatively write

 Z = C(X, Y),

for C some expression in X and Y. When exp(X) and exp(Y) commute, then Z = X + Y, mimicking the behavior of complex exponentiation.

The general case is given by the more elaborate BCH formula, a series expansion of nested Lie brackets.[11] For matrices, the Lie bracket is the same operation as the commutator, which monitors lack of commutativity in multiplication. This general expansion unfolds as follows,[nb 3]

 Z = C(X, Y) = X + Y + \tfrac12 [X, Y] + \tfrac{1}{12} [X,[X,Y]] - \tfrac{1}{12} [Y,[X,Y]] + \cdots ~.

The infinite expansion in the BCH formula for SO(3) reduces to a compact form,

Z = \alpha X + \beta Y + \gamma[X , Y],

for suitable trigonometric function coefficients (α, β, γ).

It is worthwhile to write this composite rotation generator as

\alpha X + \beta Y + \gamma[X , Y] ~\underset{\mathfrak{so}(3)}{=} ~ X + Y + \tfrac12 [X, Y] + \tfrac{1}{12} [X,[X,Y]] - \tfrac{1}{12} [Y,[X,Y]] + \cdots   ,

to emphasize that this is a Lie algebra identity.

The above identity holds for all faithful representations of so(3). The kernel of a Lie algebra homomorphism is an ideal, but so(3), being simple, has no nontrivial ideals and all nontrivial representations are hence faithful. It holds in particular in the doublet or spinor representation. The same explicit formula thus follows in a simpler way through Pauli matrices, cf. the 2×2 derivation for SU(2).

For the general n × n case, one might use Ref.[12]

Infinitesimal rotations[edit]

The matrices in the Lie algebra are not themselves rotations; the skew-symmetric matrices are derivatives. An actual "differential rotation", or infinitesimal rotation matrix has the form

 I + A \, d\theta ~,

where is vanishingly small and Aso(3).

These matrices do not satisfy all the same properties as ordinary finite rotation matrices under the usual treatment of infinitesimals .[13] To understand what this means, one considers

 dA_{\bold{x}} = \begin{bmatrix} 1 & 0 & 0 \\ 0 & 1 & -d\theta \\ 0 & d\theta & 1 \end{bmatrix}~ .

First, test the orthogonality condition, QTQ = I. The product is

 dA_{\bold{x}}^T \, dA_{\bold{x}} = \begin{bmatrix} 1 & 0 & 0 \\ 0 & 1+d\theta^2 & 0 \\ 0 & 0 & 1+d\theta^2 \end{bmatrix} ,

differing from an identity matrix by second order infinitesimals, discarded here. So, to first order, an infinitesimal rotation matrix is an orthogonal matrix.

Next, examine the square of the matrix,

 dA_{\bold{x}}^2 = \begin{bmatrix} 1 & 0 & 0 \\ 0 & 1-d\theta^2 & -2d\theta \\ 0 & 2d\theta & 1-d\theta^2 \end{bmatrix}~.

Again discarding second order effects, note that the angle simply doubles. This hints at the most essential difference in behavior, which we can exhibit with the assistance of a second infinitesimal rotation,

 dA_{\bold{y}} = \begin{bmatrix} 1 & 0 & d\phi \\ 0 & 1 & 0 \\ -d\phi & 0 & 1 \end{bmatrix} .

Compare the products dAxdAy to dAydAx,

\begin{align}
 dA_{\bold{x}}\,dA_{\bold{y}} &{}= \begin{bmatrix} 1 & 0 & d\phi \\ d\theta\,d\phi & 1 & -d\theta \\ -d\phi & d\theta & 1 \end{bmatrix} \\
 dA_{\bold{y}}\,dA_{\bold{x}} &{}= \begin{bmatrix} 1 & d\theta\,d\phi & d\phi \\ 0 & 1 & -d\theta \\ -d\phi & d\theta & 1 \end{bmatrix}. \\
\end{align}

Since dθ dφ is second order, we discard it: thus, to first order, multiplication of infinitesimal rotation matrices is commutative. In fact,

 dA_{\bold{x}}\,dA_{\bold{y}} = dA_{\bold{y}}\,dA_{\bold{x}} , \,\!

again to first order. In other words, the order in which infinitesimal rotations are applied is irrelevant.

This useful fact makes, for example, derivation of rigid body rotation relatively simple. But one must always be careful to distinguish (the first order treatment of) these infinitesimal rotation matrices from both finite rotation matrices and from Lie algebra elements. When contrasting the behavior of finite rotation matrices in the BCH formula above with that of infinitesimal rotation matrices, where all the commutator terms will be second order infinitesimals one finds a bona fide vector space. Technically, this dismissal of any second order terms amounts to Group contraction.

Realizations of rotations[edit]

We have seen that there are a variety of ways to represent rotations:

See also: Charts on SO(3)

A note on representations[edit]

The Lie group SO(3) is compact and simple of rank 1, and so it has a single independent Casimir element, a quadratic invariant function of the three generators which commutes with all of them. The Killing form for the rotation group is just the Kronecker delta, and so this Casimir invariant is simply the sum of the squares of the generators, J_x,\, J_y,\, J_z, of the algebra


 [J_{\bold{x}}, J_{\bold{y}}] = J_{\bold{z}}, \quad
 [J_{\bold{z}}, J_{\bold{x}}] = J_{\bold{y}}, \quad
 [J_{\bold{y}}, J_{\bold{z}}] = J_{\bold{x}}.

That is, the Casimir invariant is given by

J^2\equiv \boldsymbol{J\cdot J} =  J_x^2+J_y^2+J_z^2 \propto I~.

For unitary irreducible representations Dj, the eigenvalues of this invariant are real and discrete, and characterize each representation, which is finite dimensional, of dimensionality 2j+1. That is, the eigenvalues of this Casimir operator are

J^2=- j(j+1) ~I_{2j+1} ~,

where j is integer or half-integer, and referred to as the spin or angular momentum.

So, above, the 3×3 generators L displayed act on the triplet (spin 1) representation, while the 2×2 ones (t) act on the doublet (spin-½) representation. By taking Kronecker products of D1/2 with itself repeatedly, one may construct all higher irreducible representations Dj. That is, the resulting generators for higher spin systems in three spatial dimensions, for arbitrarily large j, can be calculated using these spin operators and ladder operators.

For every unitary irreducible representations Dj there is an equivalent one, D−j−1. All infinite-dimensional irreducible representations must be non-unitary, since the group is compact.

In quantum mechanics, the Casimir invariant is the "angular-momentum-squared" operator; integer values of spin j characterize bosonic representations, while half-integer values fermionic representations, respectively. The antihermitean matrices used above are utilized as spin operators, after they are multiplied by i, so they are now hermitean (like the Pauli matrices). Thus, in this language,


 [J_{\bold{x}}, J_{\bold{y}}] = iJ_{\bold{z}}, \quad
 [J_{\bold{z}}, J_{\bold{x}}] = iJ_{\bold{y}}, \quad
 [J_{\bold{y}}, J_{\bold{z}}] = iJ_{\bold{x}}.

and hence

J^2= j(j+1) ~I_{2j+1} ~.

Explicit expressions for these Dj are,

\begin{align} 
\left (  J_z^{(j)}\right ) _{ba} &= (j+1-a)~\delta_{ab,a}\\
\left (J_x^{(j)}\right )_{ba} &=\frac{1}{2}(\delta_{b,a+1}+\delta_{b+1,a}  ) \sqrt{(j+1)(a+b-1)-ab}\\
\left (J_y^{(j)}\right )_{ba} &=\frac{1}{2i}(\delta_{b,a+1}-\delta_{b+1,a}  ) \sqrt{(j+1)(a+b-1)-ab}\\
     &1 \le a, b \le 2j+1~,
\end{align}

for arbitrary j.

For example, the resulting spin matrices for spin 1, spin 3/2, and 5/2 are:

For j = 1

\begin{align}
  J_x &= \frac{1}{\sqrt{2}}
    \begin{pmatrix}
      0 &1 &0\\
      1 &0 &1\\
      0 &1 &0
    \end{pmatrix} \\
  J_y &= \frac{1}{\sqrt{2}}
    \begin{pmatrix}
      0 &-i &0\\
      i &0  &-i\\
      0 &i  &0
    \end{pmatrix} \\
  J_z &=
    \begin{pmatrix}
      1 &0 &0\\
      0 &0 &0\\
      0 &0 &-1
   \end{pmatrix}
\end{align}

(Note, however, how these are in an equivalent, but different basis than the above i Ls.)

For j=\textstyle\frac{3}{2}:

\begin{align}
  J_x &= \frac{1}{2}
    \begin{pmatrix}
      0        &\sqrt{3} &0        &0\\
      \sqrt{3} &0        &2        &0\\
      0        &2        &0        &\sqrt{3}\\
      0        &0        &\sqrt{3} &0
    \end{pmatrix} \\
  J_y &= \frac{1}{2}
    \begin{pmatrix}
      0         &-i\sqrt{3} &0         &0\\
      i\sqrt{3} &0          &-2i       &0\\
      0         &2i         &0         &-i\sqrt{3}\\
      0         &0          &i\sqrt{3} &0
    \end{pmatrix} \\
  J_z &=\frac{1}{2}
    \begin{pmatrix}
      3 &0 &0  &0\\
      0 &1 &0  &0\\
      0 &0 &-1 &0\\
      0 &0 &0  &-3
     \end{pmatrix}.
\end{align}

For j = \textstyle\frac{5}{2}:

\begin{align}
  J_x &= \frac{1}{2}
    \begin{pmatrix}
      0        &\sqrt{5}  &0         &0         &0         &0 \\
      \sqrt{5} &0         &2\sqrt{2} &0         &0         &0 \\
      0        &2\sqrt{2} &0         &3         &0         &0 \\
      0        &0         &3         &0         &2\sqrt{2} &0 \\
      0        &0         &0         &2\sqrt{2} &0         &\sqrt{5} \\
      0        &0         &0         &0         &\sqrt{5}  &0
    \end{pmatrix} \\
  J_y &= \frac{1}{2}
    \begin{pmatrix}
      0         &-i\sqrt{5} &0           &0          &0           &0 \\
      i\sqrt{5} &0          &-2i\sqrt{2} &0          &0           &0 \\
      0         &2i\sqrt{2} &0           &-3i        &0           &0 \\
      0         &0          &3i          &0          &-2i\sqrt{2} &0 \\
      0         &0          &0           &2i\sqrt{2} &0           &-i\sqrt{5} \\
      0         &0          &0           &0          &i\sqrt{5}   &0
    \end{pmatrix} \\
  J_z &= \frac{1}{2}
    \begin{pmatrix}
      5 &0 &0 &0  &0  &0 \\
      0 &3 &0 &0  &0  &0 \\
      0 &0 &1 &0  &0  &0 \\
      0 &0 &0 &-1 &0  &0 \\
      0 &0 &0 &0  &-3 &0 \\
      0 &0 &0 &0  &0  &-5
    \end{pmatrix}.
\end{align}

and so on.

Generalizations[edit]

The rotation group generalizes quite naturally to n-dimensional Euclidean space, Rn with its standard Euclidean structure. The group of all proper and improper rotations in n dimensions is called the orthogonal group O(n), and the subgroup of proper rotations is called the special orthogonal group SO(n), which is a Lie group of dimension n(n − 1)/2.

In special relativity, one works in a 4-dimensional vector space, known as Minkowski space rather than 3-dimensional Euclidean space. Unlike Euclidean space, Minkowski space has an inner product with an indefinite signature. However, one can still define generalized rotations which preserve this inner product. Such generalized rotations are known as Lorentz transformations and the group of all such transformations is called the Lorentz group.

The rotation group SO(3) can be described as a subgroup of E+(3), the Euclidean group of direct isometries of Euclidean R3. This larger group is the group of all motions of a rigid body: each of these is a combination of a rotation about an arbitrary axis and a translation along the axis, or put differently, a combination of an element of SO(3) and an arbitrary translation.

In general, the rotation group of an object is the symmetry group within the group of direct isometries; in other words, the intersection of the full symmetry group and the group of direct isometries. For chiral objects it is the same as the full symmetry group.

See also[edit]

Remarks[edit]

  1. ^ This is affected by first applying a rotation gφ through φ about the z-axis to take the x-axis to the line L, the intersection between the planes xy and x´y´, the latter being the rotated xy-plane. Then rotate with gθ through θ about L to obtain the new z-axis from the old one, and finally rotate by gψ through an angle ψ abut the new z-axis, where ψ is the angle between L and the new x-axis. In the equation, gθ and gψ are expressed in a temporary rotated basis at each step, which is seen from their simple form. To transform these back to the original basis, observe that gθ =gφgθgφ−1. Here boldface means that the rotation is expressed in the original basis. Likewise, gψ =gφgθgφ−1gφgψ[gφgθgφ−1gφ]−1. Thus gψgθgφ = gφgθgφ−1gφgψ[gφgθgφ−1gφ]−1*gφgθgφ−1*gφ = gφgθgψ.
  2. ^ For an alternative derivation of so(3), see Classical group.
  3. ^ For a full proof, see Derivative of the exponential map. Issues of convergence of this series to the correct element of the Lie algebra are here swept under the carpet. Convergence is guaranteed when ||X|| + ||Y|| < log 2 and ||Z|| < log 2. The series may still converge even if these conditions aren't fulfilled. A solution always exists since exp is onto in the cases under consideration.

Notes[edit]

  1. ^ Jacobson (2009), p. 34, Ex. 14.
  2. ^ n × n real matrices are identical to linear transformations of Rn expressed in its standard basis.
  3. ^ These expressions were, in fact, seminal in the development of quantum mechanics in the 1930s, cf. Ch III,  § 16, B.L. van der Waerden, 1932/1932
  4. ^ Rossmann 2002 p. 95.
  5. ^ Rossmann 2002
  6. ^ Engø 2001
  7. ^ See Rossmann 2002, theorem 3, section 2.2.
  8. ^ Rossmann 2002 Section 1.1.
  9. ^ Hall 2003 Theorem 2.27.
  10. ^ Engø 2001
  11. ^ Hall 2003, Ch. 3; Varadarajan 1984, §2.15
  12. ^ Curtright, Fairlie & Zachos 2014 Group elements of SU(2) are expressed in closed form as finite polynomials of the Lie algebra generators, for all definite spin representations of the rotation group.
  13. ^ (Goldstein, Poole & Safko 2002, §4.8)

References[edit]

  • Hall, Brian C. (2003), Lie Groups, Lie Algebras, and Representations An Elementary Introduction, Graduate texts in mathematics 222, Springer Publishing, ISBN 0-387-40122-9 
  • Jacobson, Nathan (2009), Basic algebra 1 (2nd ed.), Dover Publications, ISBN 978-0-486-47189-1 
  • Joshi, A. W. (2007), Elements of Group Theory for Physicists, New Age International, pp. 111ff, ISBN 81-224-0975-X 
  • Rossmann, Wulf (2002), Lie Groups – An Introduction Through Linear Groups, Oxford Graduate Texts in Mathematics, Oxford Science Publications, ISBN 0 19 859683 9 
  • van der Waerden, B. L. (1952), Group Theory and Quantum Mechanics, Springer Publishing, ISBN 978-3642658624  (translation of the original 1932 edition, Die Gruppentheoretische Methode in Der Quantenmechanik).