Jordan matrix

From Wikipedia, the free encyclopedia
Jump to: navigation, search

In the mathematical discipline of matrix theory, a Jordan block over a ring R (whose identities are the zero 0 and one 1) is a matrix composed of 0 elements everywhere except for the diagonal, which is filled with a fixed element \lambda\in R, and for the superdiagonal, which is composed of ones. The concept is named after Camille Jordan.

\begin{pmatrix}
\lambda & 1       & 0       & \cdots  & 0 \\
0       & \lambda & 1       & \cdots  & 0 \\
\vdots  & \vdots  & \vdots& \ddots  & \vdots \\
0       & 0       & 0        & \lambda & 1       \\
0       & 0       & 0       & 0       & \lambda \end{pmatrix}

Every Jordan block is thus specified by its dimension n and its eigenvalue \lambda and is indicated as J_{\lambda,n}. Any block diagonal matrix whose blocks are Jordan blocks is called a Jordan matrix; using either the \oplus or the “\mathrm{diag}” symbol, the (l+m+n)\times (l+m+n) block diagonal square matrix whose first diagonal block is J_{\alpha,l}, whose second diagonal block is J_{\beta,m} and whose third diagonal block is J_{\gamma,n} is compactly indicated as J_{\alpha,l}\oplus J_{\beta,m}\oplus J_{\gamma,n} or \mathrm{diag}\left(J_{\alpha,l}, J_{\beta,m}, J_{\gamma,n}\right), respectively. For example the matrix


J=\left(\begin{matrix}
0 & 1 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 1 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & i & 1 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & i & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & i & 1 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & i & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 & 7 & 1 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 7 & 1 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 7 \end{matrix}\right)

is a 10\times 10 Jordan matrix with a 3\times 3 block with eigenvalue 0, two 2\times 2 blocks with eigenvalue the imaginary unit and a 3\times 3 block with eigenvalue 7. Its Jordan-block structure can also be written as either J_{0,3}\oplus J_{i,2}\oplus J_{i,2}\oplus J_{7,3} or \mathrm{diag}\left(J_{0,3},J_{i,2},J_{i,2},J_{7,3}\right).

Linear algebra[edit]

Any n\times n square matrix A whose elements are in an algebraically closed field K is similar to a Jordan matrix J, also in \mathbb{M}_n (K), which is unique up to a permutation of its diagonal blocks themselves. J is called the Jordan normal form of A and corresponds to a generalization of the diagonalization procedure.[1][2][3] A diagonalizable matrix is similar, in fact, to a special case of Jordan matrix: the matrix whose blocks are all 1\times 1.[4][5][6]

More generally, given a Jordan matrix J=J_{\lambda_1,m_1}\oplus J_{\lambda_2,m_2} \oplus\ldots\oplus J_{\lambda_N,m_N}, i.e. whose k^\text{th} diagonal block, 1\leq k\leq N is the Jordan block J_{\lambda_k,m_k} and whose diagonal elements \lambda_k may not all be distinct, the geometric multiplicity of \lambda\in K for the matrix J, indicated as \mathrm{gmul}_J \lambda\,, corresponds to the number of Jordan blocks whose eigenvalue is \lambda. Whereas the index of an eigenvalue \lambda for J, indicated as \mathrm{idx}_J \lambda\,, is defined as the dimension of the largest Jordan block associated to that eigenvalue.

The same goes for all the matrices A similar to J, so \mathrm{idx}_A \lambda\, can be defined accordingly with respect to the Jordan normal form of A for any of its eigenvalues \lambda \in\mathrm{spec}A. In this case one can check that the index of \lambda for A is equal to its multiplicity as a root of the minimal polynomial of A (whereas, by definition, its algebraic multiplicity for A, \mathrm{mul}_A \lambda\,, is its multiplicity as a root of the characteristic polynomial of A, i.e. \det(A-xI)\in K[x]). An equivalent necessary and sufficient condition for A to be diagonalizable in K is that all of its eigenvalues have index equal to 1, i.e. its minimal polynomial has only simple roots.

Note that knowing a matrix's spectrum with all of its algebraic/geometric multiplicities and indexes does not always allow for the computation of its Jordan normal form (this may be a sufficient condition only for spectrally simple, usually low-dimensional matrices): the Jordan decomposition is, in general, a computationally challenging task. From the vector space point of view, the Jordan decomposition is equivalent to finding an orthogonal decomposition (i.e. via direct sums of eigenspaces represented by Jordan blocks) of the domain which the associated generalized eigenvectors make a basis for.

Functions of matrices[edit]

Let A\in\mathbb{M}_n (\mathbb{C}) (i.e. a n\times n complex matrix) and C\in\mathrm{GL}_n (\mathbb{C}) be the change of basis matrix to the Jordan normal form of A, i.e. A=C^{-1}JC. Now let f(z) be a holomorphic function on an open set \mathit{\Omega} such that \mathrm{spec}A\subset \mathit{\Omega}\subseteq\mathbb{C}, i.e. the spectrum of the matrix is contained inside the domain of holomorphy of f. Let

f(z)=\sum_{h=0}^{\infty}a_h (z-z_0)^h

be the power series expansion of f around z_0\in\mathit{\Omega}\backslash\mathrm{spec}A, which will be hereinafter supposed to be 0 for simplicity's sake. The matrix f(A) is then defined via the following formal power series

f(A)=\sum_{h=0}^{\infty}a_h A^h

is absolutely convergent with respect to the Euclidean norm of \mathbb{M}_n (\mathbb{C}). To put it another way, f(A)\, converges absolutely for every square matrix whose spectral radius is less than the radius of convergence of f around 0 and is uniformly convergent on any compact subsets of \mathbb{M}_n (\mathbb{C}) satisfying this property in the matrix Lie group topology.

The Jordan normal form allows the computation of functions of matrices without explicitly computing an infinite series, which is one of the main achievements of Jordan matrices. Using the facts that the k^\mathrm{th} power (k\in\mathbb{N}_0) of a diagonal block matrix is the diagonal block matrix whose blocks are the k^\mathrm{th} powers of the respective blocks, i.e. \left(A_1 \oplus A_2 \oplus A_3 \oplus\ldots\right)^k=A^k_1 \oplus A_2^k \oplus A_3^k \oplus\ldots, and that A^k=C^{-1}J^k C\,, the above matrix power series becomes

f(A)=C^{-1}f(J)C=C^{-1}\left(\bigoplus_{k=1}^N f\left(J_{\lambda_k ,m_k}\right)\right)C

where the last series must not be computed explicitly via power series of every Jordan block. In fact, if \lambda\in\mathit{\Omega}, any holomorphic function of a Jordan block f(J_{\lambda,n})\, is the following upper triangular matrix:

f(J_{\lambda,n})=\left(\begin{matrix}
f(\lambda) & f^\prime (\lambda) & \frac{f^{\prime\prime}(\lambda)}{2} & \cdots & \frac{f^{(n-2)}(\lambda)}{(n-2)!} & \frac{f^{(n-1)}(\lambda)}{(n-1)!} \\
0 & f(\lambda) & f^\prime (\lambda) & \cdots & \frac{f^{(n-3)}(\lambda)}{(n-3)!} & \frac{f^{(n-2)}(\lambda)}{(n-2)!} \\
0 & 0 & f(\lambda) & \cdots & \frac{f^{(n-4)}(\lambda)}{(n-4)!} & \frac{f^{(n-3)}(\lambda)}{(n-3)!} \\
\vdots & \vdots & \vdots & \ddots & \vdots & \vdots \\
0 & 0 & 0 & \cdots & f(\lambda) & f^\prime (\lambda) \\
0 & 0 & 0 & \cdots & 0 & f(\lambda) \\
\end{matrix}\right)=\left(\begin{matrix}
a_0 & a_1 & a_2 & \cdots & a_{n-1} \\
0 & a_0 & a_1 & \cdots & a_{n-2} \\
0 & 0 & a_0 & \cdots & a_{n-3} \\
\vdots & \vdots & \vdots & \ddots & \vdots \\
0 & 0 & 0 & \cdots & a_1 \\
0 & 0 & 0 & \cdots & a_0
\end{matrix}\right).

As a consequence of this, the computation of any functions of a matrix is straightforward whenever its Jordan normal form and its change-of-basis matrix are known. Also, \mathrm{spec}f(A)=f(\mathrm{spec}A), i.e. every eigenvalue \lambda\in\mathrm{spec}A corresponds to the eigenvalue f(\lambda)\in\mathrm{spec}f(A), but it has, in general, different algebraic multiplicity, geometric multiplicity and index. However, the algebraic multiplicity may be computed as follows:

\text{mul}_{f(A)}f(\lambda)=\sum_{\mu\in\text{spec}A\cap f^{-1}(f(\lambda))}~\text{mul}_A \mu.\,

The function f(T) of a linear transformation T between vector spaces can be defined in a similar way according to the holomorphic functional calculus, where Banach space and Riemann surface theories play a fundamental role. In the case of finite-dimensional spaces, both theories perfectly match.

Dynamical systems[edit]

Now suppose a (complex) dynamical system is simply defined by the equation

\dot{\mathbf{z}}(t)=A(\mathbf{c})\mathbf{z}(t),
\mathbf{z}(0)=\mathbf{z}_0 \in\mathbb{C}^n,

where \mathbf{z}:\mathbb{R_+}\rightarrow \mathcal{R} is the (n-dimensional) curve parametrization of an orbit on the Riemann surface \mathcal{R} of the dynamical system, whereas A(\mathbf{c}) is an n\times n complex matrix whose elements are complex functions of a d-dimensional parameter \mathbf{c}\in\mathbb{C}^d. Even if A\in\mathbb{M}_n \left(\mathrm{C}^0(\mathbb{C}^d)\right) (i.e. A continuously depends on the parameter \mathbf{c}) the Jordan normal form of the matrix is continuously deformed almost everywhere on \mathbb{C}^d but, in general, not everywhere: there is some critical submanifold of \mathbb{C}^d on which the Jordan form abruptly changes its structure whenever the parameter crosses or simply “travels” around it (monodromy). Such changes mean that several Jordan blocks (either belonging to different eigenvalues or not) join together to a unique Jordan block, or vice versa (i.e. one Jordan block splits into two or more different ones). Many aspects of bifurcation theory for both continuous and discrete dynamical systems can be interpreted with the analysis of functional Jordan matrices.

From the tangent space dynamics, this means that the orthogonal decomposition of the dynamical system's phase space changes and, for example, different orbits gain periodicity, or lose it, or shift from a certain kind of periodicity to another (such as period-doubling, cfr. logistic map).

In a sentence, the qualitative behaviour of such a dynamical system may substantially change as the versal deformation of the Jordan normal form of A(\mathbf{c}).

Linear ordinary differential equations[edit]

The simplest example of a dynamical system is a system of linear, constant-coefficient, ordinary differential equations, i.e. let A\in\mathbb{M}_n (\mathbb{C}) and \mathbf{z}_0 \in\mathbb{C}^n:

\dot{\mathbf{z}}(t)=A\mathbf{z}(t),
\mathbf{z}(0)=\mathbf{z}_0,

whose direct closed-form solution involves computation of the matrix exponential:

\mathbf{z}(t)=e^{tA}\mathbf{z}_0.

Another way, provided the solution is restricted to the local Lebesgue space of n-dimensional vector fields \mathbf{z}\in\mathrm{L}_{\mathrm{loc}}^1 (\mathbb{R}_+)^n, is to use its Laplace transform \mathbf{Z}(s)=\mathcal{L}[\mathbf{z}](s). In this case

\mathbf{Z}(s)=\left(sI-A\right)^{-1}\mathbf{z}_0.

The matrix function \left(A-sI\right)^{-1} is called the resolvent matrix of the differential operator \frac{\mathrm{d}}{\mathrm{d}t}-A. It is meromorphic with respect to the complex parameter s\in\mathbb{C} since its matrix elements are rational functions whose denominator is equal for all to \det (A-sI). Its polar singularities are the eigenvalues of A, whose order equals their index for it, i.e. \mathrm{ord}_{(A-sI)^{-1}}\lambda=\mathrm{idx}_A \lambda.

See also[edit]

Notes[edit]

  1. ^ Beauregard & Fraleigh (1973, pp. 310-316)
  2. ^ Golub & Van Loan (1996, p. 317)
  3. ^ Nering (1970, pp. 118-127)
  4. ^ Beauregard & Fraleigh (1973, pp. 270-274)
  5. ^ Golub & Van Loan (1996, p. 316)
  6. ^ Nering (1970, pp. 113-118)

References[edit]

Further reading[edit]