Eigenvalues and eigenvectors

(Redirected from Algebraic multiplicity)
In this shear mapping the red arrow changes direction but the blue arrow does not. The blue arrow is an eigenvector of this shear mapping because it doesn't change direction, and since its length is unchanged, its eigenvalue is 1.

In linear algebra, an eigenvector or characteristic vector of a square matrix is a vector that points in a direction which is invariant under the associated linear transformation. In other words, if v is a vector which is not zero, then it is an eigenvector of a square matrix A if Av is a scalar multiple of v. This condition could be written as the equation

$A\mathbf{v} = \lambda \mathbf{v},$

(1)

where λ is a number (also called a scalar) known as the eigenvalue or characteristic value associated with the eigenvector v.

There is a correspondence between n by n square matrices and linear transformation from an n-dimensional vector space to itself. For this reason, it is equivalent to define eigenvalues and eigenvectors using either the language of matrices or the language of linear transformations.

Overview

If two-dimensional space is visualized as a rubber sheet, a linear map with two eigenvectors would be a stretching along two directions corresponding to the eigenvectors. For example, the sheet could be stretched by a factor λ1 along the x-axis and λ2 along the y-axis. In two dimensions, there can be two such independent stretching directions, but they do not have to be at right angles to each other. A rotation in two dimensions is a linear map with no eigenvectors, and a shear, as in the photo, has only one eigenvector, with eigenvalue 1. Eigenvectors do not change in direction, and the stretch factor for each is its respective eigenvalue. Other vectors will change in direction, unless the two eigenvalues are equal, in which case all vectors will be eigenvectors with that eigenvalue, yielding a magnification, a linear map which alters neither shape nor direction, but only magnitude. A reflection may be viewed as stretching a line perpendicular to the axis of reflection by a factor of −1 while stretching the axis of reflection by a factor of 1. For 3D rotations, the axis of rotation is an eigenvector of eigenvalue 1.

A three-coordinate vector may be seen as an arrow in three-dimensional space starting at the origin. In that case, an eigenvector $v$ is an arrow whose direction is either preserved or exactly reversed after multiplication by $A$. The corresponding eigenvalue determines how the length of the arrow is changed by the operation, and whether its direction is reversed or not, determined by whether the eigenvalue is negative or positive.

In abstract linear algebra, these concepts are naturally extended to more general situations, where the set of real scalar factors is replaced by any field of scalars (such as algebraic or complex numbers); the set of Cartesian vectors $\mathbb{R}^n$ is replaced by any vector space (such as the continuous functions, the polynomials or the trigonometric series), and matrix multiplication is replaced by any linear operator that maps vectors to vectors (such as the derivative from calculus). In such cases, the "vector" in "eigenvector" may be replaced by a more specific term, such as "eigenfunction", "eigenmode", "eigenface", or "eigenstate". Thus, for example, the exponential function $f(x) = e^{\lambda x}$ is an eigenfunction of the derivative operator, $'$ , with eigenvalue $\lambda$, since its derivative is $f'(x) = \lambda e^{\lambda x} = \lambda f(x)$.

The set of all eigenvectors of a matrix (or linear operator), each paired with its corresponding eigenvalue, is called the eigensystem of that matrix.[1] Any nonzero scalar multiple of an eigenvector is also an eigenvector corresponding to the same eigenvalue. An eigenspace or characteristic space of a matrix $A$ is the set of all eigenvectors of $A$ corresponding to the same eigenvalue, together with the zero vector.[2][3] An eigenbasis for $A$ is any basis for the set of all vectors that consists of linearly independent eigenvectors of $A$. Not every matrix has an eigenbasis, but every symmetric matrix does.

The prefix eigen- is adopted from the German word eigen for "own-", "unique to", "peculiar to", or "belonging to" in the sense of "idiosyncratic" in relation to the originating matrix.

Eigenvalues and eigenvectors have many applications in both pure and applied mathematics. They are used in matrix factorization, in quantum mechanics, and in many other areas.

History

Eigenvalues are often introduced in the context of linear algebra or matrix theory. Historically, however, they arose in the study of quadratic forms and differential equations.

In the 18th century Euler studied the rotational motion of a rigid body and discovered the importance of the principal axes.[4] Lagrange realized that the principal axes are the eigenvectors of the inertia matrix.[5] In the early 19th century, Cauchy saw how their work could be used to classify the quadric surfaces, and generalized it to arbitrary dimensions.[6] Cauchy also coined the term racine caractéristique (characteristic root) for what is now called eigenvalue; his term survives in characteristic equation.[7]

Fourier used the work of Laplace and Lagrange to solve the heat equation by separation of variables in his famous 1822 book Théorie analytique de la chaleur.[8] Sturm developed Fourier's ideas further and brought them to the attention of Cauchy, who combined them with his own ideas and arrived at the fact that real symmetric matrices have real eigenvalues.[6] This was extended by Hermite in 1855 to what are now called Hermitian matrices.[7] Around the same time, Brioschi proved that the eigenvalues of orthogonal matrices lie on the unit circle,[6] and Clebsch found the corresponding result for skew-symmetric matrices.[7] Finally, Weierstrass clarified an important aspect in the stability theory started by Laplace by realizing that defective matrices can cause instability.[6]

In the meantime, Liouville studied eigenvalue problems similar to those of Sturm; the discipline that grew out of their work is now called Sturm–Liouville theory.[9] Schwarz studied the first eigenvalue of Laplace's equation on general domains towards the end of the 19th century, while Poincaré studied Poisson's equation a few years later.[10]

At the start of the 20th century, Hilbert studied the eigenvalues of integral operators by viewing the operators as infinite matrices.[11] He was the first to use the German word eigen which means "own", to denote eigenvalues and eigenvectors in 1904,[12] though he may have been following a related usage by Helmholtz. For some time, the standard term in English was "proper value", but the more distinctive term "eigenvalue" is standard today.[13]

The first numerical algorithm for computing eigenvalues and eigenvectors appeared in 1929, when Von Mises published the power method. One of the most popular methods today, the QR algorithm, was proposed independently by John G.F. Francis[14] and Vera Kublanovskaya[15] in 1961.[16]

Real matrices

Matrix $A$ acts by stretching the vector $x$, not changing its direction, so $x$ is an eigenvector of $A$.

Consider n-dimensional vectors that are formed as a list of n real numbers, such as the three dimensional vectors,

$\mathbf{u} = \begin{Bmatrix}1\\3\\4\end{Bmatrix}\quad\mbox{and}\quad \mathbf{v} = \begin{Bmatrix}-20\\-60\\-80\end{Bmatrix}.$

These vectors are said to be scalar multiples of each other, also parallel or collinear, if there is a scalar λ, such that

$\mathbf{u}=\lambda\mathbf{v}.$

In this case λ = −1/20.

Now consider the linear transformation of n-dimensional vectors defined by an n×n matrix A, that is,

$A\mathbf{v}=\mathbf{w},$

or

$\begin{bmatrix} A_{1,1} & A_{1,2} & \ldots & A_{1,n} \\ A_{2,1} & A_{2,2} & \ldots & A_{2,n} \\ \vdots & \vdots & \ddots & \vdots \\ A_{n,1} & A_{n,2} & \ldots & A_{n,n} \\ \end{bmatrix} \begin{Bmatrix} v_1 \\ v_2 \\ \vdots \\ v_n \end{Bmatrix} = \begin{Bmatrix} w_1 \\ w_2 \\ \vdots \\ w_n \end{Bmatrix}$

where, for each index $i$,

$w_i = A_{i,1} v_1 + A_{i,2} v_2 + \cdots + A_{i,n} v_n = \sum_{j = 1}^{n} A_{i,j} v_j$.

If it occurs that w and v are scalar multiples, that is if

$A\mathbf{v}=\lambda\mathbf{v},$

then v is an eigenvector of the linear transformation A and the scale factor λ is the eigenvalue corresponding to that eigenvector.

The transformation matrix A = $\bigl[ \begin{smallmatrix} 2 & 1\\ 1 & 2 \end{smallmatrix} \bigr]$ preserves the direction of vectors parallel to v = (1,−1)T (in purple) and w = (1,1)T (in blue). The vectors in red are not parallel to either eigenvector, so, their directions are changed by the transformation. See also: An extended version, showing all four quadrants.

Two dimensional example

Consider the transformation matrix A, given by,

$A = \begin{bmatrix} 2 & 1\\1 & 2 \end{bmatrix}.$

The figure shows the effect of this transformation on point coordinates in the plane. The eigenvectors v of this transformation satisfy the equation,

$A\mathbf{v}=\lambda\mathbf{v}.$

Rearrange this equation to obtain

$(A-\lambda I)\mathbf{v}=0,$

which has a solution only when its determinant | AλI | equals zero.

Set the determinant to zero to obtain the polynomial equation,

$p(\lambda)=|A-\lambda I| = 3 -4\lambda + \lambda^2 = 0,$

known as the characteristic polynomial of the matrix A. In this case, it has the roots λ = 1 and λ = 3.

For λ = 1, the equation becomes,

$(A- I)\mathbf{v}=\begin{bmatrix} 1 & 1\\ 1 & 1\end{bmatrix}\begin{Bmatrix}v_1 \\ v_2\end{Bmatrix} =\begin{Bmatrix}0 \\ 0\end{Bmatrix},$

which has the solution,

$\mathbf{v} = \begin{Bmatrix} 1 \\ -1 \end{Bmatrix}.$

For λ = 3, the equation becomes,

$(A-3 I)\mathbf{w}= \begin{bmatrix} -1 & 1\\ 1 & -1\end{bmatrix}\begin{Bmatrix}w_1 \\ w_2\end{Bmatrix} =\begin{Bmatrix}0 \\ 0\end{Bmatrix},$

which has the solution,

$\mathbf{w} = \begin{Bmatrix} 1 \\ 1 \end{Bmatrix}.$

Thus, the vectors v and w are eigenvectors of A associated with the eigenvalues λ = 1 and λ = 3, respectively.

Three dimensional example

The eigenvectors v of the 3×3 matrix A,

$A= \begin{bmatrix} 2 & 0 & 1\\0 & 2 & 0\\ 1 & 0 & 2\end{bmatrix},$

satisfy the equation

$(A-\lambda I)\mathbf{v}=0.$

This equation has solutions only if the determinant | AλI | equals zero, which yields the characteristic polynomial,

$p(\lambda)=|A-\lambda I| = 6-11\lambda +6\lambda^2-\lambda^3,$

with the roots λ = 1, λ = 2 and λ = 3.

Associated with the roots λ = 1, λ = 2 and λ = 3 are the respective eigenvectors,

$\mathbf{u}=\begin{Bmatrix}1\\0\\-1\end{Bmatrix},\quad \mathbf{v}=\begin{Bmatrix}0\\1\\0\end{Bmatrix}, \quad \mbox{and}\quad \mathbf{w}=\begin{Bmatrix}1\\0\\1\end{Bmatrix},$

Diagonal matrices

Matrices with entries only along the main diagonal are called diagonal matrices. It is easy to see that the eigenvalues of a diagonal matrix are the diagonal elements themselves. Consider the matrix A,

$A=\begin{bmatrix} 1 & 0 & 0\\ 0&2& 0\\0&0&3\end{bmatrix}.$

The characteristic polynomial of A is given by

$p(\lambda)=|A-\lambda I| = (1-\lambda)(2-\lambda)(3-\lambda)=0,$

which has the roots λ = 1, λ = 2 and λ = 3.

Associated with these roots are the eigenvectors,

$\mathbf{u}=\begin{Bmatrix}1\\0\\0\end{Bmatrix}, \quad \mathbf{v}=\begin{Bmatrix}0\\1\\0\end{Bmatrix},\quad \mbox{and} \quad\mathbf{w}=\begin{Bmatrix}0\\0\\1\end{Bmatrix},$

respectively.

Triangular matrices

A matrix with elements above the main diagonal that are all zeros is described as a triangular matrix, or in this case, lower triangular. If the elements below the main diagonal are all zeros then the matrix is upper triangular. The eigenvalues of triangular matrices are the elements of the main diagonal, in the same way as for diagonal matrices.

Consider the lower triangular matrix A,

$A=\begin{bmatrix} 1 & 0 & 0\\ 1&2& 0\\2&3&3\end{bmatrix}.$

The characteristic polynomial of A is given by

$p(\lambda)=|A-\lambda I| = (1-\lambda)(2-\lambda)(3-\lambda)=0,$

which has the roots λ = 1, λ = 2 and λ = 3.

Associated with these roots are the eigenvectors,

$\mathbf{u}=\begin{Bmatrix}1\\-1\\1/2\end{Bmatrix}, \quad \mathbf{v}=\begin{Bmatrix}0\\1\\-3\end{Bmatrix},\quad \mbox{and} \quad\mathbf{w}=\begin{Bmatrix}0\\0\\1\end{Bmatrix},$

respectively.

Eigenvector basis

In this section, it is shown that a change of coordinates of a matrix A to a basis formed by its eigenvectors results in a diagonal matrix.

Let A be an n×n linear transformation

$\mathbf{X}=A\mathbf{x},$

that has n linearly independent eigenvectors vi, and consider the change of coordinates of A so that it is defined relative to its eigenvector basis.

Recall that the eigenvectors vi of A satisfy the eigenvalue equation,

$A\mathbf{v}_i=\lambda_i \mathbf{v}_i\quad i=1\ldots, n.$

Assemble these eigenvectors into the matrix V, which is invertible because these vectors are assumed to be linearly independent. This means the coordinates of x and X relative to the basis vi can be computed to be,

$\mathbf{y}=V^{-1}\mathbf{x},\quad \mbox{and}\quad \mathbf{Y}=V^{-1}\mathbf{X}.$

This yields the change of coordinates

$K=V^{-1}AV.$

To see the effect of this change of coordinates on A, introduce I=VV−1 into the eigenvalue equation

$AV(V^{-1}\mathbf{v}_i)=\lambda_i V(V^{-1} \mathbf{v}_i),$

and multiply both side by V−1 to obtain

$V^{-1}AV(V^{-1}\mathbf{v}_i)=\lambda_i (V^{-1} \mathbf{v}_i),$

Notice that

$V^{-1}\mathbf{v}_i=\mathbf{e}_i,$

which is the natural basis vector. Thus,

$V^{-1}AV\mathbf{e}_i=K\mathbf{e}_i=\lambda_i \mathbf{e}_i,$

and the matrix K is found to be a diagonal matrix with the eigenvalues λi as its diagonal elements.

This shows that a matrix A with a linearly independent system of eigenvectors is similar to a diagonal matrix formed from its eigenvalues.

Matrices

Characteristic polynomial

The eigenvalue equation for a matrix $A$ is

$A v - \lambda v = 0,$

which is equivalent to

$(A-\lambda I)v = 0,$

where $I$ is the $n\times n$ identity matrix. It is a fundamental result of linear algebra that an equation $M v = 0$ has a non-zero solution $v$ if, and only if, the determinant $\det(M)$ of the matrix $M$ is zero. It follows that the eigenvalues of $A$ are precisely the real numbers $\lambda$ that satisfy the equation

$\det(A-\lambda I) = 0$

The left-hand side of this equation can be seen (using Leibniz' rule for the determinant) to be a polynomial function of the variable $\lambda$. The degree of this polynomial is $n$, the order of the matrix. Its coefficients depend on the entries of $A$, except that its term of degree $n$ is always $(-1)^n\lambda^n$. This polynomial is called the characteristic polynomial of $A$; and the above equation is called the characteristic equation (or, less often, the secular equation) of $A$.

For example, let $A$ be the matrix

$A = \begin{bmatrix} 2 & 0 & 0 \\ 0 & 3 & 4 \\ 0 & 4 & 9 \end{bmatrix}$

The characteristic polynomial of $A$ is

$\det (A-\lambda I) \;=\; \det \left(\begin{bmatrix} 2 & 0 & 0 \\ 0 & 3 & 4 \\ 0 & 4 & 9 \end{bmatrix} - \lambda \begin{bmatrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{bmatrix}\right) \;=\; \det \begin{bmatrix} 2 - \lambda & 0 & 0 \\ 0 & 3 - \lambda & 4 \\ 0 & 4 & 9 - \lambda \end{bmatrix}$

which is

$(2 - \lambda) \bigl[ (3 - \lambda) (9 - \lambda) - 16 \bigr] = -\lambda^3 + 14\lambda^2 - 35\lambda + 22$

The roots of this polynomial are 2, 1, and 11. Indeed these are the only three eigenvalues of $A$, corresponding to the eigenvectors $[1,0,0]',$ $[0,2,-1]',$ and $[0,1,2]'$ (or any non-zero multiples thereof).

Real domain

Since the eigenvalues are roots of the characteristic polynomial, an $n\times n$ matrix has at most $n$ eigenvalues. If the matrix has real entries, the coefficients of the characteristic polynomial are all real; but it may have fewer than $n$ real roots, or no real roots at all.

For example, consider the cyclic permutation matrix

$A = \begin{bmatrix} 0 & 1 & 0\\0 & 0 & 1\\ 1 & 0 & 0\end{bmatrix}$

This matrix shifts the coordinates of the vector up by one position, and moves the first coordinate to the bottom. Its characteristic polynomial is $1 - \lambda^3$ which has one real root $\lambda_1 = 1$. Any vector with three equal non-zero coordinates is an eigenvector for this eigenvalue. For example,

$A \begin{bmatrix} 5\\5\\5 \end{bmatrix} = \begin{bmatrix} 5\\5\\5 \end{bmatrix} = 1 \cdot \begin{bmatrix} 5\\5\\5 \end{bmatrix}$

Complex domain

The fundamental theorem of algebra implies that the characteristic polynomial of an $n\times n$ matrix $A$, being a polynomial of degree $n$, has exactly $n$ complex roots. More precisely, it can be factored into the product of $n$ linear terms,

$\det(A-\lambda I) = (\lambda_1 - \lambda )(\lambda_2 - \lambda)\cdots(\lambda_n - \lambda)$

where each $\lambda_i$ is a complex number. The numbers $\lambda_1$, $\lambda_2$, ... $\lambda_n$, (which may not be all distinct) are roots of the polynomial, and are precisely the eigenvalues of $A$.

Even if the entries of $A$ are all real numbers, the eigenvalues may still have non-zero imaginary parts (and the coordinates of the corresponding eigenvectors will therefore also have non-zero imaginary parts). Also, the eigenvalues may be irrational numbers even if all the entries of $A$ are rational numbers, or all are integers. However, if the entries of $A$ are algebraic numbers (which include the rationals), the eigenvalues will be (complex) algebraic numbers too.

The non-real roots of a real polynomial with real coefficients can be grouped into pairs of complex conjugate values, namely with the two members of each pair having the same real part and imaginary parts that differ only in sign. If the degree is odd, then by the intermediate value theorem at least one of the roots will be real. Therefore, any real matrix with odd order will have at least one real eigenvalue; whereas a real matrix with even order may have no real eigenvalues.

In the example of the 3×3 cyclic permutation matrix $A$, above, the characteristic polynomial $1 - \lambda^3$ has two additional non-real roots, namely

$\lambda_2 = -1/2 + \mathbf{i}\sqrt{3}/2\quad\quad$ and $\quad\quad\lambda_3 = \lambda_2^* = -1/2 - \mathbf{i}\sqrt{3}/2$,

where $\mathbf{i}= \sqrt{-1}$ is the imaginary unit. Note that $\lambda_2\lambda_3 = 1$, $\lambda_2^2 = \lambda_3$, and $\lambda_3^2 = \lambda_2$. Then

$A \begin{bmatrix} 1 \\ \lambda_2 \\ \lambda_3 \end{bmatrix} = \begin{bmatrix} \lambda_2\\ \lambda_3 \\1 \end{bmatrix} = \lambda_2 \cdot \begin{bmatrix} 1\\ \lambda_2 \\ \lambda_3 \end{bmatrix} \quad\quad$ and $\quad\quad A \begin{bmatrix} 1 \\ \lambda_3 \\ \lambda_2 \end{bmatrix} = \begin{bmatrix} \lambda_3 \\ \lambda_2 \\ 1 \end{bmatrix} = \lambda_3 \cdot \begin{bmatrix} 1 \\ \lambda_3 \\ \lambda_2 \end{bmatrix}$

Therefore, the vectors $[1,\lambda_2,\lambda_3]'$ and $[1,\lambda_3,\lambda_2]'$ are eigenvectors of $A$, with eigenvalues $\lambda_2$, and $\lambda_3$, respectively.

Algebraic multiplicity

Let $\lambda_i$ be an eigenvalue of an $n\times n$ matrix $A$. The algebraic multiplicity $\mu_A(\lambda_i)$ of $\lambda_i$ is its multiplicity as a root of the characteristic polynomial, that is, the largest integer $k$ such that $(\lambda - \lambda_i)^k$ divides evenly that polynomial.[17][18]

Like the geometric multiplicity $\gamma_A(\lambda_i)$, we have $1 \leq \mu_A(\lambda_i) \leq n$; and the sum $\boldsymbol{\mu}_A$ of $\mu_A(\lambda_i)$ over all distinct eigenvalues also cannot exceed $n$. If complex eigenvalues are considered, $\boldsymbol{\mu}_A$ is exactly $n$.

It can be proved that the geometric multiplicity $\gamma_A(\lambda_i)$ of an eigenvalue never exceeds its algebraic multiplicity $\mu_A(\lambda_i)$. Therefore, $\boldsymbol{\gamma}_A$ is at most $\boldsymbol{\mu}_A$.[17]

If $\mu_A(\lambda_i) = 1$, then $\lambda_i$ is said to be a simple eigenvalue.[18]

If $\gamma_A(\lambda_i) = \mu_A(\lambda_i)$, then $\lambda_i$ is said to be a semisimple eigenvalue.

Example

For the matrix: $A= \begin{bmatrix} 2 & 0 & 0 & 0 \\ 1 & 2 & 0 & 0 \\ 0 & 1 & 3 & 0 \\ 0 & 0 & 1 & 3 \end{bmatrix},$

the characteristic polynomial of $A$ is $\det (A-\lambda I) \;=\; \det \begin{bmatrix} 2- \lambda & 0 & 0 & 0 \\ 1 & 2- \lambda & 0 & 0 \\ 0 & 1 & 3- \lambda & 0 \\ 0 & 0 & 1 & 3- \lambda \end{bmatrix}= (2 - \lambda)^2 (3 - \lambda)^2$,
being the product of the diagonal with a lower triangular matrix.

The roots of this polynomial, and hence the eigenvalues, are 2 and 3. The algebraic multiplicity of each eigenvalue is 2; in other words they are both double roots. On the other hand, the geometric multiplicity of the eigenvalue 2 is only 1, because its eigenspace is spanned by the vector $[0,1,-1,1]$, and is therefore 1-dimensional. Similarly, the geometric multiplicity of the eigenvalue 3 is 1 because its eigenspace is spanned by $[0,0,0,1]$. Hence, the total algebraic multiplicity of A, denoted $\mu_A$, is 4, which is the most it could be for a 4 by 4 matrix. The geometric multiplicity $\gamma_A$ is 2, which is the smallest it could be for a matrix which has two distinct eigenvalues.

Diagonalization and eigendecomposition

If the sum $\boldsymbol{\gamma}_A$ of the geometric multiplicities of all eigenvalues is exactly $n$, then $A$ has a set of $n$ linearly independent eigenvectors. Let $Q$ be a square matrix whose columns are those eigenvectors, in any order. Then we will have $A Q = Q\Lambda$, where $\Lambda$ is the diagonal matrix such that $\Lambda_{i i}$ is the eigenvalue associated to column $i$ of $Q$. Since the columns of $Q$ are linearly independent, the matrix $Q$ is invertible. Premultiplying both sides by $Q^{-1}$ we get $Q^{-1}A Q = \Lambda$. By definition, therefore, the matrix $A$ is diagonalizable.

Conversely, if $A$ is diagonalizable, let $Q$ be a non-singular square matrix such that $Q^{-1} A Q$ is some diagonal matrix $D$. Multiplying both sides on the left by $Q$ we get $A Q = Q D$. Therefore each column of $Q$ must be an eigenvector of $A$, whose eigenvalue is the corresponding element on the diagonal of $D$. Since the columns of $Q$ must be linearly independent, it follows that $\boldsymbol{\gamma}_A = n$. Thus $\boldsymbol{\gamma}_A$ is equal to $n$ if and only if $A$ is diagonalizable.

If $A$ is diagonalizable, the space of all $n$-coordinate vectors can be decomposed into the direct sum of the eigenspaces of $A$. This decomposition is called the eigendecomposition of $A$, and it is preserved under change of coordinates.

A matrix that is not diagonalizable is said to be defective. For defective matrices, the notion of eigenvector can be generalized to generalized eigenvectors, and that of diagonal matrix to a Jordan form matrix. Over an algebraically closed field, any matrix $A$ has a Jordan form and therefore admits a basis of generalized eigenvectors, and a decomposition into generalized eigenspaces

Further properties

Let $A$ be an arbitrary $n\times n$ matrix of complex numbers with eigenvalues $\lambda_1$, $\lambda_2$, ... $\lambda_n$. (Here it is understood that an eigenvalue with algebraic multiplicity $\mu$ occurs $\mu$ times in this list.) Then

• The trace of $A$, defined as the sum of its diagonal elements, is also the sum of all eigenvalues:
$\operatorname{tr}(A) = \sum_{i=1}^n A_{i i} = \sum_{i=1}^n \lambda_i = \lambda_1+ \lambda_2 +\cdots+ \lambda_n$.
• The determinant of $A$ is the product of all eigenvalues:
$\operatorname{det}(A) = \prod_{i=1}^n \lambda_i=\lambda_1\lambda_2\cdots\lambda_n$.
• The eigenvalues of the $k$th power of $A$, i.e. the eigenvalues of $A^k$, for any positive integer $k$, are $\lambda_1^k,\lambda_2^k,\dots,\lambda_n^k$
• The matrix $A$ is invertible if and only if all the eigenvalues $\lambda_i$ are nonzero.
• If $A$ is invertible, then the eigenvalues of $A^{-1}$ are $1/\lambda_1,1/\lambda_2,\dots,1/\lambda_n$. Clearly, the geometric multiplicities coincide. Moreover, since the characteristic polynomial of the inverse is the reciprocal polynomial for that of the original, they share the same algebraic multiplicity.
• If $A$ is equal to its conjugate transpose $A^*$ (in other words, if $A$ is Hermitian), then every eigenvalue is real. The same is true of any a symmetric real matrix. If $A$ is also positive-definite, positive-semidefinite, negative-definite, or negative-semidefinite every eigenvalue is positive, non-negative, negative, or non-positive respectively.
• Every eigenvalue of a unitary matrix has absolute value $|\lambda|=1$.

Left and right eigenvectors

The use of matrices with a single column (rather than a single row) to represent vectors is traditional in many disciplines. For that reason, the word "eigenvector" almost always means a right eigenvector, namely a column vector that must be placed to the right of the matrix $A$ in the defining equation

$A v = \lambda v$.

There may be also single-row vectors that are unchanged when they occur on the left side of a product with a square matrix $A$; that is, which satisfy the equation

$u A = \lambda u$

Any such row vector $u$ is called a left eigenvector of $A$.

The left eigenvectors of $A$ are transposes of the right eigenvectors of the transposed matrix $A^\mathsf{T}$, since their defining equation is equivalent to

$A^\mathsf{T} u^\mathsf{T} = \lambda u^\mathsf{T}$

It follows that, if $A$ is Hermitian, its left and right eigenvectors are complex conjugates. In particular if $A$ is a real symmetric matrix, they are the same except for transposition.

Variational characterization

Main article: Min-max theorem

In the Hermitian case, eigenvalues can be given a variational characterization. The largest eigenvalue of $H$ is the maximum value of the quadratic form $x^\mathsf{T} H x/x^\mathsf{T} x$. A value of $x$ that realizes that maximum, is an eigenvector.

General definition

The concept of eigenvectors and eigenvalues extends naturally to abstract linear transformations on abstract vector spaces. Namely, let $V$ be any vector space over some field $K$ of scalars, and let $T$ be a linear transformation mapping $V$ into $V$. We say that a non-zero vector $v$ of $V$ is an eigenvector of $T$ if (and only if) there is a scalar $\lambda$ in $K$ such that

$T(v)=\lambda v$.

This equation is called the eigenvalue equation for $T$, and the scalar $\lambda$ is the eigenvalue of $T$ corresponding to the eigenvector $v$. Note that $T(v)$ means the result of applying the operator $T$ to the vector $v$, while $\lambda v$ means the product of the scalar $\lambda$ by $v$.[19]

The matrix-specific definition is a special case of this abstract definition. Namely, the vector space $V$ is the set of all column vectors of a certain size $n$×1, and $T$ is the linear transformation that consists in multiplying a vector by the given $n\times n$ matrix $A$.

Some authors allow $v$ to be the zero vector in the definition of eigenvector.[20] This is reasonable as long as we define eigenvalues and eigenvectors carefully: If we would like the zero vector to be an eigenvector, then we must first define an eigenvalue of $T$ as a scalar $\lambda$ in $K$ such that there is a nonzero vector $v$ in $V$ with $T(v) = \lambda v$. We then define an eigenvector to be a vector $v$ in $V$ such that there is an eigenvalue $\lambda$ in $K$ with $T(v) = \lambda v$. This way, we ensure that it is not the case that every scalar is an eigenvalue corresponding to the zero vector.

Geometric multiplicity

The geometric multiplicity $\gamma_T(\lambda)$ of an eigenvalue $\lambda$ is the dimension of the eigenspace associated with $\lambda$, i.e., the maximum number of vectors in any linearly independent set of eigenvectors with that eigenvalue.[17][18] It is clear from the definition of eigenvalue in the eigenvalue equation (1) that we always have $\gamma_T(\lambda) \geq 1.$

Eigenspace and spectrum

If $v$ is an eigenvector of $T$, with eigenvalue $\lambda$, then any scalar multiple $\alpha v$ of $v$ with nonzero $\alpha$ is also an eigenvector with eigenvalue $\lambda$, since $T(\alpha v) = \alpha T(v) = \alpha(\lambda v) = \lambda(\alpha v)$. Moreover, if $u$ and $v$ are eigenvectors with the same eigenvalue $\lambda$ and $u \neq -v$, then $u+v$ is also an eigenvector with the same eigenvalue $\lambda$. Therefore, the set of all eigenvectors with the same eigenvalue $\lambda$, together with the zero vector, is a linear subspace of $V$, called the eigenspace of $T$ associated to $\lambda$.[21][22][23] If that subspace has dimension 1, it is sometimes called an eigenline.[24]

The eigenspaces of T always form a direct sum (and as a consequence any family of eigenvectors for different eigenvalues is always linearly independent). Therefore the sum of the dimensions of the eigenspaces cannot exceed the dimension n of the space on which T operates, and in particular there cannot be more than n distinct eigenvalues.[25]

Any subspace spanned by eigenvectors of $T$ is an invariant subspace of $T$, and the restriction of T to such a subspace is diagonalizable.

The set of eigenvalues of $T$ is sometimes called the spectrum of $T$.

Eigenbasis

An eigenbasis for a linear operator $T$ that operates on a vector space $V$ is a basis for $V$ that consists entirely of eigenvectors of $T$ (possibly with different eigenvalues). Such a basis exists precisely if the direct sum of the eigenspaces equals the whole space, in which case one can take the union of bases chosen in each of the eigenspaces as eigenbasis. The matrix of T in a given basis is diagonal precisely when that basis is an eigenbasis for T, and for this reason T is called diagonalizable if it admits an eigenbasis.

Dynamic equations

The simplest difference equations have the form

$x_t=a_1x_{t-1} +a_2x_{t-2}+\cdots +a_kx_{t-k}.$

The solution of this equation for x in terms of t is found by using its characteristic equation

$\lambda^k-a_1\lambda^{k-1} -a_2\lambda^{k-2}-\cdots -a_{k-1}\lambda-a_k=0,$

which can be found by stacking into matrix form a set of equations consisting of the above difference equation and the k–1 equations $x_{t-1}=x_{t-1}, \dots , x_{t-k+1}=x_{t-k+1},$ giving a k-dimensional system of the first order in the stacked variable vector $[x_t, \dots , x_{t-k+1}]$ in terms of its once-lagged value, and taking the characteristic equation of this system's matrix. This equation gives k characteristic roots $\lambda_1, \dots , \lambda_k,$ for use in the solution equation

$x_t=c_1\lambda _1^t+\cdots + c_k\lambda _k^t.$

A similar procedure is used for solving a differential equation of the form

$\frac{d^{k}x}{dt^k}+a_{k-1}\frac{d^{k-1}x}{dt^{k-1}}+\cdots +a_1\frac{dx}{dt} +a_0x=0.$

Calculation

Main article: Eigenvalue algorithm

Eigenvalues

The eigenvalues of a matrix $A$ can be determined by finding the roots of the characteristic polynomial. Explicit algebraic formulas for the roots of a polynomial exist only if the degree $n$ is 4 or less. According to the Abel–Ruffini theorem there is no general, explicit and exact algebraic formula for the roots of a polynomial with degree 5 or more.

It turns out that any polynomial with degree $n$ is the characteristic polynomial of some companion matrix of order $n$. Therefore, for matrices of order 5 or more, the eigenvalues and eigenvectors cannot be obtained by an explicit algebraic formula, and must therefore be computed by approximate numerical methods.

In theory, the coefficients of the characteristic polynomial can be computed exactly, since they are sums of products of matrix elements; and there are algorithms that can find all the roots of a polynomial of arbitrary degree to any required accuracy.[26] However, this approach is not viable in practice because the coefficients would be contaminated by unavoidable round-off errors, and the roots of a polynomial can be an extremely sensitive function of the coefficients (as exemplified by Wilkinson's polynomial).[26]

Efficient, accurate methods to compute eigenvalues and eigenvectors of arbitrary matrices were not known until the advent of the QR algorithm in 1961. [26] Combining the Householder transformation with the LU decomposition results in an algorithm with better convergence than the QR algorithm.[citation needed] For large Hermitian sparse matrices, the Lanczos algorithm is one example of an efficient iterative method to compute eigenvalues and eigenvectors, among several other possibilities.[26]

Eigenvectors

Once the (exact) value of an eigenvalue is known, the corresponding eigenvectors can be found by finding non-zero solutions of the eigenvalue equation, that becomes a system of linear equations with known coefficients. For example, once it is known that 6 is an eigenvalue of the matrix

$A = \begin{bmatrix} 4 & 1\\6 & 3 \end{bmatrix}$

we can find its eigenvectors by solving the equation $A v = 6 v$, that is

$\begin{bmatrix} 4 & 1\\6 & 3 \end{bmatrix}\begin{bmatrix}x\\y\end{bmatrix} = 6 \cdot \begin{bmatrix}x\\y\end{bmatrix}$

This matrix equation is equivalent to two linear equations

$\left\{\begin{matrix} 4x + {\ }y &{}= 6x\\6x + 3y &{}=6 y\end{matrix}\right. \quad\quad\quad$ that is $\left\{\begin{matrix} -2x+ {\ }y &{}=0\\+6x-3y &{}=0\end{matrix}\right.$

Both equations reduce to the single linear equation $y=2x$. Therefore, any vector of the form $[a,2a]'$, for any non-zero real number $a$, is an eigenvector of $A$ with eigenvalue $\lambda = 6$.

The matrix $A$ above has another eigenvalue $\lambda=1$. A similar calculation shows that the corresponding eigenvectors are the non-zero solutions of $3x+y=0$, that is, any vector of the form $[b,-3b]'$, for any non-zero real number $b$.

Some numeric methods that compute the eigenvalues of a matrix also determine a set of corresponding eigenvectors as a by-product of the computation.

Generalizations to infinite-dimensional spaces

For more details on this topic, see Spectral theorem.

The definition of eigenvalue of a linear transformation $T$ remains valid even if the underlying space $V$ is an infinite dimensional Hilbert or Banach space. Namely, a scalar $\lambda$ is an eigenvalue if and only if there is some nonzero vector $v$ such that $T(v) = \lambda v$.

Eigenfunctions

A widely used class of linear operators acting on infinite dimensional spaces are the differential operators on function spaces. Let $D$ be a linear differential operator in on the space $\mathbf{C^\infty}$ of infinitely differentiable real functions of a real argument $t$. The eigenvalue equation for $D$ is the differential equation

$D f = \lambda f$

The functions that satisfy this equation are commonly called eigenfunctions of $D$. For the derivative operator $d/dt$, an eigenfunction is a function that, when differentiated, yields a constant times the original function. The solution is an exponential function

$f(t) = Ae^{\lambda t} ,$

including when $\lambda$ is zero when it becomes a constant function. Eigenfunctions are an essential tool in the solution of differential equations and many other applied and theoretical fields. For instance, the exponential functions are eigenfunctions of the shift operators. This is the basis of Fourier transform methods for solving problems.

Spectral theory

If $\lambda$ is an eigenvalue of $T$, then the operator $T-\lambda I$ is not one-to-one, and therefore its inverse $(T-\lambda I)^{-1}$ does not exist. The converse is true for finite-dimensional vector spaces, but not for infinite-dimensional ones. In general, the operator $T - \lambda I$ may not have an inverse, even if $\lambda$ is not an eigenvalue.

For this reason, in functional analysis one defines the spectrum of a linear operator $T$ as the set of all scalars $\lambda$ for which the operator $T-\lambda I$ has no bounded inverse. Thus the spectrum of an operator always contains all its eigenvalues, but is not limited to them.

Associative algebras and representation theory

More algebraically, rather than generalizing the vector space to an infinite dimensional space, one can generalize the algebraic object that is acting on the space, replacing a single operator acting on a vector space with an algebra representation – an associative algebra acting on a module. The study of such actions is the field of representation theory.

A closer analog of eigenvalues is given by the representation-theoretical concept of weight, with the analogs of eigenvectors and eigenspaces being weight vectors and weight spaces.

Applications

Eigenvalues of geometric transformations

The following table presents some example transformations in the plane along with their 2×2 matrices, eigenvalues, and eigenvectors.

 scaling unequal scaling rotation horizontal shear hyperbolic rotation illustration matrix $\begin{bmatrix}k & 0\\0 & k\end{bmatrix}$ $\begin{bmatrix}k_1 & 0\\0 & k_2\end{bmatrix}$ $\begin{bmatrix}c & -s \\ s & c\end{bmatrix}$ $c=\cos\theta$ $s=\sin\theta$ $\begin{bmatrix}1 & k\\ 0 & 1\end{bmatrix}$ $\begin{bmatrix} c & s \\ s & c \end{bmatrix}$ $c=\cosh \varphi$ $s=\sinh \varphi$ characteristic polynomial $\ (\lambda - k)^2$ $(\lambda - k_1)(\lambda - k_2)$ $\lambda^2 - 2c\lambda + 1$ $\ (\lambda - 1)^2$ $\lambda^2 - 2c\lambda + 1$ eigenvalues $\lambda_i$ $\lambda_1 = \lambda_2 = k$ $\lambda_1 = k_1$ $\lambda_2 = k_2$ $\lambda_1 = e^{\mathbf{i}\theta}=c+s\mathbf{i}$ $\lambda_2 = e^{-\mathbf{i}\theta}=c-s\mathbf{i}$ $\lambda_1 = \lambda_2 = 1$ $\lambda_1 = e^\varphi$ $\lambda_2 = e^{-\varphi}$, algebraic multipl. $\mu_i=\mu(\lambda_i)$ $\mu_1 = 2$ $\mu_1 = 1$ $\mu_2 = 1$ $\mu_1 = 1$ $\mu_2 = 1$ $\mu_1 = 2$ $\mu_1 = 1$ $\mu_2 = 1$ geometric multipl. $\gamma_i = \gamma(\lambda_i)$ $\gamma_1 = 2$ $\gamma_1 = 1$ $\gamma_2 = 1$ $\gamma_1 = 1$ $\gamma_2 = 1$ $\gamma_1 = 1$ $\gamma_1 = 1$ $\gamma_2 = 1$ eigenvectors All non-zero vectors $u_1 = \begin{bmatrix}1\\0\end{bmatrix}$ $u_2 = \begin{bmatrix}0\\1\end{bmatrix}$ $u_1 = \begin{bmatrix}{\ }1\\-\mathbf{i}\end{bmatrix}$ $u_2 = \begin{bmatrix}{\ }1\\ +\mathbf{i}\end{bmatrix}$ $u_1 = \begin{bmatrix}1\\0\end{bmatrix}$ $u_1 = \begin{bmatrix}{\ }1\\{\ }1\end{bmatrix}$ $u_2 = \begin{bmatrix}{\ }1\\-1\end{bmatrix}.$

Note that the characteristic equation for a rotation is a quadratic equation with discriminant $D = -4(\sin\theta)^2$, which is a negative number whenever $\theta$ is not an integer multiple of 180°. Therefore, except for these special cases, the two eigenvalues are complex numbers, $\cos\theta \pm \mathbf{i}\sin\theta$; and all eigenvectors have non-real entries. Indeed, except for those special cases, a rotation changes the direction of every nonzero vector in the plane.

Schrödinger equation

The wavefunctions associated with the bound states of an electron in a hydrogen atom can be seen as the eigenvectors of the hydrogen atom Hamiltonian as well as of the angular momentum operator. They are associated with eigenvalues interpreted as their energies (increasing downward: $n=1,2,3,\ldots$) and angular momentum (increasing across: s, p, d, ...). The illustration shows the square of the absolute value of the wavefunctions. Brighter areas correspond to higher probability density for a position measurement. The center of each figure is the atomic nucleus, a proton.

An example of an eigenvalue equation where the transformation $T$ is represented in terms of a differential operator is the time-independent Schrödinger equation in quantum mechanics:

$H\psi_E = E\psi_E \,$

where $H$, the Hamiltonian, is a second-order differential operator and $\psi_E$, the wavefunction, is one of its eigenfunctions corresponding to the eigenvalue $E$, interpreted as its energy.

However, in the case where one is interested only in the bound state solutions of the Schrödinger equation, one looks for $\psi_E$ within the space of square integrable functions. Since this space is a Hilbert space with a well-defined scalar product, one can introduce a basis set in which $\psi_E$ and $H$ can be represented as a one-dimensional array and a matrix respectively. This allows one to represent the Schrödinger equation in a matrix form.

The bra–ket notation is often used in this context. A vector, which represents a state of the system, in the Hilbert space of square integrable functions is represented by $|\Psi_E\rangle$. In this notation, the Schrödinger equation is:

$H|\Psi_E\rangle = E|\Psi_E\rangle$

where $|\Psi_E\rangle$ is an eigenstate of $H$ and $E$ represents the eigenvalue. It is an observable self adjoint operator, the infinite dimensional analog of Hermitian matrices. As in the matrix case, in the equation above $H|\Psi_E\rangle$ is understood to be the vector obtained by application of the transformation $H$ to $|\Psi_E\rangle$.

Molecular orbitals

In quantum mechanics, and in particular in atomic and molecular physics, within the Hartree–Fock theory, the atomic and molecular orbitals can be defined by the eigenvectors of the Fock operator. The corresponding eigenvalues are interpreted as ionization potentials via Koopmans' theorem. In this case, the term eigenvector is used in a somewhat more general meaning, since the Fock operator is explicitly dependent on the orbitals and their eigenvalues. If one wants to underline this aspect one speaks of nonlinear eigenvalue problem. Such equations are usually solved by an iteration procedure, called in this case self-consistent field method. In quantum chemistry, one often represents the Hartree–Fock equation in a non-orthogonal basis set. This particular representation is a generalized eigenvalue problem called Roothaan equations.

Geology and glaciology

In geology, especially in the study of glacial till, eigenvectors and eigenvalues are used as a method by which a mass of information of a clast fabric's constituents' orientation and dip can be summarized in a 3-D space by six numbers. In the field, a geologist may collect such data for hundreds or thousands of clasts in a soil sample, which can only be compared graphically such as in a Tri-Plot (Sneed and Folk) diagram,[27][28] or as a Stereonet on a Wulff Net.[29]

The output for the orientation tensor is in the three orthogonal (perpendicular) axes of space. The three eigenvectors are ordered $v_1, v_2, v_3$ by their eigenvalues $E_1 \geq E_2 \geq E_3$;[30] $v_1$ then is the primary orientation/dip of clast, $v_2$ is the secondary and $v_3$ is the tertiary, in terms of strength. The clast orientation is defined as the direction of the eigenvector, on a compass rose of 360°. Dip is measured as the eigenvalue, the modulus of the tensor: this is valued from 0° (no dip) to 90° (vertical). The relative values of $E_1$, $E_2$, and $E_3$ are dictated by the nature of the sediment's fabric. If $E_1 = E_2 = E_3$, the fabric is said to be isotropic. If $E_1 = E_2 > E_3$, the fabric is said to be planar. If $E_1 > E_2 > E_3$, the fabric is said to be linear.[31]

Principal components analysis

PCA of the multivariate Gaussian distribution centered at $(1,3)$ with a standard deviation of 3 in roughly the $(0.878,0.478)$ direction and of 1 in the orthogonal direction. The vectors shown are unit eigenvectors of the (symmetric, positive-semidefinite) covariance matrix scaled by the square root of the corresponding eigenvalue. (Just as in the one-dimensional case, the square root is taken because the standard deviation is more readily visualized than the variance.

The eigendecomposition of a symmetric positive semidefinite (PSD) matrix yields an orthogonal basis of eigenvectors, each of which has a nonnegative eigenvalue. The orthogonal decomposition of a PSD matrix is used in multivariate analysis, where the sample covariance matrices are PSD. This orthogonal decomposition is called principal components analysis (PCA) in statistics. PCA studies linear relations among variables. PCA is performed on the covariance matrix or the correlation matrix (in which each variable is scaled to have its sample variance equal to one). For the covariance or correlation matrix, the eigenvectors correspond to principal components and the eigenvalues to the variance explained by the principal components. Principal component analysis of the correlation matrix provides an orthonormal eigen-basis for the space of the observed data: In this basis, the largest eigenvalues correspond to the principal components that are associated with most of the covariability among a number of observed data.

Principal component analysis is used to study large data sets, such as those encountered in data mining, chemical research, psychology, and in marketing. PCA is popular especially in psychology, in the field of psychometrics. In Q methodology, the eigenvalues of the correlation matrix determine the Q-methodologist's judgment of practical significance (which differs from the statistical significance of hypothesis testing; cf. criteria for determining the number of factors). More generally, principal component analysis can be used as a method of factor analysis in structural equation modeling.

Vibration analysis

1st lateral bending
Main article: Vibration

Eigenvalue problems occur naturally in the vibration analysis of mechanical structures with many degrees of freedom. The eigenvalues are the natural frequencies (or eigenfrequencies) of vibration, and the eigenvectors are the shapes of these vibrational modes. In particular, undamped vibration is governed by

$m\ddot x + kx = 0$

or

$m\ddot x = -k x$

that is, acceleration is proportional to position (i.e., we expect $x$ to be sinusoidal in time).

In $n$ dimensions, $m$ becomes a mass matrix and $k$ a stiffness matrix. Admissible solutions are then a linear combination of solutions to the generalized eigenvalue problem

$-kx= \omega^2 mx$

where $\omega^2$ is the eigenvalue and $\omega$ is the angular frequency. Note that the principal vibration modes are different from the principal compliance modes, which are the eigenvectors of $k$ alone. Furthermore, damped vibration, governed by

$m\ddot x + c \dot x + kx = 0$

$(\omega^2 m + \omega c + k)x = 0.$

This can be reduced to a generalized eigenvalue problem by clever use of algebra at the cost of solving a larger system.

The orthogonality properties of the eigenvectors allows decoupling of the differential equations so that the system can be represented as linear summation of the eigenvectors. The eigenvalue problem of complex structures is often solved using finite element analysis, but neatly generalize the solution to scalar-valued vibration problems.

Eigenfaces

Eigenfaces as examples of eigenvectors
Main article: Eigenface

In image processing, processed images of faces can be seen as vectors whose components are the brightnesses of each pixel.[32] The dimension of this vector space is the number of pixels. The eigenvectors of the covariance matrix associated with a large set of normalized pictures of faces are called eigenfaces; this is an example of principal components analysis. They are very useful for expressing any face image as a linear combination of some of them. In the facial recognition branch of biometrics, eigenfaces provide a means of applying data compression to faces for identification purposes. Research related to eigen vision systems determining hand gestures has also been made.

Similar to this concept, eigenvoices represent the general direction of variability in human pronunciations of a particular utterance, such as a word in a language. Based on a linear combination of such eigenvoices, a new voice pronunciation of the word can be constructed. These concepts have been found useful in automatic speech recognition systems for speaker adaptation.

Tensor of moment of inertia

In mechanics, the eigenvectors of the moment of inertia tensor define the principal axes of a rigid body. The tensor of moment of inertia is a key quantity required to determine the rotation of a rigid body around its center of mass.

Stress tensor

In solid mechanics, the stress tensor is symmetric and so can be decomposed into a diagonal tensor with the eigenvalues on the diagonal and eigenvectors as a basis. Because it is diagonal, in this orientation, the stress tensor has no shear components; the components it does have are the principal components.

Graphs

In spectral graph theory, an eigenvalue of a graph is defined as an eigenvalue of the graph's adjacency matrix $A$, or (increasingly) of the graph's Laplacian matrix due to its Discrete Laplace operator, which is either $T - A$ (sometimes called the combinatorial Laplacian) or $I - T^{-1/2}A T^{-1/2}$ (sometimes called the normalized Laplacian), where $T$ is a diagonal matrix with $T_{i i}$ equal to the degree of vertex $v_i$, and in $T^{-1/2}$, the $i$th diagonal entry is $1/\sqrt{\operatorname{deg}(v_i)}$. The $k$th principal eigenvector of a graph is defined as either the eigenvector corresponding to the $k$th largest or $k$th smallest eigenvalue of the Laplacian. The first principal eigenvector of the graph is also referred to merely as the principal eigenvector.

The principal eigenvector is used to measure the centrality of its vertices. An example is Google's PageRank algorithm. The principal eigenvector of a modified adjacency matrix of the World Wide Web graph gives the page ranks as its components. This vector corresponds to the stationary distribution of the Markov chain represented by the row-normalized adjacency matrix; however, the adjacency matrix must first be modified to ensure a stationary distribution exists. The second smallest eigenvector can be used to partition the graph into clusters, via spectral clustering. Other methods are also available for clustering.

Basic reproduction number

The basic reproduction number ($R_0$) is a fundamental number in the study of how infectious diseases spread. If one infectious person is put into a population of completely susceptible people, then $R_0$ is the average number of people that one typical infectious person will infect. The generation time of an infection is the time, $t_G$, from one person becoming infected to the next person becoming infected. In a heterogeneous population, the next generation matrix defines how many people in the population will become infected after time $t_G$ has passed. $R_0$ is then the largest eigenvalue of the next generation matrix.[33][34]

Notes

1. ^ William H. Press, Saul A. Teukolsky, William T. Vetterling, Brian P. Flannery (2007), Numerical Recipes: The Art of Scientific Computing, Chapter 11: Eigensystems., pages=563–597. Third edition, Cambridge University Press. ISBN 9780521880688
2. ^ Wolfram Research, Inc. (2010) Eigenvector. Accessed on 2010-01-29.
3. ^ Nering (1970, p. 107)
4. ^ Note:
• In 1751, Leonard Euler proved that any body has a principal axis of rotation: Leonard Euler (presented: October 1751 ; published: 1760) "Du mouvement d'un corps solide quelconque lorsqu'il tourne autour d'un axe mobile" (On the movement of any solid body while it rotates around a moving axis), Histoire de l'Académie royale des sciences et des belles lettres de Berlin, pp.176-227. On p. 212, Euler proves that any body contains a principal axis of rotation: "Théorem. 44. De quelque figure que soit le corps, on y peut toujours assigner un tel axe, qui passe par son centre de gravité, autour duquel le corps peut tourner librement & d'un mouvement uniforme." (Theorem. 44. Whatever be the shape of the body, one can always assign to it such an axis, which passes through its center of gravity, around which it can rotate freely and with a uniform motion.)
• In 1755, Johann Andreas Segner proved that any body has three principal axes of rotation: Johann Andreas Segner, Specimen theoriae turbinum [Essay on the theory of tops (i.e., rotating bodies)] ( Halle ("Halae"), (Germany) : Gebauer, 1755). On p. XXVIIII (i.e., 29), Segner derives a third-degree equation in t, which proves that a body has three principal axes of rotation. He then states (on the same page): "Non autem repugnat tres esse eiusmodi positiones plani HM, quia in aequatione cubica radices tres esse possunt, et tres tangentis t valores." (However, it is not inconsistent [that there] be three such positions of the plane HM, because in cubic equations, [there] can be three roots, and three values of the tangent t.)
• The relevant passage of Segner's work was discussed briefly by Arthur Cayley. See: A. Cayley (1862) "Report on the progress of the solution of certain special problems of dynamics," Report of the Thirty-second meeting of the British Association for the Advancement of Science; held at Cambridge in October 1862, 32 : 184-252 ; see especially pages 225-226.
5. ^ See Hawkins 1975, §2
6. ^ a b c d See Hawkins 1975, §3
7. ^ a b c See Kline 1972, pp. 807–808
8. ^ See Kline 1972, p. 673
9. ^ See Kline 1972, pp. 715–716
10. ^ See Kline 1972, pp. 706–707
11. ^ See Kline 1972, p. 1063
12. ^ See:
• David Hilbert (1904) "Grundzüge einer allgemeinen Theorie der linearen Integralgleichungen. (Erste Mitteilung)" (Fundamentals of a general theory of linear integral equations. (First report)), Nachrichten von der Gesellschaft der Wissenschaften zu Göttingen, Mathematisch-Physikalische Klasse (News of the Philosophical Society at Göttingen, mathematical-physical section), pp. 49-91. From page 51: "Insbesondere in dieser ersten Mitteilung gelange ich zu Formeln, die die Entwickelung einer willkürlichen Funktion nach gewissen ausgezeichneten Funktionen, die ich Eigenfunktionen nenne, liefern: … (In particular, in this first report I arrive at formulas that provide the [series] development of an arbitrary function in terms of some distinctive functions, which I call eigenfunctions: … ) Later on the same page: "Dieser Erfolg ist wesentlich durch den Umstand bedingt, daß ich nicht, wie es bisher geschah, in erster Linie auf den Beweis für die Existenz der Eigenwerte ausgehe, … " (This success is mainly attributable to the fact that I do not, as it has happened until now, first of all aim at a proof of the existence of eigenvalues, … )
• For the origin and evolution of the terms eigenvalue, characteristic value, etc., see: Earliest Known Uses of Some of the Words of Mathematics (E)
13. ^ See Aldrich 2006
14. ^ Francis, J. G. F. (1961), "The QR Transformation, I (part 1)", The Computer Journal 4 (3): 265–271, doi:10.1093/comjnl/4.3.265 and Francis, J. G. F. (1962), "The QR Transformation, II (part 2)", The Computer Journal 4 (4): 332–345, doi:10.1093/comjnl/4.4.332
15. ^ Kublanovskaya, Vera N. (1961), "On some algorithms for the solution of the complete eigenvalue problem", USSR Computational Mathematics and Mathematical Physics 3: 637–657. Also published in: "О некоторых алгорифмах для решения полной проблемы собственных значений" [On certain algorithms for the solution of the complete eigenvalue problem], Журнал вычислительной математики и математической физики (Journal of Computational Mathematics and Mathematical Physics) 1 (4), 1961: 555–570
16. ^ See Golub & van Loan 1996, §7.3; Meyer 2000, §7.3
17. ^ a b c Nering (1970, p. 107)
18. ^ a b c Golub & Van Loan (1996, p. 316)
19. ^ See Korn & Korn 2000, Section 14.3.5a; Friedberg, Insel & Spence 1989, p. 217
20. ^ Axler, Sheldon, "Ch. 5", Linear Algebra Done Right (2nd ed.), p. 77
21. ^ Shilov 1977, p. 109
22. ^ Lemma for the eigenspace
23. ^ Nering (1970, p. 107)
24. ^
25. ^ For a proof of this lemma, see Roman 2008, Theorem 8.2 on p. 186; Shilov 1977, p. 109; Hefferon 2001, p. 364; Beezer 2006, Theorem EDELI on p. 469; and Lemma for linear independence of eigenvectors
26. ^ a b c d Trefethen, Lloyd N.; Bau, David (1997), Numerical Linear Algebra, SIAM
27. ^ Graham, D.; Midgley, N. (2000), "Graphical representation of particle shape using triangular diagrams: an Excel spreadsheet method", Earth Surface Processes and Landforms 25 (13): 1473–1477, doi:10.1002/1096-9837(200012)25:13<1473::AID-ESP158>3.0.CO;2-C
28. ^ Sneed, E. D.; Folk, R. L. (1958), "Pebbles in the lower Colorado River, Texas, a study of particle morphogenesis", Journal of Geology 66 (2): 114–150, Bibcode:1958JG.....66..114S, doi:10.1086/626490
29. ^ Knox-Robinson, C.; Gardoll, Stephen J. (1998), "GIS-stereoplot: an interactive stereonet plotting module for ArcView 3.0 geographic information system", Computers & Geosciences 24 (3): 243, Bibcode:1998CG.....24..243K, doi:10.1016/S0098-3004(97)00122-2
30. ^ Stereo32 software
31. ^ Benn, D.; Evans, D. (2004), A Practical Guide to the study of Glacial Sediments, London: Arnold, pp. 103–107
32. ^ Xirouhakis, A.; Votsis, G.; Delopoulus, A. (2004), Estimation of 3D motion and structure of human faces (PDF), National Technical University of Athens
33. ^ Diekmann O, Heesterbeek JAP, Metz JAJ (1990), "On the definition and the computation of the basic reproduction ratio R0 in models for infectious diseases in heterogeneous populations", Journal of Mathematical Biology 28 (4): 365–382, doi:10.1007/BF00178324, PMID 2117040
34. ^ Odo Diekmann and J. A. P. Heesterbeek (2000), Mathematical epidemiology of infectious diseases, Wiley series in mathematical and computational biology, West Sussex, England: John Wiley & Sons

References

• Korn, Granino A.; Korn, Theresa M. (2000), "Mathematical Handbook for Scientists and Engineers: Definitions, Theorems, and Formulas for Reference and Review", New York: McGraw-Hill (2nd Revised ed.) (Dover Publications), Bibcode:1968mhse.book.....K, ISBN 0-486-41147-8.
• Lipschutz, Seymour (1991), Schaum's outline of theory and problems of linear algebra, Schaum's outline series (2nd ed.), New York: McGraw-Hill Companies, ISBN 0-07-038007-4.
• Friedberg, Stephen H.; Insel, Arnold J.; Spence, Lawrence E. (1989), Linear algebra (2nd ed.), Englewood Cliffs, New Jersey 07632: Prentice Hall, ISBN 0-13-537102-3.
• Aldrich, John (2006), "Eigenvalue, eigenfunction, eigenvector, and related terms", in Jeff Miller (Editor), Earliest Known Uses of Some of the Words of Mathematics, retrieved 2006-08-22
• Strang, Gilbert (1993), Introduction to linear algebra, Wellesley-Cambridge Press, Wellesley, Massachusetts, ISBN 0-9614088-5-5.
• Strang, Gilbert (2006), Linear algebra and its applications, Thomson, Brooks/Cole, Belmont, California, ISBN 0-03-010567-6.
• Bowen, Ray M.; Wang, Chao-Cheng (1980), Linear and multilinear algebra, Plenum Press, New York, ISBN 0-306-37508-7.
• Cohen-Tannoudji, Claude (1977), "Chapter II. The mathematical tools of quantum mechanics", Quantum mechanics, John Wiley & Sons, ISBN 0-471-16432-1.
• Fraleigh, John B.; Beauregard, Raymond A. (1995), Linear algebra (3rd ed.), Addison-Wesley Publishing Company, ISBN 0-201-83999-7.
• Golub, Gene H.; Van Loan, Charles F. (1996), Matrix computations (3rd ed.), Johns Hopkins University Press, Baltimore, Maryland, ISBN 978-0-8018-5414-9.
• Hawkins, T. (1975), "Cauchy and the spectral theory of matrices", Historia Mathematica 2: 1–29, doi:10.1016/0315-0860(75)90032-4.
• Horn, Roger A.; Johnson, Charles F. (1985), Matrix analysis, Cambridge University Press, ISBN 0-521-30586-1.
• Kline, Morris (1972), Mathematical thought from ancient to modern times, Oxford University Press, ISBN 0-19-501496-0.
• Meyer, Carl D. (2000), Matrix analysis and applied linear algebra, Society for Industrial and Applied Mathematics (SIAM), Philadelphia, ISBN 978-0-89871-454-8.
• Nering, Evar D. (1970), Linear Algebra and Matrix Theory (2nd ed.), New York: Wiley, LCCN 76091646
• Brown, Maureen (October 2004), Illuminating Patterns of Perception: An Overview of Q Methodology.
• Golub, Gene F.; van der Vorst, Henk A. (2000), "Eigenvalue computation in the 20th century", Journal of Computational and Applied Mathematics 123: 35–65, doi:10.1016/S0377-0427(00)00413-1.
• Akivis, Max A.; Goldberg, Vladislav V. (1969), Tensor calculus, Russian, Science Publishers, Moscow.
• Gelfand, I. M. (1971), Lecture notes in linear algebra, Russian, Science Publishers, Moscow.
• Alexandrov, Pavel S. (1968), Lecture notes in analytical geometry, Russian, Science Publishers, Moscow.
• Carter, Tamara A.; Tapia, Richard A.; Papaconstantinou, Anne, Linear Algebra: An Introduction to Linear Algebra for Pre-Calculus Students, Rice University, Online Edition, retrieved 2008-02-19.
• Roman, Steven (2008), Advanced linear algebra (3rd ed.), New York: Springer Science + Business Media, LLC, ISBN 978-0-387-72828-5.
• Shilov, Georgi E. (1977), Linear algebra, Translated and edited by Richard A. Silverman, New York: Dover Publications, ISBN 0-486-63518-X.
• Hefferon, Jim (2001), Linear Algebra, Online book, St Michael's College, Colchester, Vermont, USA.
• Kuttler, Kenneth (2007), An introduction to linear algebra (PDF), Online e-book in PDF format, Brigham Young University.
• Demmel, James W. (1997), Applied numerical linear algebra, SIAM, ISBN 0-89871-389-7.
• Beezer, Robert A. (2006), A first course in linear algebra, Free online book under GNU licence, University of Puget Sound.
• Lancaster, P. (1973), Matrix theory, Russian, Moscow, Russia: Science Publishers.
• Halmos, Paul R. (1987), Finite-dimensional vector spaces (8th ed.), New York: Springer-Verlag, ISBN 0-387-90093-4.
• Pigolkina, T. S. and Shulman, V. S., Eigenvalue (Russian), In:Vinogradov, I. M. (Ed.), Mathematical Encyclopedia, Vol. 5, Soviet Encyclopedia, Moscow, 1977.
• Greub, Werner H. (1975), Linear Algebra (4th ed.), Springer-Verlag, New York, ISBN 0-387-90110-8.
• Larson, Ron; Edwards, Bruce H. (2003), Elementary linear algebra (5th ed.), Houghton Mifflin Company, ISBN 0-618-33567-6.
• Curtis, Charles W., Linear Algebra: An Introductory Approach, 347 p., Springer; 4th ed. 1984. Corr. 7th printing edition (August 19, 1999), ISBN 0-387-90992-3.
• Shores, Thomas S. (2007), Applied linear algebra and matrix analysis, Springer Science+Business Media, LLC, ISBN 0-387-33194-8.
• Sharipov, Ruslan A. (1996), Course of Linear Algebra and Multidimensional Geometry: the textbook, arXiv:math/0405323, ISBN 5-7477-0099-5.
• Gohberg, Israel; Lancaster, Peter; Rodman, Leiba (2005), Indefinite linear algebra and applications, Basel-Boston-Berlin: Birkhäuser Verlag, ISBN 3-7643-7349-0.