Eigenvalue perturbation
|
|
This article has multiple issues. Please help improve it or discuss these issues on the talk page.
|
In mathematics, eigenvalue perturbation is a perturbation approach to finding eigenvalues and eigenvectors of systems perturbed from one with known eigenvectors and eigenvalues. It also allows one to determine the sensitivity of the eigenvalues and eigenvectors with respect to changes in the system. The following derivations are essentially self-contained and can be found in many texts on numerical linear algebra[1] or numerical functional analysis.
Contents |
Example [edit]
Suppose we have solutions to the generalized eigenvalue problem,
That is, we know
and
for
. Now suppose we want to change the matrices by a small amount. That is, we want to let
and
where all of the
terms are much smaller than the corresponding term. We expect answers to be of the form
and
Steps [edit]
We assume that the matrices are symmetric and positive definite and assume we have scaled the eigenvectors such that
where
is the Kronecker delta.
Now we want to solve the equation
Substituting, we get
which expands to
Canceling from (1) leaves
Removing the higher-order terms, this simplifies to
When the matrix is symmetric, the unperturbed eigenvectors are orthogonal and so we use them as a basis for the perturbed eigenvectors. That is, we want to construct
where the
are small constants that are to be determined. Substituting (4) into (3) and rearranging gives
Or:
By equation (1):
Because the eigenvectors are orthogonal, we can remove the summations by left multiplying by
:
By use of equation (1) again:
The two terms containing
are equal because left-multiplying (1) by
gives
Canceling those terms in (6) leaves
Rearranging gives
But by (2), this denominator is equal to 1. Thus
■
Then, by left-multiplying equation (5) by
(for
):
Or by changing the name of the indices:
To find
, use
Summary [edit]
and
for infinitesimal
and
(the high order terms in (3) being negligible)
Results [edit]
This means it is possible to efficiently do a sensitivity analysis on
as a function of changes in the entries of the matrices. (Recall that the matrices are symmetric and so changing
will also change
, hence the
term.)
and
Similarly
and
See also [edit]
Bounds exist that do not rely on approximations in the Bauer-Fike theorem
References [edit]
- ^ Trefethen, Lloyd N. (1997). Numerical Linear Algebra. SIAM (Philadelphia, PA). p. 258. ISBN 0-89871-361-7.
![[K_0] \mathbf{x}_{0i} = \lambda_{0i} [M_0] \mathbf{x}_{0i}. \qquad (1)](http://upload.wikimedia.org/math/4/c/c/4cc02fbae31d18f5288e7ce56139bab3.png)
![[K] = [K_0]+[\delta K] \,](http://upload.wikimedia.org/math/9/7/0/970e961f31d12ea7fb3708995eb01b6a.png)
![[M] = [M_0]+[\delta M] \,](http://upload.wikimedia.org/math/3/b/c/3bcd3ea5455fdacb6f69144b567d9bc1.png)


![\mathbf{x}_{0j}^\top[M_0]\mathbf{x}_{0i} = \delta_i^j \qquad(2)](http://upload.wikimedia.org/math/e/d/3/ed3497ecd38fcf8f87a7b1466b87fe02.png)
![[K]\mathbf{x}_i = \lambda_i [M] \mathbf{x}_i.](http://upload.wikimedia.org/math/a/d/6/ad6752f926dd003a55a874a6735760a3.png)
![([K_0]+[\delta K])(\mathbf{x}_{0i} + \delta \mathbf{x}_i) = (\lambda_{0i}+\delta\lambda_{i})([M_0]+[\delta M])(\mathbf{x}_{0i}+\delta\mathbf{x}_{0i}),](http://upload.wikimedia.org/math/7/d/6/7d6b18355e7b731b5e73dc3632a7b48d.png)
![\begin{align}
\left[K_0\right]\mathbf{x}_{0i} & + [\delta K]\mathbf{x}_{0i} + [K_0]\delta \mathbf{x}_i + [\delta K]\delta \mathbf{x}_i \\[6pt]
& = \lambda_{0i}[M_0]\mathbf{x}_{0i}+
\lambda_{0i}[M_0]\delta\mathbf{x}_i +
\lambda_{0i}[\delta M]\mathbf{x}_{0i} +
\delta\lambda_i[M_0]\mathbf{x}_{0i} \\[6pt]
& {} + \lambda_{0i}[\delta M]\delta\mathbf{x}_i +
\delta\lambda_i[\delta M]\mathbf{x}_{0i} +
\delta\lambda_i[M_0]\delta\mathbf{x}_i +
\delta\lambda_i[\delta M]\delta\mathbf{x}_i.
\end{align}](http://upload.wikimedia.org/math/b/0/7/b0721edc84082bde5befd7baa155d533.png)
![\begin{align}
\left[\delta K\right]\mathbf{x}_{0i} & + [K_0]\delta \mathbf{x}_i + [\delta K]\delta \mathbf{x}_i \\[6pt]
& = \lambda_{0i}[M_0]\delta\mathbf{x}_i +
\lambda_{0i}[\delta M]\mathbf{x}_{0i} +
\delta\lambda_i[M_0]\mathbf{x}_{0i} \\[6pt]
& {} + \lambda_{0i}[\delta M]\delta\mathbf{x}_i +
\delta\lambda_i[\delta M]\mathbf{x}_{0i} +
\delta\lambda_i[M_0]\delta\mathbf{x}_i +
\delta\lambda_i[\delta M]\delta\mathbf{x}_i.
\end{align}](http://upload.wikimedia.org/math/2/f/2/2f2d7da728cd02996144ef7c24d71f97.png)
![[K_0] \delta\mathbf{x}_i+[\delta K] \mathbf{x}_{0i} = \lambda_{0i}[M_0] \delta \mathbf{x}_i + \lambda_{0i}[\delta M]\mathrm{x}_{0i} + \delta \lambda_i [M_0]\mathbf{x}_{0i}. \qquad(3)](http://upload.wikimedia.org/math/1/7/a/17a7e20f293fddd6a69e135b866f96c3.png)

![[K_0]\sum_{j=1}^N \epsilon_{ij} \mathbf{x}_{0j} + [\delta K]\mathbf{x}_{0i} = \lambda_{0i} [M_0] \sum_{j=1}^N \epsilon_{ij} \mathbf{x}_{0j} + \lambda_{0i} [\delta M] \mathbf{x}_{0i} + \delta\lambda_i [M_0] \mathbf{x}_{0i}. \qquad (5)](http://upload.wikimedia.org/math/a/7/8/a785ccbfe4b7fa4822ad4408b9f493b9.png)
![\sum_{j=1}^N \epsilon_{ij} [K_0] \mathbf{x}_{0j} + [\delta K]\mathbf{x}_{0i} = \lambda_{0i} [M_0] \sum_{j=1}^N \epsilon_{ij} \mathbf{x}_{0j} + \lambda_{0i} [\delta M] \mathbf{x}_{0i} + \delta\lambda_i [M_0] \mathbf{x}_{0i}.](http://upload.wikimedia.org/math/6/e/c/6ec6fd4ef36fa5c0f1d99d4971eee5e1.png)
![\sum_{j=1}^N \epsilon_{ij} \lambda_{0j} [M_0] \mathbf{x}_{0j} + [\delta K]\mathbf{x}_{0i} = \lambda_{0i} [M_0] \sum_{j=1}^N \epsilon_{ij} \mathbf{x}_{0j} + \lambda_{0i} [\delta M] \mathbf{x}_{0i} + \delta\lambda_i [M_0] \mathbf{x}_{0i}.](http://upload.wikimedia.org/math/6/d/9/6d95e623ae726d769756440599691e60.png)
![\mathbf{x}_{0i}^\top \epsilon_{ii} \lambda_{0i} [M_0] \mathbf{x}_{0i} + \mathbf{x}_{0i}^\top[\delta K]\mathbf{x}_{0i} = \lambda_{0i} \mathbf{x}_{0i}^\top[M_0] \epsilon_{ii} \mathbf{x}_{0i} + \lambda_{0i}\mathbf{x}_{0i}^\top [\delta M] \mathbf{x}_{0i} + \delta\lambda_i\mathbf{x}_{0i}^\top [M_0] \mathbf{x}_{0i}.](http://upload.wikimedia.org/math/e/1/0/e10feb171e11311422a54d3db85bee67.png)
![\mathbf{x}_{0i}^\top[K_0] \epsilon_{ii} \mathbf{x}_{0i} + \mathbf{x}_{0i}^\top[\delta K]\mathbf{x}_{0i} = \lambda_{0i} \mathbf{x}_{0i}^\top[M_0] \epsilon_{ii} \mathbf{x}_{0i} + \lambda_{0i}\mathbf{x}_{0i}^\top [\delta M] \mathbf{x}_{0i} + \delta\lambda_i\mathbf{x}_{0i}^\top [M_0] \mathbf{x}_{0i}. ~~(6)](http://upload.wikimedia.org/math/b/0/3/b03c68a292546272f0e39ef1cf1b5c97.png)
![\mathbf{x}_{0i}^\top[K_0]\mathbf{x}_{0i} = \lambda_{0i}\mathbf{x}_{0i}^\top[M_0]\mathbf{x}_{0i}.](http://upload.wikimedia.org/math/a/b/c/abc389374f72d3ade9455eaba904ab91.png)
![\mathbf{x}_{0i}^\top[\delta K]\mathbf{x}_{0i} = \lambda_{0i} \mathbf{x}_{0i}^\top[\delta M] \mathbf{x}_{0i} + \delta\lambda_i \mathbf{x}_{0i}^\top [M_0] \mathbf{x}_{0i}.](http://upload.wikimedia.org/math/c/c/0/cc06f086b402590b07b397ea28002047.png)
![\delta\lambda_i = \frac{\mathbf{x}^\top_{0i}([\delta K] - \lambda_{0i}[\delta M] )\mathbf{x}_{0i}}{\mathbf{x}_{0i}^\top[M_0] \mathbf{x}_{0i}}](http://upload.wikimedia.org/math/e/3/c/e3ceea2979643babcc269d13dc945f2e.png)
■![\epsilon_{ik} = \frac{\mathbf{x}^\top_{0k}([\delta K] - \lambda_{0i}[\delta M])\mathbf{x}_{0i}}{\lambda_{0i}-\lambda_{0k}}, \qquad i\neq k.](http://upload.wikimedia.org/math/5/9/e/59eabc19ab92f9c41184e1c03292579a.png)
![\epsilon_{ij} = \frac{\mathbf{x}^\top_{0j}([\delta K] - \lambda_{0i}[\delta M])\mathbf{x}_{0i}}{\lambda_{0i}-\lambda_{0j}}, \qquad i\neq j.](http://upload.wikimedia.org/math/d/5/0/d50918780fff7dd9653920b2e216332f.png)
![\mathbf{x}^\top_i[M]\mathbf{x}_i = 1 \Rightarrow \epsilon_{ii}=-\frac{1}{2}\mathbf{x}^\top_{0i}[\delta M]\mathbf{x}_{0i}.](http://upload.wikimedia.org/math/8/f/5/8f573587fcc41e17f192acdae20af543.png)
![\lambda_i = \lambda_{0i} + \mathbf{x}^\top_{0i} ([\delta K] - \lambda_{0i}[\delta M]) \mathbf{x}_{0i}](http://upload.wikimedia.org/math/2/2/9/2292a53f9e45ef74249c42afce7ea7b8.png)
![\mathbf{x}_i = \mathbf{x}_{0i}(1 - \frac{1}{2} \mathbf{x}^\top_{0i}[\delta M] \mathbf{x}_{0i}) + \sum_{j=1\atop j\neq i}^N \frac{\mathbf{x}^\top_{0j}([\delta K] - \lambda_{0i}[\delta M])\mathbf{x}_{0i}}{\lambda_{0i}-\lambda_{0j}}\mathbf{x}_{0j}](http://upload.wikimedia.org/math/8/5/b/85b93ff0f55dff25ce56d18898570b77.png)
![\frac{\partial \lambda_i}{\partial K_{(k\ell)}} = \frac{\partial}{\partial K_{(k\ell)}}\left(\lambda_{0i} + \mathbf{x}^\top_{0i} ([\delta K] - \lambda_{0i}[\delta M]) \mathbf{x}_{0i}\right) = x_{0i(k)} x_{0i(\ell)} (2 - \delta_k^\ell)](http://upload.wikimedia.org/math/e/5/e/e5e55bd88a52ff5148f0192719e25702.png)
![\frac{\partial \lambda_i}{\partial M_{(k\ell)}} = \frac{\partial}{\partial M_{(k\ell)}}\left(\lambda_{0i} + \mathbf{x}^\top_{0i} ([\delta K] - \lambda_{0i}[\delta M]) \mathbf{x}_{0i}\right) =
\lambda_i x_{0i(k)} x_{0i(\ell)}(2-\delta_k^\ell).](http://upload.wikimedia.org/math/e/4/4/e44d36cc3be58cf72db39758afefa501.png)

