Spectral theory

From Wikipedia, the free encyclopedia
Jump to: navigation, search

In mathematics, spectral theory is an inclusive term for theories extending the eigenvector and eigenvalue theory of a single square matrix to a much broader theory of the structure of operators in a variety of mathematical spaces.[1] It is a result of studies of linear algebra and the solutions of systems of linear equations and their generalizations.[2] The theory is connected to that of analytic functions because the spectral properties of an operator are related to analytic functions of the spectral parameter.[3]

Mathematical background[edit]

The name spectral theory was introduced by David Hilbert in his original formulation of Hilbert space theory, which was cast in terms of quadratic forms in infinitely many variables. The original spectral theorem was therefore conceived as a version of the theorem on principal axes of an ellipsoid, in an infinite-dimensional setting. The later discovery in quantum mechanics that spectral theory could explain features of atomic spectra was therefore fortuitous.

There have been three main ways to formulate spectral theory, all of which retain their usefulness. After Hilbert's initial formulation, the later development of abstract Hilbert space and the spectral theory of a single normal operator on it did very much go in parallel with the requirements of physics; particularly at the hands of von Neumann.[4] The further theory built on this to include Banach algebras, which can be given abstractly. This development leads to the Gelfand representation, which covers the commutative case, and further into non-commutative harmonic analysis.

The difference can be seen in making the connection with Fourier analysis. The Fourier transform on the real line is in one sense the spectral theory of differentiation qua differential operator. But for that to cover the phenomena one has already to deal with generalized eigenfunctions (for example, by means of a rigged Hilbert space). On the other hand it is simple to construct a group algebra, the spectrum of which captures the Fourier transform's basic properties, and this is carried out by means of Pontryagin duality.

One can also study the spectral properties of operators on Banach spaces. For example, compact operators on Banach spaces have many spectral properties similar to that of matrices.

Physical background[edit]

The background in the physics of vibrations has been explained in this way:[5]

Spectral theory is connected with the investigation of localized vibrations of a variety of different objects, from atoms and molecules in chemistry to obstacles in acoustic waveguides. These vibrations have frequencies, and the issue is to decide when such localized vibrations occur, and how to go about computing the frequencies. This is a very complicated problem since every object has not only a fundamental tone but also a complicated series of overtones, which vary radically from one body to another.

The mathematical theory is not dependent on such physical ideas on a technical level, but there are examples of mutual influence (see for example Mark Kac's question Can you hear the shape of a drum?). Hilbert's adoption of the term "spectrum" has been attributed to an 1897 paper of Wilhelm Wirtinger on Hill differential equation (by Jean Dieudonné), and it was taken up by his students during the first decade of the twentieth century, among them Erhard Schmidt and Hermann Weyl. The conceptual basis for Hilbert space was developed from Hilbert's ideas by Erhard Schmidt and Frigyes Riesz.[6][7] It was almost twenty years later, when quantum mechanics was formulated in terms of the Schrödinger equation, that the connection was made to atomic spectra; a connection with the mathematical physics of vibration had been suspected before, as remarked by Henri Poincaré, but rejected for simple quantitative reasons, absent an explanation of the Balmer series.[8] The later discovery in quantum mechanics that spectral theory could explain features of atomic spectra was therefore fortuitous, rather than being an object of Hilbert's spectral theory.

A definition of spectrum[edit]

Consider a bounded linear transformation T defined everywhere over a general Banach space. We form the transformation:

 R_{\zeta} = \left( \zeta I -  T \right)^{-1}.

Here I is the identity operator and ζ is a complex number. The inverse of an operator T, that is T−1, is defined by:

T T^{-1} = T^{-1} T = I.

If the inverse exists, T is called regular. If it does not exist, T is called singular.

With these definitions, the resolvent set of T is the set of all complex numbers ζ such that Rζ exists and is bounded. This set often is denoted as ρ(T). The spectrum of T is the set of all complex numbers ζ such that Rζ fails to exist or is unbounded. Often the spectrum of T is denoted by σ(T). The function Rζ for all ζ in ρ(T) (that is, wherever Rζ exists as a bounded operator) is called the resolvent of T. The spectrum of T is therefore the complement of the resolvent set of T in the complex plane.[9] Every eigenvalue of T belongs to σ(T), but σ(T) may contain non-eigenvalues.[10]

This definition applies to a Banach space, but of course other types of space exist as well, for example, topological vector spaces include Banach spaces, but can be more general.[11][12] On the other hand, Banach spaces include Hilbert spaces, and it is these spaces that find the greatest application and the richest theoretical results.[13] With suitable restrictions, much can be said about the structure of the spectra of transformations in a Hilbert space. In particular, for self-adjoint operators, the spectrum lies on the real line and (in general) is a spectral combination of a point spectrum of discrete eigenvalues and a continuous spectrum.[14]

Spectral theory briefly[edit]

Main article: Spectral theorem

In functional analysis and linear algebra the spectral theorem establishes conditions under which an operator can be expressed in simple form as a sum of simpler operators. As a full rigorous presentation is not appropriate for this article, we take an approach that avoids much of the rigor and satisfaction of a formal treatment with the aim of being more comprehensible to a non-specialist.

This topic is easiest to describe by introducing the bra–ket notation of Dirac for operators.[15][16] As an example, a very particular linear operator L might be written as a dyadic product:[17][18]

 L = | k_1 \rangle \langle b_1 |,

in terms of the "bra"  \langle b_1 | and the "ket"  | k_1 \rangle . A function f is described by a ket as  | f \rangle . The function f(x) defined on the coordinates (x_1, x_2, x_3, \dots) is denoted as:

 f(x)=\langle x, f\rangle

and the magnitude of f by:

 \|f \|^2 = \langle f, f\rangle =\int \langle f, x\rangle \langle x, f \rangle \, dx = \int f^*(x) f(x) \, dx

where the notation '*' denotes a complex conjugate. This inner product choice defines a very specific inner product space, restricting the generality of the arguments that follow.[13]

The effect of L upon a function f is then described as:

 L | f\rangle = | k_1 \rangle \langle b_1 | f \rangle

expressing the result that the effect of L on f is to produce a new function  | k_1 \rangle multiplied by the inner product represented by \langle b_1 | f \rangle .

A more general linear operator L might be expressed as:

 L = \lambda_1 | e_1\rangle\langle f_1| +  \lambda_2 | e_2\rangle \langle f_2| +   \lambda_3 | e_3\rangle\langle f_3| + \dots ,

where the  \{ \, \lambda_i \, \} are scalars and the  \{ \, | e_i \rangle \, \} are a basis and the  \{ \, \langle f_i | \, \} a reciprocal basis for the space. The relation between the basis and the reciprocal basis is described, in part, by:

 \langle f_i | e_j \rangle = \delta_{ij}

If such a formalism applies, the  \{ \, \lambda_i \, \} are eigenvalues of L and the functions  \{ \, | e_i \rangle \, \} are eigenfunctions of L. The eigenvalues are in the spectrum of L.[19]

Some natural questions are: under what circumstances does this formalism work, and for what operators L are expansions in series of other operators like this possible? Can any function f be expressed in terms of the eigenfunctions (are they a Schauder basis) and under what circumstances does a point spectrum or a continuous spectrum arise? How do the formalisms for infinite-dimensional spaces and finite-dimensional spaces differ, or do they differ? Can these ideas be extended to a broader class of spaces? Answering such questions is the realm of spectral theory and requires considerable background in functional analysis and matrix algebra.

Resolution of the identity[edit]

This section continues in the rough and ready manner of the above section using the bra–ket notation, and glossing over the many important details of a rigorous treatment.[20] A rigorous mathematical treatment may be found in various references.[21] In particular, the dimension n of the space will be finite.

Using the bra–ket notation of the above section, the identity operator may be written as:

I = \sum _{i=1} ^{n} | e_i \rangle \langle f_i |

where it is supposed as above that { |e_i\rangle } are a basis and the {  \langle f_i | } a reciprocal basis for the space satisfying the relation:

\langle f_i | e_j\rangle = \delta_{ij} .

This expression of the identity operation is called a representation or a resolution of the identity.[20],[21] This formal representation satisfies the basic property of the identity:

 I^k = I\,

valid for every positive integer k.

Applying the resolution of the identity to any function in the space | \psi \rangle, one obtains:

I |\psi \rangle = |\psi \rangle = \sum_{i=1}^{n} | e_i \rangle \langle f_i | \psi \rangle =  \sum_{i=1}^{n} \ c_i | e_i \rangle

which is the generalized Fourier expansion of ψ in terms of the basis functions { ei }.[22] Here c_i = \langle f_i | \psi \rangle.

Given some operator equation of the form:

O | \psi \rangle = | h \rangle

with h in the space, this equation can be solved in the above basis through the formal manipulations:

 O | \psi \rangle = \sum_{i=1}^{n} c_i \left( O | e_i \rangle \right)  =  \sum_{i=1}^{n} | e_i \rangle \langle f_i |  h \rangle ,
\langle f_j|O| \psi \rangle = \sum_{i=1}^{n}  c_i \langle f_j| O | e_i \rangle  =  \sum_{i=1}^{n} \langle f_j| e_i \rangle \langle f_i | h \rangle  = \langle f_j |  h \rangle, \quad \forall j

which converts the operator equation to a matrix equation determining the unknown coefficients cj in terms of the generalized Fourier coefficients \langle f_j | h \rangle of h and the matrix elements O_{ji}= \langle f_j| O | e_i \rangle of the operator O.

The role of spectral theory arises in establishing the nature and existence of the basis and the reciprocal basis. In particular, the basis might consist of the eigenfunctions of some linear operator L:

L | e_i \rangle = \lambda_i | e_i \rangle \, ;

with the { λi } the eigenvalues of L from the spectrum of L. Then the resolution of the identity above provides the dyad expansion of L:

LI = L = \sum_{i=1}^{n} L | e_i \rangle \langle f_i|  = \sum_{i=1}^{n} \lambda _i | e_i \rangle \langle f_i | .

Resolvent operator[edit]

Main article: Resolvent formalism

Using spectral theory, the resolvent operator R:

R =  (\lambda I - L)^{-1},\,

can be evaluated in terms of the eigenfunctions and eigenvalues of L, and the Green's function corresponding to L can be found.

Applying R to some arbitrary function in the space, say \varphi,

R  |\varphi \rangle = (\lambda I - L)^{-1} |\varphi \rangle = \sum_{i=1}^n \frac{1}{\lambda- \lambda_i} |e_i \rangle \langle f_i | \varphi \rangle.

This function has poles in the complex λ-plane at each eigenvalue of L. Thus, using the calculus of residues:

\frac{1}{2\pi i } \oint_C R  |\varphi \rangle d \lambda = -\sum_{i=1}^n |e_i \rangle   \langle f_i | \varphi \rangle  = -|\varphi \rangle,

where the line integral is over a contour C that includes all the eigenvalues of L.

Suppose our functions are defined over some coordinates {xj}, that is:

\langle x, \varphi \rangle = \varphi (x_1, x_2, ...).

Introducing the notation

 \langle x , y \rangle = \delta (x-y),

where δ(x − y) = δ(x1 − y1, x2 − y2, x3 − y3, ...) is the Dirac delta function,[23] we can write

\langle x, \varphi \rangle = \int \langle x , y \rangle \langle y, \varphi \rangle dy.

Then:

\begin{align}
\left\langle x, \frac{1}{2\pi i } \oint_C \frac{\varphi}{\lambda I - L} d \lambda\right\rangle &= \frac{1}{2\pi i }\oint_C d \lambda \left \langle x, \frac{\varphi}{\lambda I - L} \right \rangle\\
&= \frac{1}{2\pi i } \oint_C d \lambda \int dy \left \langle x,  \frac{y}{\lambda I - L} \right \rangle  \langle y, \varphi \rangle
\end{align}

The function G(x, y; λ) defined by:

\begin{align}
G(x, y; \lambda) &= \left \langle x, \frac{y}{\lambda I - L} \right \rangle \\
&= \sum_{i=1}^n \sum_{j=1}^n \langle x, e_i \rangle \left \langle f_i, \frac{e_j}{\lambda I - L} \right \rangle \langle f_j , y\rangle \\
&= \sum_{i=1}^n \frac{\langle x,  e_i \rangle \langle f_i , y\rangle }{\lambda  - \lambda_i} \\
&= \sum_{i=1}^n \frac{e_i (x) f_i^*(y) }{\lambda  - \lambda_i},
\end{align}

is called the Green's function for operator L, and satisfies:[24]

\frac{1}{2\pi i }\oint_C G(x,y;\lambda)  d \lambda = -\sum_{i=1}^n  \langle x, e_i \rangle \langle f_i , y\rangle = -\langle x, y\rangle = -\delta (x-y).

Operator equations[edit]

Consider the operator equation:

(O-\lambda I ) |\psi \rangle = |h \rangle;

in terms of coordinates:

\int \langle x, (O-\lambda I)y \rangle \langle y, \psi \rangle dy = h(x).

A particular case is λ = 0.

The Green's function of the previous section is:

\langle y, G(\lambda) z\rangle = \left \langle y, (O-\lambda I)^{-1} z \right \rangle = G(y, z; \lambda),

and satisfies:

\int \langle x, (O - \lambda I) y \rangle \langle y, G(\lambda) z \rangle dy = \int \langle x, (O-\lambda I) y \rangle \left \langle y, (O-\lambda I)^{-1} z \right \rangle dy = \langle x , z \rangle = \delta (x-z).

Using this Green's function property:

\int \langle x, (O-\lambda I) y \rangle G(y, z; \lambda ) dy = \delta (x-z).

Then, multiplying both sides of this equation by h(z) and integrating:

\int dz h(z) \int dy \langle x, (O-\lambda I)y \rangle G(y, z; \lambda)=\int dy \langle x, (O-\lambda I) y \rangle \int dz h(z)G(y, z; \lambda) = h(x),

which suggests the solution is:

\psi(x) = \int h(z) G(x, z; \lambda) dz.

That is, the function ψ(x) satisfying the operator equation is found if we can find the spectrum of O, and construct G, for example by using:

G(x, z; \lambda)  = \sum_{i=1}^n \frac{e_i (x) f_i^*(z)}{\lambda - \lambda_i}.

There are many other ways to find G, of course.[25] See the articles on Green's functions and on Fredholm integral equations. It must be kept in mind that the above mathematics is purely formal, and a rigorous treatment involves some pretty sophisticated mathematics, including a good background knowledge of functional analysis, Hilbert spaces, distributions and so forth. Consult these articles and the references for more detail.

Spectral theorem and Rayleigh quotient[edit]

Optimization problems may be the most useful examples about the combinatorial significance of the eigenvalues and eigenvectors in symmetric matrices, especially for the Rayleigh quotient with respect to a matrix M.

Theorem Let M be a symmetric matrix and let x be the non-zero vector that maximize the Rayleigh quotient with respect to M. Then, x is an eigenvector of M with eigenvalue equal to the Rayleigh quotient. Moreover, this eigenvalue is the largest eigenvalue of M.

Proof Assume the spectral theorem. Let the eigenvalues of M be \lambda_1\le\lambda_2\le\cdots\le\lambda_n. Since the {v_i}i form an orthonormal basis, any vector x can be expressed in this basis as

x = \sum_{i}\ x\ v_{i}^{T} \ v_{i}

The way to prove this formula is pretty easy. Namely,

v_j^{T}\sum_{i} v_i^{T} x v_i
 = \sum_{i} v_i^{T} x v_j^{T} v_i
 = (v_j^{T} x ) v_j^{T} v_j
 = v_j^{T} x

evaluate the Rayleigh quotient with respect to x:

\frac{x^{T} M x}{x^{T} x}
= (\sum_{i} (v_i^{T} x) v_i)^{T} M (\sum_{j} (v_j^{T} x) v_j)
= (\sum_{i} (v_i^{T} x) v_i)^{T}) (\sum_{j} (v_j^{T} x) v_j\lambda_j)
= \sum_{i,j}  (v_i^{T} x) v_i)^{T})(v_j^{T} x) v_j\lambda_j)
= \sum_{j} (v_j^{T} x)(v_j^{T} x)\lambda_j
= \sum_{j} (v_j^{T} x)^2\lambda_j\le\lambda_n \sum_{j} (v_j^{T} x)^2
= \lambda_n

so the Rayleigh quotient is always less than \lambda_n.

[26]

See also[edit]

Notes[edit]

  1. ^ Jean Alexandre Dieudonné (1981). History of functional analysis. Elsevier. ISBN 0-444-86148-3. 
  2. ^ William Arveson (2002). "Chapter 1: spectral theory and Banach algebras". A short course on spectral theory. Springer. ISBN 0-387-95300-0. 
  3. ^ Viktor Antonovich Sadovnichiĭ (1991). "Chapter 4: The geometry of Hilbert space: the spectral theory of operators". Theory of Operators. Springer. p. 181 et seq. ISBN 0-306-11028-8. 
  4. ^ John von Neumann (1996). The mathematical foundations of quantum mechanics; Volume 2 in Princeton Landmarks in Mathematics series (Reprint of translation of original 1932 ed.). Princeton University Press. ISBN 0-691-02893-1. 
  5. ^ E. Brian Davies, quoted on the King's College London analysis group website "Research at the analysis group". 
  6. ^ Nicholas Young (1988). An introduction to Hilbert space. Cambridge University Press. p. 3. ISBN 0-521-33717-8. 
  7. ^ Jean-Luc Dorier (2000). On the teaching of linear algebra; Vol. 23 of Mathematics education library. Springer. ISBN 0-7923-6539-9. 
  8. ^ Cf. Spectra in mathematics and in physics by Jean Mawhin, p.4 and pp. 10-11.
  9. ^ Edgar Raymond Lorch (2003). Spectral Theory (Reprint of Oxford 1962 ed.). Textbook Publishers. p. 89. ISBN 0-7581-7156-0. 
  10. ^ Nicholas Young. op. cit. p. 81. ISBN 0-521-33717-8. 
  11. ^ Helmut H. Schaefer, Manfred P. H. Wolff (1999). Topological vector spaces (2nd ed.). Springer. p. 36. ISBN 0-387-98726-6. 
  12. ^ Dmitriĭ Petrovich Zhelobenko (2006). Principal structures and methods of representation theory. American Mathematical Society. ISBN 0821837311. 
  13. ^ a b Edgar Raymond Lorch (2003). "Chapter III: Hilbert Space". op. cit.. p. 57. ISBN 0-7581-7156-0. 
  14. ^ Edgar Raymond Lorch (2003). "Chapter V: The Structure of Self-Adjoint Transformations". op. cit.. p. 106 ff. ISBN 0-7581-7156-0. 
  15. ^ Bernard Friedman (1990). Principles and Techniques of Applied Mathematics (Reprint of 1956 Wiley ed.). Dover Publications. p. 26. ISBN 0-486-66444-9. 
  16. ^ PAM Dirac (1981). The principles of quantum mechanics (4th ed.). Oxford University Press. p. 29 ff. ISBN 0-19-852011-5. 
  17. ^ Jürgen Audretsch (2007). "Chapter 1.1.2: Linear operators on the Hilbert space". Entangled systems: new directions in quantum physics. Wiley-VCH. p. 5. ISBN 3-527-40684-0. 
  18. ^ R. A. Howland (2006). Intermediate dynamics: a linear algebraic approach (2nd ed.). Birkhäuser. p. 69 ff. ISBN 0-387-28059-6. 
  19. ^ Bernard Friedman (1990). "Chapter 2: Spectral theory of operators". op. cit. p. 57. ISBN 0-486-66444-9. 
  20. ^ a b See discussion in Dirac's book referred to above, and Milan Vujičić (2008). Linear algebra thoroughly explained. Springer. p. 274. ISBN 3-540-74637-4. 
  21. ^ a b See, for example, the fundamental text of John von Neumann. op. cit. ISBN 0-691-02893-1.  and Arch W. Naylor, George R. Sell (2000). Linear Operator Theory in Engineering and Science; Vol. 40 of Applied mathematical science. Springer. p. 401. ISBN 0-387-95001-X. , Steven Roman (2008). Advanced linear algebra (3rd ed.). Springer. ISBN 0-387-72828-7. , I︠U︡riĭ Makarovich Berezanskiĭ (1968). Expansions in eigenfunctions of selfadjoint operators; Vol. 17 in Translations of mathematical monographs. American Mathematical Society. ISBN 0-8218-1567-9. 
  22. ^ See for example, Gerald B Folland (2009). "Convergence and completeness". Fourier Analysis and its Applications (Reprint of Wadsworth & Brooks/Cole 1992 ed.). American Mathematical Society. pp. 77 ff. ISBN 0-8218-4790-2. 
  23. ^ PAM Dirac. op. cit. p. 60 ff. ISBN 0-19-852011-5. 
  24. ^ Bernard Friedman. op. cit. p. 214, Eq. 2.14. ISBN 0-486-66444-9. 
  25. ^ For example, see Sadri Hassani (1999). "Chapter 20: Green's functions in one dimension". Mathematical physics: a modern introduction to its foundations. Springer. p. 553 et seq. ISBN 0-387-98579-4.  and Qing-Hua Qin (2007). Green's function and boundary elements of multifield materials. Elsevier. ISBN 0-08-045134-9. 
  26. ^ Spielman,Daniel A. "Lecture Note of Spectral Graph Theory" Yale University(2012) http://cs.yale.edu/homes/spielman/561/ .

General references[edit]

  • Shmuel Kantorovitz (1983). Spectral Theory of Banach Space Operators;. Springer. 

External links[edit]