Orthonormality

From Wikipedia, the free encyclopedia
  (Redirected from Orthonormal)
Jump to: navigation, search

In linear algebra, two vectors in an inner product space are orthonormal if they are orthogonal and unit vectors. A set of vectors form an orthonormal set if all vectors in the set are mutually orthogonal and all of unit length. An orthonormal set which forms a basis is called an orthonormal basis.

Intuitive overview[edit]

The construction of orthogonality of vectors is motivated by a desire to extend the intuitive notion of perpendicular vectors to higher-dimensional spaces. In the Cartesian plane, two vectors are said to be perpendicular if the angle between them is 90° (i.e. if they form a right angle). This definition can be formalized in Cartesian space by defining the dot product and specifying that two vectors in the plane are orthogonal if their dot product is zero.

Similarly, the construction of the norm of a vector is motivated by a desire to extend the intuitive notion of the length of a vector to higher-dimensional spaces. In Cartesian space, the norm of a vector is the square root of the vector dotted with itself. That is,

\| \mathbf{x} \| = \sqrt{ \mathbf{x} \cdot \mathbf{x}}

Many important results in linear algebra deal with collections of two or more orthogonal vectors. But often, it is easier to deal with vectors of unit length. That is, it often simplifies things to only consider vectors whose norm equals 1. The notion of restricting orthogonal pairs of vectors to only those of unit length is important enough to be given a special name. Two vectors which are orthogonal and of length 1 are said to be orthonormal.

Simple example[edit]

What does a pair of orthonormal vectors in 2-D Euclidean space look like?

Let u = (x1, y1) and v = (x2, y2). Consider the restrictions on x1, x2, y1, y2 required to make u and v form an orthonormal pair.

  • From the orthogonality restriction, uv = 0.
  • From the unit length restriction on u, ||u|| = 1.
  • From the unit length restriction on v, ||v|| = 1.

Expanding these terms gives 3 equations:

  1. x_1 x_2 + y_1 y_2 = 0 \quad
  2. \sqrt{{x_1}^2 + {y_1}^2} = 1
  3. \sqrt{{x_2}^2 + {y_2}^2} = 1

Converting from Cartesian to polar coordinates, and considering Equation (2) and Equation (3) immediately gives the result r1 = r2 = 1. In other words, requiring the vectors be of unit length restricts the vectors to lie on the unit circle.

After substitution, Equation (1) becomes  \cos \theta _1 \cos \theta _2 + \sin \theta _1 \sin \theta _2 = 0. Rearranging gives \tan \theta _1 = - \cot \theta _2. Using a trigonometric identity to convert the cotangent term gives

 \tan ( \theta_1 ) = \tan \left( \theta_2 + \tfrac{\pi}{2} \right)
 \Rightarrow \theta _1 = \theta _2 + \tfrac{\pi}{2}

It is clear that in the plane, orthonormal vectors are simply radii of the unit circle whose difference in angles equals 90°.

Definition[edit]

Let \mathcal{V} be an inner-product space. A set of vectors

 \left\{ u_1 , u_2 , \ldots , u_n , \ldots \right\} \in \mathcal{V}

is called orthonormal if and only if

 \forall i,j : \langle u_i , u_j \rangle = \delta_{ij}

where \delta_{ij} \, is the Kronecker delta and \langle \cdot , \cdot \rangle is the inner product defined over \mathcal{V}.

Significance[edit]

Orthonormal sets are not especially significant on their own. However, they display certain features that make them fundamental in exploring the notion of diagonalizability of certain operators on vector spaces.

Properties[edit]

Orthonormal sets have certain very appealing properties, which make them particularly easy to work with.

  • Theorem. If {e1, e2,...,en} is an orthonormal list of vectors, then
||a_1 e_1 + a_2 e_2 + \cdots + a_n e_n||^2 = |a_1|^2 + |a_2|^2 + \cdots + |a_n|^2

Existence[edit]

  • Gram-Schmidt theorem. If {v1, v2,...,vn} is a linearly independent list of vectors in an inner-product space \mathcal{V}, then there exists an orthonormal list {e1, e2,...,en} of vectors in \mathcal{V} such that span(e1, e2,...,en) = span(v1, v2,...,vn).

Proof of the Gram-Schmidt theorem is constructive, and discussed at length elsewhere. The Gram-Schmidt theorem, together with the axiom of choice, guarantees that every vector space admits an orthonormal basis. This is possibly the most significant use of orthonormality, as this fact permits operators on inner-product spaces to be discussed in terms of their action on the space's orthonormal basis vectors. What results is a deep relationship between the diagonalizability of an operator and how it acts on the orthonormal basis vectors. This relationship is characterized by the Spectral Theorem.

Examples[edit]

Standard basis[edit]

The standard basis for the coordinate space Fn is

{e1, e2,...,en}   where    e1 = (1, 0, ..., 0)
   e2 = (0, 1, ..., 0)
\vdots
   en = (0, 0, ..., 1)

Any two vectors ei, ej where i≠j are orthogonal, and all vectors are clearly of unit length. So {e1, e2,...,en} forms an orthonormal basis.

Real-valued functions[edit]

When referring to real-valued functions, usually the inner product is assumed unless otherwise stated. Two functions \phi(x) and \psi(x) are orthonormal over the interval [a,b] if

(1)\quad\langle\phi(x),\psi(x)\rangle = \int_a^b\phi(x)\psi(x)dx = 0,\quad{\rm and}
(2)\quad||\phi(x)||_2 = ||\psi(x)||_2 = \left[\int_a^b|\phi(x)|^2dx\right]^\frac{1}{2} = \left[\int_a^b|\psi(x)|^2dx\right]^\frac{1}{2} = 1.

Fourier series[edit]

The Fourier series is a method of expressing a periodic function in terms of sinusoidal basis functions. Taking C[−π,π] to be the space of all real-valued functions continuous on the interval [−π,π] and taking the inner product to be

\langle f, g \rangle = \int_{-\pi}^{\pi} f(x)g(x)dx

It can be shown that

\left\{ \frac{1}{\sqrt{2\pi}}, \frac{\sin(x)}{\sqrt{\pi}}, \frac{\sin(2x)}{\sqrt{\pi}}, \ldots, \frac{\sin(nx)}{\sqrt{\pi}}, \frac{\cos(x)}{\sqrt{\pi}}, \frac{\cos(2x)}{\sqrt{\pi}}, \ldots, \frac{\cos(nx)}{\sqrt{\pi}} \right\}, \quad n \in \mathbb{N}

forms an orthonormal set.

However, this is of little consequence, because C[−π,π] is infinite-dimensional, and a finite set of vectors cannot span it. But, removing the restriction that n be finite makes the set dense in C[−π,π] and therefore an orthonormal basis of C[−π,π].

See also[edit]

References[edit]