Tensor contraction

From Wikipedia, the free encyclopedia
Jump to: navigation, search

In multilinear algebra, a tensor contraction is an operation on one or more tensors that arises from the natural pairing of a finite-dimensional vector space and its dual. In components, it is expressed as a sum of products of scalar components of the tensor(s) caused by applying the summation convention to a pair of dummy indices which are bound to each other in an expression. The contraction of a single mixed tensor occurs when a pair of literal indices (one a subscript, the other a superscript) of the tensor are set equal to each other and summed over. In the Einstein notation this summation is built into the notation. The result is another tensor with order reduced by 2.

Tensor contraction can be seen as a generalization of the trace.

Abstract formulation[edit]

Let V be a vector space over a field k. The core of the contraction operation, and the simplest case, is the natural pairing of V with its dual vector space V*. The pairing is the linear transformation from the tensor product of these two spaces to the field k:

 C : V^* \otimes V \rightarrow k

corresponding to the bilinear form

 \langle f, v \rangle = f(v)

where f is in V* and v is in V. The map C defines the contraction operation on a tensor of type (1,1), which is an element of V^* \otimes V . Note that the result is a scalar (an element of k). Using the natural isomorphism between V^* \otimes V and the space of linear transformations from V to V,[1] one obtains a basis-free definition of the trace.

In general, a tensor of type (m, n) (with m ≥ 1 and n ≥ 1) is an element of the vector space

V \otimes \cdots \otimes V \otimes V^{*} \otimes \cdots \otimes V^{*}

(where there are m V factors and n V* factors).[2][3] Applying the natural pairing to the kth V factor and the lth V* factor, and using the identity on all other factors, defines the (k, l) contraction operation, which is a linear map which yields a tensor of type (m − 1, n − 1).[2] By analogy with the (1,1) case, the general contraction operation is sometimes called the trace.

Contraction in index notation[edit]

In abstract index notation, the basic contraction of a vector and a dual vector is denoted by

 \tilde f (\vec v) = f_\gamma v^\gamma

which is shorthand for the explicit coordinate summation[4]

 f_\gamma v^\gamma = f_1 v^1 + f_2 v^2 + \cdots + f_n v^n

(where vi are the components of v in a particular basis and fi are the components of f in the corresponding dual basis).

Since a general mixed dyadic tensor is a linear combination of decomposable tensors of the form f \otimes v, the explicit formula for the dyadic case follows: let

 \mathbf{T} = T^i{}_j \mathbf{e}_i \mathbf{e}^j

be a mixed dyadic tensor. Then its contraction is

 T^i {}_j \mathbf{e}_i \cdot \mathbf{e}^j = T^i {}_j \delta_i {}^j 
= T^j {}_j = T^1 {}_1 + \cdots + T^n {}_n .

A general contraction is denoted by labeling one covariant index and one contravariant index with the same letter, summation over that index being implied by the summation convention. The resulting contracted tensor inherits the remaining indices of the original tensor. For example, contracting a tensor T of type (2,2) on the second and third indices to create a new tensor U of type (1,1) is written as

 T^{ab} {}_{bc} = \sum_{b}{T^{ab}{}_{bc}} = T^{a1} {}_{1c} + T^{a2} {}_{2c} + \cdots + T^{an} {}_{nc} = U^a {}_c .

By contrast, let

 \mathbf{T} = \mathbf{e}^i \mathbf{e}^j

be an unmixed dyadic tensor. This tensor does not contract; if its base vectors are dotted the result is the contravariant metric tensor,

 g^{ij} = \mathbf{e}^i \cdot \mathbf{e}^j ,

whose rank is 2.

Metric contraction[edit]

As in the previous example, contraction on a pair of indices that are either both contravariant or both covariant is not possible in general. However, in the presence of an inner product (also known as a metric) g, such contractions are possible. One uses the metric to raise or lower one of the indices, as needed, and then one uses the usual operation of contraction. The combined operation is known as metric contraction.[5]

Application to tensor fields[edit]

Contraction is often applied to tensor fields over spaces (e.g. Euclidean space, manifolds, or schemes). Since contraction is a purely algebraic operation, it can be applied pointwise to a tensor field, e.g. if T is a (1,1) tensor field on Euclidean space, then in any coordinates, its contraction (a scalar field) U at a point x is given by

U(x) = \sum_{i} T^{i}_{i}(x)

Since the role of x is not complicated here, it is often suppressed, and the notation for tensor fields becomes identical to that for purely algebraic tensors.

Over a Riemannian manifold, a metric (field of inner products) is available, and both metric and non-metric contractions are crucial to the theory. For example, the Ricci tensor is a non-metric contraction of the Riemann curvature tensor, and the scalar curvature is the unique metric contraction of the Ricci tensor.

One can also view contraction of a tensor field in the context of modules over an appropriate ring of functions on the manifold[5] or the context of sheaves of modules over the structure sheaf;[6] see the discussion at the end of this article.

Tensor divergence[edit]

As an application of the contraction of a tensor field, let V be a vector field on a Riemannian manifold (for example, Euclidean space). Let  V^\alpha {}_{;\beta} be the covariant derivative of V (in some choice of coordinates). In the case of Cartesian coordinates in Euclidean space, one can write

 V^\alpha {}_{;\beta} = {\partial V^\alpha \over \partial x^\beta}.

Then changing index β to α causes the pair of indices to become bound to each other, so that the derivative contracts with itself to obtain the following sum:

 V^\alpha {}_{;\alpha} = V^0 {}_{;0} + \cdots + V^n {}_{;n}

which is the divergence div V. Then

 \mathrm{div} V = V^\alpha {}_{;\alpha} = 0

is a continuity equation for V.

In general, one can define various divergence operations on higher-rank tensor fields, as follows. If T is a tensor field with at least one contravariant index, taking the covariant differential and contracting the chosen contravariant index with the new covariant index corresponding to the differential results in a new tensor of rank one lower than that of T.[5]

Contraction of a pair of tensors[edit]

One can generalize the core contraction operation (vector with dual vector) in a slightly different way, by considering a pair of tensors T and U. The tensor product T \otimes U is a new tensor, which, if it has at least one covariant and one contravariant index, can be contracted. The case where T is a vector and U is a dual vector is exactly the core operation introduced first in this article.

In abstract index notation, to contract two tensors with each other, one places them side by side (juxtaposed) as factors of the same term. This implements the tensor product, yielding a composite tensor. Contracting two indices in this composite tensor implements the desired contraction of the two tensors.

For example, matrices can be represented as tensors of type (1,1) with the first index being contravariant and the second index being covariant. Let  \Lambda^\alpha {}_\beta be the components of one matrix and let  \Mu^\beta {}_\gamma be the components of a second matrix. Then their multiplication is given by the following contraction, an example of the contraction of a pair of tensors:

 \Lambda^\alpha {}_\beta \Mu^\beta {}_\gamma = \Nu^\alpha {}_\gamma .

Also, the interior product of a vector with a differential form is a special case of the contraction of two tensors with each other.

More general algebraic contexts[edit]

Let R be a commutative ring and let M be a finite free module over R. Then contraction operates on the full (mixed) tensor algebra of M in exactly the same way as it does in the case of vector spaces over a field. (The key fact is that the natural pairing is still perfect in this case.)

More generally, let OX be a sheaf of commutative rings over a topological space X, e.g. OX could be the structure sheaf of a complex manifold, analytic space, or scheme. Let M be a locally free sheaf of modules over OX of finite rank. Then the dual of M is still well-behaved[6] and contraction operations make sense in this context.

See also[edit]

Notes[edit]

  1. ^ Let L(V,V) be the space of linear transformations from V to V. Then the natural map
    V^* \otimes V \rightarrow L(V,V)
    is defined by
    f \otimes v \mapsto g
    where g(w) = f(w)v. Suppose that V is finite-dimensional. If {vi} is a basis of V and {fi} is the corresponding dual basis, then f^i \otimes v_j maps to the transformation whose matrix in this basis has only one nonzero entry, a 1 in the i,j position. This shows that the map is an isomorphism.
  2. ^ a b William Fulton and Joe Harris, Representation Theory: A First Course, GTM 129, Springer-Verlag, 1991, ISBN 0-387-97495-4, pp. 471–476.
  3. ^ Frank Warner, Foundations of Differentiable Manifolds and Lie Groups, GTM 94, Springer-Verlag, 1993, ISBN 0-387-90894-3, pp. 54–56.
  4. ^ In physics (and sometimes in mathematics), indices often start with zero instead of one. In four-dimensional spacetime, indices run from 0 to 3.
  5. ^ a b c Barrett O'Neill, Semi-Riemannian Geometry with Applications to Relativity, Academic Press, 1983, p. 86.
  6. ^ a b Robin Hartshorne, Algebraic Geometry, Springer-Verlag, 1977, ISBN 0-387-90244-9.

References[edit]

  • Donald H. Menzel. Mathematical Physics. Dover Publications, New York.
  • Richard L. Bishop and Samuel I. Goldberg, Tensor Analysis on Manifolds, Dover, 1980, ISBN 0-486-64039-6.