Linear algebra

From Wikipedia, the free encyclopedia
  (Redirected from Linear Algebra)
Jump to: navigation, search
Not to be confused with Elementary algebra. ‹See Tfd›
The three-dimensional Euclidean space R3 is a vector space, and lines and planes passing through the origin are vector subspaces in R3.

Linear algebra is the branch of mathematics concerning vector spaces and linear mappings between such spaces. It includes the study of lines, planes, and subspaces, but is also concerned with properties common to all vector spaces.

The set of points with coordinates that satisfy a linear equation form a hyperplane in an n-dimensional space. The conditions under which a set of n hyperplanes intersect in a single point is an important focus of study in Linear algebra. Such an investigation is initially motivated by a system of linear equations containing several unknowns. Such equations are naturally represented using the formalism of matrices and vectors.[1][2]

Linear algebra is central to both pure and applied mathematics. For instance, abstract algebra arises by relaxing the axioms of a vector space, leading to a number of generalizations. Functional analysis studies the infinite-dimensional version of the theory of vector spaces. Combined with calculus, linear algebra facilitates the solution of linear systems of differential equations. Techniques from linear algebra are also used in analytic geometry, engineering, physics, natural sciences, computer science, computer animation, and the social sciences (particularly in economics). Because linear algebra is such a well-developed theory, nonlinear mathematical models are sometimes approximated by linear ones.

History[edit]

The study of linear algebra first emerged from the study of determinants, which were used to solve systems of linear equations. Determinants were used by Leibniz in 1693, and subsequently, Gabriel Cramer devised Cramer's Rule for solving linear systems in 1750. Later, Gauss further developed the theory of solving linear systems by using Gaussian elimination, which was initially listed as an advancement in geodesy.[3]

The study of matrix algebra first emerged in England in the mid-1800s. In 1844 Hermann Grassmann published his “Theory of Extension” which included foundational new topics of what is today called linear algebra. In 1848, James Joseph Sylvester introduced the term matrix, which is Latin for "womb". While studying compositions of linear transformations, Arthur Cayley was led to define matrix multiplication and inverses. Crucially, Cayley used a single letter to denote a matrix, thus treating a matrix as an aggregate object. He also realized the connection between matrices and determinants, and wrote "There would be many things to say about this theory of matrices which should, it seems to me, precede the theory of determinants".[3]

In 1882, Hüseyin Tevfik Pasha wrote the book titled "Linear Algebra".[4][5] The first modern and more precise definition of a vector space was introduced by Peano in 1888;[3] by 1900, a theory of linear transformations of finite-dimensional vector spaces had emerged. Linear algebra first took its modern form in the first half of the twentieth century, when many ideas and methods of previous centuries were generalized as abstract algebra. The use of matrices in quantum mechanics, special relativity, and statistics helped spread the subject of linear algebra beyond pure mathematics. The development of computers led to increased research in efficient algorithms for Gaussian elimination and matrix decompositions, and linear algebra became an essential tool for modelling and simulations.[3]

The origin of many of these ideas is discussed in the articles on determinants and Gaussian elimination.

Educational history[edit]

With respect to the history of mathematics curriculum, linear algebra first appeared in graduate textbooks in the 1940s and in undergraduate textbooks in the 1950s.[6] Following work of the School Mathematics Study Group, U.S. high schools asked 12th grade students to do "matrix algebra, formerly reserved for college" in the 1960s.[7] In France during the 1960s, educators attempted to instill linear algebra in the first year of secondary school through curriculum on affine dimensional vector spaces; this lead to a backlash in the 1980s that removed linear algebra from the curriculum.[8] In the 1993, the U.S.-based Linear Algebra Curriculum Study Group recommended that undergraduate linear algebra courses take an application-based "matrix-orientation" as opposed to a theoretical orientation.[9]

Scope of study[edit]

Vector spaces[edit]

The main structures of linear algebra are vector spaces. A vector space over a field F is a set V together with two binary operations. Elements of V are called vectors and elements of F are called scalars. The first operation, vector addition, takes any two vectors v and w and outputs a third vector v + w. The second operation takes any scalar a and any vector v and outputs a new vector av. In view of the first example, where the multiplication is done by rescaling the vector v by a scalar a, the multiplication is called scalar multiplication of v by a. The operations of addition and multiplication in a vector space satisfy the following axioms.[10] In the list below, let u, v and w be arbitrary vectors in V, and a and b scalars in F.

Axiom Signification
Associativity of addition u + (v + w) = (u + v) + w
Commutativity of addition u + v = v + u
Identity element of addition There exists an element 0 ∈ V, called the zero vector, such that v + 0 = v for all vV.
Inverse elements of addition For every v ∈ V, there exists an element −vV, called the additive inverse of v, such that v + (−v) = 0
Distributivity of scalar multiplication with respect to vector addition   a(u + v) = au + av
Distributivity of scalar multiplication with respect to field addition (a + b)v = av + bv
Compatibility of scalar multiplication with field multiplication a(bv) = (ab)v [nb 1]
Identity element of scalar multiplication 1v = v, where 1 denotes the multiplicative identity in F.

Elements of a general vector space V may be objects of any nature, for example, functions, polynomials, vectors, or matrices. Linear algebra is concerned with properties common to all vector spaces.

Linear transformations[edit]

Similarly as in the theory of other algebraic structures, linear algebra studies mappings between vector spaces that preserve the vector-space structure. Given two vector spaces V and W over a field F, a linear transformation (also called linear map, linear mapping or linear operator) is a map

 T:V\to W

that is compatible with addition and scalar multiplication:

 T(u+v)=T(u)+T(v), \quad T(av)=aT(v)

for any vectors u,vV and a scalar aF.

Additionally for any vectors u, vV and scalars a, bF:

 \quad T(au+bv)=T(au)+T(bv)=aT(u)+bT(v)

When a bijective linear mapping exists between two vector spaces (that is, every vector from the second space is associated with exactly one in the first), we say that the two spaces are isomorphic. Because an isomorphism preserves linear structure, two isomorphic vector spaces are "essentially the same" from the linear algebra point of view. One essential question in linear algebra is whether a mapping is an isomorphism or not, and this question can be answered by checking if the determinant is nonzero. If a mapping is not an isomorphism, linear algebra is interested in finding its range (or image) and the set of elements that get mapped to zero, called the kernel of the mapping.

Linear transformations have geometric significance. For example, 2 × 2 real matrices denote standard planar mappings that preserve the origin.

Subspaces, span, and basis[edit]

Again in analogue with theories of other algebraic objects, linear algebra is interested in subsets of vector spaces that are vector spaces themselves; these subsets are called linear subspaces. For instance, the range and kernel of a linear mapping are both subspaces, and are thus often called the range space and the nullspace; these are important examples of subspaces. Another important way of forming a subspace is to take a linear combination of a set of vectors v1, v2, …, vk:

 a_1 v_1 + a_2 v_2 + \cdots + a_k v_k,

where a1, a2, …, ak are scalars. The set of all linear combinations of vectors v1, v2, …, vk is called their span, which forms a subspace.

A linear combination of any system of vectors with all zero coefficients is the zero vector of V. If this is the only way to express the zero vector as a linear combination of v1, v2, …, vk then these vectors are linearly independent. Given a set of vectors that span a space, if any vector w is a linear combination of other vectors (and so the set is not linearly independent), then the span would remain the same if we remove w from the set. Thus, a set of linearly dependent vectors is redundant in the sense that there will be a linearly independent subset will span the same subspace. Therefore, we are mostly interested in a linearly independent set of vectors that spans a vector space V, which we call a basis of V. Any set of vectors that spans V contains a basis, and any linearly independent set of vectors in V can be extended to a basis.[11] It turns out that if we accept the axiom of choice, every vector space has a basis;[12] nevertheless, this basis may be unnatural, and indeed, may not even be constructable. For instance, there exists a basis for the real numbers considered as a vector space over the rationals, but no explicit basis has been constructed.

Any two bases of a vector space V have the same cardinality, which is called the dimension of V. The dimension of a vector space is well-defined by the dimension theorem for vector spaces. If a basis of V has finite number of elements, V is called a finite-dimensional vector space. If V is finite-dimensional and U is a subspace of V, then dim U ≤ dim V. If U1 and U2 are subspaces of V, then

\dim(U_1 + U_2) = \dim U_1 + \dim U_2 - \dim(U_1 \cap U_2).[13]

One often restricts consideration to finite-dimensional vector spaces. A fundamental theorem of linear algebra states that all vector spaces of the same dimension are isomorphic,[14] giving an easy way of characterizing isomorphism.

Vectors as n-tuples: matrix theory[edit]

Main article: Matrix (mathematics)

A particular basis {v1, v2, …, vn} of V allows one to construct a coordinate system in V: the vector with coordinates (a1, a2, …, an) is the linear combination

 a_1 v_1 + a_2 v_2 + \cdots + a_n v_n. \,

The condition that v1, v2, …, vn span V guarantees that each vector v can be assigned coordinates, whereas the linear independence of v1, v2, …, vn assures that these coordinates are unique (i.e. there is only one linear combination of the basis vectors that is equal to v). In this way, once a basis of a vector space V over F has been chosen, V may be identified with the coordinate n-space Fn. Under this identification, addition and scalar multiplication of vectors in V correspond to addition and scalar multiplication of their coordinate vectors in Fn. Furthermore, if V and W are an n-dimensional and m-dimensional vector space over F, and a basis of V and a basis of W have been fixed, then any linear transformation T: VW may be encoded by an m × n matrix A with entries in the field F, called the matrix of T with respect to these bases. Two matrices that encode the same linear transformation in different bases are called similar. Matrix theory replaces the study of linear transformations, which were defined axiomatically, by the study of matrices, which are concrete objects. This major technique distinguishes linear algebra from theories of other algebraic structures, which usually cannot be parameterized so concretely.

There is an important distinction between the coordinate n-space Rn and a general finite-dimensional vector space V. While Rn has a standard basis {e1, e2, …, en}, a vector space V typically does not come equipped with such a basis and many different bases exist (although they all consist of the same number of elements equal to the dimension of V).

One major application of the matrix theory is calculation of determinants, a central concept in linear algebra. While determinants could be defined in a basis-free manner, they are usually introduced via a specific representation of the mapping; the value of the determinant does not depend on the specific basis. It turns out that a mapping has an inverse if and only if the determinant has an inverse (every non-zero real or complex number has an inverse[15]). If the determinant is zero, then the nullspace is nontrivial. Determinants have other applications, including a systematic way of seeing if a set of vectors is linearly independent (we write the vectors as the columns of a matrix, and if the determinant of that matrix is zero, the vectors are linearly dependent). Determinants could also be used to solve systems of linear equations (see Cramer's rule), but in real applications, Gaussian elimination is a faster method.

Eigenvalues and eigenvectors[edit]

In general, the action of a linear transformation may be quite complex. Attention to low-dimensional examples gives an indication of the variety of their types. One strategy for a general n-dimensional transformation T is to find "characteristic lines" that are invariant sets under T. If v is a non-zero vector such that Tv is a scalar multiple of v, then the line through 0 and v is an invariant set under T and v is called a characteristic vector or eigenvector. The scalar λ such that Tv = λv is called a characteristic value or eigenvalue of T.

To find an eigenvector or an eigenvalue, we note that

Tv-\lambda v=(T-\lambda \, \text{I})v=0,

where I is the identity matrix. For there to be nontrivial solutions to that equation, det(T − λ I) = 0. The determinant is a polynomial, and so the eigenvalues are not guaranteed to exist if the field is R. Thus, we often work with an algebraically closed field such as the complex numbers when dealing with eigenvectors and eigenvalues so that an eigenvalue will always exist. It would be particularly nice if given a transformation T taking a vector space V into itself we can find a basis for V consisting of eigenvectors. If such a basis exists, we can easily compute the action of the transformation on any vector: if v1, v2, …, vn are linearly independent eigenvectors of a mapping of n-dimensional spaces T with (not necessarily distinct) eigenvalues λ1, λ2, …, λn, and if v = a1v1 + ... + an vn, then,

T(v)=T(a_1 v_1)+\cdots+T(a_n v_n)=a_1 T(v_1)+\cdots+a_n T(v_n)=a_1 \lambda_1 v_1 + \cdots +a_n \lambda_n v_n.

Such a transformation is called a diagonalizable matrix since in the eigenbasis, the transformation is represented by a diagonal matrix. Because operations like matrix multiplication, matrix inversion, and determinant calculation are simple on diagonal matrices, computations involving matrices are much simpler if we can bring the matrix to a diagonal form. Not all matrices are diagonalizable (even over an algebraically closed field).

Inner-product spaces[edit]

Besides these basic concepts, linear algebra also studies vector spaces with additional structure, such as an inner product. The inner product is an example of a bilinear form, and it gives the vector space a geometric structure by allowing for the definition of length and angles. Formally, an inner product is a map

 \langle \cdot, \cdot \rangle : V \times V \rightarrow \mathbf{F}

that satisfies the following three axioms for all vectors u, v, w in V and all scalars a in F:[16][17]

\langle u,v\rangle =\overline{\langle v,u\rangle}.

Note that in R, it is symmetric.

\langle au,v\rangle= a \langle u,v\rangle.
\langle u+v,w\rangle= \langle u,w\rangle+ \langle v,w\rangle.
\langle v,v\rangle \geq 0 with equality only for v = 0.

We can define the length of a vector v in V by

\|v\|^2=\langle v,v\rangle,

and we can prove the Cauchy–Schwarz inequality:

|\langle u,v\rangle| \leq \|u\| \cdot \|v\|.

In particular, the quantity

\frac{|\langle u,v\rangle|}{\|u\| \cdot \|v\|} \leq 1,

and so we can call this quantity the cosine of the angle between the two vectors.

Two vectors are orthogonal if \langle u, v\rangle =0. An orthonormal basis is a basis where all basis vectors have length 1 and are orthogonal to each other. Given any finite-dimensional vector space, an orthonormal basis could be found by the Gram–Schmidt procedure. Orthonormal bases are particularly nice to deal with, since if v = a1 v1 + ... + an vn, then a_i = \langle v,v_i \rangle.

The inner product facilitates the construction of many useful concepts. For instance, given a transform T, we can define its Hermitian conjugate T* as the linear transform satisfying

 \langle T u, v \rangle = \langle u, T^* v\rangle.

If T satisfies TT* = T*T, we call T normal. It turns out that normal matrices are precisely the matrices that have an orthonormal system of eigenvectors that span V.

Some main useful theorems[edit]

  • A matrix is invertible, or non-singular, if and only if the linear map represented by the matrix is an isomorphism.
  • Any vector space over a field F of dimension n is isomorphic to Fn as a vector space over F.
  • Corollary: Any two vector spaces over F of the same finite dimension are isomorphic to each other.
  • A linear map is an isomorphism if and only if the determinant is nonzero.

Applications[edit]

Because of the ubiquity of vector spaces, linear algebra is used in many fields of mathematics, natural sciences, computer science, and social science. Below are just some examples of applications of linear algebra.

Solution of linear systems[edit]

Linear algebra provides the formal setting for the linear combination of equations used in the Gaussian method. Suppose the goal is to find and describe the solution(s), if any, of the following system of linear equations:

\begin{alignat}{7}
2x &&\; + \;&& y             &&\; - \;&& z  &&\; = \;&& 8 & \qquad (L_1) \\
-3x &&\; - \;&& y             &&\; + \;&& 2z &&\; = \;&& -11 & \qquad (L_2) \\
-2x &&\; + \;&& y &&\; +\;&& 2z  &&\; = \;&& -3 &  \qquad (L_3)
\end{alignat}

The Gaussian-elimination algorithm is as follows: eliminate x from all equations below L1, and then eliminate y from all equations below L2. This will put the system into triangular form. Then, using back-substitution, each unknown can be solved for.

In the example, x is eliminated from L2 by adding (3/2)L1 to L2. x is then eliminated from L3 by adding L1 to L3. Formally:

L_2 + \tfrac{3}{2}L_1 \rightarrow L_2
L_3 + L_1 \rightarrow L_3

The result is:

\begin{alignat}{7}
2x &&\; + && y &&\; - &&\; z &&\; = \;&& 8 & \\
&& && \frac{1}{2}y &&\; + &&\; \frac{1}{2}z &&\; = \;&& 1 & \\
&& && 2y &&\; + &&\; z &&\; = \;&& 5 & 
\end{alignat}

Now y is eliminated from L3 by adding −4L2 to L3:

L_3 + -4L_2 \rightarrow L_3

The result is:

\begin{alignat}{7}
2x &&\; + && y \;&& - &&\; z \;&& = \;&& 8 & \\
&& && \frac{1}{2}y \;&& + &&\; \frac{1}{2}z \;&& = \;&& 1 & \\
&& && && &&\; -z \;&&\; = \;&& 1 & 
\end{alignat}

This result is a system of linear equations in triangular form, and so the first part of the algorithm is complete.

The last part, back-substitution, consists of solving for the known in reverse order. It can thus be seen that

z = -1 \quad (L_3)

Then, z can be substituted into L2, which can then be solved to obtain

y = 3 \quad (L_2)

Next, z and y can be substituted into L1, which can be solved to obtain

x = 2 \quad (L_1)

The system is solved.

We can, in general, write any system of linear equations as a matrix equation:

Ax=b.

The solution of this system is characterized as follows: first, we find a particular solution x0 of this equation using Gaussian elimination. Then, we compute the solutions of Ax = 0; that is, we find the null space N of A. The solution set of this equation is given by x_0+N=\{x_0+n: n\in N \}. If the number of variables equal the number of equations, then we can characterize when the system has a unique solution: since N is trivial if and only if det A ≠ 0, the equation has a unique solution if and only if det A ≠ 0.[18]

Least-squares best fit line[edit]

The least squares method is used to determine the best fit line for a set of data.[19] This line will minimize the sum of the squares of the residuals.

Fourier series expansion[edit]

Fourier series are a representation of a function f: [−π, π] → R as a trigonometric series:

f(x)=\frac{a_0}{2} + \sum_{n=1}^\infty \, [a_n \cos(nx) + b_n \sin(nx)].

This series expansion is extremely useful in solving partial differential equations. In this article, we will not be concerned with convergence issues; it is nice to note that all Lipschitz-continuous functions have a converging Fourier series expansion, and nice enough discontinuous functions have a Fourier series that converges to the function value at most points.

The space of all functions that can be represented by a Fourier series form a vector space (technically speaking, we call functions that have the same Fourier series expansion the "same" function, since two different discontinuous functions might have the same Fourier series). Moreover, this space is also an inner product space with the inner product

\langle f,g \rangle= \frac{1}{\pi} \int_{-\pi}^\pi f(x) g(x) \, dx.

The functions gn(x) = sin(nx) for n > 0 and hn(x) = cos(nx) for n ≥ 0 are an orthonormal basis for the space of Fourier-expandable functions. We can thus use the tools of linear algebra to find the expansion of any function in this space in terms of these basis functions. For instance, to find the coefficient ak, we take the inner product with hk:

\langle f,h_k \rangle=\frac{a_0}{2}\langle h_0,h_k \rangle + \sum_{n=1}^\infty \, [a_n \langle h_n,h_k\rangle + b_n \langle\ g_n,h_k \rangle],

and by orthonormality,  \langle f,h_k\rangle=a_k; that is,

 a_k = \frac{1}{\pi} \int_{-\pi}^\pi f(x) \cos(kx) \, dx.

Quantum mechanics[edit]

Quantum mechanics is highly inspired by notions in linear algebra. In quantum mechanics, the physical state of a particle is represented by a vector, and observables (such as momentum, energy, and angular momentum) are represented by linear operators on the underlying vector space. More concretely, the wave function of a particle describes its physical state and lies in the vector space L2 (the functions φ: R3C such that \int_{-\infty}^\infty \int_{-\infty}^\infty \int_{-\infty}^{\infty} |\phi|^2 dxdydz is finite), and it evolves according to the Schrödinger equation. Energy is represented as the operator H=-\frac{\hbar^2}{2m} \nabla^2 + V(x,y,z), where V is the potential energy. H is also known as the Hamiltonian operator. The eigenvalues of H represents the possible energies that can be observed. Given a particle in some state φ, we can expand φ into a linear combination of eigenstates of H. The component of H in each eigenstate determines the probability of measuring the corresponding eigenvalue, and the measurement forces the particle to assume that eigenstate (wave function collapse).

Geometric introduction[edit]

Many of the principles and techniques of linear algebra can be seen in the geometry of lines in a real two dimensional plane E. When formulated using vectors and matrices the geometry of points and lines in the plane can be extended to the geometry of points and hyperplanes in high-dimensional spaces.

Point coordinates in the plane E are ordered pairs of real numbers, (x,y), and a line is defined as the set of points (x,y) that satisfy the linear equation λ: ax+by + c =0 (where the matrix [a, b, c] is nonzero).[20] Then,

 \lambda: \begin{bmatrix} a & b & c\end{bmatrix} \begin{Bmatrix} x\\ y \\1\end{Bmatrix} = 0,

or

 A\mathbf{x}=0,

where x=(x, y, 1) is the 3x1 set of homogeneous coordinates associated with the point (x, y).[21]

Homogeneous coordinates identify the plane E with the z=1 plane in three dimensional space. The x-y coordinates in E are obtained from homogeneous coordinates y=(y1, y2, y3) by dividing by the third component (if it is nonzero) to obtain y=(y1/y3, y2/y3, 1 ).

The linear equation, λ, has the important property, that if x1 and x2 are homogeneous coordinates of points on the line, then the point αx1 + βx2 is also on the line, for any real α and β.

Now consider two lines λ1: a1x+b1y + c1 =0 and λ2: a2x+b2y + c2 =0. The intersection of these two lines is defined by x=(x, y, 1) that satisfy the matrix equation,

\lambda_{1,2}: \begin{bmatrix} a_1 & b_1 & c_1\\ a_2 & b_2 & c_2 \end{bmatrix} \begin{Bmatrix} x\\ y \\1\end{Bmatrix} = \begin{Bmatrix}0\\0 \end{Bmatrix},

or using homogeneous coordinates,

 B\mathbf{x}=0.

The point of intersection of these two lines is the unique non-zero solution of these equations. In homogeneous coordinates, the solutions are multiples of the following solution:[21]

 x_1 = \begin{vmatrix} b_1 & c_1\\ b_2 & c_2\end{vmatrix}, x_2 = -\begin{vmatrix} a_1 & c_1\\ a_2 & c_2\end{vmatrix}, x_3 = \begin{vmatrix} a_1 & b_1\\ a_2 & b_2\end{vmatrix}

if the rows of B are linearly independent (i.e., λ1 and λ2 represent distinct lines). Divide through by x3 to get Cramer's rule for the solution of a set of two linear equations in two unknowns.[22] Notice that this yields a point in the z=1 plane only when the 2x2 submatrix associated with x3 has a non-zero determinant.

It is interesting to consider the case of three lines, λ1, λ2 and λ3, which yield the matrix equation,

\lambda_{1,2,3}: \begin{bmatrix} a_1 & b_1 & c_1\\ a_2 & b_2 & c_2 \\ a_3 & b_3 & c_3\end{bmatrix} \begin{Bmatrix} x\\ y \\1\end{Bmatrix} = \begin{Bmatrix}0\\0 \\0\end{Bmatrix}.

which in homogeneous form yields,

 C\mathbf{x}=0.

Clearly, this equation has the solution x=(0,0,0), which is not a point on the z=1 plane E. For a solution to exist in the plane E, the coefficient matrix C must have rank 2, which means its determinant must be zero. Another way to say this is that the columns of the matrix must be linearly dependent.

Introduction to linear transformations[edit]

Another way to approach linear algebra is to consider linear functions on the two dimensional real plane E=R2. Here R denotes the set of real numbers. Let x=(x, y) be an arbitrary vector in E and consider the linear function λ: ER, given by

 \lambda: \begin{bmatrix}a & b\end{bmatrix}\begin{Bmatrix} x\\y\end{Bmatrix} = c,

or

A\mathbf{x}=c.

This transformation has the important property that if Ay=d, then

 A(\alpha\mathbf{x}+\beta \mathbf{y}) = \alpha A \mathbf{x} + \beta A\mathbf{y} = \alpha c + \beta d.

This shows that the sum of vectors in E map to the sum of their images in R. This is the defining characteristic of a linear map, or linear transformation.[20] For this case, where the image space is a real number the map is called a linear functional.[22]

Consider the linear functional a little more carefully. Let i=(1,0) and j =(0,1) be the natural basis vectors on E, so that x=xi+yj. It is now possible to see that

 A\mathbf{x} = A(x\mathbf{i}+y\mathbf{j})=x A\mathbf{i} + y A\mathbf{j} = \begin{bmatrix}A\mathbf{i} & A\mathbf{j}\end{bmatrix}\begin{Bmatrix} x\\y\end{Bmatrix} = \begin{bmatrix}a & b\end{bmatrix}\begin{Bmatrix} x\\y\end{Bmatrix} = c.

Thus, the columns of the matrix A are the image of the basis vectors of E in R.

This is true for any pair of vectors used to define coordinates in E. Suppose we select a non-orthogonal non-unit vector basis v and w to define coordinates of vectors in E. This means a vector x has coordinates (α,β), such that xvw. Then, we have the linear functional

 \lambda: A\mathbf{x} = \begin{bmatrix} A\mathbf{v} & A\mathbf{w} \end{bmatrix}\begin{Bmatrix} \alpha \\ \beta \end{Bmatrix}  = \begin{bmatrix} d & e \end{bmatrix}\begin{Bmatrix} \alpha \\ \beta \end{Bmatrix}  =c,

where Av=d and Aw=e are the images of the basis vectors v and w. This is written in matrix form as

 \begin{bmatrix}a & b\end{bmatrix} \begin{bmatrix} v_1 & w_1 \\ v_2 & w_2 \end{bmatrix}  =\begin{bmatrix} d & e \end{bmatrix}.

Coordinates relative to a basis[edit]

This leads to the question of how to determine the coordinates of a vector x relative to a general basis v and w in E. Assume that we know the coordinates of the vectors, x, v and w in the natural basis i=(1,0) and j =(0,1). Our goal is two find the real numbers α, β, so that xvw, that is

 \begin{Bmatrix} x \\ y \end{Bmatrix} = \begin{bmatrix} v_1 & w_1 \\ v_2 & w_2 \end{bmatrix} \begin{Bmatrix} \alpha \\ \beta\end{Bmatrix}.

To solve this equation for α, β, we compute the linear coordinate functionals σ and τ for the basis v, w, which are given by,[21]

 \sigma = \begin{bmatrix}\sigma_1 &\sigma_2\end{bmatrix}=\frac{1}{v_1 w_2- v_2w_1}\begin{bmatrix} w_2 & - w_1\end{bmatrix},  \tau = \begin{bmatrix}\tau_1 &\tau_2\end{bmatrix}=\frac{1}{v_1 w_2- v_2w_1}\begin{bmatrix}  -v_2  & v_1\end{bmatrix},

The functionals σ and τ compute the components of x along the basis vectors v and w, respectively, that is,

\sigma \mathbf{x}=\alpha, \tau\mathbf{x}=\beta,

which can be written in matrix form as

 \begin{bmatrix} \sigma_1 & \sigma_2 \\ \tau_1 &\tau_2 \end{bmatrix} \begin{Bmatrix} x \\ y \end{Bmatrix} =\begin{Bmatrix} \alpha \\ \beta\end{Bmatrix}.

These coordinate functionals have the properties,

 \sigma\mathbf{v}=1, \sigma\mathbf{w}=0, \tau\mathbf{w}=1, \tau\mathbf{v}=0.

These equations can be assembled into the single matrix equation,

 \begin{bmatrix} \sigma_1 & \sigma_2 \\ \tau_1 &\tau_2 \end{bmatrix} \begin{bmatrix} v_1 & w_1 \\ v_2 &w_2 \end{bmatrix} = \begin{bmatrix} 1& 0\\0 & 1\end{bmatrix}.

Thus, the matrix formed by the coordinate linear functionals is the inverse of the matrix formed by the basis vectors.[20][22]

Inverse image[edit]

The set of points in the plane E that map to the same image in R under the linear functional λ define a line in E. This line is the image of the inverse map, λ−1: RE. This inverse image is the set of the points x=(x, y) that solve the equation,

 A\mathbf{x}=\begin{bmatrix}a & b\end{bmatrix}\begin{Bmatrix} x\\y\end{Bmatrix} = c.

Notice that a linear functional operates on known values for x=(x, y) to compute a value c in R, while the inverse image seeks the values for x=(x, y) that yield a specific value c.

In order to solve the equation, we first recognize that only one of the two unknowns (x,y) can be determined, so we select y to be determined, and rearrange the equation

 by = c - ax.

Solve for y and obtain the inverse image as the set of points,

 \mathbf{x}(t) = \begin{Bmatrix} 0\\ c/b\end{Bmatrix} + t\begin{Bmatrix} 1\\ -a/b\end{Bmatrix}=\mathbf{p} + t\mathbf{h}  .

For convenience the free parameter x has been relabeled t.

The vector p defines the intersection of the line with the y-axis, known as the y-intercept. The vector h satisfies the homogeneous equation,

A\mathbf{h}= \begin{bmatrix}a & b\end{bmatrix} \begin{Bmatrix} 1\\ -a/b\end{Bmatrix}= 0.

Notice that if h is a solution to this homogeneous equation, then t h is also a solution.

The set of points of a linear functional that map to zero define the kernel of the linear functional. The line can be considered to be the set of points h in the kernel translated by the vector p.[20][22]

Generalizations and related topics[edit]

Since linear algebra is a successful theory, its methods have been developed and generalized in other parts of mathematics. In module theory, one replaces the field of scalars by a ring. The concepts of linear independence, span, basis, and dimension (which is called rank in module theory) still make sense. Nevertheless, many theorems from linear algebra become false in module theory. For instance, not all modules have a basis (those that do are called free modules), the rank of a free module is not necessarily unique, not every linearly independent subset of a module can be extended to form a basis, and not every subset of a module that spans the space contains a basis.

In multilinear algebra, one considers multivariable linear transformations, that is, mappings that are linear in each of a number of different variables. This line of inquiry naturally leads to the idea of the dual space, the vector space V consisting of linear maps f: VF where F is the field of scalars. Multilinear maps T: VnF can be described via tensor products of elements of V.

If, in addition to vector addition and scalar multiplication, there is a bilinear vector product V × VV, the vector space is called an algebra; for instance, associative algebras are algebras with an associate vector product (like the algebra of square matrices, or the algebra of polynomials).

Functional analysis mixes the methods of linear algebra with those of mathematical analysis and studies various function spaces, such as Lp spaces.

Representation theory studies the actions of algebraic objects on vector spaces by representing these objects as matrices. It is interested in all the ways that this is possible, and it does so by finding subspaces invariant under all transformations of the algebra. The concept of eigenvalues and eigenvectors is especially important.

Algebraic geometry considers the solutions of systems of polynomial equations.

There are several related topics in the field of Computer Programming that utilizes much of the techniques and theorems Linear Algebra encompasses and refers to.

See also[edit]

Notes[edit]

  1. ^ Strang, Gilbert (July 19, 2005), Linear Algebra and Its Applications (4th ed.), Brooks Cole, ISBN 978-0-03-010567-8 
  2. ^ Weisstein, Eric. "Linear Algebra". From MathWorld--A Wolfram Web Resource. Wolfram. Retrieved 16 April 2012. 
  3. ^ a b c d Vitulli, Marie. "A Brief History of Linear Algebra and Matrix Theory". Department of Mathematics. University of Oregon. Archived from the original on 2012-09-10. Retrieved 2014-07-08. 
  4. ^ http://www.journals.istanbul.edu.tr/tr/index.php/oba/article/download/9103/8452
  5. ^ http://archive.org/details/linearalgebra00tevfgoog
  6. ^ Tucker, Alan (1993). "The Growing Importance of Linear Algebra in Undergraduate Mathematics". College Mathematics Journal 24 (1): 3–9. doi:10.2307/2686426. 
  7. ^ Goodlad, John I.; von stoephasius, Reneta; Klein, M. Frances (1966). "The changing school curriculum". U.S. Department of Health, Education, and Welfare: Office of Education. Retrieved 9 July 2014. 
  8. ^ Dorier, Jean-Luc; Robert, Aline; Robinet, Jacqueline; Rogalsiu, Marc (2000). Dorier, Jean-Luc, ed. The Obstacle of Formalism in Linear Algebra. Springer. pp. 85–124. ISBN 978-0-7923-6539-6. Retrieved 9 July 2014. 
  9. ^ Carlson, David; Johnson, Charles R.; Lay, David C.; Porter, A. Duane (1993). "The Linear Algebra Curriculum Study Group Recommendations for the First Course in Linear Algebra". The College Mathematics Journal 24 (1): 41–46. doi:10.2307/2686430. 
  10. ^ Roman 2005, ch. 1, p. 27
  11. ^ Axler (2004), pp. 28–29
  12. ^ The existence of a basis is straightforward for countably generated vector spaces, and for well-ordered vector spaces, but in full generality it is logically equivalent to the axiom of choice.
  13. ^ Axler (2204), p. 33
  14. ^ Axler (2004), p. 55
  15. ^ If we restrict to integers, then only 1 and -1 have an inverse. Consequently, the inverse of an integer matrix is an integer matrix if and only if the determinant is 1 or -1.
  16. ^ P. K. Jain, Khalil Ahmad (1995). "5.1 Definitions and basic properties of inner product spaces and Hilbert spaces". Functional analysis (2nd ed.). New Age International. p. 203. ISBN 81-224-0801-X. 
  17. ^ Eduard Prugovec̆ki (1981). "Definition 2.1". Quantum mechanics in Hilbert space (2nd ed.). Academic Press. pp. 18 ff. ISBN 0-12-566060-X. 
  18. ^ Gunawardena, Jeremy. "Matrix algebra for beginners, Part I". Harvard Medical School. Retrieved 2 May 2012. 
  19. ^ Miller, Steven. "The Method of Least Squares". Brown University. Retrieved 1 May 2013. 
  20. ^ a b c d Strang, Gilbert (July 19, 2005), Linear Algebra and Its Applications (4th ed.), Brooks Cole, ISBN 978-0-03-010567-8 
  21. ^ a b c J. G. Semple and G. T. Kneebone, Algebraic Projective Geometry, Clarendon Press, London, 1952.
  22. ^ a b c d E. D. Nering, Linear Algebra and Matrix Theory, John-Wiley, New York, NY, 1963
  1. ^ This axiom is not asserting the associativity of an operation, since there are two operations in question, scalar multiplication: bv; and field multiplication: ab.

Further reading[edit]

History
  • Fearnley-Sander, Desmond, "Hermann Grassmann and the Creation of Linear Algebra" ([1]), American Mathematical Monthly 86 (1979), pp. 809–817.
  • Grassmann, Hermann, Die lineale Ausdehnungslehre ein neuer Zweig der Mathematik: dargestellt und durch Anwendungen auf die übrigen Zweige der Mathematik, wie auch auf die Statik, Mechanik, die Lehre vom Magnetismus und die Krystallonomie erläutert, O. Wigand, Leipzig, 1844.
Introductory textbooks
  • Bretscher, Otto (June 28, 2004), Linear Algebra with Applications (3rd ed.), Prentice Hall, ISBN 978-0-13-145334-0 
  • Farin, Gerald; Hansford, Dianne (December 15, 2004), Practical Linear Algebra: A Geometry Toolbox, AK Peters, ISBN 978-1-56881-234-2 
  • Friedberg, Stephen H.; Insel, Arnold J.; Spence, Lawrence E. (November 11, 2002), Linear Algebra (4th ed.), Prentice Hall, ISBN 978-0-13-008451-4 
  • Hefferon, Jim (2008), Linear Algebra 
  • Anton, Howard (2005), Elementary Linear Algebra (Applications Version) (9th ed.), Wiley International 
  • Lay, David C. (August 22, 2005), Linear Algebra and Its Applications (3rd ed.), Addison Wesley, ISBN 978-0-321-28713-7 
  • Kolman, Bernard; Hill, David R. (May 3, 2007), Elementary Linear Algebra with Applications (9th ed.), Prentice Hall, ISBN 978-0-13-229654-0 
  • Leon, Steven J. (2006), Linear Algebra With Applications (7th ed.), Pearson Prentice Hall, ISBN 978-0-13-185785-8 
  • Poole, David (2010), Linear Algebra: A Modern Introduction (3rd ed.), Cengage – Brooks/Cole, ISBN 978-0-538-73545-2 
  • Ricardo, Henry (2010), A Modern Introduction To Linear Algebra (1st ed.), CRC Press, ISBN 978-1-4398-0040-9 
  • Sadun, Lorenzo (2008), Applied Linear Algebra: the decoupling principle (2nd ed.), AMS, ISBN 978-0-8218-4441-0 
  • Strang, Gilbert (July 19, 2005), Linear Algebra and Its Applications (4th ed.), Brooks Cole, ISBN 978-0-03-010567-8 
Advanced textbooks
Study guides and outlines
  • Leduc, Steven A. (May 1, 1996), Linear Algebra (Cliffs Quick Review), Cliffs Notes, ISBN 978-0-8220-5331-6 
  • Lipschutz, Seymour; Lipson, Marc (December 6, 2000), Schaum's Outline of Linear Algebra (3rd ed.), McGraw-Hill, ISBN 978-0-07-136200-9 
  • Lipschutz, Seymour (January 1, 1989), 3,000 Solved Problems in Linear Algebra, McGraw–Hill, ISBN 978-0-07-038023-3 
  • McMahon, David (October 28, 2005), Linear Algebra Demystified, McGraw–Hill Professional, ISBN 978-0-07-146579-3 
  • Zhang, Fuzhen (April 7, 2009), Linear Algebra: Challenging Problems for Students, The Johns Hopkins University Press, ISBN 978-0-8018-9125-0 

External links[edit]

Online books[edit]