Cramer's rule

In linear algebra, Cramer's rule is an explicit formula for the solution of a system of linear equations with as many equations as unknowns, valid whenever the system has a unique solution. It expresses the solution in terms of the determinants of the (square) coefficient matrix and of matrices obtained from it by replacing one column by the vector of right hand sides of the equations. It is named after Gabriel Cramer (1704–1752), who published the rule for an arbitrary number of unknowns in 1750,^[1] although Colin Maclaurin also published special cases of the rule in 1748^[2] (and possibly knew of it as early as 1729).^[3]^[4]^[5]

Cramer's rule is computationally very inefficient for systems of more than two or thee equations;^[6] its asymptotic complexity is O(n·n!) compared to elmination methods that have polynomial time complexity.^[7]^[8] Cramer's rule is also numerically unstable even for 2×2 systems.^[9]

General case

Consider a system of $n$ linear equations for $n$ unknowns, represented in matrix multiplication form as follows:

Ax=b

where the $n \times n$ matrix $A$ has a nonzero determinant, and the vector $x=(x_{1},\ldots ,x_{n})^{\mathrm {T} }$ is the column vector of the variables. Then the theorem states that in this case the system has a unique solution, whose individual values for the unknowns are given by:

x_{i}={\frac {\det(A_{i})}{\det(A)}}\qquad i=1,\ldots ,n

where $A_{i}$ is the matrix formed by replacing the $i$ -th column of $A$ by the column vector $b$ .

The rule holds for systems of equations with coefficients and unknowns in any field, not just in the real numbers. It has recently been shown that Cramer's rule can be implemented in O(n³) time,^[10] which is comparable to more common methods of solving systems of linear equations, such as Gaussian elimination (consistently requiring 2.5 times as many arithmetic operations for all matrix sizes, while exhibiting comparable numeric stability in most cases).

Proof

The proof for Cramer's rule uses just two properties of determinants: linearity with respect to any given column (taking for that column a linear combination of column vectors produces as determinant the corresponding linear combination of their determinants), and the fact that the determinant is zero whenever two columns are equal (which is implied by the basic property that the determinant is alternating in the columns).

Fix the index j of a column. Linearity means that if we consider only column j as variable (fixing the others arbitrarily), the resulting function $R n \to R$ (assuming matrix entries are in $R$ ) can be given by a matrix, with one row and n columns, that acts on column j. In fact this is precisely what Laplace expansion does, writing $det(A) = C 1 a 1, j + ... + C n a n,j$ for certain coefficients C₁, ..., C_n that depend on the columns of $A$ other than column j (the precise expression for these cofactors is not important here). The value $det(A)$ is then the result of applying the one-line matrix $L (j) = (C 1 C 2 ... C n)$ to column j of $A$ . If $L (j)$ is applied to any other column k of $A$ , then the result is the determinant of the matrix obtained from $A$ by replacing column j by a copy of column k, so the resulting determinant is 0 (the case of two equal columns).

Now consider a system of $n$ linear equations in $n$ unknowns $x_{1},\ldots ,x_{n}$ , whose coefficient matrix is $A$ , with det(A) assumed to be nonzero:

{\begin{matrix}a_{11}x_{1}+a_{12}x_{2}+\cdots +a_{1n}x_{n}&=&b_{1}\\a_{21}x_{1}+a_{22}x_{2}+\cdots +a_{2n}x_{n}&=&b_{2}\\\vdots &\vdots &\vdots \\a_{n1}x_{1}+a_{n2}x_{2}+\cdots +a_{nn}x_{n}&=&b_{n}.\end{matrix}}

If one combines these equations by taking C₁ times the first equation, plus C₂ times the second, and so forth until C_n times the last, then the coefficient of $x j$ will become $C 1 a 1, j + ... + C n a n,j = det(A)$ , while the coefficients of all other unknowns become 0; the left hand side becomes simply det(A)x_j. The right hand side is $C 1 b 1 + ... + C n b n$ , which is $L (j)$ applied to the column vector b of the right hand sides $b i$ . In fact what has been done here is multiply the matrix equation $A x = b$ on the left by $L (j)$ . Dividing by the nonzero number det(A) one finds the following equation, necessary to satisfy the system:

x_{j}={\frac {L_{(j)}\cdot \mathbf {b} }{\det(A)}}.

But by construction the numerator is the determinant of the matrix obtained from $A$ by replacing column j by b, so we get the expression of Cramer's rule as a necessary condition for a solution. The same procedure can be repeated for other values of j to find values for the other unknowns.

The only point that remains to prove is that these values for the unknowns, the only possible ones, do indeed together form a solution. But if the matrix $A$ is invertible with inverse $A -1$ , then $x = A -1 b$ will be a solution, thus showing its existence. To see that $A$ is invertible when det(A) is nonzero, consider the $n \times n$ matrix M obtained by stacking the one-line matrices $L (j)$ on top of each other for j = 1, ..., n (this gives the adjugate matrix for $A$ ). It was shown that $L (j) A = (0 ... 0 det(A) 0 ... 0)$ where $det(A)$ appears at the position j; from this it follows that $MA = det(A) I n$ . Therefore

{\frac {1}{\det(A)}}M=A^{-1},

completing the proof.

For other proofs, see below.

Finding inverse matrix

Let $A$ be an $n \times n$ matrix. Then

\mathrm {Adj} (A)A=\mathrm {det} (A)I

where Adj(A) denotes the adjugate matrix of $A$ , $det(A)$ is the determinant, and I is the identity matrix. If det(A) is invertible in R, then the inverse matrix of $A$ is

A^{-1}={\frac {1}{\operatorname {det} (A)}}\operatorname {Adj} (A).

If R is a field (such as the field of real numbers), then this gives a formula for the inverse of $A$ , provided $det(A) \neq 0$ . In fact, this formula will work whenever R is a commutative ring, provided that det(A) is a unit. If det(A) is not a unit, then $A$ is not invertible.

Applications

Explicit formulas for small systems

Consider the linear system

\left\{{\begin{matrix}a_{1}x+b_{1}y&={\color {red}c_{1}}\\a_{2}x+b_{2}y&={\color {red}c_{2}}\end{matrix}}\right.

which in matrix format is

{\begin{bmatrix}a_{1}&b_{1}\\a_{2}&b_{2}\end{bmatrix}}{\begin{bmatrix}x\\y\end{bmatrix}}={\begin{bmatrix}{\color {red}c_{1}}\\{\color {red}c_{2}}\end{bmatrix}}.

Assume $a 1 b 2 - b 1 a 2$ nonzero. Then, with help of determinants $x$ and $y$ can be found with Cramer's rule as

{\begin{aligned}x&={\begin{vmatrix}{\color {red}{c_{1}}}&b_{1}\\{\color {red}{c_{2}}}&b_{2}\end{vmatrix}}{\Big /}{\begin{vmatrix}a_{1}&b_{1}\\a_{2}&b_{2}\end{vmatrix}}={{\color {red}c_{1}}b_{2}-b_{1}{\color {red}c_{2}} \over a_{1}b_{2}-b_{1}a_{2}}\\y&={\begin{vmatrix}a_{1}&{\color {red}{c_{1}}}\\a_{2}&{\color {red}{c_{2}}}\end{vmatrix}}{\Big /}{\begin{vmatrix}a_{1}&b_{1}\\a_{2}&b_{2}\end{vmatrix}}={a_{1}{\color {red}c_{2}}-{\color {red}c_{1}}a_{2} \over a_{1}b_{2}-b_{1}a_{2}}\end{aligned}}

The rules for $3 \times 3$ matrices are similar. Given

\left\{{\begin{matrix}a_{1}x+b_{1}y+c_{1}z&={\color {red}d_{1}}\\a_{2}x+b_{2}y+c_{2}z&={\color {red}d_{2}}\\a_{3}x+b_{3}y+c_{3}z&={\color {red}d_{3}}\end{matrix}}\right.

which in matrix format is

${\begin{bmatrix}a_{1}&b_{1}&c_{1}\\a_{2}&b_{2}&c_{2}\\a_{3}&b_{3}&c_{3}\end{bmatrix}}{\begin{bmatrix}x\\y\\z\end{bmatrix}}={\begin{bmatrix}{\color {red}d_{1}}\\{\color {red}d_{2}}\\{\color {red}d_{3}}\end{bmatrix}}.$

Then the values of $x, y$ and $z$ can be found as follows:

x={\frac {\begin{vmatrix}{\color {red}d_{1}}&b_{1}&c_{1}\\{\color {red}d_{2}}&b_{2}&c_{2}\\{\color {red}d_{3}}&b_{3}&c_{3}\end{vmatrix}}{\begin{vmatrix}a_{1}&b_{1}&c_{1}\\a_{2}&b_{2}&c_{2}\\a_{3}&b_{3}&c_{3}\end{vmatrix}}},\quad y={\frac {\begin{vmatrix}a_{1}&{\color {red}d_{1}}&c_{1}\\a_{2}&{\color {red}d_{2}}&c_{2}\\a_{3}&{\color {red}d_{3}}&c_{3}\end{vmatrix}}{\begin{vmatrix}a_{1}&b_{1}&c_{1}\\a_{2}&b_{2}&c_{2}\\a_{3}&b_{3}&c_{3}\end{vmatrix}}},{\text{ and }}z={\frac {\begin{vmatrix}a_{1}&b_{1}&{\color {red}d_{1}}\\a_{2}&b_{2}&{\color {red}d_{2}}\\a_{3}&b_{3}&{\color {red}d_{3}}\end{vmatrix}}{\begin{vmatrix}a_{1}&b_{1}&c_{1}\\a_{2}&b_{2}&c_{2}\\a_{3}&b_{3}&c_{3}\end{vmatrix}}}.

Differential geometry

Cramer's rule is also extremely useful for solving problems in differential geometry. Consider the two equations $F(x,y,u,v)=0$ and $G(x,y,u,v)=0$ . When u and v are independent variables, we can define $x=X(u,v)$ and $y=Y(u,v).$

Finding an equation for ${\dfrac {\partial x}{\partial u}}$ is a trivial application of Cramer's rule.

First, calculate the first derivatives of F, G, x, and y:

{\begin{aligned}dF&={\frac {\partial F}{\partial x}}dx+{\frac {\partial F}{\partial y}}dy+{\frac {\partial F}{\partial u}}du+{\frac {\partial F}{\partial v}}dv=0\\[6pt]dG&={\frac {\partial G}{\partial x}}dx+{\frac {\partial G}{\partial y}}dy+{\frac {\partial G}{\partial u}}du+{\frac {\partial G}{\partial v}}dv=0\\[6pt]dx&={\frac {\partial X}{\partial u}}du+{\frac {\partial X}{\partial v}}dv\\[6pt]dy&={\frac {\partial Y}{\partial u}}du+{\frac {\partial Y}{\partial v}}dv.\end{aligned}}

Substituting dx, dy into dF and dG, we have:

{\begin{aligned}dF&=\left({\frac {\partial F}{\partial x}}{\frac {\partial x}{\partial u}}+{\frac {\partial F}{\partial y}}{\frac {\partial y}{\partial u}}+{\frac {\partial F}{\partial u}}\right)du+\left({\frac {\partial F}{\partial x}}{\frac {\partial x}{\partial v}}+{\frac {\partial F}{\partial y}}{\frac {\partial y}{\partial v}}+{\frac {\partial F}{\partial v}}\right)dv=0\\[6pt]dG&=\left({\frac {\partial G}{\partial x}}{\frac {\partial x}{\partial u}}+{\frac {\partial G}{\partial y}}{\frac {\partial y}{\partial u}}+{\frac {\partial G}{\partial u}}\right)du+\left({\frac {\partial G}{\partial x}}{\frac {\partial x}{\partial v}}+{\frac {\partial G}{\partial y}}{\frac {\partial y}{\partial v}}+{\frac {\partial G}{\partial v}}\right)dv=0.\end{aligned}}

Since u, v are both independent, the coefficients of du, dv must be zero. So we can write out equations for the coefficients:

{\begin{aligned}{\frac {\partial F}{\partial x}}{\frac {\partial x}{\partial u}}+{\frac {\partial F}{\partial y}}{\frac {\partial y}{\partial u}}&=-{\frac {\partial F}{\partial u}}\\[6pt]{\frac {\partial G}{\partial x}}{\frac {\partial x}{\partial u}}+{\frac {\partial G}{\partial y}}{\frac {\partial y}{\partial u}}&=-{\frac {\partial G}{\partial u}}\\[6pt]{\frac {\partial F}{\partial x}}{\frac {\partial x}{\partial v}}+{\frac {\partial F}{\partial y}}{\frac {\partial y}{\partial v}}&=-{\frac {\partial F}{\partial v}}\\[6pt]{\frac {\partial G}{\partial x}}{\frac {\partial x}{\partial v}}+{\frac {\partial G}{\partial y}}{\frac {\partial y}{\partial v}}&=-{\frac {\partial G}{\partial v}}.\end{aligned}}

Now, by Cramer's rule, we see that:

{\frac {\partial x}{\partial u}}={\frac {\begin{vmatrix}-{\frac {\partial F}{\partial u}}&{\frac {\partial F}{\partial y}}\\-{\frac {\partial G}{\partial u}}&{\frac {\partial G}{\partial y}}\end{vmatrix}}{\begin{vmatrix}{\frac {\partial F}{\partial x}}&{\frac {\partial F}{\partial y}}\\{\frac {\partial G}{\partial x}}&{\frac {\partial G}{\partial y}}\end{vmatrix}}}.

This is now a formula in terms of two Jacobians:

{\frac {\partial x}{\partial u}}=-{\frac {\left({\frac {\partial (F,G)}{\partial (u,y)}}\right)}{\left({\frac {\partial (F,G)}{\partial (x,y)}}\right)}}.

Similar formulas can be derived for ${\frac {\partial x}{\partial v}},{\frac {\partial y}{\partial u}},{\frac {\partial y}{\partial v}}.$

Integer programming

Cramer's rule can be used to prove that an integer programming problem whose constraint matrix is totally unimodular and whose right-hand side is integer, has integer basic solutions. This makes the integer program substantially easier to solve.

Ordinary differential equations

Cramer's rule is used to derive the general solution to an inhomogeneous linear differential equation by the method of variation of parameters.

Geometric interpretation

Cramer's rule has a geometric interpretation that can be considered also a proof or simply giving insight about its geometric nature. These geometric arguments work in general and not only in the case of two equations with two unknowns presented here.

Given the system of equations

{\begin{matrix}a_{11}x_{1}+a_{12}x_{2}&=b_{1}\\a_{21}x_{1}+a_{22}x_{2}&=b_{2}\end{matrix}}

it can be considered as an equation between vectors

x_{1}{\binom {a_{11}}{a_{21}}}+x_{2}{\binom {a_{12}}{a_{22}}}={\binom {b_{1}}{b_{2}}}.

The area of the parallelogram determined by ${\binom {a_{11}}{a_{21}}}$ and ${\binom {a_{12}}{a_{22}}}$ is given by the determinant of the system of equations:

{\begin{vmatrix}a_{11}&a_{12}\\a_{21}&a_{22}\end{vmatrix}}.

In general, when there are more variables and equations, the determinant of $n$ vectors of length $n$ will give the volume of the parallelepiped determined by those vectors in the $n$ -th dimensional Euclidean space.

Therefore the area of the parallelogram determined by $x_{1}{\binom {a_{11}}{a_{21}}}$ and ${\binom {a_{12}}{a_{22}}}$ has to be $x_{1}$ times the area of the first one since one of the sides has been multiplied by this factor. Now, this last parallelogram, by Cavalieri's principle, has the same area as the parallelogram determined by ${\binom {b_{1}}{b_{2}}}=x_{1}{\binom {a_{11}}{a_{21}}}+x_{2}{\binom {a_{12}}{a_{22}}}$ and ${\binom {a_{12}}{a_{22}}}$ .

Equating the areas of this last and the second parallelogram gives the equation

{\begin{vmatrix}b_{1}&a_{12}\\b_{2}&a_{22}\end{vmatrix}}={\begin{vmatrix}a_{11}x_{1}&a_{12}\\a_{21}x_{1}&a_{22}\end{vmatrix}}=x_{1}{\begin{vmatrix}a_{11}&a_{12}\\a_{21}&a_{22}\end{vmatrix}}

from which Cramer's rule follows.

Other proofs

A short proof

A short proof of Cramer's rule ^[11] can be given by noticing that $x_{1}$ is the determinant of the matrix

X_{1}={\begin{bmatrix}x_{1}&0&0&\dots &0\\x_{2}&1&0&\dots &0\\x_{3}&0&1&\dots &0\\\vdots &\vdots &\vdots &\ddots &\vdots \\x_{n}&0&0&\dots &1\end{bmatrix}}

On the other hand, assuming that our original matrix $A$ is invertible, this matrix $X_{1}$ has columns $A^{-1}b,A^{-1}v_{2},\ldots ,A^{-1}v_{n}$ , where $v_{k}$ is the $k$ -th column of the matrix $A$ . Recall that the matrix $A_{1}$ has columns $b,v_{2},\ldots ,v_{n}$ . Hence we have

x_{1}=\det(X_{1})=\det(A^{-1})\det(A_{1})={\frac {\det(A_{1})}{\det(A)}}.

The proof for other $x_{j}$ is similar.

Proof using Clifford algebra

Consider the system of three scalar equations in three unknown scalars $x_{1},x_{2},x_{3}$

{\begin{aligned}a_{11}x_{1}+a_{12}x_{2}+a_{13}x_{3}&=c_{1}\\a_{21}x_{1}+a_{22}x_{2}+a_{23}x_{3}&=c_{2}\\a_{31}x_{1}+a_{32}x_{2}+a_{33}x_{3}&=c_{3}\end{aligned}}

and assign an orthonormal vector basis $\mathbf {e} _{1},\mathbf {e} _{2},\mathbf {e} _{3}$ for ${\mathcal {G}}_{3}$ as

{\begin{aligned}a_{11}\mathbf {e} _{1}x_{1}+a_{12}\mathbf {e} _{1}x_{2}+a_{13}\mathbf {e} _{1}x_{3}&=c_{1}\mathbf {e} _{1}\\a_{21}\mathbf {e} _{2}x_{1}+a_{22}\mathbf {e} _{2}x_{2}+a_{23}\mathbf {e} _{2}x_{3}&=c_{2}\mathbf {e} _{2}\\a_{31}\mathbf {e} _{3}x_{1}+a_{32}\mathbf {e} _{3}x_{2}+a_{33}\mathbf {e} _{3}x_{3}&=c_{3}\mathbf {e} _{3}\end{aligned}}

Let the vectors

{\begin{aligned}\mathbf {a} _{1}&=a_{11}\mathbf {e} _{1}+a_{21}\mathbf {e} _{2}+a_{31}\mathbf {e} _{3}\\\mathbf {a} _{2}&=a_{12}\mathbf {e} _{1}+a_{22}\mathbf {e} _{2}+a_{32}\mathbf {e} _{3}\\\mathbf {a} _{3}&=a_{13}\mathbf {e} _{1}+a_{23}\mathbf {e} _{2}+a_{33}\mathbf {e} _{3}\end{aligned}}

Adding the system of equations, it is seen that

{\begin{aligned}\mathbf {c} &=c_{1}\mathbf {e} _{1}+c_{2}\mathbf {e} _{2}+c_{3}\mathbf {e} _{3}\\&=x_{1}\mathbf {a} _{1}+x_{2}\mathbf {a} _{2}+x_{3}\mathbf {a} _{3}\end{aligned}}

Using the exterior product, each unknown scalar $x_{k}$ can be solved as

{\begin{aligned}\mathbf {c} \wedge \mathbf {a} _{2}\wedge \mathbf {a} _{3}&=x_{1}\mathbf {a} _{1}\wedge \mathbf {a} _{2}\wedge \mathbf {a} _{3}\\\mathbf {c} \wedge \mathbf {a} _{1}\wedge \mathbf {a} _{3}&=x_{2}\mathbf {a} _{2}\wedge \mathbf {a} _{1}\wedge \mathbf {a} _{3}\\\mathbf {c} \wedge \mathbf {a} _{1}\wedge \mathbf {a} _{2}&=x_{3}\mathbf {a} _{3}\wedge \mathbf {a} _{1}\wedge \mathbf {a} _{2}\\x_{1}&={\frac {\mathbf {c} \wedge \mathbf {a} _{2}\wedge \mathbf {a} _{3}}{\mathbf {a} _{1}\wedge \mathbf {a} _{2}\wedge \mathbf {a} _{3}}}\\x_{2}&={\frac {\mathbf {c} \wedge \mathbf {a} _{1}\wedge \mathbf {a} _{3}}{\mathbf {a} _{2}\wedge \mathbf {a} _{1}\wedge \mathbf {a} _{3}}}={\frac {\mathbf {a} _{1}\wedge \mathbf {c} \wedge \mathbf {a} _{3}}{\mathbf {a} _{1}\wedge \mathbf {a} _{2}\wedge \mathbf {a} _{3}}}\\x_{3}&={\frac {\mathbf {c} \wedge \mathbf {a} _{1}\wedge \mathbf {a} _{2}}{\mathbf {a} _{3}\wedge \mathbf {a} _{1}\wedge \mathbf {a} _{2}}}={\frac {\mathbf {a} _{1}\wedge \mathbf {a} _{2}\wedge \mathbf {c} }{\mathbf {a} _{1}\wedge \mathbf {a} _{2}\wedge \mathbf {a} _{3}}}\end{aligned}}

For $n$ equations in $n$ unknowns, the solution for the $k$ -th unknown $x_{k}$ generalizes to

{\begin{aligned}x_{k}&={\frac {\mathbf {a} _{1}\wedge \cdots \wedge (\mathbf {c} )_{k}\wedge \cdots \wedge \mathbf {a} _{n}}{\mathbf {a} _{1}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{n}}}\\&=(\mathbf {a} _{1}\wedge \cdots \wedge (\mathbf {c} )_{k}\wedge \cdots \wedge \mathbf {a} _{n})(\mathbf {a} _{1}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{n})^{-1}\\&={\frac {(\mathbf {a} _{1}\wedge \cdots \wedge (\mathbf {c} )_{k}\wedge \cdots \wedge \mathbf {a} _{n})(\mathbf {a} _{1}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{n})}{(\mathbf {a} _{1}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{n})(\mathbf {a} _{1}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{n})}}\\&={\frac {(\mathbf {a} _{1}\wedge \cdots \wedge (\mathbf {c} )_{k}\wedge \cdots \wedge \mathbf {a} _{n})\cdot (\mathbf {a} _{1}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{n})}{(-1)^{\frac {n(n-1)}{2}}(\mathbf {a} _{n}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{1})\cdot (\mathbf {a} _{1}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{n})}}\\&={\frac {(\mathbf {a} _{n}\wedge \cdots \wedge (\mathbf {c} )_{k}\wedge \cdots \wedge \mathbf {a} _{1})\cdot (\mathbf {a} _{1}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{n})}{(\mathbf {a} _{n}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{1})\cdot (\mathbf {a} _{1}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{n})}}\end{aligned}}

If $a k$ are linearly independent, then the $x_{k}$ can be expressed in determinant form identical to Cramer’s Rule as

{\begin{aligned}x_{k}&={\frac {(\mathbf {a} _{n}\wedge \cdots \wedge (\mathbf {c} )_{k}\wedge \cdots \wedge \mathbf {a} _{1})\cdot (\mathbf {a} _{1}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{n})}{(\mathbf {a} _{n}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{1})\cdot (\mathbf {a} _{1}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{n})}}\\[8pt]&={\begin{vmatrix}\mathbf {a} _{1}\cdot \mathbf {a} _{1}&\cdots &\mathbf {a} _{1}\cdot (\mathbf {c} )_{k}&\cdots &\mathbf {a} _{1}\cdot \mathbf {a} _{n}\\\vdots &\ddots &\vdots &\ddots &\vdots \\\mathbf {a} _{k}\cdot \mathbf {a} _{1}&\cdots &\mathbf {a} _{k}\cdot (\mathbf {c} )_{k}&\cdots &\mathbf {a} _{k}\cdot \mathbf {a} _{n}\\\vdots &\ddots &\vdots &\ddots &\vdots \\\mathbf {a} _{n}\cdot \mathbf {a} _{1}&\cdots &\mathbf {a} _{n}\cdot (\mathbf {c} )_{k}&\cdots &\mathbf {a} _{n}\cdot \mathbf {a} _{n}\end{vmatrix}}{\begin{vmatrix}\mathbf {a} _{1}\cdot \mathbf {a} _{1}&\cdots &\mathbf {a} _{1}\cdot \mathbf {a} _{k}&\cdots &\mathbf {a} _{1}\cdot \mathbf {a} _{n}\\\vdots &\ddots &\vdots &\ddots &\vdots \\\mathbf {a} _{k}\cdot \mathbf {a} _{1}&\cdots &\mathbf {a} _{k}\cdot \mathbf {a} _{k}&\cdots &\mathbf {a} _{k}\cdot \mathbf {a} _{n}\\\vdots &\ddots &\vdots &\ddots &\vdots \\\mathbf {a} _{n}\cdot \mathbf {a} _{1}&\cdots &\mathbf {a} _{n}\cdot \mathbf {a} _{k}&\cdots &\mathbf {a} _{n}\cdot \mathbf {a} _{n}\end{vmatrix}}^{-1}\\[8pt]&={\begin{vmatrix}\mathbf {a} _{1}\\\vdots \\\mathbf {a} _{k}\\\vdots \\\mathbf {a} _{n}\end{vmatrix}}{\begin{vmatrix}\mathbf {a} _{1}&\cdots &(\mathbf {c} )_{k}&\cdots &\mathbf {a} _{n}\end{vmatrix}}{\begin{vmatrix}\mathbf {a} _{1}\\\vdots \\\mathbf {a} _{k}\\\vdots \\\mathbf {a} _{n}\end{vmatrix}}^{-1}{\begin{vmatrix}\mathbf {a} _{1}&\cdots &\mathbf {a} _{k}&\cdots &\mathbf {a} _{n}\end{vmatrix}}^{-1}\\[8pt]&={\begin{vmatrix}\mathbf {a} _{1}&\cdots &(\mathbf {c} )_{k}&\cdots &\mathbf {a} _{n}\end{vmatrix}}{\begin{vmatrix}\mathbf {a} _{1}&\cdots &\mathbf {a} _{k}&\cdots &\mathbf {a} _{n}\end{vmatrix}}^{-1}\\[8pt]&={\begin{vmatrix}a_{11}&\ldots &c_{1}&\cdots &a_{1n}\\\vdots &\ddots &\vdots &\ddots &\vdots \\a_{k1}&\cdots &c_{k}&\cdots &a_{kn}\\\vdots &\ddots &\vdots &\ddots &\vdots \\a_{n1}&\cdots &c_{n}&\cdots &a_{nn}\end{vmatrix}}{\begin{vmatrix}a_{11}&\ldots &a_{1k}&\cdots &a_{1n}\\\vdots &\ddots &\vdots &\ddots &\vdots \\a_{k1}&\cdots &a_{kk}&\cdots &a_{kn}\\\vdots &\ddots &\vdots &\ddots &\vdots \\a_{n1}&\cdots &a_{nk}&\cdots &a_{nn}\end{vmatrix}}^{-1}\end{aligned}}

where $(c) k$ denotes the substitution of vector $a k$ with vector $c$ in the $k$ -th numerator position.

Systems of vector equations: Cramer’s Rule extended

Consider the system of $n$ vector equations in $n$ unknown vectors $\mathbf {x} _{1},\ldots ,\mathbf {x} _{k},\ldots ,\mathbf {x} _{n}$

{\begin{aligned}a_{11}\mathbf {x} _{1}+\cdots +a_{1k}\mathbf {x} _{k}+\cdots +a_{1n}\mathbf {x} _{n}&=\mathbf {c} _{1}\\&\vdots \\a_{k1}\mathbf {x} _{1}+\cdots +a_{kk}\mathbf {x} _{k}+\cdots +a_{kn}\mathbf {x} _{n}&=\mathbf {c} _{k}\\&\vdots \\a_{n1}\mathbf {x} _{1}+\cdots +a_{nk}\mathbf {x} _{k}+\cdots +a_{nn}\mathbf {x} _{n}&=\mathbf {c} _{n}\end{aligned}}

where we want to solve for each unknown vector $x k$ in terms of the given scalar constants $a_{rc}$ and vector constants $c k$ .

Solving for unknown vectors

Using the Clifford algebra (or geometric algebra) of Euclidean vectors, the vectors $x k$ and $c k$ are in a vector space having $d$ dimensions spanned by a basis of $d$ orthonormal base vectors $\mathbf {e} _{1},\ldots ,\mathbf {e} _{d}$ . This $d$ -dimensional space can be extended to be a subspace of a larger $(d + n)$ -dimensional space spanned by $\mathbf {e} _{1},\ldots ,\mathbf {e} _{d},\ldots ,\mathbf {e} _{d+k},\ldots ,\mathbf {e} _{d+n}$ .

Multiply the $k$ -th equation by the $(d+k)$ th orthonormal base unit $\mathbf {e} _{d+k}$ , using the exterior product on the right, as

{\begin{aligned}\left(a_{11}\mathbf {x} _{1}+\cdots +a_{1k}\mathbf {x} _{k}+\cdots +a_{1n}\mathbf {x} _{n}\right)\wedge \mathbf {e} _{d+1}&=\mathbf {c} _{1}\wedge \mathbf {e} _{d+1}\\&\vdots \\(a_{k1}\mathbf {x} _{1}+\cdots +a_{kk}\mathbf {x} _{k}+\cdots +a_{kn}\mathbf {x} _{n})\wedge \mathbf {e} _{d+k}&=\mathbf {c} _{k}\wedge \mathbf {e} _{d+k}\\&\vdots \\(a_{n1}\mathbf {x} _{1}+\cdots +a_{nk}\mathbf {x} _{k}+\cdots +a_{nn}\mathbf {x} _{n})\wedge \mathbf {e} _{d+n}&=\mathbf {c} _{n}\wedge \mathbf {e} _{d+n}\end{aligned}}

The original system of equations in grade- $1$ vectors is now transformed into a system of equations in grade- $2$ vectors, and no parallel components have been deleted by the exterior products since they multiply on perpendicular extended base units.

Let the vectors

{\begin{aligned}\mathbf {a} _{1}&=a_{11}\mathbf {e} _{d+1}+\cdots +a_{k1}\mathbf {e} _{d+k}+\cdots +a_{n1}\mathbf {e} _{d+n}\\&\vdots \\\mathbf {a} _{k}&=a_{1k}\mathbf {e} _{d+1}+\cdots +a_{kk}\mathbf {e} _{d+k}+\cdots +a_{nk}\mathbf {e} _{d+n}\\&\vdots \\\mathbf {a} _{n}&=a_{1n}\mathbf {e} _{d+1}+\cdots +a_{kn}\mathbf {e} _{d+k}+\cdots +a_{nn}\mathbf {e} _{d+n}\end{aligned}}

Adding the transformed system of equations gives

{\begin{aligned}\mathbf {C} &=\mathbf {c} _{1}\wedge \mathbf {e} _{d+1}+\cdots +\mathbf {c} _{k}\wedge \mathbf {e} _{d+k}+\cdots +\mathbf {c} _{n}\wedge \mathbf {e} _{d+n}\\&=\mathbf {C} _{1}+\cdots +\mathbf {C} _{k}+\cdots +\mathbf {C} _{n}\\&=\mathbf {x} _{1}\wedge \mathbf {a} _{1}+\cdots +\mathbf {x} _{k}\wedge \mathbf {a} _{k}+\cdots +\mathbf {x} _{n}\wedge \mathbf {a} _{n}\end{aligned}}

which is a $2$ -vector equation. These exterior (wedge) products are equal to Clifford products since the factors are perpendicular.

For $n=3,\mathbf {x} _{1},\mathbf {x} _{2}$ , and $\mathbf {x} _{3}$ are solved by multiplying $\mathbf {a} _{2}\wedge \mathbf {a} _{3},\mathbf {a} _{1}\wedge \mathbf {a} _{3}$ , and $\mathbf {a} _{1}\wedge \mathbf {a} _{2}$ , respectively, on the right with exterior products

{\begin{aligned}\mathbf {C} \wedge \mathbf {a} _{2}\wedge \mathbf {a} _{3}&=\mathbf {x} _{1}\wedge \mathbf {a} _{1}\wedge \mathbf {a} _{2}\wedge \mathbf {a} _{3}=\mathbf {x} _{1}(\mathbf {a} _{1}\wedge \mathbf {a} _{2}\wedge \mathbf {a} _{3})\\\mathbf {C} \wedge \mathbf {a} _{1}\wedge \mathbf {a} _{3}&=\mathbf {x} _{2}\wedge \mathbf {a} _{2}\wedge \mathbf {a} _{1}\wedge \mathbf {a} _{3}=\mathbf {x} _{2}(\mathbf {a} _{2}\wedge \mathbf {a} _{1}\wedge \mathbf {a} _{3})\\\mathbf {C} \wedge \mathbf {a} _{1}\wedge \mathbf {a} _{2}&=\mathbf {x} _{3}\wedge \mathbf {a} _{3}\wedge \mathbf {a} _{1}\wedge \mathbf {a} _{2}=\mathbf {x} _{3}(\mathbf {a} _{3}\wedge \mathbf {a} _{1}\wedge \mathbf {a} _{2})\\[6pt]\mathbf {x} _{1}&=(\mathbf {C} \wedge \mathbf {a} _{2}\wedge \mathbf {a} _{3})(\mathbf {a} _{1}\wedge \mathbf {a} _{2}\wedge \mathbf {a} _{3})^{-1}={\frac {(\mathbf {C} \wedge \mathbf {a} _{2}\wedge \mathbf {a} _{3})\cdot ((-1)^{1-1}\mathbf {a} _{1}\wedge \mathbf {a} _{2}\wedge \mathbf {a} _{3})}{(\mathbf {a} _{1}\wedge \mathbf {a} _{2}\wedge \mathbf {a} _{3})^{2}}}\\[6pt]\mathbf {x} _{2}&=(\mathbf {C} \wedge \mathbf {a} _{1}\wedge \mathbf {a} _{3})(\mathbf {a} _{2}\wedge \mathbf {a} _{1}\wedge \mathbf {a} _{3})^{-1}={\frac {(\mathbf {a} _{1}\wedge \mathbf {C} \wedge \mathbf {a} _{3})\cdot \left((-1)^{2-1}\mathbf {a} _{1}\wedge \mathbf {a} _{2}\wedge \mathbf {a} _{3}\right)}{(\mathbf {a} _{1}\wedge \mathbf {a} _{2}\wedge \mathbf {a} _{3})^{2}}}\\[6pt]\mathbf {x} _{3}&=(\mathbf {C} \wedge \mathbf {a} _{1}\wedge \mathbf {a} _{2})(\mathbf {a} _{3}\wedge \mathbf {a} _{1}\wedge \mathbf {a} _{2})^{-1}={\frac {(\mathbf {a} _{1}\wedge \mathbf {a} _{2}\wedge \mathbf {C} )\cdot ((-1)^{3-1}\mathbf {a} _{1}\wedge \mathbf {a} _{2}\wedge \mathbf {a} _{3})}{(\mathbf {a} _{1}\wedge \mathbf {a} _{2}\wedge \mathbf {a} _{3})^{2}}}\end{aligned}}

In the solution of $x 1$ , and similarly for $x 2$ and $\mathbf {x} _{3},\mathbf {C} \wedge \mathbf {a} _{2}\wedge \mathbf {a} _{3}$ is a $4$ -blade having $3$ of its $4$ dimensions in the extended dimensions $\mathbf {e} _{d+k}$ , and the remaining one dimension is in the solution space of the vectors $x k$ and $c k$ . The $3$ -blade $\mathbf {a} _{1}\wedge \mathbf {a} _{2}\wedge \mathbf {a} _{3}$ is in the problem space, or the extended dimensions. The inner product $(\mathbf {C} \wedge \mathbf {a} _{2}\wedge \mathbf {a} _{3})\cdot (\mathbf {a} _{1}\wedge \mathbf {a} _{2}\wedge \mathbf {a} _{3})$ reduces, or contracts, to a $1$ -vector in the $d$ -dimensional solution space. The divisor $(\mathbf {a} _{1}\wedge \mathbf {a} _{2}\wedge \mathbf {a} _{3})^{2}$ , the square of a blade, is a scalar product that can be computed by a determinant. Since $C$ is a $2$ -vector, it commutes $\mathbf {C} \wedge \mathbf {a} _{k}=\mathbf {a} _{k}\wedge \mathbf {C}$ with the vectors $\mathbf {a} _{k}$ without sign change and is conveniently shifted into the vacant $k$ -th spot. A sign change $(-1)^{k-1}$ occurs in every even $(+)$ $k$ -th solution $x +$ , such as $x 2$ , due to commuting or shifting $a k$ right an odd number of times, in the dividend blade $\mathbf {a} _{1}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{n}$ , into its $k$ -th spot.

In general we have:

{\begin{aligned}\mathbf {x} _{k}&=(\mathbf {a} _{1}\wedge \cdots \wedge (\mathbf {C} )_{k}\wedge \cdots \wedge \mathbf {a} _{n})\cdot ((-1)^{k-1}\mathbf {a} _{1}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{n})^{-1}\\[6pt]&={\frac {(\mathbf {a} _{1}\wedge \cdots \wedge (\mathbf {C} )_{k}\wedge \cdots \wedge \mathbf {a} _{n})\cdot ((-1)^{k-1}\mathbf {a} _{1}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{n})}{(\mathbf {a} _{1}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{n})^{2}}}\\[6pt]&={\frac {(-1)^{k-1}(\mathbf {a} _{1}\wedge \cdots \wedge (\mathbf {C} )_{k}\wedge \cdots \wedge \mathbf {a} _{n})\cdot (\mathbf {a} _{1}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{n})}{(-1)^{\frac {n(n-1)}{2}}(\mathbf {a} _{n}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{1})\cdot (\mathbf {a} _{1}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{n})}}\\[6pt]&={\frac {(-1)^{k-1}(\mathbf {a} _{1}\wedge \cdots \wedge (\mathbf {C} )_{k}\wedge \cdots \wedge \mathbf {a} _{n})\cdot (\mathbf {a} _{n}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{1})}{(\mathbf {a} _{n}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{1})\cdot (\mathbf {a} _{1}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{n})}}\\[6pt]&={\frac {(\mathbf {a} _{1}\wedge \cdots \wedge (\mathbf {C} )_{k}\wedge \cdots \wedge \mathbf {a} _{n})\cdot (\mathbf {a} _{n}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{1})}{(-1)^{k-1}{\begin{vmatrix}\mathbf {a} _{1}\cdot \mathbf {a} _{1}&\cdots &\mathbf {a} _{1}\cdot \mathbf {a} _{k}&\cdots &\mathbf {a} _{1}\cdot \mathbf {a} _{n}\\\vdots &\ddots &\vdots &\ddots &\vdots \\\mathbf {a} _{k}\cdot \mathbf {a} _{1}&\cdots &\mathbf {a} _{k}\cdot \mathbf {a} _{k}&\cdots &\mathbf {a} _{k}\cdot \mathbf {a} _{n}\\\vdots &\ddots &\vdots &\ddots &\vdots \\\mathbf {a} _{n}\cdot \mathbf {a} _{1}&\cdots &\mathbf {a} _{n}\cdot \mathbf {a} _{k}&\cdots &\mathbf {a} _{n}\cdot \mathbf {a} _{n}\end{vmatrix}}}}\\[6pt]&={\frac {(\mathbf {a} _{1}\wedge \cdots \wedge (\mathbf {C} )_{k}\wedge \cdots \wedge \mathbf {a} _{n})\cdot (\mathbf {a} _{n}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{1})}{(-1)^{k-1}{\begin{vmatrix}\mathbf {a} _{1}&\cdots &\mathbf {a} _{k}&\cdots &\mathbf {a} _{n}\end{vmatrix}}^{2}}}\end{aligned}}

where $(\mathbf {C} )_{k}$ denotes replacing the $k$ -th element $a k$ with $C$ . The factor $(-1)^{k-1}$ accounts for shifting the $k$ -th vector $a k$ by $k-1$ places. The $(n+1)$ -blade $\mathbf {a} _{1}\wedge \cdots \wedge (\mathbf {C} )_{k}\wedge \cdots \wedge \mathbf {a} _{n}$ is multiplied by inner product with the reversed $n$ -blade $\mathbf {a} _{n}\wedge \cdots \wedge \mathbf {a} _{k}\wedge \cdots \wedge \mathbf {a} _{1}$ , producing a $1$ -vector in the $d$ -dimensional solution space.

Using this formula, for solving a system of $n$ vector equations having $n$ unknown vectors $\mathbf {x} _{1},\ldots ,\mathbf {x} _{k},\ldots ,\mathbf {x} _{n}$ in a $d$ -dimensional space, requires extending the space to $(d+n)$ dimensions. The extended $n$ dimensions are essentially used to hold the system of $n$ equations represented by the scalar constants $1$ -vectors $a k$ and the vector constants $1$ -vectors $c k$ . The $n$ vector constants $c k$ are grade-increased to $2$ -vectors or grade- $2$ vectors $\mathbf {c} _{k}\wedge \mathbf {e} _{d+k}=\mathbf {C} _{k}$ that are partly in the extended space. Notice the similarity of form to Cramer’s Rule for systems of scalar equations; a basis is added in both cases. The advantage of this formula is that it avoids scalar coordinates and the results are directly in terms of vectors.

The system of vector equations can also be solved in terms of coordinates, without using the geometric algebra formula above, by the usual process of expanding all the vectors in the system into their coordinate vector components. In each expanded equation, the parallel (like) components are summed into $d$ groups that form $d$ independent systems of $n$ unknown coordinates in $n$ equations. Each system solves for one dimension of coordinates. After solving the $d$ systems, the solved vectors can be reassembled from the solved coordinates. It seems that few books explicitly discuss this process for systems of vector equations. This process is the application of the abstract concept of linear independence as it applies to linearly independent dimensions of vector components or unit vectors. The linear independence concept extends to multivectors in geometric algebra, where each unique unit blade is linearly independent of the others for the purpose of solving equations or systems of equations. An equation containing a sum of $d$ linearly independent terms can be rewritten as $d$ separate independent equations, each in the terms of one dimension.

Solving for unknown scalars

Note that, instead of solving for unknown vectors $x k$ , the $x k$ may be known vectors and the vectors $a k$ may be unknown. The vectors $a 1, a 2, a 3$ could be solved as

{\begin{aligned}-\mathbf {C} \wedge \mathbf {x} _{2}\wedge \mathbf {x} _{3}&=\mathbf {a} _{1}\wedge \mathbf {x} _{1}\wedge \mathbf {x} _{2}\wedge \mathbf {x} _{3}=\mathbf {a} _{1}(\mathbf {x} _{1}\wedge \mathbf {x} _{2}\wedge \mathbf {x} _{3})\\-\mathbf {C} \wedge \mathbf {x} _{1}\wedge \mathbf {x} _{3}&=\mathbf {a} _{2}\wedge \mathbf {x} _{2}\wedge \mathbf {x} _{1}\wedge \mathbf {x} _{3}=\mathbf {a} _{2}(\mathbf {x} _{2}\wedge \mathbf {x} _{1}\wedge \mathbf {x} _{3})\\-\mathbf {C} \wedge \mathbf {x} _{1}\wedge \mathbf {x} _{2}&=\mathbf {a} _{3}\wedge \mathbf {x} _{3}\wedge \mathbf {x} _{1}\wedge \mathbf {x} _{2}=\mathbf {a} _{3}(\mathbf {x} _{3}\wedge \mathbf {x} _{1}\wedge \mathbf {x} _{2})\\\mathbf {a} _{1}&=(-\mathbf {C} \wedge \mathbf {x} _{2}\wedge \mathbf {x} _{3})(\mathbf {x} _{1}\wedge \mathbf {x} _{2}\wedge \mathbf {x} _{3})^{-1}\\&={\frac {(-\mathbf {C} \wedge \mathbf {x} _{2}\wedge \mathbf {x} _{3})\cdot ((-1)^{1-1}\mathbf {x} _{1}\wedge \mathbf {x} _{2}\wedge \mathbf {x} _{3})}{(\mathbf {x} _{1}\wedge \mathbf {x} _{2}\wedge \mathbf {x} _{3})^{2}}}\\\mathbf {a} _{2}&=(-\mathbf {C} \wedge \mathbf {x} _{1}\wedge \mathbf {x} _{3})(\mathbf {x} _{2}\wedge \mathbf {x} _{1}\wedge \mathbf {x} _{3})^{-1}\\&={\frac {(-\mathbf {x} _{1}\wedge \mathbf {C} \wedge \mathbf {x} _{3})\cdot ((-1)^{2-1}\mathbf {x} _{1}\wedge \mathbf {x} _{2}\wedge \mathbf {x} _{3})}{(\mathbf {x} _{1}\wedge \mathbf {x} _{2}\wedge \mathbf {x} _{3})^{2}}}\\\mathbf {a} _{3}&=(-\mathbf {C} \wedge \mathbf {x} _{1}\wedge \mathbf {x} _{2})(\mathbf {x} _{3}\wedge \mathbf {x} _{1}\wedge \mathbf {x} _{2})^{-1}\\&={\frac {(-\mathbf {x} _{1}\wedge \mathbf {x} _{2}\wedge \mathbf {C} )\cdot \left((-1)^{3-1}\mathbf {x} _{1}\wedge \mathbf {x} _{2}\wedge \mathbf {x} _{3}\right)}{(\mathbf {x} _{1}\wedge \mathbf {x} _{2}\wedge \mathbf {x} _{3})^{2}}}\end{aligned}}

In general, vector $a k$ may be solved as

{\begin{aligned}\mathbf {a} _{k}&=(-\mathbf {x} _{1}\wedge \cdots \wedge (\mathbf {C} )_{k}\wedge \cdots \wedge \mathbf {x} _{n})\cdot \left((-1)^{k-1}\mathbf {x} _{1}\wedge \cdots \wedge \mathbf {x} _{k}\wedge \cdots \wedge \mathbf {x} _{n}\right)^{-1}\\&={\frac {\left(-\mathbf {x} _{1}\wedge \cdots \wedge (\mathbf {C} )_{k}\wedge \cdots \wedge \mathbf {x} _{n})\cdot ((-1)^{k-1}\mathbf {x} _{1}\wedge \cdots \wedge \mathbf {x} _{k}\wedge \cdots \wedge \mathbf {x} _{n}\right)}{\left(\mathbf {x} _{1}\wedge \cdots \wedge \mathbf {x} _{k}\wedge \cdots \wedge \mathbf {x} _{n}\right)^{2}}}\\&={\frac {(-1)^{k}(\mathbf {x} _{1}\wedge \cdots \wedge (\mathbf {C} )_{k}\wedge \cdots \wedge \mathbf {x} _{n})\cdot (\mathbf {x} _{1}\wedge \cdots \wedge \mathbf {x} _{k}\wedge \cdots \wedge \mathbf {x} _{n})}{(-1)^{n(n-1)/2}(\mathbf {x} _{n}\wedge \cdots \wedge \mathbf {x} _{k}\wedge \cdots \wedge \mathbf {x} _{1})\cdot (\mathbf {x} _{1}\wedge \cdots \wedge \mathbf {x} _{k}\wedge \cdots \wedge \mathbf {x} _{n})}}\\&={\frac {(-1)^{k}(\mathbf {x} _{1}\wedge \cdots \wedge (\mathbf {C} )_{k}\wedge \cdots \wedge \mathbf {x} _{n})\cdot (\mathbf {x} _{n}\wedge \cdots \wedge \mathbf {x} _{k}\wedge \cdots \wedge \mathbf {x} _{1})}{(\mathbf {x} _{n}\wedge \cdots \wedge \mathbf {x} _{k}\wedge \cdots \wedge \mathbf {x} _{1})\cdot (\mathbf {x} _{1}\wedge \cdots \wedge \mathbf {x} _{k}\wedge \cdots \wedge \mathbf {x} _{n})}}\\&={\frac {(\mathbf {x} _{1}\wedge \cdots \wedge (\mathbf {C} )_{k}\wedge \cdots \wedge \mathbf {x} _{n})\cdot (\mathbf {x} _{n}\wedge \cdots \wedge \mathbf {x} _{k}\wedge \cdots \wedge \mathbf {x} _{1})}{(-1)^{k}{\begin{vmatrix}\mathbf {x} _{1}\cdot \mathbf {x} _{1}&\cdots &\mathbf {x} _{1}\cdot \mathbf {x} _{k}&\cdots &\mathbf {x} _{1}\cdot \mathbf {x} _{n}\\\vdots &\ddots &\vdots &\ddots &\vdots \\\mathbf {x} _{k}\cdot \mathbf {x} _{1}&\cdots &\mathbf {x} _{k}\cdot \mathbf {x} _{k}&\cdots &\mathbf {x} _{k}\cdot \mathbf {x} _{n}\\\vdots &\ddots &\vdots &\ddots &\vdots \\\mathbf {x} _{n}\cdot \mathbf {x} _{1}&\cdots &\mathbf {x} _{n}\cdot \mathbf {x} _{k}&\cdots &\mathbf {x} _{n}\cdot \mathbf {x} _{n}\end{vmatrix}}}}\\&={\frac {(\mathbf {x} _{1}\wedge \cdots \wedge (\mathbf {C} )_{k}\wedge \cdots \wedge \mathbf {x} _{n})\cdot (\mathbf {x} _{n}\wedge \cdots \wedge \mathbf {x} _{k}\wedge \cdots \wedge \mathbf {x} _{1})}{(-1)^{k}{\begin{vmatrix}\mathbf {x} _{1}&\cdots &\mathbf {x} _{k}&\cdots &\mathbf {x} _{n}\end{vmatrix}}^{2}}}\end{aligned}}

and represents transforming or projecting the system, or each vector $c k$ , onto the basis of vectors $\mathbf {x} _{1},\ldots ,\mathbf {x} _{k},\ldots ,\mathbf {x} _{n}$ which need not be orthonormal. However solving for the vectors $\mathbf {a} _{k}$ by this formula is unnecessary and unnecessarily requires $n$ vectors $\mathbf {c} _{1},\ldots ,\mathbf {c} _{k},\ldots ,\mathbf {c} _{n}$ at a time. Solving each equation is independent in this case. This has been shown to clarify the usage, as far as what not to do, unless one has an unusual need to solve a particular vector $a k$ . Instead, the following can be done in the case of projecting vectors $c k$ onto a new arbitrary basis $x k$ .

Projecting a vector onto an arbitrary basis.

Projecting any vector $c$ onto a new arbitrary basis $\mathbf {x} _{1},\ldots ,\mathbf {x} _{k},\ldots ,\mathbf {x} _{n}$ as

{\begin{aligned}\mathbf {c} &=c_{1}\mathbf {e} _{1}+\cdots +c_{k}\mathbf {e} _{k}+\cdots +c_{n}\mathbf {e} _{n}\\&=a_{1}\mathbf {x} _{1}+\cdots +a_{k}\mathbf {x} _{k}+\cdots +a_{n}\mathbf {x} _{n}\end{aligned}}

where each $x k$ is written in the form

\mathbf {x} _{k}=x_{k1}\mathbf {e} _{1}+\cdots +x_{kk}\mathbf {e} _{k}+\cdots +x_{kn}\mathbf {e} _{n}

is a system of $n$ scalar equations in $n$ unknown coordinates $a_{k}$

{\begin{aligned}a_{1}x_{11}+\cdots +a_{k}x_{k1}+\cdots +a_{n}x_{n1}&=c_{1}\\&\vdots \\a_{1}x_{1k}+\cdots +a_{k}x_{kk}+\cdots +a_{n}x_{nk}&=c_{k}\\&\vdots \\a_{1}x_{1n}+\cdots +a_{k}x_{kn}+\cdots +a_{n}x_{nn}&=c_{n}\end{aligned}}

and can be solved using the ordinary Cramer’s rule for systems of scalar equations, where the step of adding a basis can be considered as already done. For $n = 3$ , the solutions for the scalars $a_{k}$ are

{\begin{aligned}\mathbf {c} \wedge \mathbf {x} _{2}\wedge \mathbf {x} _{3}&=a_{1}\mathbf {x} _{1}\wedge \mathbf {x} _{2}\wedge \mathbf {x} _{3}\\\mathbf {c} \wedge \mathbf {x} _{1}\wedge \mathbf {x} _{3}&=a_{2}\mathbf {x} _{2}\wedge \mathbf {x} _{1}\wedge \mathbf {x} _{3}\\\mathbf {c} \wedge \mathbf {x} _{1}\wedge \mathbf {x} _{2}&=a_{3}\mathbf {x} _{3}\wedge \mathbf {x} _{1}\wedge \mathbf {x} _{2}\\a_{1}&={\frac {\mathbf {c} \wedge \mathbf {x} _{2}\wedge \mathbf {x} _{3}}{\mathbf {x} _{1}\wedge \mathbf {x} _{2}\wedge \mathbf {x} _{3}}}\\a_{2}&={\frac {\mathbf {c} \wedge \mathbf {x} _{1}\wedge \mathbf {x} _{3}}{\mathbf {x} _{2}\wedge \mathbf {x} _{1}\wedge \mathbf {x} _{3}}}={\frac {\mathbf {x} _{1}\wedge \mathbf {c} \wedge \mathbf {x} _{3}}{\mathbf {x} _{1}\wedge \mathbf {x} _{2}\wedge \mathbf {x} _{3}}}\\a_{3}&={\frac {\mathbf {c} \wedge \mathbf {x} _{1}\wedge \mathbf {x} _{2}}{\mathbf {x} _{3}\wedge \mathbf {x} _{1}\wedge \mathbf {x} _{2}}}={\frac {\mathbf {x} _{1}\wedge \mathbf {x} _{2}\wedge \mathbf {c} }{\mathbf {x} _{1}\wedge \mathbf {x} _{2}\wedge \mathbf {x} _{3}}}\end{aligned}}

For $n$ basis vectors ( $n$ equations in $n$ unknowns), the solution for the $k$ -th unknown scalar coordinate $a_{k}$ generalizes to

a_{k}={\frac {\mathbf {x} _{1}\wedge \cdots \wedge (\mathbf {c} )_{k}\wedge \cdots \wedge \mathbf {x} _{n}}{\mathbf {x} _{1}\wedge \cdots \wedge \mathbf {x} _{k}\wedge \cdots \wedge \mathbf {x} _{n}}}={\begin{vmatrix}x_{11}&\ldots &c_{1}&\cdots &x_{n1}\\\vdots &\ddots &\vdots &\ddots &\vdots \\x_{1k}&\cdots &c_{k}&\cdots &x_{nk}\\\vdots &\ddots &\vdots &\ddots &\vdots \\x_{1n}&\cdots &c_{n}&\cdots &x_{nn}\end{vmatrix}}{\begin{vmatrix}x_{11}&\ldots &x_{k1}&\cdots &x_{n1}\\\vdots &\ddots &\vdots &\ddots &\vdots \\x_{1k}&\cdots &x_{kk}&\cdots &x_{nk}\\\vdots &\ddots &\vdots &\ddots &\vdots \\x_{1n}&\cdots &x_{kn}&\cdots &x_{nn}\end{vmatrix}}^{-1}

the formula for Cramer’s rule.

The remainder of this subsection outlines some additional concepts or applications that may be important to consider when using arbitrary bases, but otherwise you may skip ahead to the next subsection.

The reciprocal basis $\mathbf {x} _{1}^{\prime },\ldots ,\mathbf {x} _{k}^{\prime },\ldots ,\mathbf {x} _{n}^{\prime }$ of the arbitrary basis $\mathbf {x} _{1},\ldots ,\mathbf {x} _{k},\ldots ,\mathbf {x} _{n}$ is such that $\mathbf {c} \cdot \mathbf {x} _{k}^{\prime }=a_{k}$ , while in general $\mathbf {c} \cdot \mathbf {x} _{k}\neq a_{k}$ . The $k$ -th reciprocal base $\mathbf {x} _{k}^{\prime }$ is

{\begin{aligned}\mathbf {c} \cdot \mathbf {x} _{k}^{\prime }=a_{k}&=(-1)^{k-1}(\mathbf {c} \wedge \mathbf {x} _{1}\wedge \cdots \wedge ()_{k}\wedge \cdots \wedge \mathbf {x} _{n})\cdot (\mathbf {x} _{1}\wedge \cdots \wedge \mathbf {x} _{k}\wedge \cdots \wedge \mathbf {x} _{n})^{-1}\\&=(-1)^{k-1}\mathbf {c} \cdot ((\mathbf {x} _{1}\wedge \cdots \wedge ()_{k}\wedge \cdots \wedge \mathbf {x} _{n})\cdot (\mathbf {x} _{1}\wedge \cdots \wedge \mathbf {x} _{k}\wedge \cdots \wedge \mathbf {x} _{n})^{-1})\\\mathbf {x} _{k}^{\prime }&=(-1)^{k-1}(\mathbf {x} _{1}\wedge \cdots \wedge ()_{k}\wedge \cdots \wedge \mathbf {x} _{n})\cdot (\mathbf {x} _{1}\wedge \cdots \wedge \mathbf {x} _{k}\wedge \cdots \wedge \mathbf {x} _{n})^{-1}\end{aligned}}

where $()_{k}$ denotes that the $k$ -th vector $x k$ is removed from the blade. In mathematics literature, the reciprocal basis $\mathbf {x} _{1}^{\prime },\ldots ,\mathbf {x} _{k}^{\prime },\ldots ,\mathbf {x} _{n}^{\prime }$ is usually written using superscript indices as $\mathbf {x} ^{1},\ldots ,\mathbf {x} ^{k},\ldots ,\mathbf {x} ^{n}$ which should not be confused as exponents or powers of the vectors. The reciprocal bases can be computed once and saved, and then any vector $c$ can be projected onto the arbitrary basis as $\mathbf {c} =(\mathbf {c} \cdot \mathbf {x} ^{k})\mathbf {x} _{k}$ with implied summation over the range of $k\in \{1,\cdots ,n\}$ .

Note that

{\begin{aligned}\mathbf {x} _{k}\cdot \mathbf {x} ^{k}&=(-1)^{k-1}\mathbf {x} _{k}\cdot ((\mathbf {x} _{1}\wedge \cdots \wedge ()_{k}\wedge \cdots \wedge \mathbf {x} _{n})\cdot (\mathbf {x} _{1}\wedge \cdots \wedge \mathbf {x} _{k}\wedge \cdots \wedge \mathbf {x} _{n})^{-1})\\&=(-1)^{k-1}(\mathbf {x} _{k}\wedge \mathbf {x} _{1}\wedge \cdots \wedge ()_{k}\wedge \cdots \wedge \mathbf {x} _{n})\cdot (\mathbf {x} _{1}\wedge \cdots \wedge \mathbf {x} _{k}\wedge \cdots \wedge \mathbf {x} _{n})^{-1}\\&=(\mathbf {x} _{1}\wedge \cdots \wedge \mathbf {x} _{k}\wedge \cdots \wedge \mathbf {x} _{n})\cdot (\mathbf {x} _{1}\wedge \cdots \wedge \mathbf {x} _{k}\wedge \cdots \wedge \mathbf {x} _{n})^{-1}\\&=1\\&=\mathbf {x} ^{k}\cdot \mathbf {x} _{k}\\\mathbf {x} ^{k}\cdot \mathbf {x} _{k}&=(-1)^{k-1}\mathbf {x} ^{k}\cdot ((\mathbf {x} ^{1}\wedge \cdots \wedge ()^{k}\wedge \cdots \wedge \mathbf {x} ^{n})\cdot (\mathbf {x} ^{1}\wedge \cdots \wedge \mathbf {x} ^{k}\wedge \cdots \wedge \mathbf {x} ^{n})^{-1})\end{aligned}}

and that for $j\neq k$

{\begin{aligned}\mathbf {x} _{j}\cdot \mathbf {x} ^{k}&=(-1)^{k-1}\mathbf {x} _{j}\cdot ((\mathbf {x} _{1}\wedge \cdots \wedge \mathbf {x} _{j}\wedge \cdots \wedge ()_{k}\wedge \cdots \wedge \mathbf {x} _{n})\cdot (\mathbf {x} _{1}\wedge \cdots \wedge \mathbf {x} _{k}\wedge \cdots \wedge \mathbf {x} _{n})^{-1})\\&=(-1)^{k-1}(\mathbf {x} _{j}\wedge \mathbf {x} _{1}\wedge \cdots \wedge \mathbf {x} _{j}\wedge \cdots \wedge ()_{k}\wedge \cdots \wedge \mathbf {x} _{n})\cdot (\mathbf {x} _{1}\wedge \cdots \wedge \mathbf {x} _{k}\wedge \cdots \wedge \mathbf {x} _{n})^{-1}\\&=(\mathbf {x} _{1}\wedge \cdots \wedge \mathbf {x} _{j}\wedge \cdots \wedge (\mathbf {x} _{j})_{k}\wedge \cdots \wedge \mathbf {x} _{n})\cdot (\mathbf {x} _{1}\wedge \cdots \wedge \mathbf {x} _{k}\wedge \cdots \wedge \mathbf {x} _{n})^{-1}\\&=0\\&=\mathbf {x} ^{k}\cdot \mathbf {x} _{j}\\\mathbf {x} ^{k}\cdot \mathbf {x} _{j}&=(-1)^{j-1}\mathbf {x} ^{k}\cdot ((\mathbf {x} ^{1}\wedge \cdots \wedge \mathbf {x} ^{k}\wedge \cdots \wedge ()^{j}\wedge \cdots \wedge \mathbf {x} ^{n})\cdot (\mathbf {x} ^{1}\wedge \cdots \wedge \mathbf {x} ^{k}\wedge \cdots \wedge \mathbf {x} ^{n})^{-1})\end{aligned}}

therefore if the $\mathbf {x} ^{k}$ are the new arbitrary bases, then the $x k$ are the reciprocal bases and we also have

\mathbf {c} =(\mathbf {c} \cdot \mathbf {x} _{k})\mathbf {x} ^{k}

with the summation convention over $k$ .

If we abandon the old basis $e k$ and old coordinates $c_{k}$ and $a_{k}$ of $c$ and refer $c$ only to the new basis $x k$ and its reciprocal $\mathbf {x} ^{k}$ , then we can newly rename coordinates for $c$ on the new bases as

{\begin{aligned}\mathbf {c} &=(\mathbf {c} \cdot \mathbf {x} ^{k})\mathbf {x} _{k}=c^{k}\mathbf {x} _{k}\\\mathbf {c} &=(\mathbf {c} \cdot \mathbf {x} _{k})\mathbf {x} ^{k}=c_{k}\mathbf {x} ^{k}\end{aligned}}

This is a coordinates naming convention that is often used implicitly such that $c^{k}=\mathbf {c} \cdot \mathbf {x} ^{k}$ and $c_{k}=\mathbf {c} \cdot \mathbf {x} _{k}$ are understood as identities. Using this coordinates naming convention we can derive the expression

\mathbf {c} \cdot \mathbf {c} =c^{k}\mathbf {x} _{k}\cdot c_{j}\mathbf {x} ^{j}=c^{k}c_{j}\mathbf {x} _{k}\cdot \mathbf {x} ^{j}

Since $\mathbf {x} _{k}\cdot \mathbf {x} ^{j}=1$ for $j=k$ and $\mathbf {x} _{k}\cdot \mathbf {x} ^{j}=0$ for $j\neq k$ (or $\mathbf {x} _{k}\cdot \mathbf {x} ^{j}=\delta _{k}^{j}$ using Kronecker delta), this expression reduces to the identity

\mathbf {c} \cdot \mathbf {c} =c^{k}c_{k}=(\mathbf {c} \cdot \mathbf {x} ^{k})(\mathbf {c} \cdot \mathbf {x} _{k}).

Since $c$ is an arbitrary vector, we can choose any two vectors $u, v$ and find the identities

{\begin{aligned}\mathbf {u} \cdot \mathbf {v} &=u^{k}v_{k}=u_{k}v^{k}\\&=(\mathbf {u} \cdot \mathbf {x} ^{k})(\mathbf {v} \cdot \mathbf {x} _{k})=(\mathbf {u} \cdot \mathbf {x} _{k})(\mathbf {v} \cdot \mathbf {x} ^{k})\end{aligned}}

Reciprocal bases.

In terms of a basis $x k$ and its reciprocal basis $\mathbf {x} ^{k}$ , the inner or dot product $u \cdot v$ of two vectors can be written four ways

{\begin{aligned}\mathbf {u} \cdot \mathbf {v} &=[(\mathbf {u} \cdot \mathbf {x} ^{j})\mathbf {x} _{j}]\cdot [(\mathbf {v} \cdot \mathbf {x} ^{k})\mathbf {x} _{k}]=u^{j}v^{k}\mathbf {x} _{j}\cdot \mathbf {x} _{k}=u^{j}v^{k}m_{jk}\\&=\left[(\mathbf {u} \cdot \mathbf {x} _{j})\mathbf {x} ^{j}\right]\cdot [(\mathbf {v} \cdot \mathbf {x} _{k})\mathbf {x} ^{k}]=u_{j}v_{k}\mathbf {x} ^{j}\cdot \mathbf {x} ^{k}=u_{j}v_{k}m^{jk}\\&=\left[(\mathbf {u} \cdot \mathbf {x} ^{j})\mathbf {x} _{j}\right]\cdot [(\mathbf {v} \cdot \mathbf {x} _{k})\mathbf {x} ^{k}]=u^{j}v_{k}m_{j}^{k}=u^{j}v_{k}\delta _{j}^{k}=u^{k}v_{k}\\&=\left[(\mathbf {u} \cdot \mathbf {x} _{j})\mathbf {x} ^{j}\right]\cdot [(\mathbf {v} \cdot \mathbf {x} ^{k})\mathbf {x} _{k}]=u_{j}v^{k}m_{k}^{j}=u_{j}v^{k}\delta _{k}^{j}=u_{k}v^{k}\end{aligned}}

In the language of tensors, $m$ is called the metric tensor of the basis, $δ$ is the Kronecker delta, an upper-indexed (superscripted) element is called contravariant, and a lower-indexed (subscripted) element is called covariant. Equating right-hand sides, we obtain the tensor contractions that are equivalent to the dot product

{\begin{aligned}u^{j}v^{k}m_{jk}&=u_{k}v^{k}=u^{j}v_{j}=\mathbf {u} \cdot \mathbf {v} \\u_{j}v_{k}m^{jk}&=u_{j}v^{j}=u^{k}v_{k}=\mathbf {u} \cdot \mathbf {v} \end{aligned}}

where in the first equation either $u^{j}m_{jk}=u_{k}$ or $v^{k}m_{jk}=v_{j}$ (index-lowering contractions), and in the second equation either $u_{j}m^{jk}=u^{k}$ or $v_{k}m^{jk}=v^{j}$ (index-raising contractions). The contraction that lowers the index on $u^{j}$ into $u_{k}$ expands to the sum

{\begin{aligned}u^{j}m_{jk}&=u^{1}\mathbf {x} _{1}\cdot \mathbf {x} _{k}+u^{2}\mathbf {x} _{2}\cdot \mathbf {x} _{k}+\cdots +u^{n}\mathbf {x} _{n}\cdot \mathbf {x} _{k}\\&=\left(u^{1}\mathbf {x} _{1}+u^{2}\mathbf {x} _{2}+\cdots +u^{n}\mathbf {x} _{n}\right)\cdot \mathbf {x} _{k}\\&=\left(u^{j}\mathbf {x} _{j}\right)\cdot \mathbf {x} _{k}=\mathbf {u} \cdot \mathbf {x} _{k}=u_{k}\end{aligned}}

Contractions are a form of inner product. Contractions such as these

{\begin{aligned}u_{k}&=\mathbf {u} \cdot \mathbf {x} _{k}=u_{j}\mathbf {x} ^{j}\cdot \mathbf {x} _{k}=u_{j}m_{k}^{j}=u_{j}\delta _{k}^{j}\\u^{k}&=\mathbf {u} \cdot \mathbf {x} ^{k}=u^{j}\mathbf {x} _{j}\cdot \mathbf {x} ^{k}=u^{j}m_{j}^{k}=u^{j}\delta _{j}^{k}\end{aligned}}

are called index renaming. Contractions involving $m$ and $\delta$ have many relations such as

{\begin{aligned}m_{1k}m^{1k}&=(\mathbf {x} _{1}\cdot \mathbf {x} _{k})(\mathbf {x} ^{1}\cdot \mathbf {x} ^{k})=(x_{1})_{k}(x^{1})^{k}=\mathbf {x} _{1}\cdot \mathbf {x} ^{1}=1\\m_{jk}m^{jk}&=n=m_{j}^{j}=m_{k}^{k}=\delta _{j}^{j}=\delta _{k}^{k}\\m_{j}^{i}m_{ik}&=(\mathbf {x} ^{i}\cdot \mathbf {x} _{j})(\mathbf {x} _{i}\cdot \mathbf {x} _{k})=(x_{j})^{i}(x_{k})_{i}=\mathbf {x} _{j}\cdot \mathbf {x} _{k}=m_{jk}\\m_{i}^{j}m^{ik}&=(\mathbf {x} ^{j}\cdot \mathbf {x} _{i})(\mathbf {x} ^{i}\cdot \mathbf {x} ^{k})=(x^{j})_{i}(x^{k})^{i}=\mathbf {x} ^{j}\cdot \mathbf {x} ^{k}=m^{jk}\end{aligned}}

When viewed as $n \times n$ matrices, $m_{jk}$ and $m^{jk}$ are inverse matrices. The matrices $m$ are symmetric, so the indices can be reversed. The contraction that computes the matrix product is

{\begin{aligned}m^{ji}m_{ik}&=(\mathbf {x} ^{j}\cdot \mathbf {x} ^{i})(\mathbf {x} _{i}\cdot \mathbf {x} _{k})=(x^{j})^{i}(x_{k})_{i}=\mathbf {x} ^{j}\cdot \mathbf {x} _{k}=m_{k}^{j}=\delta _{k}^{j}\\{}[m^{jk}]&=[m_{jk}]^{-1}\end{aligned}}

The Kronecker delta $\delta _{k}^{j}$ , viewed as a matrix, is the identity matrix. From this matrix product identity, the reciprocal bases $\mathbf {x} ^{j}$ can be computed as

{\begin{aligned}m^{ji}\mathbf {x} _{i}\cdot \mathbf {x} _{k}&=\mathbf {x} ^{j}\cdot \mathbf {x} _{k}\\m^{ji}\mathbf {x} _{i}&=\mathbf {x} ^{j}=(\mathbf {x} ^{j}\cdot \mathbf {x} ^{i})\mathbf {x} _{i}=(x^{j})^{i}\mathbf {x} _{i}\end{aligned}}

The formula $\mathbf {u} \cdot \mathbf {v} =u_{i}v^{i}=u^{i}v_{i}$ for the inner or dot product of vectors requires the terms to be products of covariant and contravariant component pairs. One of the vectors has to be expressed in terms of the reciprocal basis relative to the basis of the other vector. This requirement is satisfied when expressing vectors on an orthonormal basis that is self-reciprocal, but must be paid proper attention otherwise. The formula is often written

\mathbf {u} \cdot \mathbf {v} =\sum _{i}u_{i}v_{i},

but this is valid only if the vectors are both expressed on the same orthonormal basis $\mathbf {e} ^{k}=\mathbf {e} _{k}$ with $\mathbf {e} _{j}\cdot \mathbf {e} _{k}=\delta _{jk}$ .

The derivative operator $\nabla$ called del is often written as

\nabla =\sum _{i=1}^{n}\mathbf {e} _{i}{\frac {\partial }{\partial x_{i}}}=\mathbf {e} _{i}{\frac {\partial }{\partial x_{i}}}

where the $e i$ are an orthonormal standard basis with vectors written in the Cartesian form $\mathbf {x} =x_{j}\mathbf {e} _{j}$ . Del $\nabla$ can be treated as a vector in computations. It can also be written as

\nabla =\mathbf {x} ^{i}{\frac {\partial }{\partial r^{i}}}=\mathbf {x} _{i}{\frac {\partial }{\partial r_{i}}}

for a basis $\mathbf {x} _{i}$ and reciprocal basis $\mathbf {x} ^{i}$ , and position vector $\mathbf {r} =r^{j}\mathbf {x} _{j}=r_{j}\mathbf {x} ^{j}$ written in the tensor forms. For example, the divergence of $r$ can be computed several ways as

{\begin{aligned}\nabla \cdot \mathbf {r} &={\frac {\partial (\mathbf {x} ^{i}\cdot \mathbf {r} )}{\partial r^{i}}}={\frac {\partial r^{i}}{\partial r^{i}}}=\delta _{i}^{i}=n\\\nabla \cdot \mathbf {r} &={\frac {\partial (\mathbf {x} _{i}\cdot \mathbf {r} )}{\partial r_{i}}}={\frac {\partial r_{i}}{\partial r_{i}}}=\delta _{i}^{i}=n\\\nabla \cdot \mathbf {r} &={\frac {\partial }{\partial r_{i}}}\mathbf {x} _{i}\cdot (r^{j}\mathbf {x} _{j})={\frac {\partial }{\partial r_{i}}}r^{j}m_{ij}={\frac {\partial r_{i}}{\partial r_{i}}}=\delta _{i}^{i}=n\\\nabla \cdot \mathbf {r} &={\frac {\partial }{\partial r_{i}}}\mathbf {x} _{i}\cdot (r_{j}\mathbf {x} ^{j})={\frac {\partial }{\partial r_{i}}}r_{j}m_{i}^{j}={\frac {\partial r_{i}}{\partial r_{i}}}=\delta _{i}^{i}=n\\\nabla \cdot \mathbf {r} &={\frac {\partial }{\partial r^{i}}}\mathbf {x} ^{i}\cdot (r^{j}\mathbf {x} _{j})={\frac {\partial }{\partial r^{i}}}r^{j}m_{j}^{i}={\frac {\partial r^{i}}{\partial r^{i}}}=\delta _{i}^{i}=n\\\nabla \cdot \mathbf {r} &={\frac {\partial }{\partial r^{i}}}\mathbf {x} ^{i}\cdot (r_{j}\mathbf {x} ^{j})={\frac {\partial }{\partial r^{i}}}r_{j}m^{ij}={\frac {\partial r^{i}}{\partial r^{i}}}=\delta _{i}^{i}=n.\end{aligned}}

The derivative operator $\nabla$ can be applied further in this way as a vector, where

\nabla \mathbf {r} =\nabla \cdot \mathbf {r} +\nabla \wedge \mathbf {r}

in geometric calculus for vectors in any number of dimensions $n$ , and

\nabla \mathbf {r} =-\nabla \cdot \mathbf {r} +\nabla \times \mathbf {r}

in quaternions or vector analysis in three dimensions spanned by the orthonormal quaternion vector units $i, j$ , and $k$ .

For $n = 3$ , the product $\nabla \cdot \mathbf {r}$ is known as divergence, and the product $(\nabla \wedge \mathbf {r} )/\mathbf {I} _{3}=\nabla \times \mathbf {r}$ is known as curl. The value $\mathbf {I} _{3}=\mathbf {e} _{1}\wedge \mathbf {e} _{2}\wedge \mathbf {e} _{3}$ is the pseudoscalar of the Clifford algebra. Dividing the bivector $\nabla \wedge \mathbf {r}$ by the pseudoscalar $\mathbf {I} _{3}$ produces its spatial dual in the orthogonal vector space with the same magnitude, and oriented with sign in the expected direction for the curl vector. For a scalar field $f$ , the product $\nabla f$ is known as the gradient vector, which generalizes the scalar-valued derivative of a single-variable function to a vector-valued derivative of a multi-variable function $f$ .

In the rectilinear coordinates system (or affine or oblique coordinate system) that has been considered so far, the metric tensor $m$ has been a constant matrix containing constant ratios that relate to the amount of shearing that occurs in transforming from one rectilinear system to another. In a curvilinear coordinates system, the metric tensor $m$ may be variable and varies with the position vector $r$ . The local frame or basis $\mathbf {x} _{i}$ at $r$ can be defined as

\mathbf {x} _{i}={\frac {\partial \mathbf {r} }{\partial s^{i}}}={\frac {\partial r^{k}}{\partial s^{i}}}\mathbf {e} _{k}

where the position vector $\mathbf {r} =r^{k}\mathbf {e} _{k}$ . It can be assumed that $\mathbf {e} _{k}$ is a standard basis. Each $r^{k}$ is a function of the variables $s^{i}$ , and each $s^{i}$ is at least an implicit function of the variables $r^{k}$ such that the transformation is invertible. The basis $\mathbf {x} _{i}$ is a frame local to each position of $r$ in space, and may vary with position. The covariant metric tensor is

m_{ij}=\mathbf {x} _{i}\cdot \mathbf {x} _{j}={\frac {\partial r^{k}\mathbf {e} _{k}}{\partial s^{i}}}\cdot {\frac {\partial r^{l}\mathbf {e} _{l}}{\partial s^{j}}}={\frac {\partial r^{k}}{\partial s^{i}}}{\frac {\partial r^{l}}{\partial s^{j}}}\delta _{kl}=\sum _{k}{\frac {\partial r^{k}}{\partial s^{i}}}{\frac {\partial r^{k}}{\partial s^{j}}}

and in terms of the Jacobian matrix $\mathbf {J} ,m_{ij}$ is expressed as the matrix

{\begin{aligned}J_{ki}&={\frac {\partial r^{k}}{\partial s^{i}}}\\\mathbf {J} &=[J_{ki}]\\{}[m_{ij}]&=\left[\sum _{k}J_{ki}J_{kj}\right]=\mathbf {J} ^{\mathrm {T} }\mathbf {J} \end{aligned}}

The contravariant metric tensor $m^{ij}$ is again the matrix inverse of the covariant metric tensor

{}\left[m^{ij}\right]=[m_{ij}]^{-1}

and the contravariant or reciprocal basis is

\mathbf {x} ^{i}=m^{ij}\mathbf {x} _{j}

In a cylindrical coordinate system or spherical coordinate system, $m_{ij}$ is a diagonal matrix and $m^{ij}$ is easily found as the matrix with each element inverted.

Projecting a vector onto an orthogonal basis

Projections onto arbitrary bases $x k$ , as solved using Cramer’s rule as just above, treats projections onto orthogonal bases as only a special case. Projections onto mutually orthogonal bases can be achieved using the ordinary projection operation

a_{k}=\mathbf {c} \cdot {\frac {\mathbf {x} _{k}}{|\mathbf {x} _{k}|^{2}}}={\frac {\mathbf {c} \cdot \mathbf {x} _{k}}{\mathbf {x} _{k}\cdot \mathbf {x} _{k}}}={\frac {\mathbf {c} ^{||\mathbf {x} _{k}}\cdot \mathbf {x} _{k}}{\mathbf {x} _{k}\cdot \mathbf {x} _{k}}}

which is correct only if the $x k$ are mutually orthogonal. If the bases $\mathbf {x} _{1},\ldots ,\mathbf {x} _{k},\ldots ,\mathbf {x} _{n}$ are constrained to be mutually perpendicular (orthogonal), then the formula for Cramer’s rule becomes

{\begin{aligned}a_{k}&={\frac {\mathbf {x} _{1}\wedge \cdots \wedge (\mathbf {c} ^{||\mathbf {x} _{k}}+\mathbf {c} ^{\bot \mathbf {x} _{k}})_{k}\wedge \cdots \wedge \mathbf {x} _{n}}{\mathbf {x} _{1}\wedge \cdots \wedge \mathbf {x} _{k}\wedge \cdots \wedge \mathbf {x} _{n}}}\\&={\frac {\mathbf {x} _{1}\cdots (\mathbf {c} ^{||\mathbf {x} _{k}})_{k}\cdots \mathbf {x} _{n}}{\mathbf {x} _{1}\cdots \mathbf {x} _{k}\cdots \mathbf {x} _{n}}}={\frac {\mathbf {c} ^{||\mathbf {x} _{k}}\mathbf {x} _{1}\cdots ()_{k}\cdots \mathbf {x} _{n}}{\mathbf {x} _{k}\mathbf {x} _{1}\cdots ()_{k}\cdots \mathbf {x} _{n}}}\\&={\frac {\mathbf {c} ^{||\mathbf {x} _{k}}}{\mathbf {x} _{k}}}={\frac {\mathbf {c} ^{||\mathbf {x} _{k}}\mathbf {x} _{k}}{\mathbf {x} _{k}\mathbf {x} _{k}}}={\frac {\mathbf {c} \cdot \mathbf {x} _{k}}{\mathbf {x} _{k}\cdot \mathbf {x} _{k}}}\end{aligned}}

where $c$ has been written as a sum of vector components parallel and perpendicular to $x k$ . For any two perpendicular vectors $x j, x k$ , their exterior product $\mathbf {x} _{j}\wedge \mathbf {x} _{k}=\mathbf {x} _{j}\mathbf {x} _{k}$ equals their Clifford product. The vector component $\mathbf {c} ^{\bot \mathbf {x} _{k}}$ must be parallel to the other $\mathbf {x} _{j\neq k}$ , therefore its outermorphism is zero. The result is Cramer’s rule reduced to orthogonal projection of vector $c$ onto base $\mathbf {x} _{k}$ such that $\mathbf {c} ^{||\mathbf {x} _{k}}=a_{k}\mathbf {x} _{k}$ .

In general, the bases $\mathbf {x} _{1},\ldots ,\mathbf {x} _{k},\ldots ,\mathbf {x} _{n}$ are not necessarily mutually orthogonal and the projection to use is Cramer’s rule, generalized projection, not the dot product specific to orthogonal projection.

An orthonormal basis is identical to its reciprocal basis since

{\begin{aligned}\mathbf {x} ^{k}&=(-1)^{k-1}(\mathbf {x} _{1}\wedge \cdots \wedge ()_{k}\wedge \cdots \wedge \mathbf {x} _{n})\cdot (\mathbf {x} _{1}\wedge \cdots \wedge \mathbf {x} _{k}\wedge \cdots \wedge \mathbf {x} _{n})^{-1}\\&=(-1)^{k-1}(\mathbf {x} _{1}\cdots ()_{k}\cdots \mathbf {x} _{n})(\mathbf {x} _{n}\cdots ()_{k}\cdots \mathbf {x} _{1})\mathbf {x} _{k}(-1)^{k-1}\\&=\mathbf {x} _{k}\end{aligned}}

and $\mathbf {c} =(\mathbf {c} \cdot \mathbf {x} ^{k})\mathbf {x} _{k}$ with implied summation over the range of $k\in \{1,\cdots ,n\}$ . For an orthogonal basis, each reciprocal base is already shown to be

\mathbf {x} ^{k}={\frac {\mathbf {x} _{k}}{|\mathbf {x} _{k}|^{2}}}={\frac {\mathbf {x} _{k}}{\mathbf {x} _{k}\cdot \mathbf {x} _{k}}}={\frac {\mathbf {x} _{k}}{\mathbf {x} _{k}\mathbf {x} _{k}}}={\frac {1}{\mathbf {x} _{k}}}=\mathbf {x} _{k}^{-1}

which suggests the name reciprocal basis.

Solving a system of vector equations using SymPy

The free software SymPy, for symbolic mathematics using python, includes a Geometric Algebra Module and interactive calculator console isympy. The isympy console can be used to solve systems of vector equations using the formulas of this article. A simple example of console interaction follows to solve the system

{\begin{aligned}3\mathbf {v} _{1}+4\mathbf {v} _{2}+5\mathbf {v} _{3}&=\mathbf {c} _{1}=9\mathbf {e} _{1}+2\mathbf {e} _{2}+3\mathbf {e} _{3}\\2\mathbf {v} _{1}+3\mathbf {v} _{2}+7\mathbf {v} _{3}&=\mathbf {c} _{2}=6\mathbf {e} _{1}+5\mathbf {e} _{2}+8\mathbf {e} _{3}\\9\mathbf {v} _{1}+6\mathbf {v} _{2}+9\mathbf {v} _{3}&=\mathbf {c} _{3}=2\mathbf {e} _{1}+4\mathbf {e} _{2}+7\mathbf {e} _{3}\end{aligned}}

$isympy
>>> from sympy.galgebra.ga import *
>>> (e1,e2,e3,e4,e5,e6) = MV.setup('e*1|2|3|4|5|6',metric='[1,1,1,1,1,1]')
>>> (v1,v2,v3) = symbols('v1 v2 v3')
>>> (c1,c2,c3,C) = symbols('c1 c2 c3 C')
>>> (a1,a2,a3) = symbols('a1 a2 a3')
>>> a1 = 3*e4 + 2*e5 + 9*e6
>>> a2 = 4*e4 + 3*e5 + 6*e6
>>> a3 = 5*e4 + 7*e5 + 9*e6
>>> c1 = 9*e1 + 2*e2 + 3*e3
>>> c2 = 6*e1 + 5*e2 + 8*e3
>>> c3 = 2*e1 + 4*e2 + 7*e3
>>> C = (c1^e4) + (c2^e5) + (c3^e6)
>>> v1 = (C^a2^a3)|((-1)**(1-1)*MV.inv(a1^a2^a3))
>>> v2 = (a1^C^a3)|((-1)**(2-1)*MV.inv(a1^a2^a3))
>>> v3 = (a1^a2^C)|((-1)**(3-1)*MV.inv(a1^a2^a3))
>>> 3*v1 + 4*v2 + 5*v3
9*e_1 + 2*e_2 + 3*e_3
>>> 2*v1 + 3*v2 + 7*v3
6*e_1 + 5*e_2 + 8*e_3
>>> 9*v1 + 6*v2 + 9*v3
2*e_1 + 4*e_2 + 7*e_3

Incompatible and indeterminate cases

A system of equations is said to be incompatible or inconsistent when there are no solutions and it is called indeterminate when there is more than one solution. For linear equations, an indeterminate system will have infinitely many solutions (if it is over an infinite field), since the solutions can be expressed in terms of one or more parameters that can take arbitrary values.

Cramer's rule applies to the case where the coefficient determinant is nonzero. In the 2 × 2 case, if the coefficient determinant is zero, then the system is incompatible if the numerator determinants are nonzero, or indeterminate if the numerator determinants are zero.

For 3x3 or higher systems, the only thing one can say when the coefficient determinant equals zero is that if any of the numerator determinants are nonzero, then the system must be incompatible. However, having all determinants zero does not imply that the system is indeterminate. A simple example where all determinants vanish (equal zero) but the system is still incompatible is the 3x3 system x+y+z=1, x+y+z=2, x+y+z=3.

Notes

^ Cramer, Gabriel (1750). "Introduction à l'Analyse des lignes Courbes algébriques" (in French). Geneva: Europeana. pp. 656–659. Retrieved 2012-05-18.
^ MacLaurin, Colin (1748). A Treatise of Algebra, in Three Parts.
^ Boyer, Carl B. (1968). A History of Mathematics (2nd ed.). Wiley. p. 431.
^ Katz, Victor (2004). A History of Mathematics (Brief ed.). Pearson Education. pp. 378–379.
^ Hedman, Bruce A. (1999). "An Earlier Date for "Cramer's Rule"" (PDF). Historia Mathematica. 4(26) (4): 365–368. doi:10.1006/hmat.1999.2247 Template:Inconsistent citations{{cite journal}}: CS1 maint: postscript (link)
^ David Poole (2014). Linear Algebra: A Modern Introduction. Cengage Learning. p. 276. ISBN 978-1-285-98283-0.
^ Joe D. Hoffman; Steven Frankel (2001). Numerical Methods for Engineers and Scientists, Second Edition,. CRC Press. p. 30. ISBN 978-0-8247-0443-8.
^ Thomas S. Shores (2007). Applied Linear Algebra and Matrix Analysis. Springer Science & Business Media. p. 132. ISBN 978-0-387-48947-6.
^ Nicholas J. Higham (2002). Accuracy and Stability of Numerical Algorithms: Second Edition. SIAM. p. 132. ISBN 978-0-89871-521-7.
^ Ken Habgood, Itamar Arel (2012). "A condensation-based application of Cramerʼs rule for solving large-scale linear systems" (PDF). Journal of Discrete Algorithms. 10: 98–109. doi:10.1016/j.jda.2011.06.007.
^ Robinson, Stephen M. (1970). "A Short Proof of Cramer's Rule". Mathematics Magazine. 43: 94–95.

External links

[1] Cramer, Gabriel (1750). "Introduction à l'Analyse des lignes Courbes algébriques" (in French). Geneva: Europeana. pp. 656–659. Retrieved 2012-05-18.

[2] MacLaurin, Colin (1748). A Treatise of Algebra, in Three Parts.

[3] Boyer, Carl B. (1968). A History of Mathematics (2nd ed.). Wiley. p. 431.

[4] Katz, Victor (2004). A History of Mathematics (Brief ed.). Pearson Education. pp. 378–379.

[5] Hedman, Bruce A. (1999). "An Earlier Date for "Cramer's Rule"" (PDF). Historia Mathematica. 4(26) (4): 365–368. doi:10.1006/hmat.1999.2247 Template:Inconsistent citations{{cite journal}}: CS1 maint: postscript (link)

[Poole2014-6] David Poole (2014). Linear Algebra: A Modern Introduction. Cengage Learning. p. 276. ISBN 978-1-285-98283-0.

[HoffmanFrankel2001-7] Joe D. Hoffman; Steven Frankel (2001). Numerical Methods for Engineers and Scientists, Second Edition,. CRC Press. p. 30. ISBN 978-0-8247-0443-8.

[Shores2007-8] Thomas S. Shores (2007). Applied Linear Algebra and Matrix Analysis. Springer Science & Business Media. p. 132. ISBN 978-0-387-48947-6.

[Higham2002-9] Nicholas J. Higham (2002). Accuracy and Stability of Numerical Algorithms: Second Edition. SIAM. p. 132. ISBN 978-0-89871-521-7.

[10] Ken Habgood, Itamar Arel (2012). "A condensation-based application of Cramerʼs rule for solving large-scale linear systems" (PDF). Journal of Discrete Algorithms. 10: 98–109. doi:10.1016/j.jda.2011.06.007.

[11] Robinson, Stephen M. (1970). "A Short Proof of Cramer's Rule". Mathematics Magazine. 43: 94–95.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

v t e Linear algebra
Outline Glossary
Basic concepts	Scalar Vector Vector space Scalar multiplication Vector projection Linear span Linear map Linear projection Linear independence Linear combination Multilinear map Basis Change of basis Row and column vectors Row and column spaces Kernel Eigenvalues and eigenvectors Transpose Linear equations
Matrices	Block Decomposition Invertible Minor Multiplication Rank Transformation Cramer's rule Gaussian elimination Productive matrix
Bilinear	Orthogonality Dot product Hadamard product Inner product space Outer product Kronecker product Gram–Schmidt process
Multilinear algebra	Determinant Cross product Triple product Seven-dimensional cross product Geometric algebra Exterior algebra Bivector Multivector Tensor Outermorphism
Vector space constructions	Dual Direct sum Function space Quotient Subspace Tensor product
Numerical	Floating-point Numerical stability Basic Linear Algebra Subprograms Sparse matrix Comparison of linear algebra libraries
Category