# Tridiagonal matrix algorithm

In numerical linear algebra, the tridiagonal matrix algorithm, also known as the Thomas algorithm (named after Llewellyn Thomas), is a simplified form of Gaussian elimination that can be used to solve tridiagonal systems of equations. A tridiagonal system for n unknowns may be written as

$a_{i}x_{i-1}+b_{i}x_{i}+c_{i}x_{i+1}=d_{i},$ where $a_{1}=0$ and $c_{n}=0$ .

${\begin{bmatrix}b_{1}&c_{1}&&&0\\a_{2}&b_{2}&c_{2}&&\\&a_{3}&b_{3}&\ddots &\\&&\ddots &\ddots &c_{n-1}\\0&&&a_{n}&b_{n}\end{bmatrix}}{\begin{bmatrix}x_{1}\\x_{2}\\x_{3}\\\vdots \\x_{n}\end{bmatrix}}={\begin{bmatrix}d_{1}\\d_{2}\\d_{3}\\\vdots \\d_{n}\end{bmatrix}}.$ For such systems, the solution can be obtained in $O(n)$ operations instead of $O(n^{3})$ required by Gaussian elimination. A first sweep eliminates the $a_{i}$ 's, and then an (abbreviated) backward substitution produces the solution. Examples of such matrices commonly arise from the discretization of 1D Poisson equation and natural cubic spline interpolation.

Thomas' algorithm is not stable in general, but is so in several special cases, such as when the matrix is diagonally dominant (either by rows or columns) or symmetric positive definite; for a more precise characterization of stability of Thomas' algorithm, see Higham Theorem 9.12. If stability is required in the general case, Gaussian elimination with partial pivoting (GEPP) is recommended instead.

## Method

The forward sweep consists of the computation of new coefficients as follows, denoting the new coefficients with primes:

$c'_{i}={\begin{cases}{\cfrac {c_{i}}{b_{i}}},&i=1,\\{\cfrac {c_{i}}{b_{i}-a_{i}c'_{i-1}}},&i=2,3,\dots ,n-1\end{cases}}$ and

$d'_{i}={\begin{cases}{\cfrac {d_{i}}{b_{i}}},&i=1,\\{\cfrac {d_{i}-a_{i}d'_{i-1}}{b_{i}-a_{i}c'_{i-1}}},&i=2,3,\dots ,n.\end{cases}}$ The solution is then obtained by back substitution:

$x_{n}=d'_{n},$ $x_{i}=d'_{i}-c'_{i}x_{i+1},\quad i=n-1,n-2,\ldots ,1.$ The method above does not modify the original coefficient vectors, but must also keep track of the new coefficients. If the coefficient vectors may be modified, then an algorithm with less bookkeeping is:

For $i=2,3,\dots ,n,$ do

$w={\cfrac {a_{i}}{b_{i-1}}},$ $b_{i}:=b_{i}-wc_{i-1},$ $d_{i}:=d_{i}-wd_{i-1},$ followed by the back substitution

$x_{n}={\cfrac {d_{n}}{b_{n}}},$ $x_{i}={\cfrac {d_{i}-c_{i}x_{i+1}}{b_{i}}}\quad {\text{for }}i=n-1,n-2,\dots ,1.$ The implementation in a VBA subroutine without preserving the coefficient vectors:

Sub TriDiagonal_Matrix_Algorithm(N%, A#(), B#(), C#(), D#(), X#())
Dim i%, W#
For i = 2 To N
W = A(i) / B(i - 1)
B(i) = B(i) - W * C(i - 1)
D(i) = D(i) - W * D(i - 1)
Next i
X(N) = D(N) / B(N)
For i = N - 1 To 1 Step -1
X(i) = (D(i) - C(i) * X(i + 1)) / B(i)
Next i
End Sub


## Derivation

The derivation of the tridiagonal matrix algorithm is a special case of Gaussian elimination.

Suppose that the unknowns are $x_{1},\ldots ,x_{n}$ , and that the equations to be solved are:

{\begin{alignedat}{4}&&&b_{1}x_{1}&&+c_{1}x_{2}&&=d_{1}\\&a_{i}x_{i-1}&&+b_{i}x_{i}&&+c_{i}x_{i+1}&&=d_{i}\,,\quad i=2,\ldots ,n-1\\&a_{n}x_{n-1}&&+b_{n}x_{n}&&&&=d_{n}\,.\end{alignedat}} Consider modifying the second ($i=2$ ) equation with the first equation as follows:

$({\mbox{equation 2}})\cdot b_{1}-({\mbox{equation 1}})\cdot a_{2}$ which would give:

$(b_{2}b_{1}-c_{1}a_{2})x_{2}+c_{2}b_{1}x_{3}=d_{2}b_{1}-d_{1}a_{2}.$ Note that $x_{1}$ has been eliminated from the second equation. Using a similar tactic with the modified second equation on the third equation yields:

$(b_{3}(b_{2}b_{1}-c_{1}a_{2})-c_{2}b_{1}a_{3})x_{3}+c_{3}(b_{2}b_{1}-c_{1}a_{2})x_{4}=d_{3}(b_{2}b_{1}-c_{1}a_{2})-(d_{2}b_{1}-d_{1}a_{2})a_{3}.\,$ This time $x_{2}$ was eliminated. If this procedure is repeated until the $n^{th}$ row; the (modified) $n^{th}$ equation will involve only one unknown, $x_{n}$ . This may be solved for and then used to solve the $(n-1)^{th}$ equation, and so on until all of the unknowns are solved for.

Clearly, the coefficients on the modified equations get more and more complicated if stated explicitly. By examining the procedure, the modified coefficients (notated with tildes) may instead be defined recursively:

${\tilde {a}}_{i}=0\,$ ${\tilde {b}}_{1}=b_{1}\,$ ${\tilde {b}}_{i}=b_{i}{\tilde {b}}_{i-1}-{\tilde {c}}_{i-1}a_{i}\,$ ${\tilde {c}}_{1}=c_{1}\,$ ${\tilde {c}}_{i}=c_{i}{\tilde {b}}_{i-1}\,$ ${\tilde {d}}_{1}=d_{1}\,$ ${\tilde {d}}_{i}=d_{i}{\tilde {b}}_{i-1}-{\tilde {d}}_{i-1}a_{i}.\,$ To further hasten the solution process, ${\tilde {b}}_{i}$ may be divided out (if there's no division by zero risk), the newer modified coefficients, each notated with a prime, will be:

$a'_{i}=0\,$ $b'_{i}=1\,$ $c'_{1}={\frac {c_{1}}{b_{1}}}\,$ $c'_{i}={\frac {c_{i}}{b_{i}-c'_{i-1}a_{i}}}\,$ $d'_{1}={\frac {d_{1}}{b_{1}}}\,$ $d'_{i}={\frac {d_{i}-d'_{i-1}a_{i}}{b_{i}-c'_{i-1}a_{i}}}.\,$ This gives the following system with the same unknowns and coefficients defined in terms of the original ones above:

${\begin{array}{lcl}x_{i}+c'_{i}x_{i+1}=d'_{i}\qquad &;&\ i=1,\ldots ,n-1\\x_{n}=d'_{n}\qquad &;&\ i=n.\\\end{array}}\,$ The last equation involves only one unknown. Solving it in turn reduces the next last equation to one unknown, so that this backward substitution can be used to find all of the unknowns:

$x_{n}=d'_{n}\,$ $x_{i}=d'_{i}-c'_{i}x_{i+1}\qquad ;\ i=n-1,n-2,\ldots ,1.$ ## Variants

In some situations, particularly those involving periodic boundary conditions, a slightly perturbed form of the tridiagonal system may need to be solved:

{\begin{alignedat}{4}&a_{1}x_{n}&&+b_{1}x_{1}&&+c_{1}x_{2}&&=d_{1}\\&a_{i}x_{i-1}&&+b_{i}x_{i}&&+c_{i}x_{i+1}&&=d_{i}\,,\quad i=2,\ldots ,n-1\\&a_{n}x_{n-1}&&+b_{n}x_{n}&&+c_{n}x_{1}&&=d_{n}\,.\end{alignedat}} In this case, we can make use of the Sherman–Morrison formula to avoid the additional operations of Gaussian elimination and still use the Thomas algorithm. The method requires solving a modified non-cyclic version of the system for both the input and a sparse corrective vector, and then combining the solutions. This can be done efficiently if both solutions are computed at once, as the forward portion of the pure tridiagonal matrix algorithm can be shared.

If we indicate by:

$A={\begin{bmatrix}b_{1}&c_{1}&&&a_{1}\\a_{2}&b_{2}&c_{2}&&\\&a_{3}&b_{3}&\ddots &\\&&\ddots &\ddots &c_{n-1}\\c_{n}&&&a_{n}&b_{n}\end{bmatrix}},x={\begin{bmatrix}x_{1}\\x_{2}\\x_{3}\\\vdots \\x_{n}\end{bmatrix}},d={\begin{bmatrix}d_{1}\\d_{2}\\d_{3}\\\vdots \\d_{n}\end{bmatrix}}$ Then the system to be solved is:

$Ax=d$ In this case the coefficients $a_{1}$ and $c_{n}$ are, generally speaking, non-zero, so their presence does not allow to apply the Thomas algorithm directly. We can therefore consider $B\in \mathbb {R} ^{n\times n}$ and $u,v\in \mathbb {R} ^{n}$ as following:

$B={\begin{bmatrix}b_{1}-\gamma &c_{1}&&&0\\a_{2}&b_{2}&c_{2}&&\\&a_{3}&b_{3}&\ddots &\\&&\ddots &\ddots &c_{n-1}\\0&&&a_{n}&b_{n}-{\frac {c_{n}a_{1}}{\gamma }}\end{bmatrix}},u={\begin{bmatrix}\gamma \\0\\0\\\vdots \\c_{n}\end{bmatrix}},v={\begin{bmatrix}1\\0\\0\\\vdots \\a_{1}/\gamma \end{bmatrix}}.$ Where $\gamma \in \mathbb {R}$ is a parameter to be chosen. The matrix $A$ can be reconstructed as $A=B+uv^{\mathsf {T}}$ . The solution is then obtained in the following way: first we solve two tridiagonal systems of equations applying the Thomas algorithm:
$By=d\qquad \qquad Bq=u$ Then we reconstruct the solution $x$ using the Shermann-Morrison formula:

{\begin{aligned}x&=A^{-1}d=(B+uv^{T})^{-1}d=B^{-1}d-{\frac {B^{-1}uv^{T}B^{-1}}{1+v^{T}B^{-1}u}}d=y-{\frac {qv^{T}y}{1+v^{T}q}}\end{aligned}} The implementation in C without preserving the coefficient vectors:

typedef struct{
double A[n+2];
double B[n+2];
double C[n+2];
double D[n+2];
} COEFFICIENTS;

//Apply Thomas Alg., unknowns x,...,x[n]
void ThomasAlg(double x[n+1], COEFFICIENTS* coeff){
double u[n+1]={},v[n+1]={};
double y[n+1]={},q[n+1]={};
double* A=coeff->A, *B=coeff->B,*C=coeff->C,*D=coeff->D;
double Value=0;
double w;
int i;

u=gamma;
u[n]=C[n];

v=1;
v[n]=A/gamma;

//create matrix B
A=0;
B=B-gamma;
B[n]=B[n]-(C[n]*A[n])/gamma;
C[n]=0;

for(i=2;i<n+1;i++){
w=A[i]/B[i-1];
B[i]=B[i]-w*C[i-1];
D[i]=D[i]-w*D[i-1];
u[i]=u[i]-w*u[i-1];
}
y[n]=D[n]/B[n];
q[n]=u[n]/B[n];

for(i=n-1;i>0;i--){
y[i]=(D[i]-C[i]*y[i+1])/B[i];
q[i]=(u[i]-C[i]*q[i+1])/B[i];
}
Value=(v*y+v[n]*y[n])/(1+v*q+v[n]*q[n]);

for(i=1;i<n+1;i++){
x[i]=y[i]-q[i]*Value;
}
}


There is also another way to solve the slightly perturbed form of the tridiagonal system considered above. Let us consider two auxiliary linear systems of dimension $(n-1)\times (n-1)$ :

{\begin{aligned}\qquad \ \ \ \ \ b_{2}u_{2}+c_{2}u_{3}&=d_{2}\\a_{3}u_{2}+b_{3}u_{3}+c_{3}u_{4}&=d_{3}\\a_{i}u_{i-1}+b_{i}u_{i}+c_{i}u_{i+1}&=d_{i}\\\dots \\a_{n}u_{n-1}+b_{n}u_{n}\qquad &=d_{n}\,.\end{aligned}}\quad i=4,\ldots ,n-1\qquad \qquad {\begin{aligned}\qquad \ \ \ \ \ b_{2}v_{2}+c_{2}v_{3}&=-a_{2}\\a_{3}v_{2}+b_{3}v_{3}+c_{3}v_{4}&=0\\a_{i}u_{i-1}+b_{i}u_{i}+c_{i}u_{i+1}&=0\\\dots \\a_{n}v_{n-1}+b_{n}v_{n}\qquad &=-c_{n}\,.\end{aligned}}\quad i=4,\ldots ,n-1 For convenience, we additionally define $u_{1}=0$ and $v_{1}=1$ . We can now find the solutions $\{u_{2},u_{3}\dots ,u_{n}\}$ and $\{v_{2},v_{3}\dots ,v_{n}\}$ applying Thomas algorithm to the two auxiliary tridiagonal system.

The solution $\{x_{1},x_{2}\dots ,x_{n}\}$ can be then represented in the form:

$x_{i}=u_{i}+x_{1}v_{i}\qquad i=1,2,\dots ,n$ Indeed, multiplying each equation of the second auxiliary system by $x_{1}$ , adding with the corresponding equation of the first auxiliary system and using the representation $x_{i}=u_{i}+x_{1}v_{i}$ , we immediately see that equations number $2$ through $n$ of the original system are satisfied; it only remains to satisfy equation number $1$ . To do so, consider formula for $i=2$ and $i=n$ and substitute $x_{2}=u_{2}+x_{1}v_{2}$ and $x_{n}=u_{n}+x_{1}v_{n}$ into the first equation of the original system. This yields one scalar equation for $x_{1}$ :

$b_{1}x_{1}+c_{1}(u_{2}+x_{1}v_{2})+a_{1}(u_{n}+x_{1}v_{n})=d_{1}$ As such, we find:

$x_{1}={\frac {d_{1}-a_{1}u_{n}-c_{1}u_{2}}{b_{1}+a_{1}v_{n}+c_{1}v_{2}}}$ The implementation in Dev-C++ without preserving the coefficient vectors:

typedef struct{
double A[n+2];
double B[n+2];
double C[n+2];
double D[n+2];
} COEFFICIENTS;

//Apply Thomas Alg., unknowns x,...,x[n]
void ThomasAlg(double x[n+1],COEFFICIENTS* coeff){
double u[n+1]={},v[n+1]={};
double* A=coeff->A, *B=coeff->B,*C=coeff->C,*D=coeff->D;
double w,F[n+1]={};
F=-A;
F[n]=-C[n];
int i;
u=0;
v=1;
for(i=3;i<n+1;i++){
w=A[i]/B[i-1];
B[i]=B[i]-w*C[i-1];
D[i]=D[i]-w*D[i-1];
F[i]=F[i]-w*F[i-1];
}
u[n]=D[n]/B[n];
v[n]=F[n]/B[n];
for(i=n-1;i>1;i--){
u[i]=(D[i]-C[i]*u[i+1])/B[i];
v[i]=(F[i]-C[i]*v[i+1])/B[i];
}
x=(D-A*u[n]-C*u)/(B+A*v[n]+C*v);

for(i=2;i<n+1;i++){
x[i]=u[i]+x*v[i];
}
}


In both cases the auxiliary systems to be solved are genuinely tri-diagonal, so the overall computational complexity of solving system $Ax=d$ remains linear with the respect to the dimension of the system $n$ , that is $O(n)$ arithmetic operations.

In other situations, the system of equations may be block tridiagonal (see block matrix), with smaller submatrices arranged as the individual elements in the above matrix system (e.g., the 2D Poisson problem). Simplified forms of Gaussian elimination have been developed for these situations.

The textbook Numerical Mathematics by Quarteroni, Sacco and Saleri, lists a modified version of the algorithm which avoids some of the divisions (using instead multiplications), which is beneficial on some computer architectures.

Parallel tridiagonal solvers have been published for many vector and parallel architectures, including GPUs

For an extensive treatment of parallel tridiagonal and block tridiagonal solvers see