This is an old revision of this page, as edited by Mwtoews(talk | contribs) at 02:17, 21 July 2017(modifications to example to allow it to work on older versions of Python, and other minor modifications). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
Revision as of 02:17, 21 July 2017 by Mwtoews(talk | contribs)(modifications to example to allow it to work on older versions of Python, and other minor modifications)
The Gauss–Seidel method is an iterative technique for solving a square system of n linear equations with unknown x:
.
It is defined by the iteration
where is the kth approximation or iteration of is the next or k + 1 iteration of , and the matrix A is decomposed into a lower triangular component , and a strictly upper triangular component U: .[2]
In more detail, write out A, x and b in their components:
Then the decomposition of A into its lower triangular component and its strictly upper triangular component is given by:
The system of linear equations may be rewritten as:
The Gauss–Seidel method now solves the left hand side of this expression for x, using previous value for x on the right hand side. Analytically, this may be written as:
However, by taking advantage of the triangular form of , the elements of x(k+1) can be computed sequentially using forward substitution:
The procedure is generally continued until the changes made by an iteration are below some tolerance, such as a sufficiently small residual.
Discussion
The element-wise formula for the Gauss–Seidel method is extremely similar to that of the Jacobi method.
The computation of xi(k+1) uses only the elements of x(k+1) that have already been computed, and only the elements of x(k) that have not yet to be advanced to iteration k+1. This means that, unlike the Jacobi method, only one storage vector is required as elements can be overwritten as they are computed, which can be advantageous for very large problems.
However, unlike the Jacobi method, the computations for each element cannot be done in parallel. Furthermore, the values at each iteration are dependent on the order of the original equations.
The Gauss–Seidel method sometimes converges even if these conditions are not satisfied.
Algorithm
Since elements can be overwritten as they are computed in this algorithm, only one storage vector is needed, and vector indexing is omitted. The algorithm goes as follows:
Inputs: A, bOutput: Choose an initial guess to the solutionrepeat until convergence
forifrom 1 untilndoforjfrom 1 untilndoifj ≠ ithenend ifend (j-loop)
end (i-loop)
check if convergence is reached
end (repeat)
Examples
An example for the matrix version
A linear system shown as is given by:
and
We want to use the equation
in the form
where:
and
We must decompose into the sum of a lower triangular component and a strict upper triangular component :
and
The inverse of is:
.
Now we can find:
Now we have and and we can use them to obtain the vectors iteratively.
First of all, we have to choose : we can only guess. The better the guess, the quicker the algorithm will perform.
We suppose:
We can then calculate:
As expected, the algorithm converges to the exact solution:
In fact, the matrix A is strictly diagonally dominant (but not positive definite).
Another example for the matrix version
Another linear system shown as is given by:
and
We want to use the equation
in the form
where:
and
We must decompose into the sum of a lower triangular component and a strict upper triangular component :
and
The inverse of is:
.
Now we can find:
Now we have and and we can use them to obtain the vectors iteratively.
First of all, we have to choose : we can only guess. The better the guess, the quicker will perform the algorithm.
We suppose:
We can then calculate:
If we test for convergence we'll find that the algorithm diverges. In fact, the matrix A is neither diagonally dominant nor positive definite.
Then, convergence to the exact solution
is not guaranteed and, in this case, will not occur.
An example for the equation version
Suppose given k equations where xn are vectors of these equations and starting point x0.
From the first equation solve for x1 in terms of For the next equations substitute the previous values of xs.
To make it clear let's consider an example.
Solving for and gives:
Suppose we choose (0, 0, 0, 0) as the initial approximation, then the first
approximate solution is given by
Using the approximations obtained, the iterative procedure is repeated until
the desired accuracy has been reached. The following are the approximated
solutions after four iterations.
The exact solution of the system is (1, 2, −1, 1).
An example using Python and NumPy
The following numerical procedure simply iterates to produce the solution vector.
importnumpyasnpITERATION_LIMIT=1000# initialize the matrixA=np.array([[10.,-1.,2.,0.],[-1.,11.,-1.,3.],[2.,-1.,10.,-1.],[0.,3.,-1.,8.]])# initialize the RHS vectorb=np.array([6.,25.,-11.,15.])print("System of equations:")foriinrange(A.shape[0]):row=["{0:3g}*x{1}".format(A[i,j],j+1)forjinrange(A.shape[1])]print("[{0}] = [{1:3g}]".format(" + ".join(row),b[i]))x=np.zeros_like(b)forit_countinrange(1,ITERATION_LIMIT):x_new=np.zeros_like(x)print("Iteration {0}: {1}".format(it_count,x))foriinrange(A.shape[0]):s1=np.dot(A[i,:i],x_new[:i])s2=np.dot(A[i,i+1:],x[i+1:])x_new[i]=(b[i]-s1-s2)/A[i,i]ifnp.allclose(x,x_new,rtol=1e-8):breakx=x_newprint("Solution: {0}".format(x))error=np.dot(A,x)-bprint("Error: {0}".format(error))