Simplex algorithm

See Nelder–Mead method for the downhill simplex or amoeba method of optimization theory

In mathematical optimization theory, the simplex algorithm or simplex method, created by the American mathematician George Dantzig in 1947, is a popular algorithm for numerically solving linear programming problems. The journal Computing in Science and Engineering listed it as one of the top 10 algorithms of the twentieth century.^[1]

The name of the algorithm is derived from the concept of a simplex and was suggested by T. S. Motzkin.^[2] Simplices are not actually used in the method, but one interpretation of it is that it operates on simplicial cones and these become simplices with an additional constraint.^[3]

Efficiency

The simplex method is remarkably efficient in practice and was a great improvement over earlier methods such as Fourier–Motzkin elimination. It has been known since the 1970s that it has polynomial-time average-case complexity under various distributions. These results on "random" matrices still didn't quite capture the desired intuition that the method works well on "typical" matrices. In 2001 Spielman and Teng introduced the notion of smoothed complexity to provide a more realistic analysis of the performance of algorithms.^[4]

However, in 1972, Klee and Minty^[5] gave an example showing that the worst-case complexity of simplex method as formulated by Dantzig is exponential time. Since then, for almost every variation on the method, it has been shown that there is a family of linear programs for which it performs badly. It is an open question if there is a variation with polynomial time, or even sub-exponential worst-case complexity.

Other algorithms for solving linear programming problems are described in the linear programming article.

Overview

The simplex algorithm operates on linear programs in standard form, that is linear programming problems of the form,

Minimize

\mathbf {c} ^{T}\mathbf {x}

Subject to

\mathbf {A} \mathbf {x} =\mathbf {b} ,\,\mathbf {x} \geq 0

with $x=(x_{1},\dots ,x_{n})$ the variables of the problem, $c=(c_{1},\dots ,c_{n})$ are the coefficients of the objective function, A a p×n matrix and $b=(b_{1},\dots ,b_{p})$ constants. There is a straightforward process to convert any linear program into one in standard form so this results in no loss of generality.

In geometric terms, the feasible region

\mathbf {A} \mathbf {x} =\mathbf {b} ,\,\mathbf {x} \geq 0

is a (possibly unbounded) convex polytope. There is a simple characterization of the extreme points or vertices of this polytope, namely $x=(x_{1},\dots ,x_{n})$ is an extreme point if and only if the column vectors $A_{i}$ , where $x_{i}\neq 0$ , are linearly independent^[6]. In this context such a point is known as a basic feasible solution (BFS).

It can be shown that for a linear program in standard form, if the objective function has a minimum value on the feasible region then it has this value on (at least) one of the extreme points.^[7] This in itself reduces the problem to a finite computation since there are finite number of extreme points, however the number of extreme points is unmanageably large for all but the smallest linear programs.^[8]

It can also be shown that if an extreme point is not a minimum point of the objective function then there is an edge containing the point so that the objective function is strictly decreasing on the edge moving away from the point.^[9] If the edge is finite then the edge connects to another extreme point where the objective function has a smaller value, otherwise the objective function is unbounded below on the edge and the linear program has no solution. The simplex algorithm applies this insight by walking along edges of the polytope to extreme points with lower and lower objective values. This continues until the minimum value is reached or an unbounded edge is visited, concluding that the problem has no solution. The algorithm always terminates because the number of vertices in the polytope is finite; moreover since we jump between vertices always in the same direction (that of the objective function), we hope that the number of vertices visited will be small.^[10]

The solution of a linear program is accomplished in two steps. In the first step, known as Phase I, a starting extreme point is found. Depending on the nature of the program this may be trivial, but in general it can be solved by applying the simplex algorithm to a modified version of the original program. The possible results of Phase I are either a basic feasible solution is found or that the feasible region is empty. In the latter case the linear program is called infeasible. In the second step, Phase II, the simplex algorithm is applied using the basic feasible solution found in Phase I as a starting point. The possible results from Phase II are either an optimum basic feasible solution or an infinite edge on which the objective function is unbounded below.

Standard form

The transformation of a linear program to one in standard form may be accomplished as follows.^[11] First, for each variable with a lower bound other than 0, a new variable is introduced representing the difference between the variable and bound. The original variable can then be eliminated by substitution. For example, given the constraint

x_{1}\geq 5,\,

a new variable, y₁, is introduced with

y_{1}=x_{1}-5,\,x_{1}=y_{1}+5.

The second equation may be used to eliminate x₁ from the linear program. In this way, all lower bound constraints may be changed to nonnegativity restrictions.

Second, for each remaining inequality constraint, a new variable, called a slack variable, is introduced to change the constraint to an equality constraint. This variable represents the difference between the two sides of the inequality and is assumed to be nonnegative. For example the inequalities

x_{2}+2x_{3}\leq 3,\,

-x_{4}+3x_{5}\geq 2\,

are replaced with

x_{2}+2x_{3}+s_{1}=3,\,

-x_{4}+3x_{5}-s_{2}=2,\,

s_{1},\,s_{2}\geq 0.\,

It is much easier to perform algebraic manipulation on inequalities in this form. In inequalities where ≥ appears such as the second one, some authors refer to the variable introduced as a surplus variable.

Third, each unrestricted variable is eliminated from the linear program. This can be done in two ways, one is by solving for the variable in one of the equations in which it appears and then eliminating the variable by substitution. The other is to replace the variable with the difference of two restricted variables. For example if z₁ is unrestricted then write

z_{1}=z_{1}^{+}-z_{1}^{-},\,

z_{1}^{+},\,z_{1}^{-}\geq 0.\,

The equation may be used to eliminate z₁ from the linear program.

When this process is complete the feasible region will be in the form

\mathbf {A} \mathbf {x} =\mathbf {b} ,\,\mathbf {x} \geq 0.

It is also useful to assume that the rank of A is the number of rows. This results in no loss of generality since otherwise either the system Ax=b has redundant equations which can be dropped, or the system is inconsistent and the linear program has no solution.^[12]

Canonical tableaux

A linear program in standard form can be represented as a tableau of the form

{\begin{bmatrix}1&-\mathbf {c} ^{T}&0\\0&\mathbf {A} &\mathbf {b} \end{bmatrix}}.

The first row defines the objective function and the remaining rows specify the constraints. (Note, different authors use different conventions as to the exact layout.) If the columns of A can be rearranged so that it contains the identity matrix of order p then the tableau is said to be in canonical form.^[13] The variables corresponding to the columns of the identity matrix are called basic variables while the remaining variables are called nonbasic or free variables. If the nonbasic variables are assumed to be 0, then the values of the basic variables are easily obtained as entries in b and this solution is a basic feasible solution.

Conversely, given a basic feasible solution, the columns corresponding the nonzero variables can be expanded to a nonsingular matrix. If the corresponding tableau is multiplied by the inverse of this matrix then the result is a tableau in canonical form.^[14]

Let

{\begin{bmatrix}1&-\mathbf {c} _{B}^{T}&-\mathbf {c} _{D}^{T}&0\\0&I&\mathbf {D} &\mathbf {b} \end{bmatrix}}

be a tableau in canonical form. Additional row-addition transformations can be applied to remove the coefficients c^T_B from the objective function. This process is called pricing out and results in a canonical tableau

{\begin{bmatrix}1&0&-{\bar {\mathbf {c} }}_{D}^{T}&z_{B}\\0&I&\mathbf {D} &\mathbf {b} \end{bmatrix}}

where z_B is the value of the objective function at the corresponding basic feasible solution. The updated coefficients, also known as relative cost coefficients, are the rates of change of the objective function with respect to the nonbasic variables.

Pivot operations

The geometrical operation of moving from a basic feasible solution to an adjacent basic feasible solution is implemented as a pivot operation. First, a nonzero pivot element is selected in a nonbasic column. The row containing this element is multiplied by its reciprocal to change this element to 1, and then multiples of the row are added to the other rows to change the other entries in the column to 0. The result is that, if the pivot element is in row r, then the column becomes the r-th column of the identity matrix. The variable for this column is now a basic variable, replacing the variable which corresponded to the r-th column of the identity matrix before the operation. In effect, the variable corresponding to the pivot column enters the set of basic variables and is called the entering variable, and the variable being replaced leaves the set of basic variables and is called the leaving variable. The tableau is still in canonical form but with the set of basic variables changed by one element.

Algorithm

Let a linear program be given by a canonical tableau. The simplex algorithm proceeds by performing successive pivot operations which each give an improved basic feasible solution; the choice of pivot element at each step is largely determined by the requirement that this pivot does improve the solution.

Entering variable selection

Since the entering variable will, in general, increase from 0 to a positive number, the value of the objective function will decrease if the derivative of the objective function with respect to this function is negative. Equivalently, the value of the objective function is decreased if the pivot column is selected so that the corresponding entry in the objective row of the tableau is positive.

If there is more than one column so that the entry in the objective row is positive then the choice of which one to add to the set of basic variables is somewhat arbitrary and several entering variable choice rules^[15] have been developed.

If all the entries in the objective row are less than or equal to 0 then no choice of entering variable can be made and the solution is in fact optimal. It is easily seen to be optimal since the objective row now corresponds to an equation of the form

z(\mathbf {x} )=z_{B}+{\text{nonnegative terms corresponding to nonbasic variables}}

Note that by changing the entering variable choice rule so that it selects a column where the entry in the objective row is negative, the algorithm is changed so that it finds the maximum of the objective function rather than the minimum.

Leaving variable selection

Once the pivot column has been selected, the choice of pivot row is largely determined by the requirement that resulting solution will be feasible. First, only positive entries in the pivot column are considered since this guarantees that the value of the entering variable will be nonnegative. If there are no positive entries in the pivot column then the entering variable can take any nonnegative value with the solution remaining feasible. In this case the objective function is unbounded below and there is no minimum.

Next, the pivot row must be selected so that all the other basic variables remain positive. A calculation shows that this occurs when the resulting value of the entering variable is at a minimum. In other words, if the pivot column is c, then the pivot row r is chosen so that

b_{r}/a_{cr}\,

is the minimum over all r so that a_cr > 0. This is called the minimum ratio test.^[16] If there is more than one row for which the minimum is achieved then a dropping variable choice rule^[17] can be used to make can be used to make the determination.

Example

Consider the linear program

Minimize

Z=-2x-3y-4z\,

Subject to

3x+2y+z\leq 10

2x+5y+3z\leq 15.

x,\,y,\,z\geq 0

With the addition of slack variables s and t, this is represented by the canonical tableau

{\begin{bmatrix}1&2&3&4&0&0&0\\0&3&2&1&1&0&10\\0&2&5&3&0&1&15\end{bmatrix}}

where columns 5 and 6 represent the basic variables s and t and the corresponding basic feasible solution is

x=y=z=0,\,s=10,\,t=15.

Columns 2, 3, and 4 can be selected as pivot columns, for this example column 4 is selected. The values of x resulting from the choice of rows 2 and 3 as pivot rows are 10/1=10 and 15/3=5 respectively. Of these the minimum is 5, so row 3 must be the pivot row. Performing the pivot produces

{\begin{bmatrix}1&-{\tfrac {2}{3}}&-{\tfrac {11}{3}}&0&0&-{\tfrac {4}{3}}&-20\\0&{\tfrac {7}{3}}&{\tfrac {1}{3}}&0&1&-{\tfrac {1}{3}}&5\\0&{\tfrac {2}{3}}&{\tfrac {5}{3}}&1&0&{\tfrac {1}{3}}&5\end{bmatrix}}.

Now columns 4 and 5 represent the basic variables z and s and the corresponding basic feasible solution is

x=y=t=0,\,z=5,\,s=5.

For the next step, there are no positive entries in the objective row and in fact

Z=-20+{\tfrac {2}{3}}x+{\tfrac {11}{3}}y+{\tfrac {4}{3}}t

so the minimum value of Z is -20.

Finding an initial canonical tableau

In general, a linear program will not be given in canonical form and an equivalent canonical tableau must be found before the simplex algorithm can start. This can be accomplished by the introduction of artificial variables. Columns of the identity matrix are added as column vectors for these variables. The new tableau is in canonical form but it is not equivalent to the original problem. So a new objective function, equal to the sum of the artificial variables, is introduced and the simplex algorithm is applied to find the minimum; the modified linear program is called the Phase I problem.^[18]

The simplex algorithm applied to the Phase I problem must terminate with a minimum value for the new objective function since, being the sum of nonnegative variables, its value is bounded below by 0. If the minimum is 0 then the artificial variables can be eliminated from the resulting canonical tableau producing a canonical tableau equivalent to the original problem. The simplex algorithm can then be applied to find the solution; this step is called Phase II. If the minimum is greater than 0 then there is no feasible solution for the Phase I problem where the artificial variables are all 0. This implies that the feasible region for the original problem is, in fact, empty and the original problem has no solution.

Example

Consider the linear program

Minimize

Z=-2x-3y-4z\,

Subject to

3x+2y+z=10\,

2x+5y+3z=15\,

x,\,y,\,z\geq 0.

This is represented by the (non-canonical) tableau

{\begin{bmatrix}1&2&3&4&0\\0&3&2&1&10\\0&2&5&3&15\end{bmatrix}}.

Introduce artificial variables u and v and objective function W=u+v, giving a new tableau

{\begin{bmatrix}1&0&0&0&0&-1&-1&0\\0&1&2&3&4&0&0&0\\0&0&3&2&1&1&0&10\\0&0&2&5&3&0&1&15\end{bmatrix}}.

Note that the equation defining the original objective function is retained in anticipation of Phase II. After pricing out this becomes

{\begin{bmatrix}1&0&5&7&4&0&0&25\\0&1&2&3&4&0&0&0\\0&0&3&2&1&1&0&10\\0&0&2&5&3&0&1&15\end{bmatrix}}.

Select column 5 as a pivot column, so the pivot row must be row 4, and the updated tableau is

{\begin{bmatrix}1&0&{\tfrac {7}{3}}&{\tfrac {1}{3}}&0&0&-{\tfrac {4}{3}}&5\\0&1&-{\tfrac {2}{3}}&-{\tfrac {11}{3}}&0&0&-{\tfrac {4}{3}}&-20\\0&0&{\tfrac {7}{3}}&{\tfrac {1}{3}}&0&1&-{\tfrac {1}{3}}&5\\0&0&{\tfrac {2}{3}}&{\tfrac {5}{3}}&1&0&{\tfrac {1}{3}}&5\end{bmatrix}}.

Now select column 3 as a pivot column, for which row 3 must be the pivot row, to get

{\begin{bmatrix}1&0&0&0&0&-1&-1&0\\0&1&0&-{\tfrac {25}{7}}&0&{\tfrac {2}{7}}&-{\tfrac {10}{7}}&-{\tfrac {130}{7}}\\0&0&1&{\tfrac {1}{7}}&0&{\tfrac {3}{7}}&-{\tfrac {1}{7}}&{\tfrac {15}{7}}\\0&0&0&{\tfrac {11}{7}}&1&-{\tfrac {2}{7}}&{\tfrac {3}{7}}&{\tfrac {25}{7}}\end{bmatrix}}.

The artificial variables are now 0 and they may be dropped giving a canonical tableau equivalent to the original problem:

{\begin{bmatrix}1&0&-{\tfrac {25}{7}}&0&-{\tfrac {130}{7}}\\0&1&{\tfrac {1}{7}}&0&{\tfrac {15}{7}}\\0&0&{\tfrac {11}{7}}&1&{\tfrac {25}{7}}\end{bmatrix}}.

This is, fortuitously, already optimal and the optimum value for the original linear program is -130/7.

Degeneracy and cycling

If the values of all basic variables are strictly greater than 0, then a pivot must result in an improvement in the objective value. When this is always the case no set of basic variables occurs twice and the simplex algorithm must terminate after a finite number of steps. Basic feasible solutions where at least one of the basic variables is 0 are called degenerate and may result in pivots for which there is no improvement in the objective value. In this case there is no actual change in the solution but only a change in the set of basic variables. When several such pivots occur in succession there is the possibility the same set of basic variables will occur more than once and the simplex algorithm will cycle without producing a result. This occurs rarely in practice, though it depends on the nature of the linear program. Special entering variable and leaving variable selection rules have been devised which prevent cycling and thus guarantee that the simplex algorithm always terminates.^[19]

Implementation

The tableau form used above to describe the algorithm lends itself to an immediate implementation in which the tableau is maintained as a rectangular (m+1)-by-(m+n+1) array. It is straightforward to avoid storing the m explicit columns of the identity matrix that will occur within the tableau by virtue of B being a subset of the columns of $[\mathbf {A} \,\mathbf {I} ]$ . This implementation is referred to as the standard simplex method. The storage and computation overhead are such that the standard simplex method is a prohibitively expensive approach to solving large linear programming problems.

In each simplex iteration, the only data required are the first row of the tableau, the (pivotal) column of the tableau corresponding to the entering variable and the right-hand-side. The latter can be updated using the pivotal column and the first row of the tableau can be updated using the (pivotal) row corresponding to the leaving variable. Both the pivotal column and pivotal row may be computed directly using the solutions of linear systems of equations involving the matrix B and a matrix-vector product using A. These observations motivate the revised simplex method, for which implementations are distinguished by their invertible representation of B.

In large linear programming problems A is typically a sparse matrix and, when the resulting sparsity of B is exploited when maintaining its invertible representation, the revised simplex method is a vastly more efficient solution procedure than the standard simplex method. Commercial simplex solvers are based on the primal (or dual) revised simplex method.

Fractional linear programming

Linear-fractional programming (LFP) is a generalization of linear programming (LP). Where the objective function of linear programs are linear functions, the objective function of a linear-fractional program is a ratio of two linear functions. In other words, a linear program is a fractional-linear program in which the denominator is the constant function having the value one everywhere. A fractional-linear program can be solved by a variant of the simplex algorithm.^[20]^[21]^[22]

References

^ Computing in Science and Engineering, volume 2, no. 1, 2000
^ Murty, Comment 2.2
^ Murty, Note 3.9
^ Spielman, Daniel; Teng, Shang-Hua (2001). "Smoothed analysis of algorithms: why the simplex algorithm usually takes polynomial time". Proceedings of the Thirty-Third Annual ACM Symposium on Theory of Computing. ACM. pp. 296–305. doi:10.1145/380752.380813. ISBN 978-1-58113-349-3. arXiv:cs/0111050 Template:Inconsistent citations{{cite book}}: CS1 maint: postscript (link)
^ Greenberg, cites: V. Klee and G.J. Minty. "How Good is the Simplex Algorithm?" In O. Shisha, editor, Inequalities, III, pages 159–175. Academic Press, New York, NY, 1972
^ Murty, Theorem 3.1
^ Murty, Theorem 3.3
^ Murty, Section 3.13 p. 143
^ Murty, Section 3.8 p. 137
^ Murty, Section 3.8 p. 137
^ Follows Murty, Section 2.2
^ Murty, p. 173
^ Murty, section 2.3.2
^ Murty, section 3.12
^ Murty p. 66
^ Murty p. 66
^ Murty p. 67
^ Murty p.60
^ Murty p. 79
^ Chapter five: Craven, B. D. (1988). Fractional programming. Sigma Series in Applied Mathematics. Vol. 4. Berlin: Heldermann Verlag. p. 145. ISBN 3-88538-404-3. MR949209. {{cite book}}: Cite has empty unknown parameter: |1= (help)
^ Template:Cite article
^ Template:Cite article

Katta G. Murty, Linear Programming, Wiley, 1983. (comprehensive textbook and reference, through ellipsiodal algorithm of Khachiyan)

External links

An Introduction to Linear Programming and the Simplex Algorithm by Spyros Reveliotis of the Georgia Institute of Technology.
Greenberg, Harvey J., Klee-Minty Polytope Shows Exponential Time Complexity of Simplex Method University of Colorado at Denver (1997) PDF download
LP-Explorer A Java-based tool which, for problems in two variables, relates the algebraic and geometric views of the tableau simplex method. Also illustrates the sensitivity of the solution to changes in the right-hand-side.
Java-based interactive simplex tool hosted by Argonne National Laboratory.
Simplex Method A good tutorial for Simplex Method with examples (also two-phase and M-method).
Complex An Open Source Application for Linear Programming that resolves a giving model and shows step to step the iterations of the simplex method. (Penalization Method and Two Phases Method).

[1] Computing in Science and Engineering, volume 2, no. 1, 2000

[2] Murty, Comment 2.2

[3] Murty, Note 3.9

[4] Spielman, Daniel; Teng, Shang-Hua (2001). "Smoothed analysis of algorithms: why the simplex algorithm usually takes polynomial time". Proceedings of the Thirty-Third Annual ACM Symposium on Theory of Computing. ACM. pp. 296–305. doi:10.1145/380752.380813. ISBN 978-1-58113-349-3. arXiv:cs/0111050 Template:Inconsistent citations{{cite book}}: CS1 maint: postscript (link)

[greenberg-5] Greenberg, cites: V. Klee and G.J. Minty. "How Good is the Simplex Algorithm?" In O. Shisha, editor, Inequalities, III, pages 159–175. Academic Press, New York, NY, 1972

[6] Murty, Theorem 3.1

[7] Murty, Theorem 3.3

[8] Murty, Section 3.13 p. 143

[9] Murty, Section 3.8 p. 137

[10] Murty, Section 3.8 p. 137

[11] Follows Murty, Section 2.2

[12] Murty, p. 173

[13] Murty, section 2.3.2

[14] Murty, section 3.12

[15] Murty p. 66

[16] Murty p. 66

[17] Murty p. 67

[18] Murty p.60

[19] Murty p. 79

[20] Chapter five: Craven, B. D. (1988). Fractional programming. Sigma Series in Applied Mathematics. Vol. 4. Berlin: Heldermann Verlag. p. 145. ISBN 3-88538-404-3. MR949209. {{cite book}}: Cite has empty unknown parameter: |1= (help)

[21] Template:Cite article

[22] Template:Cite article

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

v t e Complementarity problems and algorithms
Complementarity Problems	Linear programming (LP) Quadratic programming (QP) Linear complementarity problem (LCP) Mixed linear (MLCP) Mixed (MCP) Nonlinear (NCP)
Basis-exchange algorithms	Simplex (Dantzig) Revised simplex Criss-cross Lemke

Simplex algorithm

Efficiency

Overview

Standard form

Canonical tableaux

Pivot operations

Algorithm

Entering variable selection

Leaving variable selection

Example

Finding an initial canonical tableau

Example

Degeneracy and cycling

Implementation

Fractional linear programming

See also

References

Further reading

External links