Computing the permanent

From Wikipedia, the free encyclopedia
Jump to: navigation, search

In mathematics, the computation of the permanent of a matrix is a problem that is known to be more difficult than the computation of the determinant of a matrix despite the apparent similarity of the definitions.

The permanent is defined similarly to the determinant, as a sum of products of sets of matrix entries that lie in distinct rows and columns. However, where the determinant weights each of these products with a ±1 sign based on the parity of the set, the permanent weights them all with a + 1 sign.

While the determinant can be computed in polynomial time by Gaussian elimination, the permanent cannot. In computational complexity theory, a theorem of Valiant states that computing permanents is #P-hard, and even #P-complete for matrices in which all entries are 0 or 1.Valiant (1979) This puts the computation of the permanent in a class of problems believed to be even more difficult to compute than NP. It is known that computing the permanent is impossible for logspace-uniform ACC0 circuits.(Allender & Gore 1994)

The development of both exact and approximate algorithms for computing the permanent of a matrix is an active area of research.

Definition and naive algorithm[edit]

The permanent of an n-by-n matrix A = (ai,j) is defined as

 \operatorname{perm}(A)=\sum_{\sigma\in S_n}\prod_{i=1}^n a_{i,\sigma(i)}.

The sum here extends over all elements σ of the symmetric group Sn, i.e. over all permutations of the numbers 1, 2, ..., n. This formula differs from the corresponding formula for the determinant only in that, in the determinant, each product is multiplied by the sign of the permutation σ while in this formula each product is unsigned. The formula may be directly translated into an algorithm that naively expands the formula, summing over all permutations and within the sum multiplying out each matrix entry. This requires n! n arithmetic operations.

Ryser formula[edit]

The fastest known[1] general exact algorithm is due to H. J. Ryser (1963). Ryser’s method is based on an inclusion–exclusion formula that can be given[2] as follows: Let A_k be obtained from A by deleting k columns, let P(A_k) be the product of the row-sums of A_k, and let \Sigma_k be the sum of the values of P(A_k) over all possible A_k. Then

 \operatorname{perm}(A)=\sum_{k=0}^{n-1} (-1)^{k}\Sigma_k.

It may be rewritten in terms of the matrix entries as follows[3]

\operatorname{perm} (A) = (-1)^n \sum_{S\subseteq\{1,\dots,n\}} (-1)^{|S|} \prod_{i=1}^n \sum_{j\in S} a_{ij}.

Ryser’s formula can be evaluated using O(2^nn^2) arithmetic operations, or O(2^nn) by processing the sets S in Gray code order.

Glynn formula[edit]

Another formula that appears to be as fast as Ryser's is closely related to the polarization identity for a symmetric tensor (Glynn 2010).

It has the formula (when the characteristic of the field is not two)

\operatorname{perm}(A) = \left[\sum_\delta \left(\prod_{k=1}^m \delta_k\right) \prod_{j=1}^m \sum_{i=1}^m \delta_i a_{ij}\right] / 2^{m-1},

where the outer sum is over all 2^{m-1} vectors \delta=(\delta_1=1,\delta_2,\dots,\delta_m)\in \{\pm1\}^m.

Special cases[edit]

Planar and K3,3-free[edit]

The number of perfect matchings in a bipartite graph is counted by the permanent of the graph's biadjacency matrix, and the permanent of any 0-1 matrix can be interpreted in this way as the number of perfect matchings in a graph. For planar graphs (regardless of bipartiteness), the FKT algorithm computes the number of perfect matchings in polynomial time by changing the signs of a carefully chosen subset of the entries in the Tutte matrix of the graph, so that the Pfaffian of the resulting skew-symmetric matrix (the square root of its determinant) is the number of perfect matchings. This technique can be generalized to graphs that contain no subgraph homeomorphic to the complete bipartite graph K3,3.[4]

George Pólya had asked the question[5] of when it is possible to change the signs of some of the entries of a 01 matrix A so that the determinant of the new matrix is the permanent of A. Not all 01 matrices are "convertible" in this manner; in fact it is known (Marcus & Minc (1961)) that there is no linear map T such that \operatorname{per}\,T(A) = \det A for all n\times n matrices A. The characterization of "convertible" matrices was given by Little (1975) who showed that such matrices are precisely those that are the biadjacency matrix of bipartite graphs that have a Pfaffian orientation: an orientation of the edges such that for every even cycle C for which G\setminus C has a perfect matching, there are an odd number of edges directed along C (and thus an odd number with the opposite orientation). It was also shown that these graphs are exactly those that do not contain a subgraph homeomorphic to K_{3,3}, as above.

Computation modulo a number[edit]

Modulo 2, the permanent is the same as the determinant, as (-1) \equiv 1 \pmod 2. It can also be computed modulo 2^k in time O(n^{4k-3}) for k \ge 2. However, it is UP-hard to compute the permanent modulo any number that is not a power of 2. Valiant (1979)

There are various formulae given by Glynn (2010) for the computation modulo a prime p. Firstly there is one using symbolic calculations with partial derivatives.

Secondly for p=3 there is the following formula (Grigoriy Kogan, 1996) using the determinants of the principal submatrices of the matrix:

\operatorname{perm} (A) = (-1)^{m}\Sigma_{U\subseteq \{1,\dots,m\}} \det(A_U).\det(A_{\bar U}),

where A_U is the principal submatrix of A induced by the rows and columns of A indexed by U, and \bar U is the complement of U in \{1,\dots,m\}.

(The determinant of an empty submatrix is defined to be 1).

This formula implies the following identities over fields of Characteristic 3 (Grigoriy Kogan, 1996):

for any invertible A

\operatorname{perm}(A^{-1})\det(A)^2 = \operatorname{perm}(A) ;

for any unitary U , i.e. a square matrix U such that U^{T} U = I \, ,

\operatorname{perm}(U)^2 = \det(U+V)\det(U)

where V is the matrix whose entries are the cubes of the corresponding entries of U.

Approximate computation[edit]

When the entries of A are nonnegative, the permanent can be computed approximately in probabilistic polynomial time, up to an error of εM, where M is the value of the permanent and ε > 0 is arbitrary. In other words, there exists a fully polynomial-time randomized approximation scheme (FPRAS) (Jerrum, Vigoda & Sinclair (2001)).

The most difficult step in the computation is the construction of an algorithm to sample almost uniformly from the set of all perfect matchings in a given bipartite graph: in other words, a fully polynomial almost uniform sampler (FPAUS). This can be done using a Markov chain Monte Carlo algorithm that uses a Metropolis rule to define and run a Markov chain whose distribution is close to uniform, and whose mixing time is polynomial.

It is possible to approximately count the number of perfect matchings in a graph via the self-reducibility of the permanent, by using the FPAUS in combination with a well-known reduction from sampling to counting due to Jerrum, Valiant & Vazirani (1986). Let M(G) denote the number of perfect matchings in G. Roughly, for any particular edge e in G, by sampling many matchings in G and counting how many of them are matchings in G \setminus e, one can obtain an estimate of the ratio \rho=\frac{M(G)}{M(G\setminus e)}. The number M(G) is then \rho M(G \setminus e), where  M(G \setminus e) can be approximated by applying the same method recursively.

Notes[edit]

References[edit]

  • Allender, Eric; Gore, Vivec (1994), "A uniform circuit lower bound for the permanent", SIAM Journal on Computing 23 (5): 1026–1049, doi:10.1137/s0097539792233907 
  • Kogan, Grigoriy (1996), "Computing permanents over fields of characteristic 3: where and why it becomes difficult", 37th Annual Symposium on Foundations of Computer Science (FOCS '96)