Exponential time hypothesis

In computational complexity theory, the exponential time hypothesis is an unproven computational hardness assumption that was formulated by Impagliazzo & Paturi (1999). It states that satisfiability of 3-CNF Boolean formulas cannot be solved more quickly than exponential time in the worst case. The exponential time hypothesis, if true, would imply that P ≠ NP, but it is a stronger statement. It implies that many computational problems are equivalent in complexity, in the sense that if one of them has a subexponential time algorithm then they all do, and that many known algorithms for these problems have optimal or near-optimal time complexity.^[1]

Definition

The $k$ -SAT problem is a version of the Boolean satisfiability problem in which the input to the problem is a Boolean expression in conjunctive normal form (that is, an and of ors of variables and their negations) with at most $k$ variables per clause. The goal is to determine whether this expression can be made to be true by some assignment of Boolean values to its variables. 2-SAT has a linear time algorithm, but all known algorithms for larger $k$ take exponential time, with the base of the exponential function depending on $k$ . For instance, the WalkSAT probabilistic algorithm can solve $k$ -SAT in average time $\left(2-{\frac {2}{k}}\right)^{n}n^{O(1)},$ where $n$ is the number of variables in the given $k$ -SAT instance.^[2] For each integer $k\geq 3$ , define $s_{k}$ to be the smallest number such that $k$ -SAT can be solved in time $2^{s_{k}n+o(n)}$ . This minimum might not exist, if a sequence of better and better algorithms have correspondingly smaller exponential growth in their time bounds; in that case, define $s_{k}$ to be the infimum of the real numbers $\delta$ for which $k$ -SAT can be solved in time $O(2^{\delta n})$ . Because problems with larger $k$ cannot be easier, these numbers are ordered as $s_{3}\leq s_{4}\leq \cdots$ , and because of WalkSAT they are at most $s_{k}\leq \log _{2}\left(2-{\frac {2}{k}}\right)<1.$ The exponential time hypothesis is the conjecture that they are all nonzero, or equivalently, that the smallest of them, $s_{3}$ , is nonzero.^[1]

Some sources define the exponential time hypothesis to be the slightly weaker statement that 3-SAT cannot be solved in time $2^{o(n)}$ . If there existed an algorithm to solve 3-SAT in time $2^{o(n)}$ , then $s_{3}$ would equal zero. However, it is consistent with current knowledge that there could be a sequence of 3-SAT algorithms, each with running time $O(2^{\delta _{i}n})$ for a sequence of numbers $\delta _{i}$ tending towards zero, but where the descriptions of these algorithms are so quickly growing that a single algorithm could not automatically select and run the most appropriate one. If this were to be the case, then $s_{3}$ would equal zero even though there would be no single algorithm running in time $2^{o(n)}$ .^[3] A related variant of the exponential time hypothesis is the non-uniform exponential time hypothesis, which posits that there is no family of algorithms (one for each length of the input, in the spirit of advice) that can solve 3-SAT in time $2^{o(n)}$ .^[4]

Because the numbers $s_{3},s_{4},\dots$ form a monotonic sequence that is bounded above by one, they must converge to a limit $s_{\infty }=\lim _{k\to \infty }s_{k}.$ The strong exponential time hypothesis (SETH) is the conjecture that $s_{\infty }=1$ .^[5]

Implications

Satisfiability

It is not possible for $s_{k}$ to equal $s_{\infty }$ for any finite $k$ : as Impagliazzo, Paturi & Zane (2001) showed, there exists a constant $\alpha$ such that $s_{k}\leq s_{\infty }(1-\alpha /k)$ . Therefore, if the exponential time hypothesis is true, there must be infinitely many values of $k$ for which $s_{k}$ differs from $s_{k+1}$ .^[6]

An important tool in this area is the sparsification lemma of Impagliazzo, Paturi & Zane (2001), which shows that, for every $\varepsilon >0$ , any $k$ -CNF formula can be replaced by $O(2^{\varepsilon n})$ simpler $k$ -CNF formulas in which each variable appears only a constant number of times, and therefore in which the number of clauses is linear. The sparsification lemma is proven by repeatedly finding large sets of clauses that have a nonempty common intersection in a given formula, and replacing the formula by two simpler formulas, one of which has each of these clauses replaced by their common intersection and the other of which has the intersection removed from each clause. By applying the sparsification lemma and then using new variables to split the clauses, one may then obtain a set of $O(2^{\varepsilon n})$ 3-CNF formulas, each with a linear number of variables, such that the original $k$ -CNF formula is satisfiable if and only if at least one of these 3-CNF formulas is satisfiable. Therefore, if 3-SAT could be solved in subexponential time, one could use this reduction to solve $k$ -SAT in subexponential time as well. Equivalently, if $s_{k}>0$ for any $k>0$ , then $s_{3}>0$ as well, and the exponential time hypothesis would be true.^[7]^[6]

The limiting value $s_{\infty }$ of the sequence of numbers $s_{k}$ is at most equal to $s_{\operatorname {CNF} }$ , where $s_{\operatorname {CNF} }$ is the infimum of the numbers $\delta$ such that satisfiability of conjunctive normal form formulas without clause length limits can be solved in time $O(2^{\delta n})$ . Therefore, if the strong exponential time hypothesis is true, then there would be no algorithm for general CNF satisfiability that is significantly faster than a brute-force search over all possible truth assignments. However, if the strong exponential time hypothesis fails, it would still be possible for $s_{\operatorname {CNF} }$ to equal one.^[8]

Communication complexity

In the three-party set disjointness problem in communication complexity, three subsets of the integers in some range $[1,m]$ are specified, and three communicating parties each know two of the three subsets. The goal is for the parties to transmit as few bits to each other on a shared communications channel in order for one of the parties to be able to determine whether the intersection of the three sets is empty or nonempty. A trivial $m$ -bit communications protocol would be for one of the three parties to transmit a bitvector describing the intersection of the two sets known to that party, after which either of the two remaining parties can determine the emptiness of the intersection. However, if there exists a protocol that solves the problem with $o(m)$ communication and $2^{o(m)}$ computation, it could be transformed into an algorithm for solving $k$ -SAT in time $O(1.74^{n})$ for any fixed constant $k$ , violating the strong exponential time hypothesis. Therefore, the strong exponential time hypothesis implies either that the trivial protocol for three-party set disjointness is optimal, or that any better protocol requires an exponential amount of computation.^[8]

Structural complexity

If the exponential time hypothesis is true, then 3-SAT would not have a polynomial time algorithm, and therefore it would follow that P ≠ NP. More strongly, in this case, 3-SAT could not even have a quasi-polynomial time algorithm, so NP could not be a subset of QP. However, if the exponential time hypothesis fails, it would have no implication for the P versus NP problem. A padding argument proves the existence of NP-complete problems for which the best known running times have the form ${\textstyle O(2^{n^{c}})}$ for $c<1$ , and if the best possible running time for 3-SAT were of this form, then P would be unequal to NP (because 3-SAT is NP-complete and this time bound is not polynomial) but the exponential time hypothesis would be false.

In parameterized complexity theory, because the exponential time hypothesis implies that there does not exist a fixed-parameter-tractable algorithm for maximum clique, it also implies that W[1] ≠ FPT.^[10] It is an important open problem in this area whether this implication can be reversed: does W[1] ≠ FPT imply the exponential time hypothesis? There is a hierarchy of parameterized complexity classes called the M-hierarchy that interleaves the W-hierarchy in the sense that, for all $i$ , ${\mathsf {M}}[i]\subseteq {\mathsf {W}}[i]\subseteq {\mathsf {M}}[i+1]$ ; for instance, the problem of finding a vertex cover of size $k\log n$ in an $n$ -vertex graph with parameter $k$ is complete for M[1]. The exponential time hypothesis is equivalent to the statement that M[1] ≠ FPT, and the question of whether ${\mathsf {M}}[i]\subseteq {\mathsf {W}}[i]$ for $i>1$ is also open.^[3]

It is also possible to prove implications in the other direction, from the failure of a variation of the strong exponential time hypothesis to separations of complexity classes. As Williams (2010) shows, if there exists an algorithm $A$ that solves Boolean circuit satisfiability in time $2^{n}/f(n)$ for some superpolynomially growing function $f$ , then NEXPTIME is not a subset of P/poly. Williams shows that, if algorithm $A$ exists, and a family of circuits simulating NEXPTIME in P/poly also existed, then algorithm $A$ could be composed with the circuits to simulate NEXPTIME problems nondeterministically in a smaller amount of time, violating the time hierarchy theorem. Therefore, the existence of algorithm $A$ proves the nonexistence of the family of circuits and the separation of these two complexity classes.^[15]

Notes

^ ^a ^b Impagliazzo, Russell; Paturi, Ramamohan (1999), "The Complexity of k-SAT", Proc. 14th IEEE Conf. on Computational Complexity, pp. 237–240, doi:10.1109/CCC.1999.766282, ISBN 978-0-7695-0075-1
^ Schöning, Uwe (1999), "A probabilistic algorithm for $k$ -SAT and constraint satisfaction problems", 40th Annual Symposium on Foundations of Computer Science, FOCS '99, 17-18 October, 1999, New York, NY, USA, IEEE Computer Society, pp. 410–414, doi:10.1109/SFFCS.1999.814612
^ ^a ^b Flum, Jörg; Grohe, Martin (2006), "16. Subexponential Fixed-Parameter Tractability", Parameterized Complexity Theory, EATCS Texts in Theoretical Computer Science, Springer-Verlag, pp. 417–451, ISBN 978-3-540-29952-3
^ Chen, Yijia; Eickmeyer, Kord; Flum, Jörg (2012), "The exponential time hypothesis and the parameterized clique problem", in Thilikos, Dimitrios M.; Woeginger, Gerhard J. (eds.), Parameterized and Exact Computation – 7th International Symposium, IPEC 2012, Ljubljana, Slovenia, September 12–14, 2012, Proceedings, Lecture Notes in Computer Science, vol. 7535, Springer, pp. 13–24, doi:10.1007/978-3-642-33293-7_4
^ Calabro, Chris; Impagliazzo, Russel; Paturi, Ramamohan (2009), "The Complexity of Satisfiability of Small Depth Circuits", Parameterized and Exact Computation, 4th International Workshop, IWPEC 2009, Copenhagen, Denmark, September 10-11, 2009, Revised Selected Papers, Lecture Notes in Computer Science, vol. 5917, pp. 75–85, doi:10.1007/978-3-642-11269-0_6
^ ^a ^b ^c Impagliazzo, Russell; Paturi, Ramamohan; Zane, Francis (2001), "Which problems have strongly exponential complexity?", Journal of Computer and System Sciences, 63 (4): 512–530, CiteSeerX 10.1.1.66.3717, doi:10.1006/jcss.2001.1774
^ ^a ^b ^c Woeginger, Gerhard (2003), "Exact algorithms for NP-hard problems: A survey", Combinatorial Optimization — Eureka, You Shrink! (PDF), Lecture Notes in Computer Science, vol. 2570, Springer-Verlag, pp. 185–207, CiteSeerX 10.1.1.168.5383, doi:10.1007/3-540-36478-1_17, ISBN 978-3-540-00580-3
^ ^a ^b ^c Pătraşcu, Mihai; Williams, Ryan (2010), "On the possibility of faster SAT algorithms", Proc. 21st ACM/SIAM Symposium on Discrete Algorithms (SODA 2010) (PDF), pp. 1065–1075
^ Feige, Uriel; Kilian, Joe (1997), "On limited versus polynomial nondeterminism", Chicago Journal of Theoretical Computer Science, 1: 1–20, doi:10.4086/cjtcs.1997.001
^ ^a ^b Chen, Jianer; Huang, Xiuzhen; Kanj, Iyad A.; Xia, Ge (2006), "Strong computational lower bounds via parameterized complexity", Journal of Computer and System Sciences, 72 (8): 1346–1367, doi:10.1016/j.jcss.2006.04.007
^ Karpinski, Marek; Schudy, Warren (2010), "Faster Algorithms for Feedback Arc Set Tournament, Kemeny Rank Aggregation and Betweenness Tournament", Proc. ISAAC 2010, Part I, Lecture Notes in Computer Science, 6506: 3–14, arXiv:1006.4396, doi:10.1007/978-3-642-17517-6_3, ISBN 978-3-642-17516-9
^ Cygan, Marek; Fomin, Fedor V.; Kowalik, Lukasz; Lokshtanov, Daniel; Marx, Daniel; Pilipczuk, Marcin; Pilipczuk, Michal; Saurabh, Saket (2015), Parameterized Algorithms, Springer, p. 555, ISBN 978-3-319-21274-6
^ Lokshtanov, Daniel; Marx, Dániel; Saurabh, Saket (2011), "Known algorithms on graphs of bounded treewidth are probably optimal", Proc. 22nd ACM/SIAM Symposium on Discrete Algorithms (SODA 2011), pp. 777–789, arXiv:1007.5450, doi:10.1137/1.9781611973082.61
^ Cygan, Marek; Pilipczuk, Marcin; Pilipczuk, Michał (2016), "Known algorithms for edge clique cover are probably optimal", SIAM Journal on Computing, 45 (1): 67–83, doi:10.1137/130947076, MR 3448348
^ Williams, Ryan (2010), "Improving exhaustive search implies superpolynomial lower bounds", Proc. 42nd ACM Symposium on Theory of Computing (STOC 2010), New York, NY, USA: ACM, pp. 231–240, CiteSeerX 10.1.1.216.1299, doi:10.1145/1806689.1806723, ISBN 9781450300506

v t e Computational hardness assumptions
Number theoretic	Integer factorization Phi-hiding RSA problem Strong RSA Quadratic residuosity Decisional composite residuosity Higher residuosity
Group theoretic	Discrete logarithm Diffie-Hellman Decisional Diffie–Hellman Computational Diffie–Hellman
Pairings	External Diffie–Hellman Sub-group hiding Decision linear
Lattices	Shortest vector problem (gap) Closest vector problem (gap) Learning with errors Ring learning with errors Short integer solution
Non-cryptographic	Exponential time hypothesis Unique games conjecture Planted clique conjecture