# Conjunctive normal form

In Boolean logic, a formula is in conjunctive normal form (CNF) or clausal normal form if it is a conjunction of clauses, where a clause is a disjunction of literals. As a normal form, it is useful in automated theorem proving. It is similar to the product of sums form used in circuit theory.

All conjunctions of literals and all disjunctions of literals are in CNF, as they can be seen as conjunctions of one-literal clauses and conjunctions of a single clause, respectively. As in the disjunctive normal form (DNF), the only propositional connectives a formula in CNF can contain are and, or, and not. The not operator can only be used as part of a literal, which means that it can only precede a propositional variable.

## Examples and counterexamples

All of the following formulas are in CNF:

$\neg A \wedge (B \vee C)$
$(A \vee B) \wedge (\neg B \vee C \vee \neg D) \wedge (D \vee \neg E)$
$A \or B$
$A \wedge B.$

The last formula is in CNF because it can be seen as the conjunction of the two single-literal clauses $A$ and $B$. Incidentally, the last two formulae are also in disjunctive normal form.

The following formulas are not in CNF:

$\neg (B \vee C)$
$(A \wedge B) \vee C$
$A \wedge (B \vee (D \wedge E)).$

The above three formulas are respectively equivalent to the following three formulas that are in CNF:

$\neg B \wedge \neg C$
$(A \vee C) \wedge (B \vee C)$
$A \wedge (B \vee D) \wedge (B \vee E).$

## Conversion into CNF

Every propositional formula can be converted into an equivalent formula that is in CNF. This transformation is based on rules about logical equivalences: the double negative law, De Morgan's laws, and the distributive law.

Since all logical formulae can be converted into an equivalent formula in conjunctive normal form, proofs are often based on the assumption that all formulae are CNF. However, in some cases this conversion to CNF can lead to an exponential explosion of the formula. For example, translating the following non-CNF formula into CNF produces a formula with $2^n$ clauses:

$(X_1 \wedge Y_1) \vee (X_2 \wedge Y_2) \vee \dots \vee (X_n \wedge Y_n).$

In particular, the generated formula is:

$(X_1 \vee \cdots \vee X_{n-1} \vee X_n) \wedge (X_1 \vee \cdots \vee X_{n-1} \vee Y_n) \wedge \cdots \wedge (Y_1 \vee \cdots \vee Y_{n-1} \vee Y_n).$

This formula contains $2^n$ clauses; each clause contains either $X_i$ or $Y_i$ for each $i$.

There exist transformations into CNF that avoid an exponential increase in size by preserving satisfiability rather than equivalence.[1][2] These transformations are guaranteed to only linearly increase the size of the formula, but introduce new variables. For example, the above formula can be transformed into CNF by adding variables $Z_1,\ldots,Z_n$ as follows:

$(Z_1 \vee \cdots \vee Z_n) \wedge (\neg Z_1 \vee X_1) \wedge (\neg Z_1 \vee Y_1) \wedge \cdots \wedge (\neg Z_n \vee X_n) \wedge (\neg Z_n \vee Y_n).$

An interpretation satisfies this formula only if at least one of the new variables is true. If this variable is $Z_i$, then both $X_i$ and $Y_i$ are true as well. This means that every model that satisfies this formula also satisfies the original one. On the other hand, only some of the models of the original formula satisfy this one: since the $Z_i$ are not mentioned in the original formula, their values are irrelevant to satisfaction of it, which is not the case in the last formula. This means that the original formula and the result of the translation are equisatisfiable but not equivalent.

An alternative translation includes also the clauses $Z_i \vee \neg X_i \vee \neg Y_i$. With these clauses, the formula implies $Z_i \equiv X_i \wedge Y_i$; this formula is often regarded to "define" $Z_i$ to be a name for $X_i \wedge Y_i$.

## First-order logic

In first order logic, conjunctive normal form can be taken further to yield the clausal normal form of a logical formula, which can be then used to perform first-order resolution.

## Computational complexity

An important set of problems in computational complexity involves finding assignments to the variables of a boolean formula expressed in Conjunctive Normal Form, such that the formula is true. The k-SAT problem is the problem of finding a satisfying assignment to a boolean formula expressed in CNF in which each disjunction contains at most k variables. 3-SAT is NP-complete (like any other k-SAT problem with k>2) while 2-SAT is known to have solutions in polynomial time.

Typical problems in this case involve formulas in "3CNF": conjunctive normal form with no more than three variables per conjunct. Examples of such formulas encountered in practice can be very large, for example with 100,000 variables and 1,000,000 conjuncts.

A formula in CNF can be converted into an equisatisfiable formula in "kCNF" (for k>=3) by replacing each conjunct with more than k variables $X_1 \vee \cdots \vee X_k \vee \cdots \vee X_n$ by two conjuncts $X_1 \vee \cdots \vee X_{k-1} \vee Z$ and $\neg Z \vee X_k \cdots \vee X_n$ with $Z$ a new variable, and repeating as often as necessary.

## Converting from first-order logic

To convert first-order logic to CNF:[3]

1. Convert to negation normal form.
1. Eliminate implications: convert $x \rightarrow y$ to $\neg x \vee y$
2. Move NOTs inwards by repeatedly applying DeMorgan's Law. Specifically, replace $\neg (x \vee y)$ with $(\neg x) \wedge (\neg y)$; replace $\neg (x \wedge y)$ with $(\neg x) \vee (\neg y)$; and replace $\neg\neg x$ with $x$.
2. Standardize variables
1. For sentences like (∀x P(x)) ∨ (∃x Q(x)) which use the same variable name twice, change the name of one of the variables. This avoids confusion later when we drop the quantifiers. For example, from ∀x [∃y Animal(y) ∧ ¬Loves(x, y)] ∨ [∃y Loves(y, x)]. we obtain: ∀x [∃y Animal(y) ∧ ¬Loves(x, y)] ∨ [∃z Loves(z,x)].
3. Skolemize the statement
1. ∃x P(x) into P(A), where A is a new constant (consult linked article for more details)
4. Drop universal quantifiers
5. Distribute ORs over ANDs.

## Notes

1. ^ Tseitin (1968)
2. ^ Jackson and Sheridan (2004)
3. ^ Artificial Intelligence: A modern Approach [1995...] Russel and Norvig