Set-builder notation

$\{n\mid \exists k\in \mathbb {Z} ,n=2k\}$

The set of all even integers,
expressed in set-builder notation.

In set theory and its applications to logic, mathematics, and computer science, set-builder notation is a mathematical notation for describing a set by stating the properties that its members must satisfy.^[1]

Defining sets by properties is also known as set comprehension, set abstraction or as defining a set's intension.

Sets defined by a predicate

Set-builder notation can be used to describe a set that is defined by a predicate, that is, a logical formula that evaluates to true for an element of the set, and false otherwise.^[2] In this form, set-builder notation has three parts: a variable, a colon or vertical bar separator, and a predicate. Thus there is a variable on the left of the separator, and a rule on the right of it. These three parts are contained in curly brackets:

\{x\mid \Phi (x)\}

or

\{x:\Phi (x)\}.

The vertical bar (or colon) is a separator that can be read as "such that", "for which", or "with the property that". The formula $Φ(x)$ is said to be the rule or the predicate. All values of $x$ for which the predicate holds (is true) belong to the set being defined. All values of $x$ for which the predicate does not hold do not belong to the set. Thus $\{x\mid \Phi (x)\}$ is the set of all values of $x$ that satisfy the formula $Φ$ .^[3] It may be the empty set, if no value of $x$ satisfies the formula.

Specifying the domain

A domain $E$ can appear on the left of the vertical bar:^[4]

\{x\in E\mid \Phi (x)\},

or by adjoining it to the predicate:

\{x\mid x\in E{\text{ and }}\Phi (x)\}\quad {\text{or}}\quad \{x\mid x\in E\land \Phi (x)\}.

The ∈ symbol here denotes set membership, while the $\land$ symbol denotes the logical "and" operator, known as logical conjunction. This notation represents the set of all values of $x$ that belong to some given set $E$ for which the predicate is true (see "Set existence axiom" below). If $\Phi (x)$ is a conjunction $\Phi _{1}(x)\land \Phi _{2}(x)$ , then $\{x\in E\mid \Phi (x)\}$ is sometimes written $\{x\in E\mid \Phi _{1}(x),\Phi _{2}(x)\}$ , using a comma instead of the symbol $\land$ .

In general, it is not a good idea to consider sets without defining a domain of discourse, as this would represent the subset of all possible things that may exist for which the predicate is true. This can easily lead to contradictions and paradoxes. For example, Russell's paradox shows that the expression $\{x~|~x\not \in x\},$ although seemingly well formed as a set builder expression, cannot define a set without producing a contradiction.^[5]

In cases where the set $E$ is clear from context, it may be not explicitly specified. It is common in the literature for an author to state the domain ahead of time, and then not specify it in the set-builder notation. For example, an author may say something such as, "Unless otherwise stated, variables are to be taken to be natural numbers," though in less formal contexts where the domain can be assumed, a written mention is often unnecessary.

Examples

The following examples illustrate particular sets defined by set-builder notation via predicates. In each case, the domain is specified on the left side of the vertical bar, while the rule is specified on the right side.

$\{x\in \mathbb {R} \mid x>0\}$ is the set of all strictly positive real numbers, which can be written in interval notation as $(0,\infty )$ .
$\{x\in \mathbb {R} \mid |x|=1\}$ is the set $\{-1,1\}$ . This set can also be defined as $\{x\in \mathbb {R} \mid x^{2}=1\}$ ; see equivalent predicates yield equal sets below.
For each integer $m$ , we can define $G_{m}=\{x\in \mathbb {Z} \mid x\geq m\}=\{m,m+1,m+2,\ldots \}$ . As an example, $G_{3}=\{x\in \mathbb {Z} \mid x\geq 3\}=\{3,4,5,\ldots \}$ and $G_{-2}=\{-2,-1,0,\ldots \}$ .
$\{(x,y)\in \mathbb {R} \times \mathbb {R} \mid 0<y<f(x)\}$ is the set of pairs of real numbers such that y is greater than 0 and less than $f (x)$ , for a given function $f$ . Here the cartesian product $\mathbb {R} \times \mathbb {R}$ denotes the set of ordered pairs of real numbers.
$\{n\in \mathbb {N} \mid (\exists k)[k\in \mathbb {N} \land n=2k]\}$ is the set of all even natural numbers. The $\land$ sign stands for "and", which is known as logical conjunction. The ∃ sign stands for "there exists", which is known as existential quantification. So for example, $(\exists x)P(x)$ is read as "there exists an $x$ such that $P (x)$ ".
$\{n\mid (\exists k\in \mathbb {N} )[n=2k]\}$ is a notational variant for the same set of even natural numbers. It is not necessary to specify that $n$ is a natural number, as this is implied by the formula on the right.
$\{a\in \mathbb {R} \mid (\exists p\in \mathbb {Z} )(\exists q\in \mathbb {Z} )[q\not =0\land aq=p]\}$ is the set of rational numbers; that is, real numbers that can be written as the ratio of two integers.

More complex expressions on the left side of the notation

An extension of set-builder notation replaces the single variable $x$ with an expression. So instead of $\{x\mid \Phi (x)\}$ , we may have $\{f(x)\mid \Phi (x)\},$ which should be read

\{f(x)\mid \Phi (x)\}=\{y\mid \exists x(y=f(x)\wedge \Phi (x))\}

.

For example:

$\{2n\mid n\in \mathbb {N} \}$ , where $\mathbb {N}$ is the set of all natural numbers, is the set of all even natural numbers.
$\{p/q\mid p,q\in \mathbb {Z} ,q\not =0\}$ , where $\mathbb {Z}$ is the set of all integers, is $\mathbb {Q} ,$ the set of all rational numbers.
$\{2t+1\mid t\in \mathbb {Z} \}$ is the set of odd integers.
$\{(t,2t+1)\mid t\in \mathbb {Z} \}$ creates a set of pairs, where each pair puts an integer into correspondence with an odd integer.

When inverse functions can be explicitly stated, the expression on the left can be eliminated through simple substitution. Consider the example set $\{2t+1\mid t\in \mathbb {Z} \}$ . Make the substitution $u=2t+1$ , which is to say $t=(u-1)/2$ , then replace t in the set builder notation to find

\{2t+1\mid t\in \mathbb {Z} \}=\{u\mid (u-1)/2\in \mathbb {Z} \}.

Equivalent predicates yield equal sets

Two sets are equal if and only if they have the same elements. Sets defined by set builder notation are equal if and only if their set builder rules, including the domain specifiers, are equivalent. That is

\{x\in A\mid P(x)\}=\{x\in B\mid Q(x)\}

if and only if

(\forall t)[(t\in A\land P(t))\Leftrightarrow (t\in B\land Q(t))]

.

Therefore, in order to prove the equality of two sets defined by set builder notation, it suffices to prove the equivalence of their predicates, including the domain qualifiers.

For example,

\{x\in \mathbb {R} \mid x^{2}=1\}=\{x\in \mathbb {Q} \mid |x|=1\}

because the two rule predicates are logically equivalent:

(x\in \mathbb {R} \land x^{2}=1)\Leftrightarrow (x\in \mathbb {Q} \land |x|=1).

This equivalence holds because, for any real number x, we have $x^{2}=1$ if and only if x is a rational number with $|x|=1$ . In particular, both sets are equal to the set $\{-1,1\}$ .

Set existence axiom

In many formal set theories, such as Zermelo–Fraenkel set theory, set builder notation is not part of the formal syntax of the theory. Instead, there is a set existence axiom scheme, which states that if $E$ is a set and $Φ(x)$ is a formula in the language of set theory, then there is a set $Y$ whose members are exactly the elements of $E$ that satisfy $Φ$ :

(\forall E)(\exists Y)(\forall x)[x\in Y\Leftrightarrow x\in E\land \Phi (x)].

The set $Y$ obtained from this axiom is exactly the set described in set builder notation as $\{x\in E\mid \Phi (x)\}$ .

In programming languages

A similar notation available in a number of programming languages (notably Python and Haskell) is the list comprehension, which combines map and filter operations over one or more lists.

In Python, the set-builder's braces are replaced with square brackets, parentheses, or curly braces, giving list, generator, and set objects, respectively. Python uses an English-based syntax. Haskell replaces the set-builder's braces with square brackets and uses symbols, including the standard set-builder vertical bar.

The same can be achieved in Scala using Sequence Comprehensions, where the "for" keyword returns a list of the yielded variables using the "yield" keyword.^[6]

Consider these set-builder notation examples in some programming languages:

	Example 1	Example 2
Set-builder	$\{l\ \|\ l\in L\}$	$\{(k,x)\ \|\ k\in K\wedge x\in X\wedge P(x)\}$
Python	{l for l in L}	{(k, x) for k in K for x in X if P(x)}
Haskell	[l \| l <- ls]	[(k, x) \| k <- ks, x <- xs, p x]
Scala	for (l <- L) yield l	for (k <- K; x <- X if P(x)) yield (k,x)
C#	from l in L select l	from k in K from x in X where P(x) select (k,x)
SQL	SELECT l FROM L_set	SELECT k, x FROM K_set, X_set WHERE P(x)
Prolog	setof(L,member(L,Ls),Result)	setof((K,X),(member(K,Ks),member(X,Xs),call(P,X)),Result)
Erlang	[L \|\| L <- Ls]	[{K,X} \|\| K <- Ks, X <- Xs, p(X)]
Julia	[l for l ∈ L]	[(k, x) for k ∈ K for x ∈ X if P(x)]
Mathematica	(l \|-> l) /@ L	Cases[Tuples[{K, X}], {k_, x_} /; P[x]]

The set builder notation and list comprehension notation are both instances of a more general notation known as monad comprehensions, which permits map/filter-like operations over any monad with a zero element.

Notes

^ Rosen, Kenneth (2007). Discrete Mathematics and its Applications (6th ed.). New York, NY: McGraw-Hill. pp. 111–112. ISBN 978-0-07-288008-3.
^ Michael J Cullinan, 2012, A Transition to Mathematics with Proofs, Jones & Bartlett, pp. 44ff.
^ Weisstein, Eric W. "Set". mathworld.wolfram.com. Retrieved 20 August 2020.
^ "Set-Builder Notation". mathsisfun.com. Retrieved 20 August 2020.
^ Irvine, Andrew David; Deutsch, Harry (9 October 2016) [1995]. "Russell's Paradox". Stanford Encyclopedia of Philosophy. Retrieved 6 August 2017.
^ "Sequence Comprehensions". Scala. Retrieved 6 August 2017.

[1] Rosen, Kenneth (2007). Discrete Mathematics and its Applications (6th ed.). New York, NY: McGraw-Hill. pp. 111–112. ISBN 978-0-07-288008-3.

[2] Michael J Cullinan, 2012, A Transition to Mathematics with Proofs, Jones & Bartlett, pp. 44ff.

[3] Weisstein, Eric W. "Set". mathworld.wolfram.com. Retrieved 20 August 2020.

[4] "Set-Builder Notation". mathsisfun.com. Retrieved 20 August 2020.

[5] Irvine, Andrew David; Deutsch, Harry (9 October 2016) [1995]. "Russell's Paradox". Stanford Encyclopedia of Philosophy. Retrieved 6 August 2017.

[6] "Sequence Comprehensions". Scala. Retrieved 6 August 2017.

[1]

[2]

[3]

[4]

[5]

[6]

v t e Set theory
Overview	Set (mathematics)
Axioms	Adjunction Choice countable dependent global Constructibility (V=L) Determinacy projective Extensionality Infinity Limitation of size Pairing Power set Regularity Union Martin's axiom Axiom schema replacement specification
Operations	Cartesian product Complement (i.e. set difference) De Morgan's laws Disjoint union Identities Intersection Power set Symmetric difference Union
Concepts Methods	Almost Cardinality Cardinal number (large) Class Constructible universe Continuum hypothesis Diagonal argument Element ordered pair tuple Family Forcing One-to-one correspondence Ordinal number Set-builder notation Transfinite induction Venn diagram
Set types	Amorphous Countable Empty Finite (hereditarily) Filter base subbase Ultrafilter Fuzzy Infinite (Dedekind-infinite) Recursive Singleton Subset · Superset Transitive Uncountable Universal
Theories	Alternative Axiomatic Naive Cantor's theorem Zermelo General Principia Mathematica New Foundations Zermelo–Fraenkel von Neumann–Bernays–Gödel Morse–Kelley Kripke–Platek Tarski–Grothendieck
Paradoxes Problems	Russell's paradox Suslin's problem Burali-Forti paradox
Set theorists	Paul Bernays Georg Cantor Paul Cohen Richard Dedekind Abraham Fraenkel Kurt Gödel Thomas Jech John von Neumann Willard Quine Bertrand Russell Thoralf Skolem Ernst Zermelo