= Set (mathematics) =

In mathematics, a set is a collection of different things; the things are called elements or members of the set and are typically mathematical objects: numbers, symbols, points in space, lines, geometric shapes, variables, or other sets. A set may be finite or infinite. There is a unique set with no elements, called the empty set; a set with a single element is a singleton.

Mathematics typically does not define precisely what constitutes a "set" or "collection", because such a definition would have to be in terms of something else previously defined. Instead, sets serve as foundational objects whose behavior is described by axioms modeled on intuition about collections, and then essentially all other mathematical objects are rigorously defined in terms of sets.

Set theory studies possible axiom systems and their consequences.
Since the first half of the 20th century, ZFC (Zermelo–Fraenkel set theory with the axiom of choice) has been the axiom system most commonly used.

== Context ==
Before the end of the 19th century, sets were not studied specifically, and were not clearly distinguished from sequences. Most mathematicians considered infinity as potentialmeaning that it is the result of an endless processand were reluctant to consider infinite sets, that is sets whose number of members is not a natural number. Specifically, a line was not considered as the set of its points, but as a locus where points may be located.

The mathematical study of infinite sets began with Georg Cantor (1845–1918). This provided some counterintuitive facts and paradoxes. For example, the number line has an infinite number of elements that is strictly larger than the infinite number of natural numbers, and any line segment has the same number of elements as the whole space. Also, Russell's paradox implies that the phrase "the set of all sets" is self-contradictory.

Together with other counterintuitive results, this led to the foundational crisis of mathematics, which was eventually resolved with the general adoption of Zermelo–Fraenkel set theory as a robust foundation of set theory and all mathematics.

Meanwhile, sets started to be widely used in all mathematics. In particular, algebraic structures and mathematical spaces are typically defined in terms of sets. Also, many older mathematical results are restated in terms of sets. For example, Euclid's theorem is often stated as "the set of the prime numbers is infinite". This wide use of sets in mathematics was prophesied by David Hilbert when saying: "No one will drive us from the paradise that Cantor created for us."

Generally, the common usage of sets in mathematics does not require the full power of Zermelo–Fraenkel set theory. In mathematical practice, sets can be manipulated independently of the logical framework of this theory.

The object of this article is to summarize the manipulation rules and properties of sets that are commonly used in mathematics, without reference to any logical framework. For the branch of mathematics that studies sets, see Set theory; for an informal presentation of the corresponding logical framework, see Naive set theory; for a more formal presentation, see Axiomatic set theory and Zermelo–Fraenkel set theory.

== Basic notions ==

In mathematics, a set is a collection of different things. These things are called elements or members of the set and are typically mathematical objects of any kind such as numbers, symbols, points in space, lines, other geometrical shapes, variables, functions, or even other sets. A set may also be called a collection or family, especially when its elements are themselves sets; this may avoid the confusion between the set and its members, and may make reading easier. A set may be specified either by listing its elements or by a property that characterizes its elements, such as for the set of the prime numbers or the set of all students in a given class.

If $x$ is an element of a set $S$, one says that $x$ belongs to $S$ or is in $S$, and this is written as $x\in S$. The statement "$y$ is not in $S\,$" is written as $y\not\in S$, which can also be read as "$y$ is not in $S$". For example, if $\Z$ is the set of the integers, one has $-3\in\Z$ and $1.5 \not\in \Z$. Each set is uniquely characterized by its elements. In particular, two sets that have precisely the same elements are equal (they are the same set). This property, called extensionality, can be written in formula as
$A=B \iff \forall x\; (x\in A \iff x \in B).$This implies that there is only one set with no element, the empty set (or null set) that is denoted $\varnothing$, $\empty$, or $\{\,\}$. A singleton is a set with exactly one element. If $x$ is this element, the singleton is denoted $\{x\}$. An element of a set can be itself a set. For example, the singleton $\{\empty\}$ is a set that has the empty set $\empty$ as its only element. Since these two sets do not have the same elements, $\{\empty\}$ and $\empty$ are different sets.

A set is finite if there exists a natural number $n$ such that the $n$ first natural numbers can be put in one to one correspondence with the elements of the set. In this case, one says that $n$ is the number of elements of the set. A set is infinite if such an $n$ does not exist. The empty set is a finite set with $0$ elements.

The natural numbers form an infinite set, commonly denoted $\N$. Other examples of infinite sets include integer numbers ($\Z$), rational numbers ($\Q$), real numbers ($\R$), real vector spaces, curves and most sorts of mathematical spaces.

== Specifying a set ==
Extensionality implies that for specifying a set, one has either to list its elements or to provide a property that uniquely characterizes the set elements.

=== Roster notation ===
Roster or enumeration notation is a notation introduced by Ernst Zermelo in 1908 that specifies a set by listing its elements between braces, separated by commas. For example, one sees that $\{4, 2, 1, 3\}$ and $\{\text{blue, white, red}\}$ denote sets and not tuples because of the enclosing braces.

The notations $\{\,\}$ for the empty set and $\{x\}$ for a singleton are examples of roster notation.

When specifying a set, all that matters is whether each potential element is in the set or not, so a set does not change if elements are repeated or arranged in a different order. For example,
$\{1,2,3,4\}=\{4, 2, 1, 3\} = \{4, 2, 4, 3, 1, 3\}.$

When there is a clear pattern for generating all set elements, one can use an ellipsis to abbreviate the notation; for example, $\{1,2,3,\ldots,10\}$ is a shorthand for $\{1,2,3,4,5,6,7,8,9,10\}$. Ellipses in roster notation can also be used to describe some infinite sets; for example, the set of all integers can be denoted as
$\{\ldots, -3, -2, -1, 0, 1, 2, 3, \ldots\}$
or
$\{0, 1, -1, 2, -2, 3, -3, \ldots\}.$

=== Set-builder notation ===

Set-builder notation specifies a set as being the set of all elements that satisfy some logical formula. More precisely, if $P(x)$ is a logical formula depending on a variable $x$, which evaluates to true or false depending on the value of $x$, then
$\{x \mid P(x)\}$
or
$\{x : P(x)\}$
denotes the set of all $x$ for which $P(x)$ is true. For example, a set $F$ can be specified as follows:
$F = \{n \mid n \text{ is an integer, and } 0 \leq n \leq 19\}.$
In this notation, the vertical bar "|" is read as "such that", and the whole formula can be read as "$F$ is the set of all $n$ such that $n$ is an integer in the range from 0 to 19 inclusive".

Some logical formulas, such as $\color{red}{S \text{ is a set} }$ or $\color{red}{S \text{ is a set and } S\not\in S}$ cannot be used in set-builder notation because there is no set for which the elements are characterized by the formula. There are several ways for avoiding the problem. One may prove that the formula defines a set; this is often almost immediate, but may be very difficult.

One may also introduce a larger set $U$ that must contain all elements of the specified set, and write the notation as
$\{x\mid x\in U \text{ and ...}\}$
or
$\{x\in U\mid \text{ ...}\}.$

One may also define $U$ once for all and take the convention that every variable that appears on the left of the vertical bar of the notation represents an element of $U$. This amounts to saying that $x\in U$ is implicit in set-builder notation. In this case, $U$ is often called the domain of discourse or a universe.

For example, with the convention that a lower case Latin letter may represent a real number and nothing else, the expression
$\{x\mid x\not\in \Q\}$
is an abbreviation of
$\{x\in \R \mid x\not\in \Q\},$
which defines the irrational numbers.

== Subsets ==

A subset of a set $B$ is a set $A$ such that every element of $A$ is also an element of $B$.
The following are different ways of expressing the same thing:
- $A$ is a subset of $B$,
- $\forall x\; (x\in A \implies x\in B)$,
- $A$ is contained in $B$,
- $A\subseteq B$,
- $B$ is a superset of $A$,
- $B$ contains $A$,
- $B\supseteq A$.
The relationship between sets established by ⊆ is called inclusion or containment.

A set $A$ is a proper subset of a set $B$ if $A \subseteq B$ and $A\neq B$; to denote this, one writes $A\subsetneq B$, or $A\subsetneqq B$.
Likewise, one may write $B\supsetneq A$ or $B\supsetneqq A$.

The notation $A\subset B$ often means $A\subseteq B$, but some authors use $A\subset B$ to mean $A\subsetneq B$.
To avoid ambiguity, one can write $A\subseteq B$ or $A\subsetneq B$, depending on what is intended.

=== Examples ===
- The set of all humans is a proper subset of the set of all mammals.
- $\{ 1, 3 \} \subset \{ 1, 2, 3, 4 \}$
- $\{ 1, 2, 3, 4 \} \subseteq \{ 1, 2, 3, 4 \}$

=== Properties of containment ===
- Two sets are equal if and only if they contain each other: $A = B$ is equivalent to ($A \subseteq B$ and $B \subseteq A$).
- The empty set is a subset of every set: $\varnothing \subseteq A$.

== Basic operations ==

There are several standard operations that produce new sets from given sets, in the same way as addition and multiplication produce new numbers from given numbers. The operations that are considered in this section are those such that all elements of the produced sets belong to a previously defined set. These operations are commonly illustrated with Euler diagrams and Venn diagrams.

The main basic operations on sets are the following ones.

=== Intersection ===

The intersection of two sets $A$ and $B$ is a set denoted $A \cap B$ whose elements are those elements that belong to both $A$ and $B$. That is,
$A \cap B=\{x\mid x\in A \land x\in B\},$
where $\land$ denotes the logical and.

Intersection is associative and commutative; this means that for proceeding a sequence of intersections, one may proceed in any order, without the need of parentheses for specifying the order of operations.

If $\mathcal S$ is a nonempty set of sets, its intersection, denoted
$\bigcap_{A\in \mathcal S} A,$
is the set whose elements are those elements that belong to all sets in $\mathcal S$. That is,
$\bigcap_{A\in \mathcal S} A =\{x\mid (\forall A\in \mathcal S)\; x\in A\}.$
Example: If $\mathcal S = \{X,Y\}$, then $\bigcap_{A\in \mathcal S} A = X \cap Y$.

=== Union ===

The union of two sets $A$ and $B$ is a set denoted $A \cup B$ whose elements are those elements that belong to $A$ or $B$ or both. That is,
$A \cup B=\{x\mid x\in A \lor x\in B\},$
where $\lor$ denotes the logical or.

Union is associative and commutative.

If $\mathcal S$ is a set of sets, its union, denoted
$\bigcup S = \bigcup_{A\in \mathcal S} A,$
is the set whose elements are those elements that belong to at least one set in $\mathcal S$. That is,
$\bigcup_{A\in \mathcal S} A =\{x\mid (\exists A\in \mathcal S)\; x\in A\}.$
Example: If $\mathcal S = \{X,Y\}$, then $\bigcup_{A\in \mathcal S} A = X \cup Y$.

=== Set difference ===

The set difference of two sets $A$ and $B$, is a set, denoted $A \setminus B$ or $A - B$, whose elements are those elements that belong to $A$, but not to $B$. That is,
$A \setminus B=\{x\mid x\in A \land x\not\in B\},$
where $\land$ denotes the logical and.

When $B\subseteq A$ the difference $A \setminus B$ is also called the complement of $B$ in $A$. When all sets that are considered are subsets of a fixed universal set $U$, the complement $U \setminus A$ is often called the absolute complement of $A$.

The symmetric difference of two sets $A$ and $B$, denoted $A\,\Delta\,B$, is the set of those elements that belong to $A$ or $B$ but not to both:
$A\,\Delta\,B = (A \setminus B) \cup (B \setminus A).$

=== Algebra of subsets ===

The set of all subsets of a set $U$ is called the powerset of $U$, often denoted $\mathcal P(U)$. The powerset is an algebraic structure whose main operations are union, intersection, set difference, symmetric difference and absolute complement (complement in $U$).

The powerset is a Boolean ring that has symmetric difference as addition, intersection as multiplication, the empty set as additive identity, $U$ as multiplicative identity, and the subset itself as the additive inverse.

The powerset is also a Boolean algebra for which the join $\lor$ is the union $\cup$, the meet $\land$ is the intersection $\cap$, and the negation is the set complement.

As for every Boolean algebra, the power set is also a partially ordered set for set inclusion. It is also a complete lattice.

The axioms of these structures induce many identities relating subsets, which are detailed in the linked articles.

== Functions ==

A function $f$ from a set $A$ to a set $B$ is a rule that assigns to each element of $A$ a unique element of $B$. For example, the square function maps each real number $x$ to $x^2$.

The notation $f : A \to B$ denotes a function $f$ from $A$ to $B$. The result of applying $f$ to an element $a$ of $A$ is denoted $f(a)$; it is called the value of $f$ at $a$, or the image of $a$ under $f$. The set $A$ is called the domain of $f$, and $B$ is called the codomain of $f$.

The graph of a function $f : A \to B$ is the set of all ordered pairs $(a,f(a))$ as $a$ ranges over all elements of $A$.
It is a subset of the Cartesian product $A \times B$ defined below.
For example, the graph of the square function is a parabola in $\R \times \R = \R^2$; it contains points such as $(3,9)$ and $(-4,16)$.

Once the domain and codomain are specified, the graph of $f$ contains the same information as $f$ itself.
This point of view allows one to formally define 'function' in terms of sets.
Specifically, a function from $A$ to $B$ is a triple $(A,B,G)$ of sets with $G \subset A \times B$ such that for every element $a$ in $A$, there exists a unique element $b$ in $B$ such that $(a,b) \in G$. (For functions from $\R$ to $\R$ especially, the condition on $G$ is called the vertical line test.)

=== Indexed families ===

Intuitively, an indexed family is a set whose elements are labelled with the elements of another set, the index set. These labels allow the same element to occur several times in the family.

Formally, an indexed family is a function that has the index set as its domain. Generally, the usual functional notation $f(x)$ is not used for indexed families. Instead, the element of the index set is written as a subscript of the name of the family, such as in $a_i$.

When the index set is $\{1,2\}$, an indexed family is called an ordered pair. When the index set is the set of the $n$ first natural numbers, an indexed family is called an $n$-tuple. When the index set is the set of all natural numbers an indexed family is called a sequence.

In all these cases, the natural order of the natural numbers allows omitting indices for explicit indexed families. For example, $(b,2,b)$ denotes the 3-tuple $A$ such that $A_1=b, A_2=2, A_3=b$.

The above notations $\textstyle \bigcup_{A\in \mathcal S} A$ and $\textstyle \bigcap_{A\in \mathcal S} A$ are commonly replaced with a notation involving indexed families, namely
$\bigcup_{i\in \mathcal I} A_i=\{x\mid (\exists i\in \mathcal I)\; x\in A_i\}$
and
$\bigcap_{i\in \mathcal I} A_i=\{x\mid (\forall i\in \mathcal I)\; x\in A_i\}.$

The formulas of the above sections are special cases of the formulas for indexed families, where $\mathcal S = \mathcal I$ and $i = A =A_i$. The formulas remain correct, even in the case where $A_i=A_j$ for some $i\neq j$, since $A=A\cup A= A\cap A$.

== External operations ==
In , all elements of sets produced by set operations belong to previously defined sets. In this section, other set operations are considered, which produce sets whose elements can be outside all previously considered sets. These operations are Cartesian product, disjoint union, set exponentiation and power set.

=== Cartesian product ===

Given sets $A$ and $B$, their Cartesian product (or simply product), denoted $A\times B$, is the set of all ordered pairs $(a,b)$ such that $a\in A$ and $b\in B$; that is,
$A\times B = \{(a,b) \mid a\in A \text{ and } b\in B\}.$
The definition makes sense even if $A=B$.

One can likewise define $A \times B \times C$ as a set of ordered triples $(a,b,c)$, and likewise for any finite number of sets.

In fact, the number of sets does not have to be finite.
Given any indexed family of sets $(A_i)_{i \in I}$, the product $\prod_{i \in I} A_i$ is the set of all indexed families of elements $(a_i)_{i \in I}$ such that $a_i \in A_i$ for each $i \in I$.
The axiom of choice implies that any product of nonempty sets is nonempty.

=== Set exponentiation ===

Given two sets $E$ and $F$, the set exponentiation, denoted $F^E$, is the set that has as elements all functions from $E$ to $F$.

Equivalently, $F^E$ can be viewed as the Cartesian product of a family, indexed by $E$, of sets that are all equal to $F$. This explains the terminology and the notation, since exponentiation with integer exponents is a product where all factors are equal to the base.

=== Power set ===

The power set of a set $E$ is the set that has all subsets of $E$ as elements, including the empty set and $E$ itself. It is often denoted $\mathcal P(E)$. For example,
$\mathcal P(\{1,2,3\})=\{\emptyset, \{1\}, \{2\}, \{3\}, \{1,2\}, \{1,3\}, \{2,3\}, \{1,2,3\}\}.$

There is a natural one-to-one correspondence (bijection) between the subsets of $E$ and the functions from $E$ to $\{0,1\}$; this correspondence associates to each subset the function that takes the value $1$ on the subset and $0$ elsewhere. Because of this correspondence, the power set of $E$ is commonly identified with set exponentiation:
$\mathcal P(E)=\{0,1\}^E.$
In this notation, $\{0,1\}$ is often abbreviated as $2$, which gives
$\mathcal P(E)=2^E.$
In particular, if $E$ has $n$ elements, then $2^E$ has $2^n$ elements.

=== Disjoint union ===

The disjoint union of two or more sets is similar to the union, but, if two sets have elements in common, these elements are considered as distinct in the disjoint union. This is obtained by labelling the elements by the indexes of the set they are coming from.

The disjoint union of two sets $A$ and $B$ is commonly denoted $A\sqcup B$ and is thus defined as
$A\sqcup B=\{(a,i)\mid (i=1 \land a\in A)\lor (i=2 \land a\in B\}.$

If $A=B$ is a set with $n$ elements, then $A\cup A =A$ has $n$ elements, while $A\sqcup A$ has $2n$ elements.

The disjoint union of two sets is a particular case of the disjoint union of an indexed family of sets, which is defined as
$\bigsqcup_{i \in \mathcal I}=\{(a,i)\mid i\in \mathcal I \land a\in A_i\}.$

The disjoint union is the coproduct in the category of sets. Therefore the notation
$\coprod_{i \in \mathcal I}=\{(a,i)\mid i\in \mathcal I \land a\in A_i\}$
is commonly used.

==== Internal disjoint union ====

Given an indexed family of sets $(A_i)_{i\in \mathcal I}$, there is a natural map
$\begin{align}
\bigsqcup_{i\in \mathcal I} A_i&\to \bigcup_{i\in \mathcal I} A_i\\
(a,i)&\mapsto a ,
\end{align}$
which consists in "forgetting" the indices.

This map is always surjective; it is bijective if and only if the $A_i$ are pairwise disjoint, that is, all intersections of two sets of the family are empty. In this case, $\textstyle \bigcup_{i\in \mathcal I} A_i$ and $\textstyle \bigsqcup_{i\in \mathcal I} A_i$ are commonly identified, and one says that their union is the disjoint union of the members of the family.

If a set is the disjoint union of a family of subsets, one says also that the family is a partition of the set.

== Cardinality ==

Informally, the cardinality of a set $S$, often denoted $\vert S \vert$, is the number of its members. This number is the natural number $n$ when there is a bijection between the set that is considered and the set $\{1,2,\ldots, n \}$ of the first $n$ natural numbers. The cardinality of the empty set is $0$. A set with the cardinality of a natural number is called a finite set, which applies in both cases. Otherwise, one has an infinite set.

The fact that natural numbers measure the cardinality of finite sets is the basis of the concept of natural number, and predates for several thousands years the concept of sets. A large part of combinatorics is devoted to the computation or estimation of the cardinality of finite sets.

=== Infinite cardinalities ===
The cardinality of an infinite set is commonly represented by a cardinal number, exactly as the number of elements of a finite set is represented by a natural numbers. The definition of cardinal numbers is too technical for this article; however, many properties of cardinalities can be dealt without referring to cardinal numbers, as follows.

Two sets $S$ and $T$ have the same cardinality if there exists a one-to-one correspondence (bijection) between them. This is denoted $\vert S \vert=\vert T\vert$, and would be an equivalence relation on sets, if a set of all sets would exist.

For example, the natural numbers and the even natural numbers have the same cardinality, since multiplication by two provides such a bijection. Similarly, the interval $(-1, 1)$ and the set of all real numbers have the same cardinality, a bijection being provided by the function $x\mapsto \tan(\pi x/2)$.

Having the same cardinality of a proper subset is a characteristic property of infinite sets: a set is infinite if and only if it has the same cardinality as one of its proper subsets.
So, by the above example, the natural numbers form an infinite set.

Besides equality, there is a natural inequality between cardinalities: a set $S$ has a cardinality smaller than or equal to the cardinality of another set $T$ if there is an injection from $S$ to $T$. This is denoted $\vert S \vert \le \vert T \vert$.

Schröder–Bernstein theorem implies that $\vert S \vert \le \vert T \vert$ and $\vert T \vert \le \vert S \vert$ imply $\vert S \vert = \vert T \vert$. Also, one has $\vert S \vert \le \vert T \vert$, if and only if there is a surjection from $T$ to $S$. For every two sets $S$ and $T$, one has either $\vert S \vert \le \vert T \vert$ or $\vert T \vert \le \vert S \vert$. So, inequality of cardinalities is a total order.

The cardinality of the set $\N$ of the natural numbers, denoted $\vert\N\vert=\aleph_0$, is the smallest infinite cardinality. This means that if $S$ is a set of natural numbers, then either $S$ is finite or $\vert S \vert = \vert \N \vert$.

Sets with cardinality less than or equal to $\vert\N\vert=\aleph_0$ are called countable sets; these are either finite sets or countably infinite sets (sets of cardinality $\aleph_0$); some authors use "countable" to mean "countably infinite". Sets with cardinality strictly greater than $\aleph_0$ are called uncountable sets.

Cantor's diagonal argument shows that, for every set $S$, its power set (the set of its subsets) $2^S$ has a greater cardinality:
$|S|<\left|2^S \right|.$
This implies that there is no greatest cardinality.

=== Cardinality of the real numbers ===

The cardinality of set of the real numbers is called the cardinality of the continuum and denoted $\mathfrak c$. (The term "continuum" referred to the real line before the 20th century, when the real line was not commonly viewed as a set of numbers.)

Since, as seen above, the real line $\R$ has the same cardinality of an open interval, every subset of $\R$ that contains a nonempty open interval also has the cardinality $\mathfrak c$.

One has
$\mathfrak c = 2^{\aleph_0},$
meaning that the cardinality of the real numbers equals the cardinality of the power set of the natural numbers. In particular,
$\mathfrak c > \aleph_0.$

When published in 1878 by Georg Cantor, this result was so astonishing that it was rejected by mathematicians, and several decades were needed before its common acceptance.

It can be shown that $\mathfrak c$ is also the cardinality of the entire plane, and of any finite-dimensional Euclidean space.

The continuum hypothesis, a conjecture formulated by Georg Cantor in 1878, states that there is no set with cardinality strictly between $\aleph_0$ and $\mathfrak c$. In 1963, Paul Cohen proved that the continuum hypothesis is independent of the axioms of Zermelo–Fraenkel set theory with the axiom of choice.
This means that if the most widely used set theory is consistent (that is not self-contradictory), then the same is true for both the set theory with the continuum hypothesis added as a further axiom, and the set theory with the negation of the continuum hypothesis added.

== Axiom of choice ==

Informally, the axiom of choice says that, given any family of nonempty sets, one can choose simultaneously an element in each of them. Formulated this way, acceptability of this axiom sets a foundational logical question, because of the difficulty of conceiving an infinite instantaneous action. However, there are several equivalent formulations that are much less controversial and have strong consequences in many areas of mathematics. In the present days, the axiom of choice is thus commonly accepted in mainstream mathematics.

A more formal statement of the axiom of choice is: the Cartesian product of every indexed family of nonempty sets is non empty.

Other equivalent forms are described in the following subsections.

=== Zorn's lemma ===

Zorn's lemma is an assertion that is equivalent to the axiom of choice under the other axioms of set theory, and is easier to use in usual mathematics.

Let $S$ be a partial ordered set. A chain in $S$ is a subset that is totally ordered under the induced order. Zorn's lemma states that, if every chain in $S$ has an upper bound in $S$, then $S$ has (at least) a maximal element, that is, an element that is not smaller than another element of $S$.

In most uses of Zorn's lemma, $S$ is a set of sets, the order is set inclusion, and the upperbound of a chain is taken as the union of its members.

An example of use of Zorn's lemma, is the proof that every vector space has a basis. Here the elements of $S$ are linearly independent subsets of the vector space. The union of a chain of elements of $S$ is linearly independent, since an infinite set is linearly independent if and only if each finite subset is, and every finite subset of the union of a chain must be included in a member of the chain. So, there exist a maximal linearly independent set. This linearly independent set must span the vector space because of maximality, and is therefore a basis.

Another classical use of Zorn's lemma is the proof that every proper idealthat is, an ideal that is not the whole ringof a ring is contained in a maximal ideal. Here, $S$ is the set of the proper ideals containing the given ideal. The union of chain of ideals is an ideal, since the axioms of an ideal involve a finite number of elements. The union of a chain of proper ideals is a proper ideal, since otherwise $1$ would belong to the union, and this implies that it would belong to a member of the chain.

=== Transfinite induction ===

The axiom of choice is equivalent with the fact that a well-order can be defined on every set, where a well-order is a total order such that every nonempty subset has a least element.

Simple examples of well-ordered sets are the natural numbers (with the natural order), and, for every $n$, the set of the $n$-tuples of natural numbers, with the lexicographic order.

Well-orders allow a generalization of mathematical induction, which is called transfinite induction. Given a property (predicate) $P(n)$ depending on a natural number, mathematical induction is the fact that for proving that $P(n)$ is always true, it suffices to prove that for every $n$,
$(m<n \implies P(m)) \implies P(n).$
Transfinite induction is the same, replacing natural numbers by the elements of a well-ordered set.

Often, a proof by transfinite induction easier if three cases are proved separately, the two first cases being the same as for usual induction:
- $P(0)$ is true, where $0$ denotes the least element of the well-ordered set
- $P(x) \implies P(S(x))$, where $S(x)$ denotes the successor of $x$, that is the least element that is greater than $x$
- $(\forall y;\; y < x \implies P(y)) \implies P(x)$, when $x$ is not a successor.

Transfinite induction is fundamental for defining ordinal numbers and cardinal numbers.

== See also ==
- Algebra of sets
- Alternative set theory
- Category of sets
- Class (set theory)
- Family of sets
- Fuzzy set
- Mathematical logic
- Mereology
- Principia Mathematica
- Set theory
