# Spin–statistics theorem

Jump to navigation Jump to search

In quantum mechanics, the spin–statistics theorem relates the intrinsic spin of a particle (angular momentum not due to the orbital motion) to the particle statistics it obeys. In units of the reduced Planck constant ħ, all particles that move in 3 dimensions have either integer spin or half-integer spin.[1][2]

## Background

### Quantum states and indistinguishable particles

In a quantum system, a physical state is described by a state vector. A pair of distinct state vectors are physically equivalent if their absolute value is equal, ignoring other interactions. A pair of indistinguishable particles such as this have only one state. This means that if the positions of the particles are exchanged (i.e., they undergo a permutation), this does not identify a new physical state, but rather one matching the original physical state. In fact, one cannot tell which particle is in which position.

While the physical state does not change under the exchange of the particles' positions, it is possible for the state vector to change sign as a result of an exchange. Since this does not change the absolute value of the state vector, this does not affect the physical state.

The essential ingredient in proving the spin/statistics relation is relativity, that the physical laws do not change under Lorentz transformations. The field operators transform under Lorentz transformations according to the spin of the particle that they create, by definition.

Additionally, the assumption (known as microcausality) that spacelike separated fields either commute or anticommute can be made only for relativistic theories with a time direction. Otherwise, the notion of being spacelike is meaningless. However, the proof involves looking at a Euclidean version of spacetime, in which the time direction is treated as a spatial one, as will be now explained.

Lorentz transformations include 3-dimensional rotations as well as boosts. A boost transfers to a frame of reference with a different velocity, and is mathematically like a rotation into time. By analytic continuation of the correlation functions of a quantum field theory, the time coordinate may become imaginary, and then boosts become rotations. The new "spacetime" has only spatial directions and is termed Euclidean.

### Exchange symmetry

Bosons are particles whose wavefunction is symmetric under such an exchange or permutation, so if we swap the particles the wavefunction does not change. Fermions are particles whose wavefunction is antisymmetric, so under such a swap the wavefunction gets a minus sign, meaning that the amplitude for two identical fermions to occupy the same state must be zero. This is the Pauli exclusion principle: two identical fermions cannot occupy the same state. This rule does not hold for bosons.

In quantum field theory, a state or a wavefunction is described by field operators operating on some basic state called the vacuum. In order for the operators to project out the symmetric or antisymmetric component of the creating wavefunction, they must have the appropriate commutation law. The operator

${\displaystyle \iint \psi (x,y)\phi (x)\phi (y)\,dx\,dy}$

(with ${\displaystyle \phi }$ an operator and ${\displaystyle \psi (x,y)}$ a numerical function) creates a two-particle state with wavefunction ${\displaystyle \psi (x,y)}$, and depending on the commutation properties of the fields, either only the antisymmetric parts or the symmetric parts matter.

Let us assume that ${\displaystyle x\neq y}$ and the two operators take place at the same time; more generally, they may have spacelike separation, as is explained hereafter.

If the fields commute, meaning that the following holds:

${\displaystyle \phi (x)\phi (y)=\phi (y)\phi (x)}$,

then only the symmetric part of ${\displaystyle \psi }$ contributes, so that ${\displaystyle \psi (x,y)=\psi (y,x)}$, and the field will create bosonic particles.

On the other hand, if the fields anti-commute, meaning that ${\displaystyle \phi }$ has the property that

${\displaystyle \phi (x)\phi (y)=-\phi (y)\phi (x),}$

then only the antisymmetric part of ${\displaystyle \psi }$ contributes, so that ${\displaystyle \psi (x,y)=-\psi (y,x)}$, and the particles will be fermionic.

Naively, neither has anything to do with the spin, which determines the rotation properties of the particles, not the exchange properties.

### Spin–statistics relation

The spin–statistics relation was first formulated in 1939 by Markus Fierz[3] and was rederived in a more systematic way by Wolfgang Pauli.[4] Fierz and Pauli argued their result by enumerating all free field theories subject to the requirement that there be quadratic forms for locally commuting[clarification needed] observables including a positive-definite energy density. A more conceptual argument was provided by Julian Schwinger in 1950. Richard Feynman gave a demonstration by demanding unitarity for scattering as an external potential is varied,[5] which when translated to field language is a condition on the quadratic operator that couples to the potential.[6]

### Theorem statement

The theorem states that:

• The wave function of a system of identical integer-spin particles has the same value when the positions of any two particles are swapped. Particles with wave functions symmetric under exchange are called bosons.
• The wave function of a system of identical half-integer–spin particles changes sign when two particles are swapped. Particles with wave functions antisymmetric under exchange are called fermions.

In other words, the spin–statistics theorem states that integer-spin particles are bosons, while half-integer–spin particles are fermions.

## General discussion

### A suggestive bogus argument

Consider the two-field operator product

${\displaystyle R(\pi )\phi (x)\phi (-x),}$

where R is the matrix that rotates the spin polarization of the field by 180 degrees when one does a 180-degree rotation around some particular axis. The components of ${\displaystyle \phi }$ are not shown in this notation. ${\displaystyle \phi }$ has many components, and the matrix R mixes them up with one another.

In a non-relativistic theory, this product can be interpreted as annihilating two particles at positions ${\displaystyle x}$ and ${\displaystyle -x}$ with polarizations that are rotated by ${\displaystyle \pi }$ relative to each other. Now rotate this configuration by ${\displaystyle \pi }$ around the origin. Under this rotation, the two points ${\displaystyle x}$ and ${\displaystyle -x}$ switch places, and the two field polarizations are additionally rotated by a ${\displaystyle \pi }$. So we get

${\displaystyle R(2\pi )\phi (-x)R(\pi )\phi (x),}$

which for integer spin is equal to

${\displaystyle \phi (-x)R(\pi )\phi (x)}$

and for half-integer spin is equal to

${\displaystyle -\phi (-x)R(\pi )\phi (x)}$

(proved at Spin (physics) § Rotations). Both the operators ${\displaystyle \pm \phi (-x)R(\pi )\phi (x)}$ still annihilate two particles at ${\displaystyle x}$ and ${\displaystyle -x}$. Hence we claim to have shown that, with respect to particle states:

${\displaystyle R(\pi )\phi (x)\phi (-x)={\begin{cases}\phi (-x)R(\pi )\phi (x)&{\text{ for integral spins}},\\-\phi (-x)R(\pi )\phi (x)&{\text{ for half-integral spins}}.\end{cases}}}$

So exchanging the order of two appropriately polarized operator insertions into the vacuum can be done by a rotation, at the cost of a sign in the half-integer case.

This argument by itself does not prove anything like the spin–statistics relation. To see why, consider a nonrelativistic spin-0 field described by a free Schrödinger equation. Such a field can be anticommuting or commuting. To see where it fails, consider that a nonrelativistic spin-0 field has no polarization, so that the product above is simply:

${\displaystyle \phi (-x)\phi (x).}$

In the nonrelativistic theory, this product annihilates two particles at ${\displaystyle x}$ and ${\displaystyle -x}$, and has zero expectation value in any state. In order to have a nonzero matrix element, this operator product must be between states with two more particles on the right than on the left:

${\displaystyle \langle 0|\phi (-x)\phi (x)|\psi \rangle .}$

Performing the rotation, all that we learn is that rotating the 2-particle state ${\displaystyle |\psi \rangle }$ gives the same sign as changing the operator order. This gives no additional information, so this argument does not prove anything.

### Why the bogus argument fails

To prove the spin–statistics theorem, it is necessary to use relativity, as is obvious from the consistency of the nonrelativistic spinless fermion, and the nonrelativistic spinning bosons. There are claims in the literature of proofs of the spin–statistics theorem that do not require relativity,[7][8] but they are not proofs of a theorem, as the counterexamples show, rather they are arguments for why spin–statistics is "natural", while wrong-statistics[clarification needed] is "unnatural". In relativity, the connection is required.

In relativity, there are no local fields that are pure creation operators or annihilation operators. Every local field both creates particles and annihilates the corresponding antiparticle. This means that in relativity, the product of the free real spin-0 field has a nonzero vacuum expectation value, because in addition to creating particles which are not annihilated and annihilating particles which are not subsequently created, it also includes a part that creates and annihilates "virtual" particles whose existence enters into interaction calculations – but never as scattering matrix indices or asymptotic states.

${\displaystyle G(x)=\langle 0|\phi (-x)\phi (x)|0\rangle .}$

And now the heuristic argument can be used to see that ${\displaystyle G(x)}$ is equal to ${\displaystyle G(-x)}$, which tells us that the fields cannot be anti-commuting.

## Proof

A π rotation in the Euclidean xt plane can be used to rotate vacuum expectation values of the field product of the previous section. The time rotation turns the argument of the previous section into the spin–statistics theorem.

The proof requires the following assumptions:

1. The theory has a Lorentz-invariant Lagrangian.
2. The vacuum is Lorentz-invariant.
3. The particle is a localized excitation. Microscopically, it is not attached to a string or domain wall.
4. The particle is propagating, meaning that it has a finite, not infinite, mass.
5. The particle is a real excitation, meaning that states containing this particle have a positive-definite norm.

These assumptions are for the most part necessary, as the following examples show:

1. The spinless anticommuting field shows that spinless fermions are nonrelativistically consistent. Likewise, the theory of a spinor commuting field shows that spinning bosons are too.
2. This assumption may be weakened.
3. In 2+1 dimensions, sources for the Chern–Simons theory can have exotic spins, despite the fact that the three-dimensional rotation group has only integer and half-integer spin representations.
4. An ultralocal field can have either statistics independently of its spin. This is related to Lorentz invariance, since an infinitely massive particle is always nonrelativistic, and the spin decouples from the dynamics. Although colored quarks are attached to a QCD string and have infinite mass, the spin–statistics relation for quarks can be proved in the short distance limit.
5. Gauge ghosts are spinless fermions, but they include states of negative norm.

Assumptions 1 and 2 imply that the theory is described by a path integral, and assumption 3 implies that there is a local field which creates the particle.

The rotation plane includes time, and a rotation in a plane involving time in the Euclidean theory defines a CPT transformation in the Minkowski theory. If the theory is described by a path integral, a CPT transformation takes states to their conjugates, so that the correlation function

${\displaystyle \langle 0|R\phi (x)\phi (-x)|0\rangle }$

must be positive definite at x=0 by assumption 5, the particle states have positive norm. The assumption of finite mass implies that this correlation function is nonzero for x spacelike. Lorentz invariance now allows the fields to be rotated inside the correlation function in the manner of the argument of the previous section:

${\displaystyle \langle 0|RR\phi (x)R\phi (-x)|0\rangle =\pm \langle 0|\phi (-x)R\phi (x)|0\rangle }$

Where the sign depends on the spin, as before. The CPT invariance, or Euclidean rotational invariance, of the correlation function guarantees that this is equal to G(x). So

${\displaystyle \langle 0|(R\phi (x)\phi (y)-\phi (y)R\phi (x))|0\rangle =0}$

for integer-spin fields and

${\displaystyle \langle 0|R\phi (x)\phi (y)+\phi (y)R\phi (x)|0\rangle =0}$

for half-integer–spin fields.

Since the operators are spacelike separated, a different order can only create states that differ by a phase. The argument fixes the phase to be −1 or 1 according to the spin. Since it is possible to rotate the space-like separated polarizations independently by local perturbations, the phase should not depend on the polarization in appropriately chosen field coordinates.

This argument is due to Julian Schwinger.[9]

## Consequences

### Fermionic fields

The spin–statistics theorem implies that half-integer–spin particles are subject to the Pauli exclusion principle, while integer-spin particles are not. Only one fermion can occupy a given quantum state at any time, while the number of bosons that can occupy a quantum state is not restricted. The basic building blocks of matter such as protons, neutrons, and electrons are fermions. Particles such as the photon, which mediate forces between matter particles, are bosons.

The Fermi–Dirac distribution describing fermions leads to interesting properties. Since only one fermion can occupy a given quantum state, the lowest single-particle energy level for spin-1/2 fermions contains at most two particles, with the spins of the particles oppositely aligned. Thus, even at absolute zero, a system of more than two fermions in this case still has a significant amount of energy. As a result, such a fermionic system exerts an outward pressure. Even at non-zero temperatures, such a pressure can exist. This degeneracy pressure is responsible for keeping certain massive stars from collapsing due to gravity. See white dwarf, neutron star, and black hole.

### Bosonic fields

There are a couple of interesting phenomena arising from the two types of statistics. The Bose–Einstein distribution which describes bosons leads to Bose–Einstein condensation. Below a certain temperature, most of the particles in a bosonic system will occupy the ground state (the state of lowest energy). Unusual properties such as superfluidity can result.

### Ghost fields

Ghost fields do not obey the spin–statistics relation. See Klein transformation on how to patch up a loophole in the theorem.

## Relation to representation theory of the Lorentz group

The Lorentz group has no non-trivial unitary representations of finite dimension. Thus it seems impossible to construct a Hilbert space in which all states have finite, non-zero spin and positive, Lorentz-invariant norm. This problem is overcome in different ways depending on particle spin–statistics.

For a state of integer spin the negative norm states (known as "unphysical polarization") are set to zero, which makes the use of gauge symmetry necessary.

For a state of half-integer spin the argument can be circumvented by having fermionic statistics.[10]

## Limitations: anyons in 2 dimensions

In 1982, physicist Frank Wilczek published a research paper on the possibilities of possible fractional-spin particles, which he termed anyons from their ability to take on "any" spin.[11] He wrote that they were theoretically predicted to arise in low-dimensional systems where motion is restricted to fewer than three spatial dimensions. Wilczek described their spin statistics as "interpolating continuously between the usual boson and fermion cases".[11] Evidence for the existence of anyons has been presented experimentally from 1985 through 2013,[12][13] although it is not considered definitively established that all proposed kinds of anyons exist. Anyons are related to braid symmetry and topological states of matter.

## References

1. ^ Dirac, Paul Adrien Maurice (1981-01-01). The Principles of Quantum Mechanics. Clarendon Press. p. 149. ISBN 9780198520115.
2. ^ Pauli, Wolfgang (1980-01-01). General principles of quantum mechanics. Springer-Verlag. ISBN 9783540098423.
3. ^ Markus Fierz (1939). "Über die relativistische Theorie kräftefreier Teilchen mit beliebigem Spin". Helvetica Physica Acta. 12 (1): 3–37. Bibcode:1939AcHPh..12....3F. doi:10.5169/seals-110930.
4. ^ Wolfgang Pauli (15 October 1940). "The Connection Between Spin and Statistics" (PDF). Physical Review. 58 (8): 716–722. Bibcode:1940PhRv...58..716P. doi:10.1103/PhysRev.58.716.
5. ^ Richard Feynman (1961). Quantum Electrodynamics. Basic Books. ISBN 978-0-201-36075-2.
6. ^ Wolfgang Pauli (1950). "On the Connection Between Spin and Statistics". Progress of Theoretical Physics. 5 (4): 526–543. Bibcode:1950PThPh...5..526P. doi:10.1143/ptp/5.4.526.
7. ^ Jabs, Arthur (5 April 2002). "Connecting Spin and Statistics in Quantum Mechanics". Foundations of Physics. 40 (7): 776–792. arXiv:0810.2399. Bibcode:2010FoPh...40..776J. doi:10.1007/s10701-009-9351-4.
8. ^ Horowitz, Joshua (14 April 2009). "From Path Integrals to Fractional Quantum Statistics" (PDF). Cite journal requires |journal= (help)
9. ^ Julian Schwinger (June 15, 1951). "The Quantum Theory of Fields I". Physical Review. 82 (6): 914–917. Bibcode:1951PhRv...82..914S. doi:10.1103/PhysRev.82.914.. The only difference between the argument in this paper and the argument presented here is that the operator "R" in Schwinger's paper is a pure time reversal, instead of a CPT operation, but this is the same for CP invariant free field theories which were all that Schwinger considered.
10. ^ Peskin, Michael E.; Schroeder, Daniel V. (1995). An Introduction to Quantum Field Theory. Addison-Wesley. ISBN 0-201-50397-2.
11. ^ a b Wilczek, Frank (4 October 1982). "Quantum Mechanics of Fractional-Spin Particles" (PDF). Physical Review Letters. 49 (14): 957–959. Bibcode:1982PhRvL..49..957W. doi:10.1103/PhysRevLett.49.957.
12. ^ Camino, Fernando E.; Zhou, Wei; Goldman, Vladimir J. (17 August 2005). "Realization of a Laughlin quasiparticle interferometer: Observation of fractional statistics" (PDF). Physical Review B. 72 (7): 075342. arXiv:cond-mat/0502406. Bibcode:2005PhRvB..72g5342C. doi:10.1103/PhysRevB.72.075342. Archived from the original (PDF) on 19 June 2015. Cite uses deprecated parameter |deadurl= (help), see fig. 2.B
13. ^ R. L. Willett; C. Nayak; L. N. Pfeiffer; K. W. West (12 January 2013). "Magnetic field-tuned Aharonov–Bohm oscillations and evidence for non-Abelian anyons at ν = 5/2". Physical Review Letters. 111 (18): 186401. arXiv:1301.2639. Bibcode:2013PhRvL.111r6401W. doi:10.1103/PhysRevLett.111.186401.