# Relativistic wave equations

"Relativistic quantum field equations" redirects to here.

In physics, specifically relativistic quantum mechanics (RQM) and its applications to particle physics, relativistic wave equations predict the behavior of particles at high energies and velocities comparable to the speed of light. In the context of quantum field theory (QFT), the equations determine the dynamics of quantum fields.

The solutions to the equations, universally denoted as ψ or Ψ (Greek psi), are referred to as "wavefunctions" in the context of RQM, and "fields" in the context of QFT. The equations themselves are called "wave equations" or "field equations", because they have the mathematical form of a wave equation or are generated from a Lagrangian density and the field-theoretic Euler–Lagrange equations (see classical field theory for background).

In the Schrödinger picture, the wavefunction or field is the solution to the Schrödinger equation;

$i\hbar\frac{\partial}{\partial t}\psi = \hat{H} \psi$

one of the postulates of quantum mechanics. All relativistic wave equations can be constructed by specifying various forms of the Hamiltonian operator Ĥ describing the quantum system. Alternatively, Feynman's path integral formulation uses a Lagrangian rather than a Hamiltonian operator.

More generally - the modern formalism behind relativistic wave equations is Lorentz group theory, wherein the spin of the particle has a correspondence with the representations of the Lorentz group.[1]

## History

### Early 1920s: Classical and quantum mechanics

The failure of classical mechanics applied to molecular, atomic, and nuclear systems and smaller induced the need for a new mechanics: quantum mechanics. The mathematical formulation was led by De Broglie, Bohr, Schrödinger, Pauli, and Heisenberg, and others, around the mid-1920s, and at that time was analogous to that of classical mechanics. The Schrödinger equation and the Heisenberg picture resemble the classical equations of motion in the limit of large quantum numbers and as the reduced Planck constant ħ, the quantum of action, tends to zero. This is the correspondence principle. At this point, special relativity was not fully combined with quantum mechanics, so the Schrödinger and Heisenberg formulations, as originally proposed, could not be used in situations where the particles travel near the speed of light, or when the number of each type of particle changes (this happens in real particle interactions; the numerous forms of particle decays, annihilation, matter creation, pair production, and so on).

### Late 1920s: Relativistic quantum mechanics of spin-0 and spin-½ particles

A description of quantum mechanical systems which could account for relativistic effects was sought for by many theoretical physicists; from the late 1920s to the mid-1940s.[2] The first basis for relativistic quantum mechanics, i.e. special relativity applied with quantum mechanics together, was found by all those who discovered what is frequently called the Klein–Gordon equation:

$-\hbar^2\frac{\partial^2 \psi}{\partial t^2} +(\hbar c)^2\nabla^2\psi = (mc^2)^2\psi \,,$

(1)

by inserting the energy operator and momentum operator into the relativistic energy–momentum relation:

$E^2 - (pc)^2 = (mc^2)^2\,,$

(2)

The solutions to (1) are scalar fields. The KG equation is undesirable due to its prediction of negative energies and probabilities, as a result of the quadratic nature of (2) - inevitable in a relativistic theory. This equation was initially proposed by Schrödinger, and he discarded it for such reasons, only to realize a few months later that its non-relativistic limit (what is now called the Schrödinger equation) was still of importance. Nevertheless - (1) is applicable to spin-0 bosons.[3]

Neither the non-relativistic nor relativistic equations found by Schrödinger could predict the hyperfine structure in the Hydrogen spectral series. The mysterious underlying property was spin. The first two-dimensional spin matrices (better known as the Pauli matrices) were introduced by Pauli in the Pauli equation; the Schrödinger equation with a non-relativistic Hamiltonian including an extra term for particles in magnetic fields, but this was phenomological. Weyl found a relativistic equation in terms of the Pauli matrices; the Weyl equation, for massless spin-½ fermions. The problem was resolved by Dirac in the late 1920s, when he furthered the application of equation (2) to the electron - by various manipulations he factorized the equation into the form:

$\left(\frac{E}{c} - \boldsymbol{\alpha}\cdot\mathbf{p} - \beta mc \right)\left(\frac{E}{c} + \boldsymbol{\alpha}\cdot\mathbf{p} + \beta mc \right)\psi=0 \,,$

(3A)

and one of these factors is the Dirac equation (see below), upon inserting the energy and momentum operators. For the first time, this introduced new four-dimensional spin matrices α and β in a relativistic wave equation, and explained the hyperfine structure of hydrogen. The solutions to (3A) are multi-component spinor fields, and each component satisfies (1). A remarkable result of spinor solutions is that half of the components describe a particle, while the other half describe an antiparticle; in this case the electron and positron. The Dirac equation is now known to apply for all massive spin-½ fermions. In the non-relativistic limit, the Pauli equation is recovered, while the massless case results in the Weyl equation.

Although a landmark in quantum theory, the Dirac equation is only true for spin-½ fermions, and still predicts negative energy solutions, which caused controversy at the time (in particular - not all physicists were comfortable with the "Dirac sea" of negative energy states).

### 1930s–1960s: Relativistic quantum mechanics of higher-spin particles

The natural problem became clear: to generalize the Dirac equation to particles with any spin; both fermions and bosons, and in the same equations their antiparticles (possible because of the spinor formalism introduced by Dirac in his equation, and then-recent developments in spinor calculus by van der Waerden in 1929), and ideally with positive energy solutions.[2]

This was introduced and solved by Majorana in 1932, by a deviated approach to Dirac. Majorana considered one "root" of (3A):

$\left(\frac{E}{c} + \boldsymbol{\alpha}\cdot\mathbf{p} - \beta mc \right)\psi=0 \,,$

(3B)

where ψ is a spinor field now with infinitely many components, irreducible to a finite number of tensors or spinors, to remove the indeterminacy in sign. The matrices α and β are infinite-dimensional matrices, related to infinitesimal Lorentz transformations. He did not demand that each component of to satisfy equation (2), instead he regenerated the equation using a Lorentz-invariant action, via the principle of least action, and application of Lorentz group theory.[4][5]

Majorana produced other important contributions that were unpublished, including wave equations of various dimensions (5, 6, and 16). They were anticipated later (in a more involved way) by de Broglie (1934), and Duffin, Kemmer, and Petiau (around 1938–1939), see Duffin–Kemmer–Petiau algebra. The Dirac–Fierz–Pauli formalism was more sophisticated than Majorana’s, as spinors were new mathematical tools in the early twentieth century, although Majorana’s paper of 1932 was difficult to fully understand; it took Pauli and Wigner some time to understand it, around 1940.[2]

Dirac in 1936, and Fierz and Pauli in 1939, built equations from irreducible spinors A and B, symmetric in all indices, for a massive particle of spin n + ½ for integer n (see Van der Waerden notation for the meaning of the dotted indices):

$p_{\gamma\dot{\alpha}}A_{\epsilon_1\epsilon_2\cdots\epsilon_n}^{\dot{\alpha}\dot{\beta}_1\dot{\beta}_2\cdots\dot{\beta}_n} = mcB_{\gamma\epsilon_1\epsilon_2\cdots\epsilon_n}^{\dot{\beta}_1\dot{\beta}_2\cdots\dot{\beta}_n}$

(4A)

$p^{\gamma\dot{\alpha}}B_{\gamma\epsilon_1\epsilon_2\cdots\epsilon_n}^{\dot{\beta}_1\dot{\beta}_2\cdots\dot{\beta}_n} = mcA_{\epsilon_1\epsilon_2\cdots\epsilon_n}^{\dot{\alpha}\dot{\beta}_1\dot{\beta}_2\cdots\dot{\beta}_n}$

(4B)

where p is the momentum as a covariant spinor operator. For n = 0, the equations reduce to the coupled Dirac equations and A and B together transform as the original Dirac spinor. Eliminating either A or B shows that A and B each fulfill (1).[2]

In 1941, Rarita and Schwinger focussed on spin-32 particles and derived the Rarita–Schwinger equation, including a Lagrangian to generate it, and later generalized the equations analogous to spin n + ½ for integer n. In 1945, Pauli suggested Majorana's 1932 paper to Bhabha, who returned to the general ideas introduced by Majorana in 1932. Bhabha and Lubanski proposed a completely general set of equations by replacing the mass terms in (3A) and (3B) by an arbitrary constant, subject to a set of conditions which the wavefunctions must obey.[6]

Finally, in the year 1948 (the same year as Feynman's path integral formulation was cast), Bargmann and Wigner formulated the general equation for massive particles which could have any spin, by considering the Dirac equation with a totally symmetric finite-component spinor, and using Lorentz group theory (as Majorana did): the Bargmann–Wigner equations. [7][2] In the early 1960s, a reformulation of the Bargmann–Wigner equations was made by H. Joos and Steven Weinberg. Various theorists at this time did further research in relativistic Hamiltonians for higher spin particles.[8][1][9]

### 1960s–Present

The relativistic description of spin particles has been a difficult problem in quantum theory. It is still an area of the present-day research, because the problem is only partially solved; including interactions in the equations is problematic, and paradoxical predictions (even from the Dirac equation) are still present.[5]

## Linear equations

Further information: Linear differential equation

The following equations have solutions which satisfy the superposition principle, that is, the wavefunctions are additive.

Throughout, the standard conventions of tensor index notation and Feynman slash notation are used, including Greek indices which take the values 1, 2, 3 for the spatial components and 0 for the timelike component of the indexed quantities. The wavefunctions are denoted ψ, and μ are the components of the four-gradient operator.

In matrix equations, the Pauli matrices are denoted by σμ in which μ = 0, 1, 2, 3, where σ0 is the 2 × 2 identity matrix:

$\sigma^0 = \begin{pmatrix} 1&0 \\ 0&1 \\ \end{pmatrix}$

and the other matrices have their usual representations. The expression

$\sigma^\mu \partial_\mu \equiv \sigma^0 \partial_0 + \sigma^1 \partial_1 + \sigma^2 \partial_2 + \sigma^3 \partial_3$

is a 2 × 2 matrix operator which acts on 2-component spinor fields.

The gamma matrices are denoted by γμ, in which again μ = 0, 1, 2, 3, and there are a number of representations to select from. The matrix γ0 is not necessarily the 4 × 4 identity matrix. The expression

$i\hbar \gamma^\mu \partial_\mu + mc \equiv i\hbar(\gamma^0 \partial_0 + \gamma^1 \partial_1 + \gamma^2 \partial_2 + \gamma^3 \partial_3) + mc \begin{pmatrix}1&0&0&0\\ 0&1&0&0 \\ 0&0&1&0 \\ 0&0&0&1 \end{pmatrix}$

is a 4 × 4 matrix operator which acts on 4-component spinor fields.

Note that terms such as "mc" scalar multiply an identity matrix of the relevant dimension, the common sizes are 2 × 2 or 4 × 4, and are conventionally not written for simplicity.

Particle spin quantum number s Name Equation Typical particles the equation describes
0 Klein–Gordon equation $(\hbar \partial_{\mu} + imc)(\hbar \partial^{\mu} -imc)\psi = 0$ Massless or massive spin-0 particle (such as Higgs bosons).
1/2 Weyl equation $\sigma^\mu\partial_\mu \psi=0$ Massless spin-1/2 particles.
Dirac equation $\left( i \hbar \partial\!\!\!/ - m c \right) \psi = 0$ Massive spin-1/2 particles (such as electrons).
Two-body Dirac equations $[(\gamma_1)_\mu (p_1-\tilde{A}_1)^\mu+m_1 + \tilde{S}_1]\Psi=0,$

$[(\gamma_2)_\mu (p_2-\tilde{A}_2)^\mu+m_2 + \tilde{S}_2]\Psi=0.$

Massive spin-1/2 particles (such as electrons).
Majorana equation $i \hbar \partial\!\!\!/ \psi - m c \psi_c = 0$ Massive Majorana particles.
Breit equation $i\hbar\frac{\partial \Psi}{\partial t} = \left(\sum_{i}\hat{H}_{D}(i) + \sum_{i>j}\frac{1}{r_{ij}} - \sum_{i>j}\hat{B}_{ij} \right) \Psi$ Two massive spin-1/2 particles (such as electrons) interacting electromagnetically to first order in perturbation theory.
1 Maxwell equations (in QED using the Lorenz gauge) $\partial_\mu\partial^\mu A^\nu = e \overline{\psi} \gamma^\nu \psi$ Photons, massless spin-1 particles.
Proca equation $\partial_\mu(\partial^\mu A^\nu - \partial^\nu A^\mu)+\left(\frac{mc}{\hbar}\right)^2 A^\nu=0$ Massive spin-1 particle (such as W and Z bosons).
3/2 Rarita–Schwinger equation $\epsilon^{\mu \nu \rho \sigma} \gamma^5 \gamma_\nu \partial_\rho \psi_\sigma + m\psi^\mu = 0$ Massive spin-3/2 particles.
s Bargmann–Wigner equations
$(-i\hbar \gamma^\mu \partial_\mu + mc)_{\alpha_1 \alpha_1'}\psi_{\alpha'_1 \alpha_2 \alpha_3 \cdots \alpha_{2s}} = 0$

$(-i\hbar \gamma^\mu \partial_\mu + mc)_{\alpha_2 \alpha_2'}\psi_{\alpha_1 \alpha'_2 \alpha_3 \cdots \alpha_{2s}} = 0$

$\qquad \vdots$

$(-i\hbar \gamma^\mu \partial_\mu + mc)_{\alpha_{2s} \alpha'_{2s}}\psi_{\alpha_1 \alpha_2 \alpha_3 \cdots \alpha'_{2s}} = 0$

where ψ is a rank-2s 4-component spinor.

Free particles of arbitrary spin (bosons and fermions).[10][11]

### Gauge fields

The Duffin–Kemmer–Petiau equation is an alternative equation for spin-0 and spin-1 particles:

$(i \hbar \beta^{a} \partial_a - m c) \psi = 0$

## Non-linear equations

There are equations which have solutions that do not satisfy the superposition principle.

### Spin 2

$R_{\mu \nu} - {1 \over 2}g_{\mu \nu}\,R + g_{\mu \nu} \Lambda = {8 \pi G \over c^4} T_{\mu \nu}$
The solution is a metric tensor field, rather than a wavefunction.

## References

### Notes

1. ^ a b T Jaroszewicz, P.S Kurzepa (1992). "Geometry of spacetime propagation of spinning particles". Annals of Physics (California, USA).
2. S. Esposito (2011). "Searching for an equation: Dirac, Majorana and the others". arXiv:1110.6878.
3. ^ B. R. Martin, G.Shaw (2008). Particle Physics. Manchester Physics Series (3rd ed.). John Wiley & Sons. p. 3. ISBN 978-0-470-03294-7.
4. ^ R. Casalbuoni (2006). "Majorana and the Infinite Component Wave Equations". Firenze, Italy. arXiv:hep-th/0610252v1.
5. ^ a b X. Bekaert, M.R. Traubenberg, M. Valenzuela (2009). "An infinite supermultiplet of massive higher-spin fields". arXiv:0904.2533v4.
6. ^ R.K. Loide, I. Ots, R. Saar (1997). "Bhabha relativistic wave equations".
7. ^ Bargmann, V.; Wigner, E. P. (1948). "Group theoretical discussion of relativistic wave equations". Proc. Natl. Acad. Sci. U.S.A. 34 (5): 211–23. Bibcode:1948PNAS...34..211B. doi:10.1073/pnas.34.5.211.
8. ^ E.A. Jeffery (1978). "Component Minimization of the Bargman–Wigner wavefunction". Australian Journal of Physics (Melbourne: CSIRO).
9. ^ R.F Guertin (1974). "Relativistic hamiltonian equations for any spin". Annals of Physics (Texas, USA).
10. ^ E.A. Jeffery (1978). "Component Minimization of the Bargman–Wigner wavefunction". Australian Journal of Physics (Melbourne: CSIRO).
11. ^ R.Clarkson, D.G.C. McKeon (2003). "Quantum Field Theory". pp. 61–69.