Jump to content

History of Lorentz transformations

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by John of Reading (talk | contribs) at 13:44, 7 August 2020 (Typo/general fixes, replaced: betwenn → between, on November 1844 → in November 1844, ]]’s → ]]'s). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

The history of Lorentz transformations comprises the development of linear transformations forming the Lorentz group or Poincaré group preserving the Lorentz interval and the Minkowski inner product .

In mathematics, transformations equivalent to what was later known as Lorentz transformations in various dimensions were discussed in the 19th century in relation to the theory of quadratic forms, hyperbolic geometry, Möbius geometry, and sphere geometry, which is connected to the fact that the group of motions in hyperbolic space, the Möbius group or projective special linear group, and the Laguerre group are isomorphic to the Lorentz group.

In physics, Lorentz transformations became known at the beginning of the 20th century, when it was discovered that they exhibit the symmetry of Maxwell's equations. Subsequently, they became fundamental to all of physics, because they formed the basis of special relativity in which they exhibit the symmetry of Minkowski spacetime, making the speed of light invariant between different inertial frames. They relate the spacetime coordinates of two arbitrary inertial frames of reference with constant relative speed v. In one frame, the position of an event is given by x,y,z and time t, while in the other frame the same event has coordinates x′,y′,z′ and t′.

Overview

Most general Lorentz transformations

The general quadratic form q(x) with coefficients of a symmetric matrix A, the associated bilinear form b(x,y), and the linear transformations of q(x) and b(x,y) into q(x′) and b(x′,y′) using the transformation matrix g, can be written as[1]

(Q1)

The case n=1 is the binary quadratic form introduced by Lagrange (1773) and Gauss (1798/1801), n=2 is the ternary quadratic form introduced by Gauss (1798/1801), n=3 is the quaternary quadratic form etc.

The general Lorentz transformation follows from (Q1) by setting A=A′=diag(-1,1,...,1) and det g=±1. It forms an indefinite orthogonal group called the Lorentz group O(1,n), while the case det g=+1 forms the restricted Lorentz group SO(1,n). The quadratic form q(x) becomes the Lorentz interval in terms of an indefinite quadratic form of Minkowski space (being a special case of pseudo-Euclidean space), and the associated bilinear form b(x) becomes the Minkowski inner product:[2][3]

(1a)

Such general Lorentz transformations (1a) for various dimensions were used by Gauss (1818), Jacobi (1827, 1833), Lebesgue (1837), Bour (1856), Somov (1863), Hill (1882) in order to simplify computations of elliptic functions and integrals.[4][5] They were also used by Poincaré (1881), Cox (1881/82), Picard (1882, 1884), Killing (1885, 1893), Gérard (1892), Hausdorff (1899), Woods (1901, 1903), Liebmann (1904/05) to describe hyperbolic motions (i.e. rigid motions in the hyperbolic plane or hyperbolic space), which were expressed in terms of Weierstrass coordinates of the hyperboloid model satisfying the relation or in terms of the Cayley–Klein metric of projective geometry using the "absolute" form .[M 1][6][7] In addition, infinitesimal transformations related to the Lie algebra of the group of hyperbolic motions were given in terms of Weierstrass coordinates by Killing (1888-1897).

If in (1a) are interpreted as homogeneous coordinates, then the corresponding inhomogenous coordinates follow by

so that the Lorentz transformation becomes a homography leaving invariant the equation of the unit sphere, which John Lighton Synge called "the most general formula for the composition of velocities" in terms of special relativity (the transformation matrix g stays the same as in (1a)):[8]

(1b)

Such Lorentz transformations for various dimensions were used by Gauss (1818), Jacobi (1827–1833), Lebesgue (1837), Bour (1856), Somov (1863), Hill (1882), Callandreau (1885) in order to simplify computations of elliptic functions and integrals, by Picard (1882-1884) in relation to Hermitian quadratic forms, or by Woods (1901, 1903) in terms of the Beltrami–Klein model of hyperbolic geometry. In addition, infinitesimal transformations in terms of the Lie algebra of the group of hyperbolic motions leaving invariant the unit sphere were given by Lie (1885-1893) and Werner (1889) and Killing (1888-1897).

Particular forms of Lorentz transformations or relativistic velocity additions, mostly restricted to 2, 3 or 4 dimensions, have been formulated by many authors using:

Lorentz transformation via imaginary orthogonal transformation

By using the imaginary quantities in x as well as (s=1,2...n) in g, the Lorentz transformation (1a) assumes the form of an orthogonal transformation of Euclidean space forming the orthogonal group O(n) if det g=±1 or the special orthogonal group SO(n) if det g=+1, the Lorentz interval becomes the Euclidean norm, and the Minkowski inner product becomes the dot product:[9]

(2a)

The cases n=1,2,3,4 of orthogonal transformations in terms of real coordinates were discussed by Euler (1771) and in n dimensions by Cauchy (1829). The case in which one of these coordinates is imaginary and the other ones remain real was alluded to by Lie (1871) in terms of spheres with imaginary radius, while the interpretation of the imaginary coordinate as being related to the dimension of time as well as the explicit formulation of Lorentz transformations with n=3 was given by Minkowski (1907) and Sommerfeld (1909).

A well known example of this orthogonal transformation is spatial rotation in terms of trigonometric functions, which become Lorentz transformations by using an imaginary angle , so that trigonometric functions become equivalent to hyperbolic functions:

(2b)

or in exponential form using Euler's formula :

(2c)

Defining as real, spatial rotation in the form (2b-1) was introduced by Euler (1771) and in the form (2c-1) by Wessel (1799). The interpretation of (2b) as Lorentz boost (i.e. Lorentz transformation without spatial rotation) in which correspond to the imaginary quantities was given by Minkowski (1907) and Sommerfeld (1909). As shown in the next section using hyperbolic functions, (2b) becomes (3b) while (2c) becomes (3d).

Lorentz transformation via hyperbolic functions

The case of a Lorentz transformation without spatial rotation is called a Lorentz boost. The simplest case can be given, for instance, by setting n=1 in (1a):

(3a)

which resembles precisely the relations of hyperbolic functions in terms of hyperbolic angle . Thus by adding an unchanged -axis, a Lorentz boost or hyperbolic rotation for n=2 (being the same as a rotation around an imaginary angle in (2b) or a translation in the hyperbolic plane in terms of the hyperboloid model) is given by

(3b)

in which the rapidity can be composed of arbitrary many rapidities as per the angle sum laws of hyperbolic sines and cosines, so that one hyperbolic rotation can represent the sum of many other hyperbolic rotations, analogous to the relation between angle sum laws of circular trigonometry and spatial rotations. Alternatively, the hyperbolic angle sum laws themselves can be interpreted as Lorentz boosts, as demonstrated by using the parameterization of the unit hyperbola:

(3c)

Finally, Lorentz boost (3b) assumes a simple form by using squeeze mappings in analogy to Euler's formula in (2c):[10]

(3d)

Hyperbolic relations (a,b) on the right of (3b) were given by Riccati (1757), relations (a,b,c,d,e,f) by Lambert (1768–1770). Lorentz transformations (3b) were given by Laisant (1874), Cox (1882), Lindemann (1890/91), Gérard (1892), Killing (1893, 1897/98), Whitehead (1897/98), Woods (1903/05) and Liebmann (1904/05) in terms of Weierstrass coordinates of the hyperboloid model. Hyperbolic angle sum laws equivalent to Lorentz boost (3c) were given by Riccati (1757) and Lambert (1768–1770), while the matrix representation was given by Glaisher (1878) and Günther (1880/81). Lorentz transformations (3d-1) were given by Lindemann (1890/91) and Herglotz (1909), while formulas equivalent to (3d-2) by Klein (1871).

In line with equation (1b) one can use coordinates inside the unit circle , thus the corresponding Lorentz transformations (3b) obtain the form:

(3e)

These Lorentz transformations were given by Escherich (1874) and Killing (1898) (on the left), as well as Beltrami (1868) and Schur (1885/86, 1900/02) (on the right) in terms of Beltrami coordinates[11] of hyperbolic geometry. By using the scalar product of , the resulting Lorentz transformation can be seen as equivalent to the hyperbolic law of cosines:[12][R 1][13]

(3f)

The hyperbolic law of cosines (a) was given by Taurinus (1826) and Lobachevsky (1829/30) and others, while variant (b) was given by Schur (1900/02).

Lorentz transformation via velocity

In the theory of relativity, Lorentz transformations exhibit the symmetry of Minkowski spacetime by using a constant c as the speed of light, and a parameter v as the relative velocity between two inertial reference frames. In particular, the hyperbolic angle in (3b) can be interpreted as the velocity related rapidity , so that is the Lorentz factor, the proper velocity, the velocity of another object, the velocity-addition formula, thus (3b) becomes:

(4a)

Or in four dimensions and by setting and adding an unchanged z the familiar form follows

(4b)

Without relation to physics, similar transformations have been used by Lipschitz (1885/86). In physics, analogous transformations have been introduced by Voigt (1887) and by Lorentz (1892, 1895) who analyzed Maxwell's equations, they were completed by Larmor (1897, 1900) and Lorentz (1899, 1904), and brought into their modern form by Poincaré (1905) who gave the transformation the name of Lorentz.[14] Eventually, Einstein (1905) showed in his development of special relativity that the transformations follow from the principle of relativity and constant light speed alone by modifying the traditional concepts of space and time, without requiring a mechanical aether in contradistinction to Lorentz and Poincaré.[15] Minkowski (1907–1908) used them to argue that space and time are inseparably connected as spacetime. Minkowski (1907–1908) and Varićak (1910) showed the relation to imaginary and hyperbolic functions. Important contributions to the mathematical understanding of the Lorentz transformation were also made by other authors such as Herglotz (1909/10), Ignatowski (1910), Noether (1910) and Klein (1910), Borel (1913–14).

Also Lorentz boosts for arbitrary directions in line with (1a) can be given as:[16]

or in vector notation

(4c)

Such transformations were formulated by Herglotz (1911) and Silberstein (1911) and others.

In line with equation (1b), one can substitute in (3b) or (4a), producing the Lorentz transformation of velocities (or velocity addition formula) in analogy to Beltrami coordinates of (3e):

(4d)

or using trigonometric and hyperbolic identities it becomes the hyperbolic law of cosines in terms of (3f):[12][R 1][13]

(4e)

and by further setting u=u′=c the relativistic aberration of light follows:[17]

(4f)

The velocity addition formulas were given by Einstein (1905) and Poincaré (1905/06), the aberration formula for cos(α) by Einstein (1905), while the relations to the spherical and hyperbolic law of cosines were given by Sommerfeld (1909) and Varićak (1910). These formulas resemble the equations of an ellipse of eccentricity v/c, eccentric anomaly α' and true anomaly α, first geometrically formulated by Kepler (1609) and explicitly written down by Euler (1735, 1748), Lagrange (1770) and many others in relation to planetary motions.[18][19]

Lorentz transformation via conformal, spherical wave, and Laguerre transformation

If one only requires the invariance of the light cone represented by the differential equation , which is the same as asking for the most general transformation that changes spheres into spheres, the Lorentz group can be extended by adding dilations represented by the factor λ. The result is the group Con(1,p) of spacetime conformal transformations in terms of special conformal transformations and inversions producing the relation

.

One can switch between two representations of this group by using an imaginary sphere radius coordinate x0=iR with the interval related to conformal transformations, or by using a real radius coordinate x0=R with the interval related to spherical wave transformations in terms of contact transformations preserving circles and spheres. Both representations were studied by Lie (1871) and others. It was shown by Bateman & Cunningham (1909–1910), that the group Con(1,3) is the most general one leaving invariant the equations of Maxwell's electrodynamics.

It turns out that Con(1,3) is isomorphic to the special orthogonal group SO(2,4), and contains the Lorentz group SO(1,3) as a subgroup by setting λ=1. More generally, Con(q,p) is isomorphic to SO(q+1,p+1) and contains SO(q,p) as subgroup.[20] This implies that Con(0,p) is isomorphic to the Lorentz group of arbitrary dimensions SO(1,p+1). Consequently, the conformal group in the plane Con(0,2) – known as the group of Möbius transformations – is isomorphic to the Lorentz group SO(1,3).[21][22] This can be seen using tetracyclical coordinates satisfying the form , which were discussed by Pockels (1891), Klein (1893), Bôcher (1894). The relation between Con(1,3) and the Lorentz group was noted by Bateman & Cunningham (1909–1910) and others.

A special case of Lie's geometry of oriented spheres is the Laguerre group, transforming oriented planes and lines into each other. It's generated by the Laguerre inversion introduced by Laguerre (1882) and discussed by Darboux (1887) and Smith (1900) leaving invariant with R as radius, thus the Laguerre group is isomorphic to the Lorentz group. A similar concept was studied by Scheffers (1899) in terms of contact transformations. Stephanos (1883) argued that Lie's geometry of oriented spheres in terms of contact transformations, as well as the special case of the transformations of oriented planes into each other (such as by Laguerre), provides a geometrical interpretation of Hamilton's biquaternions. The group isomorphism between the Laguerre group and Lorentz group was pointed out by Bateman (1910), Cartan (1912, 1915/55), Poincaré (1912/21) and others.[23][24]

Lorentz transformation via Cayley–Hermite transformation

The general transformation (Q1) of any quadratic form into itself can also be given using arbitrary parameters based on the Cayley transform (I-T)−1·(I+T), where I is the identity matrix, T an arbitrary antisymmetric matrix, and by adding A as symmetric matrix defining the quadratic form (there is no primed A' because the coefficients are assumed to be the same on both sides):[25][26]

(Q2)

After Cayley (1846) introduced transformations related to sums of positive squares, Hermite (1853/54, 1854) derived transformations for arbitrary quadratic forms, whose result was reformulated in terms of matrices (Q2) by Cayley (1855a, 1855b). For instance, the choice A=diag(1,1,1) gives an orthogonal transformation which can be used to describe spatial rotations corresponding to the Euler-Rodrigues parameters [a,b,c,d] discovered by Euler (1771) and Rodrigues (1840), which can be interpreted as the coefficients of quaternions. Setting d=1, the equations have the form:

(Q3)

Also the Lorentz interval and the general Lorentz transformation in any dimension can be produced by the Cayley–Hermite formalism.[R 2][R 3][27][28] For instance, Lorentz transformation (1a) with n=1 follows from (Q2) with:

(5a)

This becomes Lorentz boost (4a or 4b) by setting , which is equivalent to the relation known from Loedel diagrams, thus (5a) can be interpreted as a Lorentz boost from the viewpoint of a "median frame" in which two other inertial frames are moving with equal speed in opposite directions.

Furthermore, Lorentz transformation (1a) with n=2 is given by:

(5b)

or using n=3:

(5c)

The transformation of a binary quadratic form of which Lorentz transformation (5a) is a special case was given by Hermite (1854), equations containing Lorentz transformations (5a, 5b, 5c) as special cases were given by Cayley (1855), Lorentz transformation (5a) was given (up to a sign change) by Laguerre (1882), Darboux (1887), Smith (1900) in relation to Laguerre geometry, and Lorentz transformation (5b) was given by Bachmann (1869). In relativity, equations similar to (5b, 5c) were first employed by Borel (1913) to represent Lorentz transformations.

As described in equation (3d), the Lorentz interval is closely connected to the alternative form ,[29] which in terms of the Cayley–Hermite parameters is invariant under the transformation:[M 2]

(5d)

This transformation was given by Cayley (1884), even though he didn't relate it to the Lorentz interval but rather to . As shown in the next section in equation (6d), many authors (some before Cayley) expressed the invariance of and its relation to the Lorentz interval by using the alternative Cayley–Klein parameters and Möbius transformations.

Lorentz transformation via Cayley–Klein parameters, Möbius and spin transformations

The previously mentioned Euler-Rodrigues parameter a,b,c,d (i.e. Cayley-Hermite parameter in equation (Q3) with d=1) are closely related to Cayley–Klein parameter α,β,γ,δ introduced by Helmholtz (1866/67), Cayley (1879) and Klein (1884) to connect Möbius transformations and rotations:[M 3]

thus (Q3) becomes:

(Q4)

Also the Lorentz transformation can be expressed with variants of the Cayley–Klein parameters: One relates these parameters to a spin-matrix D, the spin transformations of variables (the overline denotes complex conjugate), and the Möbius transformation of . When defined in terms of isometries of hyperblic space (hyperbolic motions), the Hermitian matrix u associated with these Möbius transformations produces an invariant determinant identical to the Lorentz interval. Therefore, these transformations were described by John Lighton Synge as being a "factory for the mass production of Lorentz transformations".[30] It also turns out that the related spin group Spin(3, 1) or special linear group SL(2, C) acts as the double cover of the Lorentz group (one Lorentz transformation corresponds to two spin transformations of different sign), while the Möbius group Con(0,2) or projective special linear group PSL(2, C) is isomorphic to both the Lorentz group and the group of isometries of hyperbolic space.

In space, the Möbius/Spin/Lorentz transformations can be written as:[31][30][32][33]

(6a)

thus:[34]

(6b)

or in line with equation (1b) one can substitute so that the Möbius/Lorentz transformations become related to the unit sphere:

(6c)

The general transformation u′ in (6a) was given by Cayley (1854), while the general relation between Möbius transformations and transformation u′ leaving invariant the generalized circle was pointed out by Poincaré (1883) in relation to Kleinian groups. The adaptation to the Lorentz interval by which (6a) becomes a Lorentz transformation was given by Klein (1889-1893, 1896/97), Bianchi (1893), Fricke (1893, 1897). Its reformulation as Lorentz transformation (6b) was provided by Bianchi (1893) and Fricke (1893, 1897). Lorentz transformation (6c) was given by Klein (1884) in relation to surfaces of second degree and the invariance of the unit sphere. In relativity, (6a) was first employed by Herglotz (1909/10).

In the plane, the transformations can be written as:[29][33]

(6d)

thus

(6e)

which includes the special case implying , reducing the transformation to a Lorentz boost in 1+1 dimensions:

(6f)

Finally, by using the Lorentz interval related to a hyperboloid, the Möbius/Lorentz transformations can be written

(6g)

The general transformation u′ and its invariant in (6d) was already used by Lagrange (1773) and Gauss (1798/1801) in the theory of integer binary quadratic forms. The invariant was also studied by Klein (1871) in connection to hyperbolic plane geometry (see equation (3d)), while the connection between u′ and with the Möbius transformation was analyzed by Poincaré (1886) in relation to Fuchsian groups. The adaptation to the Lorentz interval by which (6d) becomes a Lorentz transformation was given by Bianchi (1888) and Fricke (1891). Lorentz Transformation (6e) was stated by Gauss around 1800 (posthumously published 1863), as well as Selling (1873), Bianchi (1888), Fricke (1891), Woods (1895) in relation to integer indefinite ternary quadratic forms. Lorentz transformation (6f) was given by Bianchi (1886, 1894) and Eisenhart (1905). Lorentz transformation (6g) of the hyperboloid was stated by Poincaré (1881) and Hausdorff (1899).

Lorentz transformation via quaternions and hyperbolic numbers

The Lorentz transformations can also be expressed in terms of biquaternions: A Minkowskian quaternion (or minquat) q having one real part and one purely imaginary part is multiplied by biquaternion a applied as pre- and postfactor. Using an overline to denote quaternion conjugation and * for complex conjugation, its general form (on the left) and the corresponding boost (on the right) are as follows:[35][36]

(7a)

Hamilton (1844/45) and Cayley (1845) derived the quaternion transformation for spatial rotations, and Cayley (1854, 1855) gave the corresponding transformation leaving invariant the sum of four squares . Cox (1882/83) discussed the Lorentz interval in terms of Weierstrass coordinates in the course of adapting William Kingdon Clifford's biquaternions a+ωb to hyperbolic geometry by setting (alternatively, 1 gives elliptic and 0 parabolic geometry). Stephanos (1883) related the imaginary part of William Rowan Hamilton's biquaternions to the radius of spheres, and introduced a homography leaving invariant the equations of oriented spheres or oriented planes in terms of Lie sphere geometry. Buchheim (1884/85) discussed the Cayley absolute and adapted Clifford's biquaternions to hyperbolic geometry similar to Cox by using all three values of . Eventually, the modern Lorentz transformation using biquaternions with as in hyperbolic geometry was given by Noether (1910) and Klein (1910) as well as Conway (1911) and Silberstein (1911).

Often connected with quaternionic systems is the hyperbolic number , which also allows to formulate the Lorentz transformations:[37][38]

(7b)

After the trigonometric expression (Euler's formula) was given by Euler (1748), and the hyperbolic analogue as well as hyperbolic numbers by Cockle (1848) in the framework of tessarines, it was shown by Cox (1882/83) that one can identify with associative quaternion multiplication. Here, is the hyperbolic versor with , while -1 denotes the elliptic or 0 denotes the parabolic counterpart (not to be confused with the expression in Clifford's biquaternions also used by Cox, in which -1 is hyperbolic). The hyperbolic versor was also discussed by Macfarlane (1892, 1894, 1900) in terms of hyperbolic quaternions. The expression for hyperbolic motions (and -1 for elliptic, 0 for parabolic motions) also appear in "biquaternions" defined by Vahlen (1901/02, 1905).

More extended forms of complex and (bi-)quaternionic systems in terms of Clifford algebra can also be used to express the Lorentz transformations. For instance, using a system a of Clifford numbers one can transform the following general quadratic form into itself, in which the individual values of can be set to +1 or -1 at will:[39][40]

(7c)

The Lorentz interval follows if the sign of one differs from all others. The general definite form as well as the general indefinite form and their invariance under transformation (1) was discussed by Lipschitz (1885/86), while hyperbolic motions were discussed by Vahlen (1901/02, 1905) by setting in transformation (2), while elliptic motions follow with -1 and parabolic motions with 0, all of which he also related to biquaternions.

Lorentz transformation via trigonometric functions

The following general relation connects the speed of light and the relative velocity to hyperbolic and trigonometric functions, where is the rapidity in (3b), is equivalent to the Gudermannian function , and is equivalent to the Lobachevskian angle of parallelism :

This relation was first defined by Varićak (1910).

a) Using one obtains the relations and , and the Lorentz boost takes the form:[41]

(8a)

This Lorentz transformation was derived by Bianchi (1886) and Darboux (1891/94) while transforming pseudospherical surfaces, and by Scheffers (1899) as a special case of contact transformation in the plane (Laguerre geometry). In special relativity, it was used by Gruner (1921) while developing Loedel diagrams, and by Vladimir Karapetoff in the 1920s.

b) Using one obtains the relations and , and the Lorentz boost takes the form:[41]

(8b)

This Lorentz transformation was derived by Eisenhart (1905) while transforming pseudospherical surfaces. In special relativity it was first used by Gruner (1921) while developing Loedel diagrams.

Lorentz transformation via squeeze mappings

As already indicated in equations (3d) in exponential form or (6f) in terms of Cayley–Klein parameter, Lorentz boosts in terms of hyperbolic rotations can be expressed as squeeze mappings. Using asymptotic coordinates of a hyperbola (u,v), they have the general form (some authors alternatively add a factor of 2 or ):[42][43]

(9a)

That this equation system indeed represents a Lorentz boost can be seen by plugging (1) into (2) and solving for the individual variables:

(9b)

Lorentz transformation (9a) of asymptotic coordinates have been used Laisant (1874) and Günther (1880/81) in relation to elliptic trigonometry, or by Lie (1879-81), Bianchi (1886, 1894), Darboux (1891/94), Eisenhart (1905) as Lie transform)[42][43] of pseudospherical surfaces in terms of the Sine-Gordon equation, or by Lipschitz (1885/86) in transformation theory. From that, different forms of Lorentz transformation were derived: (9b) by Lipschitz (1885/86), Bianchi (1886, 1894), Eisenhart (1905), trigonometric Lorentz boost (8a) by Bianchi (1886, 1894) and Darboux (1891/94), and trigonometric Lorentz boost (8b) by Eisenhart (1905). Lorentz boost (9b) was rediscovered in the framework of special relativity by Hermann Bondi (1964)[44] in terms of Bondi k-calculus, by which k can be physically interpreted as Doppler factor. Since (9b) is equivalent to (6f) in terms of Cayley–Klein parameter by setting , it can be interpreted as the 1+1 dimensional special case of Lorentz Transformation (6e) stated by Gauss around 1800 (posthumously published 1863), Selling (1873), Bianchi (1888), Fricke (1891) and Woods (1895).

Variables u, v in (9a) can be rearranged to produce another form of squeeze mapping, resulting in Lorentz transformation (5b) in terms of Cayley-Hermite parameter:

(9c)

These Lorentz transformations were given (up to a sign change) by Laguerre (1882), Darboux (1887), Smith (1900) in relation to Laguerre geometry.

On the basis of factors k or a, all previous Lorentz boosts (3b, 4a, 8a, 8b) can be expressed as squeeze mappings as well:

(9d)

Squeeze mappings in terms of were used by Darboux (1891/94) and Bianchi (1894), in terms of by Lindemann (1891) and Herglotz (1909), in terms of by Eisenhart (1905), in terms of by Bondi (1964).

Lorentz transformations in pure mathematics before 1905

Historical formulas for Lorentz boosts and velocity additions

Euler (1735, 1748) 4f Picard (1882, 1884) 1a, 1b
Riccati (1757) 3c Hill (1882) 1a, 1b
Lambert (1770) 3c, 3e Klein (1884–1897) 6a, 6c
Gauss (1800, 1818) 1a, 1b, 6e Killing (1885–1897) 1a, 3b, 3e
Taurinus (1826) 3f Callandreau (1885) 1b
Jacobi (1827, 1833) 1a, 1b Lipschitz (1885/86) 4a, 7c, 9a, 9b
Lebesgue (1837) 1a, 1b Schur (1885-1900) 3e
Cayley (1855) 5a, 5b, 5c Bianchi (1886-1894) 8a, 9a, 9b
Bour (1856) 1a, 1b Darboux (1887) 5a, 8a, 9a, 9b
Somov (1863), 1a, 1b Bianchi (1888) 6e, 8a, 9a, 9c
Beltrami (1868) 3e Lindemann (1890/91) 3b, 3d, 9d
Bachmann (1869) 5b Fricke (1891) 6a, 6b, 6d, 6e
Selling (1873) 6e Gérard (1892) 1a, 3b
Escherich (1874) 3e Woods (1895-1905) 3b, 6e
Laisant (1874) 3b, 9a Whitehead (1897/98) 3b
Lie (1879, 1881) 9a, 9b Scheffers (1899) 8a
Poincaré (1881) 1a, 6g Hausdorff (1899) 1a, 6g
Günther (1880/81) 3b, 9a Smith (1900) 5a
Cox (1881/82) 3b, 7b Liebmann (1904/05) 1a, 3b
Laguerre (1882) 5a Eisenhart (1905) 8b, 9a, 9b

Euler (1735-1771)

True and eccentric anomaly

Johannes Kepler (1609) geometrically formulated Kepler's equation and the relations between the mean, true, and eccentric anomaly.[M 4][45] The relation between the true anomaly z and the eccentric anomaly P was algebraically expressed by Leonhard Euler (1735/40) as follows:[M 5]

and in 1748:[M 6]

while Joseph-Louis Lagrange (1770/71) expressed them as follows[M 7]

By identifying the eccentricity with v/c, these relations resemble the relativistic aberration formulas (4f) so that the true/eccentric anomalies become angles measured in different inertial frames,[18] and the relativistic velocity addition (4d) follows by setting in Euler's formulas or in Lagrange's formulas.[19]

Orthogonal transformation

Euler (1771) demonstrated the invariance of quadratic forms in terms of sum of squares under a linear substitution and its coefficients, now known as orthogonal transformation, as well as under rotations using Euler angles. The case of two dimensions is given by[M 8]

or three dimensions[M 9]

These coefficiens A,B,C,D,E,F,G,H,I were related by Euler to four arbitrary parameter p,q,r,s, which where rediscovered by Olinde Rodrigues (1840) who related them to rotation angles[M 10] now called Euler–Rodrigues parameters in line with equation (Q3):[M 11]

The orthogonal transformation in four dimensions was given by him as[M 12]

As shown by Minkowski (1907), the orthogonal transformation can be directly used as Lorentz transformation (2a) or (2b) by making one variable as well as six of the sixteen coefficients imaginary.

Euler's formula and rotation

The above orthogonal transformations representing Euclidean rotations can also be expressed by using Euler's formula. After this formula was derived by Euler in 1748[M 13]

,

it was used by Caspar Wessel (1799) to describe Euclidean rotations in the complex plane:[M 14]

Replacing the real quantities by imaginary ones by setting , Wessel's transformation becomes Lorentz transformation (2c) or (3d).

Riccati (1757) – hyperbolic functions

Vincenzo Riccati introduced hyperbolic functions in 1757,[M 15][M 16] in particular he formulated the angle sum laws for hyperbolic sine and cosine:

He furthermore showed that and follow by setting and in the above formulas.

The angle sum laws for hyperbolic sine and cosine can be interpreted as hyperbolic rotations of points on a hyperbola, as in Lorentz boost (3c). (In modern publications, Riccati's additional factor r is set to unity.)

Lambert (1768–1770) – hyperbolic functions

While Riccati (1757) discussed the hyperbolic sine and cosine, Johann Heinrich Lambert (read 1767, published 1768) introduced the expression tang φ or abbreviated as the tangens hyperbolicus of a variable u, or in modern notation tφ=tanh(u):[M 17][46]

In (1770) he rewrote the addition law for the hyperbolic tangens (f) or (g) as:[M 18]

The hyperbolic relations (a,b,c,d,e,f) are equivalent to the hyperbolic relations on the right of (3b). Relations (f,g) can also be found in (3e). By setting tφ=v/c, formula (c) becomes the relative velocity between two frames, (d) the Lorentz factor, (e) the proper velocity, (f) or (g) becomes the Lorentz transformation of velocity (or relativistic velocity addition formula) for collinear velocities in (4a) and (4d).

Lambert also formulated the addition laws for the hyperbolic cosine and sine (Lambert's "cos" and "sin" actually mean "cosh" and "sinh"):

The angle sum laws for hyperbolic sine and cosine can be interpreted as hyperbolic rotations of points on a hyperbola, as in Lorentz boost (3c).

Gauss (1798–1818)

Binary quadratic forms

After the invariance of the sum of squares under linear substitutions was discussed by Euler (1771), the general expressions of a binary quadratic form and its transformation was formulated by Lagrange (1773/75) as follows[M 19]

which is equivalent to (Q1) (n=1). The theory of binary quadratic forms was considerably expanded by Carl Friedrich Gauss (1798, published 1801) in his Disquisitiones Arithmeticae. He rewrote Lagrange's formalism as follows using integer coefficients α,β,γ,δ:[M 20]

which is equivalent to (Q1) (n=1). As pointed out by Gauss, F and F′ are called "proper equivalent" if αδ-βγ=1, so that F is contained in F′ as well as F′ is contained in F. In addition, if another form F″ is contained by the same procedure in F′ it is also contained in F and so forth.[M 21]

The Lorentz interval and the Lorentz transformation (1a) (n=1) are a special case of the binary quadratic form of Lagrange and Gauss by setting (a,b,c)=(a',b',c')=(1,0,-1).

Alternatively, the transformation of coefficients (a,b,c) is identical to transformation u′ in (6d) and becomes the complete Lorentz transformation by setting

.

Ternary quadratic forms

Gauss (1798/1801)[M 22] also discussed ternary quadratic forms with the general expression

which is equivalent to (Q1) (n=2). Gauss called these forms definite when they have the same sign such as x2+y2+z2, or indefinite in the case of different signs such as x2+y2-z2. While discussing the classification of ternary quadratic forms, Gauss (1801) presented twenty special cases, among them these six variants:[M 23]

These are all six types of Lorentz interval in 2+1 dimensions that can be produced as special cases of a ternary quadratic form. In general: The Lorentz interval and the Lorentz transformation (1a) (n=2) is an indefinite ternary quadratic form, which follows from the general ternary form by setting:

Cayley–Klein parameter

The determination of all transformations of the Lorentz interval (as a special case of an integer ternary quadratic form) into itself was explicitly worked out by Gauss around 1800 (posthumously published in 1863), for which he provided a coefficient system α,β,γ,δ:[M 24]

Gauss' result was cited by Bachmann (1869), Selling (1873), Bianchi (1888), Leonard Eugene Dickson (1923).[47] The parameters α,β,γ,δ, when applied to spatial rotations, were later called Cayley–Klein parameters.

This is equivalent to Lorentz transformation (6e), containing Lorentz boost (6f) or (9b) as a special case with and .

Homogeneous coordinates

Gauss (1818) discussed planetary motions together with formulating elliptic functions. In order to simplify the integration, he transformed the expression

into

in which the eccentric anomaly E is connected to the new variable T by the following transformation including an arbitrary constant k, which Gauss then rewrote by setting k=1:[M 25]

The coefficients α,β,γ,... of Gauss' case k=1 are equivalent to the coefficient system in Lorentz transformations (1a) and (1b) (n=2).

Further setting , Gauss' transformation becomes Lorentz transformation (1b) (n=2).

Subsequently, he showed that these relations can be reformulated using three variables x,y,z and u,u′,u″, so that

can be transformed into

,

in which x,y,z and u,u′,u″ are related by the transformation:[M 26]

This is equivalent to Lorentz transformation (1a) (n=2) satisfying , and can be related to Gauss' previous equations in terms of homogeneous coordinates .

Taurinus (1826) – Hyperbolic law of cosines

After the addition theorem for the tangens hyperbolicus was given by Lambert (1768), hyperbolic geometry was used by Franz Taurinus (1826), and later by Nikolai Lobachevsky (1829/30) and others, to formulate the hyperbolic law of cosines:[48][49][50]

When solved for it corresponds to the Lorentz transformation in Beltrami coordinates (3f), and by defining the rapidities it corresponds to the relativistic velocity addition formula (4e)

.

Jacobi (1827, 1833/34) – Homogeneous coordinates

Following Gauss (1818), Carl Gustav Jacob Jacobi extended Gauss' transformation to 3 dimensions in 1827:[M 27]

By setting and k=1 in the (1827) formulas, transformation system (1) is equivalent to Lorentz transformation (1b) (n=3), and by setting k=1 in transformation system (2) it becomes equivalent to Lorentz transformation (1a) (n=3) producing .

Alternatively, in two papers from 1832 Jacobi started with an ordinary orthogonal transformation, and by using an imaginary substitution he arrived at Gauss' transformation (up to a sign change) in the case of 2 dimensions:[M 28]

By setting , transformation system (2) is equivalent to Lorentz transformation (1b) (n=2). Also transformation system (3) is equivalent to Lorentz transformation (1b) (n=2) up to a sign change.

Extending his previous result, Jacobi (1833) started with Cauchy's (1829) orthogonal transformation for n dimensions, and by using an imaginary substitution he formulated Gauss' transformation (up to a sign change) in the case of n dimensions:[M 29]

Transformation system (2) is equivalent to Lorentz transformation (1b) up to a sign change.

He also stated the following transformation leaving invariant the Lorentz interval:[M 30]

This is equivalent to Lorentz transformation (1a) up to a sign change.

Cauchy (1829) – Orthogonal transformation

Augustin-Louis Cauchy (1829) extended the orthogonal transformation of Euler (1771) to arbitrary dimensions[M 31]

The orthogonal transformation can be directly used as Lorentz transformation (2a) by making one of the variables as well as certain coefficients imaginary.

Lebesgue (1837) – Homogeneous coordinates

Victor-Amédée Lebesgue (1837) summarized the previous work of Gauss (1818), Jacobi (1827, 1833), Cauchy (1829). He started with the orthogonal transformation[M 32]

In order to achieve the invariance of the Lorentz interval[M 33]

he gave the following instructions as to how the previous equations shall be modified: In equation (9) change the sign of the last term of each member. In the first n-1 equations of (10) change the sign of the last term of the left-hand side, and in the one which satisfies α=n change the sign of the last term of the left-hand side as well as the sign of the right-hand side. In all equations (11) the last term will change sign. In equations (12) the last terms of the right-hand side will change sign, and so will the left-hand side of the n-th equation. In equations (13) the signs of the last terms of the left-hand side will change, moreover in the n-th equation change the sign of the right-hand side. In equations (14) the last terms will change sign.

These instructions give Lorentz transformation (1a) in the form:

He went on to redefine the variables of the Lorentz interval and its transformation:[M 34]

Setting it is equivalent to Lorentz transformation (1b).

Hamilton (1844/45) – Quaternions

William Rowan Hamilton's, in an abstract of a lecture held in November 1844 and published 1845/47, showed that spatial rotations can be formulated using his theory of quaternions by employing versors as pre- and postfactor, with α as unit vector and a as real angle:[M 35]

(1)

In a footnote added before printing, he showed that this is equivalent to Cayley's (1845) rotation formula by setting[M 36]

(2) .

Hamilton acknowledged Cayley's independent discovery and priority for first printed (February 1845) publication, but noted that he himself communicated formula (1) already in October 1844 to Charles Graves.

Formulas (1) or (2) are role models for Lorentz boost (7a), by replacing versors and quaternions with hyperbolic versors and biquaternions.

Cayley (1846–1884)

Euler–Rodrigues parameter and Cayley–Hermite transformation

The Euler–Rodrigues parameters discovered by Euler (1871) and Rodrigues (1840) leaving invariant were extended to by Arthur Cayley (1846) as a byproduct of what is now called the Cayley transform using the method of skew–symmetric coefficients.[M 37] Following Cayley's methods, a general transformation for quadratic forms into themselves in three (1853) and arbitrary (1854) dimensions was provided by Hermite (1853, 1854). Hermite's formula was simplified and brought into matrix form equivalent to (Q2) by Cayley (1855a)[M 38]

which he abbreviated in 1858, where is any skew-symmetric matrix:[M 39][51]

The Cayley–Hermite transformation becomes equivalent to the Lorentz transformation (5a) by setting Ω=diag(-1,1) and (5b) by setting Ω=diag(-1,1,1) and (5c) by setting Ω=diag(-1,1,1,1).

Using the parameters of (1855a), Cayley in a subsequent paper (1855b) particularly discussed several special cases, such as:[M 40]

This becomes equivalent to the Lorentz transformation (5a) in 1+1 dimensions by setting (a,b)=(-1,1) and Lorentz boost (4a) by additionally setting .

or:[M 41]

This becomes equivalent to the Lorentz transformation (5b) by setting (a,b,c)=(-1,1,1).

or:[M 42]

This becomes equivalent to the Lorentz transformation (5c) by setting (a,b,c,d)=(-1,1,1,1).

Cayley–Klein parameter

Already in 1854, Cayley published an alternative method of transforming quadratic forms by using certain parameters α,β,γ,δ in relation to an improper homographic transformation of a surface of second order into itself:[M 43]

By setting and rewriting M and M' in terms of four different parameters he demonstrated the invariance of , and subsequently showed the relation to 4D quaternion transformations. Fricke & Klein (1897) credited Cayley by calling the above transformation the most general (real or complex) space collineation of first kind of an absolute surface of second kind into itself.[M 44] Parameters α,β,γ,δ are similar to what was later called Cayley–Klein parameters in relation to spatial rotations (which was done by Cayley in 1879[M 45] and before by Hermann von Helmholtz (1866/67)[M 46]).

Cayley's improper transformation becomes proper with some sign changes, and becomes equivalent to Lorentz transformation in (6a) by setting M=M'=1 and:

Subsequently solved for it becomes Lorentz transformation (6b).

Quaternions

In 1845, Cayley showed that the Euler-Rodrigues parameters in equation (Q3) representing rotations can be related to quaternion multiplication by pre- and postfactors (an equivalent rotation formula was also used by Hamilton (1844/45)):[M 47]

and in 1848 he used the abbreviated form[M 48]

In 1854 he showed how to transform the sum of four squares into itself:[M 49]

or in 1855:[M 50]

Cayley's quaternion transformation of the sum of four squares, abbreviated QqP, served as a role model for the representation of the Lorentz transformation by Noether (1910), Klein (1910), in which the scalar part is imaginary.

Cayley absolute and hyperbolic geometry

In 1859, Cayley found out that a quadratic form or projective quadric can be used as an "absolute", serving as the basis of a projective metric (the Cayley–Klein metric).[M 51] For instance, using the absolute x2+y2+z2=0, he defined the distance of two points as follows

and he also alluded to the case of the unit sphere x2+y2+z2=1. In the hands of Klein (1871), all of this became essential for the discussion of non-Euclidean geometry (in particular the Cayley–Klein or Beltrami–Klein model of hyperbolic geometry) and associated quadratic forms and transformations, including the Lorentz interval and Lorentz transformation.

Cayley (1884) himself also discussed some properties of the Beltrami–Klein model and the pseudosphere, and formulated coordinate transformations using the Cayley-Hermite formalism:[M 2]

The form PQ-Z2 and its transformation is equivalent to and its transformation in (5d), and becomes related to the Lorentz interval by setting P=x0+x2, Q=x0-x2, Z=x1.

Cockle (1848) - Tessarines

James Cockle (1848) introduced the tessarine algebra as follows:[M 52]

.

While is the ordinary imaginary unit, the new unit led him to formulate the following relation:[M 53]

.

The real tessarine is a split-complex or hyperbolic number, a hyperbolic versor, and the multiplication leads to Lorentz boost (7b).

Hermite (1853, 1854) – Cayley–Hermite transformation

Charles Hermite (1853) extended the number theoretical work of Gauss (1801) and others (including himself) by additionally analyzing indefinite ternary quadratic forms that can be transformed into the Lorentz interval ±(x2+y2-z2), and by using Cayley's (1846) method of skew–symmetric coefficients he derived transformations leaving invariant almost all types of ternary quadratic forms.[M 54] This was generalized by him in 1854 to n dimensions:[M 55][52]

This result was subsequently expressed in matrix form by Cayley (1855), while Ferdinand Georg Frobenius (1877) added some modifications in order to include some special cases of quadratic forms that cannot be dealt with by the Cayley–Hermite transformation.[M 56][53]

This is equivalent to equation (Q2), and becomes the Lorentz transformation by setting the coefficients of the quadratic form f to diag(-1,1,...1).

For instance, the special case of the transformation of a binary quadratic form into itself was given by Hermite as follows:[M 57]

This becomes equivalent to Lorentz boost (5a) by setting (a,b,c)=(-1,0,1) and Lorentz boost (4a) by additionally setting which produces t=γ and u=βγ.

Bour (1856) – Homogeneous coordinates

Following Gauss (1818), Edmond Bour (1856) wrote the transformations:[M 58]

Setting , the transformation system (2) becomes Lorentz transformation (1b) (n=2).

Transformation system (2) is equivalent to Lorentz transformation (1a) (n=2), implying

Somov (1863) – Homogeneous coordinates

Following Gauss (1818), Jacobi (1827, 1833), and Bour (1856), Osip Ivanovich Somov (1863) wrote the transformation systems:[M 59]

Transformation system (1) is equivalent to Lorentz transformation (1b) (n=2).

Transformation system (2) is equivalent to Lorentz transformation (1a) (n=2).

Beltrami (1868) – Beltrami coordinates

Eugenio Beltrami (1868a) introduced coordinates of the Beltrami–Klein model of hyperbolic geometry, and formulated the corresponding transformations in terms of homographies:[M 60]

(where the disk radius a and the radius of curvature R are real in spherical geometry, in hyperbolic geometry they are imaginary), and for arbitrary dimensions in (1868b)[M 61]

Setting a=a0=c as speed of light and r0=v as the relative velocity, Beltrami's (1868a) formulas become the relativistic velocity addition formulas (3e or 4d), being special cases of the general velocity addition (1b). In his (1868b) formulas, one sets a=b=c and a1=v for velocity addition in arbitrary dimensions.

Bachmann (1869) – Cayley–Hermite transformation

Paul Gustav Heinrich Bachmann (1869) adapted Hermite's (1853/54) transformation of ternary quadratic forms to the case of integer transformations. He particularly analyzed the Lorentz interval and its transformation, and also alluded to the analogue result of Gauss (1800) in terms of Cayley–Klein parameters, while Bachmann formulated his result in terms of the Cayley–Hermite transformation:[M 62]

He described this transformation in 1898 in the first part of his "arithmetics of quadratic forms" as well.[54]

This is equivalent to Lorentz transformation (5b), producing the relation .

Klein (1871–1897)

Cayley absolute and non-Euclidean geometry

Elaborating on Cayley's (1859) definition of an "absolute" (Cayley–Klein metric), Felix Klein (1871) defined a "fundamental conic section" in order to discuss motions such as rotation and translation in the non-Euclidean plane,[M 63] and another fundamental form by using homogeneous coordinates x,y related to a circle with radius 2c with measure of curvature . When c is positive, the measure of curvature is negative and the fundamental conic section is real, thus the geometry becomes hyperbolic (Beltrami–Klein model):[M 64]

In (1873) he pointed out that hyperbolic geometry in terms of a surface of constant negative curvature can be related to a quadratic equation, which can be transformed into a sum of squares of which one square has a different sign, and can also be related to the interior of a surface of second degree corresponding to an ellipsoid or two-sheet hyperboloid.[M 65]

Using positive c in in line with hyperbolic geometry or directly by setting , Klein's two quadratic forms can be related to expressions and for the Lorentz interval in (3d) and (6d).

Möbius transformation, spin transformation, Cayley–Klein parameter

In (1872) while devising the Erlangen program, Klein discussed the general relation between projective metrics, binary forms and conformal geometry transforming a sphere into itself in terms of linear transformations of the complex variable x+iy.[M 66] Following Klein, these relations were discussed by Ludwig Wedekind (1875) using .[M 67] Klein (1875) then showed that all finite groups of motions follow by determining all finite groups of such linear transformations of x+iy into itself.[M 68] In (1878),[M 69] Klein classified the substitutions of with αδ-βγ=1 into hyperbolic, elliptic, parabolic, and in (1882)[M 70] he added the loxodromic substitution as the combination of elliptic and hyperbolic ones. (In 1890, Robert Fricke in his edition of Klein's lectures of elliptic functions and Modular forms, referred to the analogy of this treatment to the theory of quadratic forms as given by Gauss and in particular Dirichlet.)[M 44]

In (1884) Klein related the linear fractional transformations (interpreted as rotations around the x+iy-sphere) to Cayley–Klein parameters [α,β,γ,δ], to Euler–Rodrigues parameters [a,b,c,d], and to the unit sphere by means of stereographic projection, and also discussed transformations preserving surfaces of second degree equivalent to the transformation given by Cayley (1854):[M 71]

The formulas on the left related to the unit sphere are equivalent to Lorentz transformation (6c). The formulas on the right can be related to those on the left by setting

and become equivalent to Lorentz transformation (6a) by setting

and subsequently solved for x1... it becomes Lorentz transformation (6b).

In his lecture in the winter semester of 1889/90 (published 1892–93), he discussed the hyperbolic plane by using (as in 1871) the Lorentz interval in terms of a circle with radius 2k as the basis of hyperbolic geometry, and another quadratic form to discuss the "kinematics of hyperbolic geometry" consisting of motions and congruent displacements of the hyperbolic plane into itself:[M 72]

Klein's Lorentz interval can be connected with the other interval by setting

,

by which the transformation system on the right becomes equivalent to Lorentz transformation (6d) with 2k=1, and subsequently solved for x1... it becomes equivalent to Lorentz transformation (6e).

In his lecture in the summer semester of 1890 (published 1892–93), he discussed general surfaces of second degree, including an "oval" surface corresponding to hyperbolic space and its motions:[M 73]

The transformation of the unit sphere on the right is equivalent to Lorentz transformation (6c). Plugging the values for λ,μ,λ′,μ′,... from the right into the transformations on the left, and relating them to Klein's homogeneous coordinates by leads to Lorentz transformation (6a). Subsequently solved for x1... it becomes Lorentz transformation (6b).

In (1896/97), Klein again defined hyperbolic motions and explicitly used t as time coordinate:[M 74]

This is equivalent to Lorentz transformation (6a).

Klein's work was summarized and extended by Bianchi (1888-1893) and Fricke (1893-1897), obtaining equivalent Lorentz transformations.

Conformal transformation and polyspherical coordinates

In relation to line geometry, Klein (1871/72)[M 75] used coordinates satisfying the condition . They were introduced in 1868 (belatedly published in 1872/73) by Gaston Darboux[M 76] as a system of five coordinates in R3 (later called "pentaspherical" coordinates) in which the last coordinate is imaginary. Sophus Lie (1871)[M 77] more generally used n+2 coordinates in Rn (later called "polyspherical" coordinates) satisfying in which the last coordinate is imaginary, as a means to discuss conformal transformations generated by inversions. These simultaneous publications can be explained by the fact that Darboux, Lie, and Klein corresponded with each other by letter.

When the last coordinate is defined as real, the corresponding polyspherical coordinates satisfy the form of a sphere. Initiated by lectures of Klein between 1889–1890, his student Friedrich Pockels (1891) used such real coordinates, emphasizing that all of these coordinate systems remain invariant under conformal transformations generated by inversions:[M 78]

Special cases were described by Klein (1893):[M 79]

(pentaspherical).
(tetracyclical).

Both systems were also described by Maxime Bôcher (1894) in an expanded version of a thesis supervised by Klein.[M 80]

Polyspherical coordinates indicate that the conformal group Con(0,p) is isomorphic to the Lorentz group SO(1,p+1).[55] For instance, Con(0,2) – known as Möbius group – is related to tetracyclical coordinates satisfying , which is nothing other than the Lorentz interval invariant under the Lorentz group SO(1,3).

Lie (1871–1893)

Conformal, spherical, and orthogonal transformations

In several papers between 1847 and 1850 it was shown by Joseph Liouville[M 81] that the relation λ(δx2+δy2+δz2) is invariant under the group of conformal transformations generated by inversions transforming spheres into spheres, which can be related special conformal transformations or Möbius transformations. (The conformal nature of the linear fractional transformation of a complex variable was already discussed by Euler (1777)).[M 82][56]

Liouville's theorem was extended to all dimensions by Sophus Lie (1871a).[M 83][57] In addition, Lie described a manifold whose elements can be represented by spheres, where the last coordinate yn+1 can be related to an imaginary radius by iyn+1:[M 84]

If the second equation is satisfied, two spheres y′ and y″ are in contact. Lie then defined the correspondence between contact transformations in Rn and conformal point transformations in Rn+1: The sphere of space Rn consists of n+1 parameter (coordinates plus imaginary radius), so if this sphere is taken as the element of space Rn, it follows that Rn now corresponds to Rn+1. Therefore, any transformation (to which he counted orthogonal transformations and inversions) leaving invariant the condition of contact between spheres in Rn, corresponds to the conformal transformation of points in Rn+1.

Eventually, Lie (1871/72) pointed out that conformal point transformations consist of motions (such as rigid transformations and orthogonal transformations), similarity transformations, and inversions.[M 85]

As shown by Bateman and Cunningham (1909), the spacetime conformal group Con(1,3) of "spherical wave transformations" corresponds to the transformations of Lie's sphere geometry in which the radius indicates the fourth coordinate, while the Lorentz group SO(1,3) is a subgroup of Con(1,3).

Transforming pseudospherical surfaces

Lie (1879/80) derived an operation from Pierre Ossian Bonnet's (1867) investigations on surfaces of constant curvatures, by which pseudospherical surfaces can be transformed into each other.[M 86] Lie gave explicit formulas for this operation in two papers published in 1881: If are asymptotic coordinates of two principal tangent curves and their respective angle, and is a solution of the Sine-Gordon equation , then the following operation (now called Lie transform) is also a solution from which infinitely many new surfaces of same curvature can be derived:[M 87]

In (1880/81) he wrote these relations as follows:[M 88]

In (1883/84) he showed that the combination of Lie transform O with Bianchi transform I produces Bäcklund transform B of pseudospherical surfaces: B=OIO−1. [M 89]

As shown by Bianchi (1886) and Darboux (1891/94), the Lie transform is equivalent to Lorentz transformations (9a) and (9b) in terms of null coordinates 2s=u+v and 2σ=u-v. In general, it can be shown that the Sine-Gordon equation is Lorentz invariant.

Lie group, hyperbolic motions, and infinitesimal transformations

In (1885/86), Lie identified the projective group of a general surface of second degree with the group of non-Euclidean motions.[M 90] In a thesis guided by Lie, Hermann Werner (1889) discussed this projective group by using the equation of a unit hypersphere as the surface of second degree (which was already given before by Killing (1887)), and also gave the corresponding infinitesimal projective transformations (Lie algebra):[M 91]

More generally, Lie (1890)[M 92] defined non-Euclidean motions in terms of two forms in which the imaginary form with denotes the group of elliptic motions (in Klein's terminology), the real form with −1 the group of hyperbolic motions, with the latter having the same form as Werner's transformation:[M 93]

Summarizing, Lie (1893) discussed the real continuous groups of the conic sections representing non-Euclidean motions, which in the case of hyperbolic motions have the form:

[M 94] or [M 95] or .[M 96]

The group of hyperbolic motions is isomorphic to the Lorentz group. The interval becomes the Lorentz interval by setting

Selling (1873–74) – Quadratic forms

Continuing the work of Gauss (1801) on definite ternary quadratic forms and Hermite (1853) on indefinite ternary quadratic forms, Eduard Selling (1873) used the auxiliary coefficients ξ,η,ζ by which a definite form and an indefinite form f can be rewritten in terms of three squares:[M 97][58]

In addition, Selling showed that auxiliary coefficients ξ,η,ζ can be geometrically interpreted as point coordinates which are in motion upon one sheet of a two-sheet hyperboloid, which is related to Selling's formalism for the reduction of indefinite forms by using definite forms.[M 98]

Selling also reproduced the Lorentz transformation given by Gauss (1800/63), to whom he gave full credit, and called it the only example of a particular indefinite ternary form known to him that has ever been discussed:[M 99]

This is equivalent to Lorentz transformation (6e), containing Lorentz boost (6f) or (9b) as a special case with and .

Laisant (1874)

Elliptic polar coordinates

Charles-Ange Laisant (1874) extended circular trigonometry to elliptic trigonometry. In his model, polar coordinates x, y of circular trigonometry are related to polar coordinates x', y' of elliptic trigonometry by the relation[M 100]

He noticed the geometrical implication that any elliptic polar system of coordinates obtained by this formula is located on the same equilateral hyperbola having its asymptotes as axes.

This is equivalent to Lorentz transformation (9a).

Equipollences

In his French translation of Giusto Bellavitis' principal work on equipollences, Laisant (1874) added a chapter related to hyperbolas. The equipollence OM and its tangent MT of a hyperbola is defined by Laisant as[M 101]

(1)

Here, OA and OB are conjugate semi-diameters of a hyperbola with OB being imaginary, both of which he related to two other conjugated semi-diameters OC and OD by the following transformation:

producing the invariant relation

.

Substituting into (1), he showed that OM retains its form

He also defined velocity and acceleration by differentiation of (1).

These relations are equivalent to several Lorentz boosts or hyperbolic rotations producing the invariant Lorentz interval in line with (3b).

Escherich (1874) – Beltrami coordinates

Gustav von Escherich (1874) discussed the plane of constant negative curvature[59] based on the Beltrami–Klein model of hyperbolic geometry by Beltrami (1868). Similar to Christoph Gudermann (1830)[M 102] who introduced axial coordinates x=tan(a) and y=tan(b) in sphere geometry in order to perform coordinate transformations in the case of rotation and translation, Escherich used hyperbolic functions x=tanh(a/k) and y=tanh(b/k)[M 103] in order to give the corresponding coordinate transformations for the hyperbolic plane, which for the case of translation have the form:[M 104]

and

This is equivalent to Lorentz transformation (3e), also equivalent to the relativistic velocity addition (4d) by setting and multiplying [x,y,x′,y′] by 1/c, and equivalent to Lorentz boost (3b) by setting . This is the relation between the Beltrami coordinates in terms of Gudermann-Escherich coordinates, and the Weierstrass coordinates of the hyperboloid model introduced by Killing (1878–1893), Poincaré (1881), and Cox (1881). Both coordinate systems were compared by Cox (1881).[M 105]

Glaisher (1878)

It was shown by James Whitbread Lee Glaisher (1878) that the hyperbolic addition laws can be written as matrix multiplication[M 106]

This is equivalent to Lorentz boost (3c).

Killing (1878–1893)

Weierstrass coordinates

Wilhelm Killing (1878–1880) described non-Euclidean geometry by using Weierstrass coordinates (named after Karl Weierstrass who described them in lectures in 1872 which Killing attended) obeying the form

[M 107] with [M 108]

or[M 109]

where k is the reciprocal measure of curvature, denotes Euclidean geometry, elliptic geometry, and hyperbolic geometry. In (1877/78) he pointed out the possibility and some characteristics of a transformation (indicating rigid motions) preserving the above form.[M 110] In (1879/80) he wrote the corresponding transformations as a general rotation matrix[M 111]

In (1885) he wrote the Weierstrass coordinates and their transformation as follows:[M 112]

This is similar to Lorentz transformation (1a) (n=2) with

In (1885) he also gave the transformation for n dimensions:[M 113][60]

This is similar to Lorentz transformation (1a) with

In (1885) he applied his transformations to mechanics and defined four-dimensional vectors of velocity and force.[M 114] Regarding the geometrical interpretation of his transformations, Killing argued in (1885) that by setting and using p,x,y as rectangular space coordinates, the hyperbolic plane is mapped on one side of a two-sheet hyperboloid (known as hyperboloid model),[M 115][61] by which the previous formulas become equivalent to Lorentz transformations and the geometry becomes that of Minkowski space. Finally, in (1893) he wrote:[M 116]

This is equivalent to Lorentz transformation (1a) (n=2) with

and for n dimensions[M 117]

This is equivalent to Lorentz transformation (1a) with

Translation in the hyperbolic plane

The case of translation was given by Killing (1893) in the form[M 118]

This is equivalent to Lorentz boost (3b).

In 1898, Killing wrote that relation in a form similar to Escherich (1874), and derived the corresponding Lorentz transformation for the two cases were v is unchanged or u is unchanged:[M 119]

The upper transformation system is equivalent to Lorentz transformation (3e) and the velocity addition (4d) with l=c and , the system below is equivalent to Lorentz boost (3b).

Infinitesimal transformations and Lie group

After Lie (1885/86) identified the projective group of a general surface of second degree with the group of non-Euclidean motions, Killing (1887/88)[M 120] defined the infinitesimal projective transformations (Lie algebra) in relation to the unit hypersphere:

and in (1892) he defined the infinitesimal transformation for non-Euclidean motions in terms of Weierstrass coordinates:[M 121]

In (1897/98) he pointed out (1) that the corresponding group of non-Euclidean motions in terms of Weierstrass coordinates is intransitive when related to form (a) and transitive when related to form (b), and he also showed (2) the relation of Weierstrass coordinates to the notation of Killing (1887/88) and Werner (1889), Lie (1890):[M 122]

Setting denotes the group of hyperbolic motions and thus the Lorentz group.

Günther (1880/81)

Elliptic polar coordinates

Following Laisant (1874), Siegmund Günther (1880/81) demonstrated the relation between circular polar coordinates and elliptic polar coordinates as[M 123]

showing that any elliptic polar system of coordinates obtained by this formula is located on the same equilateral hyperbola having its asymptotes as axes.

This is equivalent to Lorentz transformation (9a).

Matrix multiplication

Following Glaisher (1878), he formulated the hyperbolic addition laws in matrix form as[M 124]

This is equivalent to Lorentz boost (3c).

Poincaré (1881 – 1887)

Weierstrass coordinates

Henri Poincaré (1881) connected the work of Hermite (1853) and Selling (1873) on indefinite quadratic forms with non-Euclidean geometry (Poincaré already discussed such relations in an unpublished manuscript in 1880).[62] He used two indefinite ternary forms in terms of three squares and then defined them in terms of Weierstrass coordinates (without using that expression) connected by a transformation with integer coefficients:[M 125][63]

He went on to describe the properties of "hyperbolic coordinates".[M 126][61] Poincaré mentioned the hyperboloid model also in (1887).[M 127]

This is equivalent to Lorentz transformation (1a) (n=2).

Möbius transformation

Poincaré (1881a) also demonstrated the connection of his above formulas to Möbius transformations:[M 125]

This is equivalent to Lorentz transformation (6g).

Poincaré (1881b) also used the Möbius transformation in relation to Fuchsian functions and the discontinuous Fuchsian group, being a special case of the hyperbolic group leaving invariant the "fundamental circle" (Poincaré disk model and Poincaré half-plane model of hyperbolic geometry).[M 128] He then extended Klein's (1878-1882) study on the relation between Möbius transformations and hyperbolic, elliptic, parabolic, and loxodromic substitutions, and while formulating Kleinian groups (1883) he used the following transformation leaving invariant the generalized circle:[M 129]

Setting this becomes transformation u′ in (6a) and becomes the complete Lorentz transformation by setting .

In 1886, Poincaré investigated the relation between indefinite ternary quadratic forms and Fuchsian functions and groups:[M 130]

This is equivalent to transformation u′ in (6d) and becomes the complete Lorentz transformation by suitibly choosing the coefficients a,b,c,... so that [X,Y,Z]=[x+z, y, -x+z].

Cox (1881–1883)

Weierstrass coordinates

Homersham Cox (1881/82) – referring to similar rectangular coordinates used by Gudermann (1830)[M 102] and George Salmon (1862)[M 131] on a sphere, and to Escherich (1874) as reported by Johannes Frischauf (1876)[M 132] in the hyperbolic plane – defined the Weierstrass coordinates (without using that expression) and their transformation:[M 133]

Replacing with , this becomes Lorentz transformation (1a) (n=2) up to a sign change in the inverse transformation.

Cox also gave the Weierstrass coordinates and their transformation in hyperbolic space:[M 134]

Replacing with , this becomes Lorentz transformation (1a) (n=3) up to a sign change in both the first as well as inverse transformation.

The case of translation was also given by him, where the y-axis remains unchanged:[M 135]

and

This is equivalent to Lorentz boost (3b).

Quaternions

Subsequently, Cox (1882/83) also described hyperbolic geometry in terms of an analogue to quaternions and Hermann Grassmann's exterior algebra. To that end, he used hyperbolic numbers (without mentioning Cockle (1848)) as a means to transfer point P to point Q in the hyperbolic plane, which he wrote in the form:[M 136]

In (1882/83a) he showed the equivalence of PQ=-cosh(θ)+ι·sinh(θ) with "quaternion multiplication",[M 137] and in (1882/83b) he described QP−1=cosh(θ)+ι·sinh(θ) as being "associative quaternion multiplication".[M 138] He also showed that the position of point P in the hyperbolic plane may be determined by three quantities in terms of Weierstrass coordinates obeying the relation z2-x2-y2=1.[M 139]

Cox's associative quaternion multiplication using the hyperbolic versor is equivalent to the Lorentz boost (7b) by setting and .

Cox went on to develop an algebra for hyperbolic space analogous to Clifford's biquaternions. While Clifford (1873) used biquaternions of the form a+ωb in which ω2=0 denotes parabolic space and ω2=1 elliptic space, Cox discussed hyperbolic space using the imaginary quantity and therefore ω2=-1.[M 140] He also obtained relations of quaternion multiplication in terms of Weierstrass coordinates:[M 141]

Hill (1882) – Homogeneous coordinates

Following Gauss (1818), George William Hill (1882) formulated the equations[M 142]

Transformation system (1) is equivalent to Lorentz transformation (1b) (n=2) with .

Transformation system (2) is equivalent to Lorentz transformation (1a) (n=2) .

Laguerre (1882) – Laguerre inversion

After previous work by Albert Ribaucour (1870),[M 143] a transformation which transforms oriented spheres into oriented spheres, oriented planes into oriented planes, and oriented lines into oriented lines, was explicitly formulated by Edmond Laguerre (1882) as "transformation by reciprocal directions" which was later called "Laguerre inversion/transformation". It can be seen as a special case of the conformal group in terms of Lie's transformations of oriented spheres. In two dimensions the transformation or oriented lines has the form (R being the radius):[M 144]

This is equivalent (up to a sign change) to Lorentz transformation (5a) in terms of Cayley–Hermite parameters (even though Laguerre didn't use the Cayley-Hermite transformation (Q2)). Lorentz boost (4a) follows with .

Picard (1882-1884) – Quadratic forms

Émile Picard (1882) analyzed the invariance of indefinite ternary Hermitian quadratic forms with integer coefficients and their relation to discontinuous groups, extending Poincaré's Fuchsian functions of one complex variable related to a circle, to "hyperfuchsian" functions of two complex variables related to a hypersphere. He formulated the following special case of an Hermitian form:[M 145][64]

Replacing the imaginary variables and coefficients with real ones, transformation system (1) is equivalent to Lorentz transformation (1a) (n=2) producing x2+y2-z2=X2+Y2-Z2 and transformation system (2) is equivalent to Lorentz transformation (1b) (n=2) producing x2+y2=X2+Y2=1.

Or in (1884a) in relation to indefinite binary Hermitian quadratic forms:[M 146]

Replacing the imaginary variables and coefficients with real ones, this is equivalent to Lorentz transformation (1a) (n=1) producing U2-V2=u2-v2.

Or in (1884b):[M 147]

Replacing the imaginary variables and coefficients with real ones, this is equivalent to Lorentz transformation (1b) (n=2) producing x2+y2=X2+Y2=1.

Or in (1884c):[M 148]

Replacing the imaginary variables and coefficients with real ones, transformation system (1) is equivalent to Lorentz transformation (1a) (n=2) producing U2+V2-W2=u2+v2-w2 and transformation system (2) is equivalent to Lorentz transformation (1b) (n=2) producing .

Stephanos (1883) – Biquaternions

Cyparissos Stephanos (1883)[M 149] showed that Hamilton's biquaternion a0+a1ι1+a2ι2+a3ι3 can be interpreted as an oriented sphere in terms of Lie's sphere geometry (1871), having the vector a1ι1+a2ι2+a3ι3 as its center and the scalar as its radius. Its norm is thus equal to the power of a point of the corresponding sphere. In particular, the norm of two quaternions N(Q1-Q2) (the corresponding spheres are in contact with N(Q1-Q2)=0) is equal to the tangential distance between two spheres. The general contact transformation between two spheres then can be given by a homography using 4 arbitrary quaternions A,B,C,D and two variable quaternions X,Y:[M 150][65][66]

(or ).

Stephanos pointed out that the special case A=0 denotes transformations of oriented planes (see Laguerre (1882)).

The Lorentz group SO(1,3) is a subgroup of the conformal group Con(1,3) in terms of Lie's transformations of orientied spheres in which the radius indicates the fourth coordinate. The Lorentz group is isomorphic to the group of Laguerre's transformation of oriented planes.

Buchheim (1884–85) – Biquaternions

Arthur Buchheim (1884, published 1885) applied Clifford's biquaternions and their operator ω to different forms of geometries (Buchheim mentions Cox (1882) as well). He defined the scalar ω2=e2 which in the case -1 denotes hyperbolic space, 1 elliptic space, and 0 parabolic space. He derived the following relations consistent with the Cayley–Klein absolute:[M 151]

By choosing e2=-1 for hyperbolic space, the Cayley absolute becomes the Lorentz interval.

Darboux (1883–1891)

Transformations of pseudospherical surfaces

Gaston Darboux (1883) represented Lie's transformation (1879/81) of pseudospheres into each other as follows:[M 152]

This becomes Lorentz boost (9a) by interpreting x, y as null coordinates.

Similar to Bianchi (1886), Darboux (1891/94) showed that the Lie transform gives rise to the following relations:[M 153]

.

Equations (1) together with transformation (2) gives Lorentz boost (9a) in terms of null coordinates. Transformation (3) is equivalent to trigonometric Lorentz boost (8a), and becomes Lorentz boost (4a) with .

Laguerre inversion

Following Laguerre (1882), Gaston Darboux (1887) presented the Laguerre inversions in four dimensions using coordinates x,y,z,R:[M 154]

This is equivalent (up to a sign change for R) to Lorentz transformation (5a) in terms of Cayley–Hermite parameters (even though Darboux didn't use the Cayley-Hermite transformation (Q2)). Lorentz boost (4a) follows with .

Darboux rewrote these equations as follows:

This is equivalent (up to a sign change for R) to a squeeze mapping in terms of Lorentz boost (9c) and (9d) where Darboux's k corresponds to a.

Callandreau (1885) – Homography

Following Gauss (1818) and Hill (1882), Octave Callandreau (1885) formulated the equations[M 155]

The transformation system is equivalent to Lorentz transformation (1b) (n=2) with .

Lipschitz (1885–86)

Boosts

Rudolf Lipschitz (1885/86) formulated transformations leaving invariant the sum of squares , which he rewrote as . This led to the problem of finding transformations leaving invariant the pairs (a=1...n) for which he gave the following solution:[M 156]

>

Equation system (1) represents Lorentz boost or squeeze mapping (9a), and (2) represents Lorentz boost (9b). Equation (3a) is very similar to the Doppler factor and (3b) to the standard Lorentz boost (4a). However, because of both the square root and the composition of x- and y- variables differs from (4a), whereas in relativity one uses as velocity smaller than the speed of light to obtain

Clifford algebra

More generally, Lipschitz used Clifford algebra in order to formulate the orthogonal transformation of a sum or squares into itself, for which he used real variables and constants, thus Λ becomes a real quaternion for n=3.[M 157] He went further and discussed transformations in which both variables x,y... and constants are complex, thus Λ becomes a complex quaternion (i.e. biquaternion) for n=3.[M 158] The transformation system for both real and complex quantities has the form:[M 159]

Lipschitz noticed that this corresponds to the transformations of quadratic forms given by Hermite (1854) and Cayley (1855). He then modified his equations to discuss the general indefinite quadratic form, by defining some variables and constants as real and some of them as purely imaginary:[M 160]

resulting into

By setting m=n-1 or n=m+1, the Lorentz interval and the Lorentz transformation follows

Schur (1885/86, 1900/02) – Beltrami coordinates

Friedrich Schur (1885/86) discussed spaces of constant Riemann curvature, and by following Beltrami (1868) he used the transformation[M 161]

This is equivalent to Lorentz transformation (3e) and therefore also equivalent to the relativistic velocity addition (4d) in arbitrary dimensions by setting R=c as the speed of light and a1=v as relative velocity.

In (1900/02) he derived basic formulas of non-Eucliden geometry, including the case of translation for which he obtained the transformation similar to his previous one:[M 162]

where can have values >0, <0 or ∞.

This is equivalent to Lorentz transformation (3e) and therefore also equivalent to the relativistic velocity addition (4d) by setting a=v and .

He also defined the triangle[67]

This is equivalent to the hyperbolic law of cosines and the relativistic velocity addition (3f, b) or (4e) by setting .

Bianchi (1886–1893)

Transformation of pseudospherical surfaces

Luigi Bianchi (1886) investigated Lie's transformation (1880) of pseudospheres into each other, obtaining the result:[M 163]

.

Equations (1) together with transformation (2) gives Lorentz boost (9a) in terms of null coordinates. Transformation (3) and its inverse are equivalent to trigonometric Lorentz boost (8a), and becomes Lorentz boost (4b) with . Plugging equations (4) into (3) gives Lorentz boost (9b) in terms of Bondi's k factor, as well as Lorentz boost (6f) with .

In 1894, Bianchi redefined the variables u,v as asymptotic coordinates, by which the transformation obtains the form:[M 164]

.

This is equivalent to a squeeze mapping in terms of Lorentz boost (9d) where Bianchi's angle σ corresponds to θ.

Möbius and spin transformations

Related to Klein's (1871) and Poincaré's (1881-1887) work on non-Euclidean geometry and indefinite quadratic forms, Bianchi (1888) analyzed the differential Lorentz interval in term of conic sections and hyperboloids, alluded to the linear fractional transformation of and its conjugate with parameters α,β,γ,δ in order to preserve the Lorentz interval, and gave credit to Gauss (1800/63) who obtained the same coefficient system:[M 165]

The is equivalent to Lorentz transformations (6d) and (6e), containing Lorentz boost (6f) or (9b) as a special case with and .

In 1893, Bianchi gave the coefficients in the case of four dimensions:[M 166]

This is equivalent to Lorentz transformation (6a)

Solving for Bianchi obtained:[M 166]

This is equivalent to Lorentz transformation (6b)

Lindemann (1890–91) – Weierstrass coordinates and Cayley absolute

Ferdinand von Lindemann discussed hyperbolic geometry in his (1890/91) edition of the lectures on geometry of Alfred Clebsch. Citing Killing (1885) and Poincaré (1887) in relation to the hyperboloid model in terms of Weierstrass coordinates for the hyperbolic plane and space, he set[M 167]

In addition, following Klein (1871) he employed the Cayley absolute related to surfaces of second degree, by using the following quadratic form and its transformation[M 168]

into which he put[M 169]

This is equivalent to Lorentz boost (3d) and squeeze mapping (9d) with and 2k=1 .

From that, he obtained the following Cayley absolute and the corresponding most general motion in hyperbolic space comprising ordinary rotations (a=0) or translations (α=0):[M 170]

This is equivalent to Lorentz boost (3b) with α=0 and 2k=1.

Fricke (1891–1897) – Möbius and spin transformations

Robert Fricke (1891) – following the work of his teacher Klein (1878–1882) as well as Poincaré (1881–1887) on automorphic functions and group theory – obtained the following transformation for an integer ternary quadratic form[M 171][68]

By setting q=1, the first part is equivalent to Lorentz transformation (6d) and the second part is equivalent to (6e), containing Lorentz boost (6f) or (9b) as a special case with and .

And the general case of four dimensions in 1893:[M 172]

By setting p=q=r=s=1, the first part is equivalent to Lorentz transformation (6a) and the second part to (6b)

Supported by Felix Klein, Fricke summarized his and Klein's work in a treatise concerning automorphic functions (1897). Using a sphere as the absolute, in which the interior of the sphere is denoted as hyperbolic space, they defined hyperbolic motions, and stressed that any hyperbolic motion corresponds to "circle relations" (now called Möbius transformations):[M 44]

This is equivalent to Lorentz transformation (6a).

Gérard (1892) – Weierstrass coordinates

Louis Gérard (1892) – in a thesis examined by Poincaré – discussed Weierstrass coordinates (without using that name) in the plane using the following invariant and its Lorentz transformation equivalent to (1a) (n=2):[M 173]

This is equivalent to Lorentz transformation (1a) (n=2).

He gave the case of translation as follows:[M 174]

This is equivalent to Lorentz boost (3b).

Macfarlane (1892–1900) – Hyperbolic quaternions

Alexander Macfarlane (1892, 1893) – analogous to Cockle (1848) and Cox (1882/83) – defined the hyperbolic versor in terms of hyperbolic numbers[M 175]

and in 1894 he defined the "exspherical" versor[M 176]

and used them to analyze hyperboloids of one- or two sheets. This was further extended by him in (1900) in order to express trigonometry in terms of hyperbolic quaternions re, with β2=+1 and , the hyperbolic number x+yβ, and the hyperbolic versor e.[M 177]

The hyperbolic versor is the basis of Lorentz boost (7b).

Woods (1895–1905)

Spin transformation

In a thesis supervised by Felix Klein, Frederick S. Woods (1895) further developed Bianchi's (1888) treatment of surfaces satisfying the Lorentz interval (pseudominimal surface), and used the transformation of Gauss (1800/63) and Bianchi (1888) while discussing automorphisms of that surface:[M 178]

The expressions within the brackets are equivalent to Lorentz transformations (6e), containing Lorentz boost (6f) or (9b) as a special case with and .

Beltrami and Weierstrass coordinates

In (1901/02) he defined the following invariant quadratic form and its projective transformation in terms of Beltrami coordinates (he pointed out that this can be connected to hyperbolic geometry by setting with R as real quantity):[M 179]

This is equivalent to Lorentz transformation (1b) (n=3) with k2=-1.

Alternatively, Woods (1903, published 1905) – citing Killing (1885) – used the invariant quadratic form in terms of Weierstrass coordinates and its transformation (with for hyperbolic space):[M 180]

This is equivalent to Lorentz transformation (1a) (n=3) with k2=-1.

and the case of translation:[M 181]

This is equivalent to Lorentz boost (3b) with k2=-1.

and the loxodromic substitution for hyperbolic space:[M 182]

This is equivalent to Lorentz boost (3b) with β=0.

Whitehead (1897/98) – Universal algebra

Alfred North Whitehead (1898) discussed the kinematics of hyperbolic space as part of his study of universal algebra, and obtained the following transformation:[M 183]

This is equivalent to Lorentz boost (3b) with α=0.

Scheffers (1899) – Contact transformation

Georg Scheffers (1899) synthetically determined all finite contact transformations preserving circles in the plane, consisting of dilatations, inversions, and the following one preserving circles and lines (compare with Laguerre inversion by Laguerre (1882) and Darboux (1887)):[M 184]

This is equivalent to Lorentz transformation (8a) by the identity .

Hausdorff (1899)

Weierstrass coordinates

Felix Hausdorff (1899) – citing Killing (1885) – discussed Weierstrass coordinates in the plane using the following invariant and its transformation:[M 185]

This is equivalent to Lorentz transformation (1a) (n=2).

Möbius transformation

Hausdorff (1899) also discussed the relation of the above coordinates to conformal Möbius transformations:[M 186]

This is equivalent to Lorentz transformation (6g).

Smith (1900) – Laguerre inversion

Percey F. Smith (1900) followed Laguerre (1882) and Darboux (1887) and defined the Laguerre inversion as follows:[M 187]

This is equivalent (up to a sign change) to Lorentz transformation (5a) in terms of Cayley–Hermite parameters (even though Smith didn't use the Cayley-Hermite transformation (Q2)). Lorentz boost (4a) follows with .

Vahlen (1901/02) – Clifford algebra and Möbius transformation

Modifying Lipschitz's (1885/86) variant of Clifford numbers, Theodor Vahlen (1901/02) formulated Möbius transformations (which he called vector transformations) and biquaternions in order to discuss motions in n-dimensional non-Euclidean space, leaving the following quadratic form invariant (where j2=1 represents hyperbolic motions, j2=-1 elliptic motions, j2=0 parabolic motions):[M 188]

The group of hyperbolic motions or the Möbius group are isomorphic to the Lorentz group.

Liebmann (1904–05) – Weierstrass coordinates

Heinrich Liebmann (1904/05) – citing Killing (1885), Gérard (1892), Hausdorff (1899) – used the invariant quadratic form and its Lorentz transformation equivalent to (1a) (n=2)[M 189]

This is equivalent to Lorentz transformation (1a) (n=2).

and the case of translation:[M 190]

This is equivalent to Lorentz boost (3b).

Eisenhart (1905) – Pseudospherical surfaces

Luther Pfahler Eisenhart (1905) followed Bianchi (1886, 1894) and Darboux (1891/94) by writing the Lie's transformation (1879/81) of pseudospherical surfaces:[M 191]

.

Equations (1) together with transformation (2) gives Lorentz boost (9a) in terms of null coordinates. Transformation (3) is equivalent to Lorentz boost (9b) in terms of Bondi's k factor, as well as Lorentz boost (6f) with . Transformation (4) is equivalent to trigonometric Lorentz boost (8b), and becomes Lorentz boost (4b) with . Eisenhart's angle σ corresponds to ϑ of Lorentz boost (9d).

Electrodynamics and special relativity

Voigt (1887)

Woldemar Voigt (1887)[R 4] developed a transformation in connection with the Doppler effect and an incompressible medium, being in modern notation:[69][70]

If the right-hand sides of his equations are multiplied by γ they are the modern Lorentz transformation (4b). In Voigt's theory the speed of light is invariant, but his transformations mix up a relativistic boost together with a rescaling of space-time. Optical phenomena in free space are scale, conformal (using the factor λ discussed above), and Lorentz invariant, so the combination is invariant too.[70] For instance, Lorentz transformations can be extended by using :[R 5]

.

l=1/γ gives the Voigt transformation, l=1 the Lorentz transformation. But scale transformations are not a symmetry of all the laws of nature, only of electromagnetism, so these transformations cannot be used to formulate a principle of relativity in general. It was demonstrated by Poincaré and Einstein that one has to set l=1 in order to make the above transformation symmetric and to form a group as required by the relativity principle, therefore the Lorentz transformation is the only viable choice.

Voigt sent his 1887 paper to Lorentz in 1908,[71] and that was acknowledged in 1909:

In a paper "Über das Doppler'sche Princip", published in 1887 (Gött. Nachrichten, p. 41) and which to my regret has escaped my notice all these years, Voigt has applied to equations of the form (7) (§ 3 of this book) [namely ] a transformation equivalent to the formulae (287) and (288) [namely ]. The idea of the transformations used above (and in § 44) might therefore have been borrowed from Voigt and the proof that it does not alter the form of the equations for the free ether is contained in his paper.[R 6]

Also Hermann Minkowski said in 1908 that the transformations which play the main role in the principle of relativity were first examined by Voigt in 1887. Voigt responded in the same paper by saying that his theory was based on an elastic theory of light, not an electromagnetic one. However, he concluded that some results were actually the same.[R 7]

Heaviside (1888), Thomson (1889), Searle (1896)

In 1888, Oliver Heaviside[R 8] investigated the properties of charges in motion according to Maxwell's electrodynamics. He calculated, among other things, anisotropies in the electric field of moving bodies represented by this formula:[72]

.

Consequently, Joseph John Thomson (1889)[R 9] found a way to substantially simplify calculations concerning moving charges by using the following mathematical transformation (like other authors such as Lorentz or Larmor, also Thomson implicitly used the Galilean transformation z-vt in his equation[73]):

Thereby, inhomogeneous electromagnetic wave equations are transformed into a Poisson equation.[73] Eventually, George Frederick Charles Searle[R 10] noted in (1896) that Heaviside's expression leads to a deformation of electric fields which he called "Heaviside-Ellipsoid" of axial ratio

[73]

Lorentz (1892, 1895)

In order to explain the aberration of light and the result of the Fizeau experiment in accordance with Maxwell's equations, Lorentz in 1892 developed a model ("Lorentz ether theory") in which the aether is completely motionless, and the speed of light in the aether is constant in all directions. In order to calculate the optics of moving bodies, Lorentz introduced the following quantities to transform from the aether system into a moving system (it's unknown whether he was influenced by Voigt, Heaviside, and Thomson)[R 11][74]

where x* is the Galilean transformation x-vt. Except the additional γ in the time transformation, this is the complete Lorentz transformation (4b).[74] While t is the "true" time for observers resting in the aether, t′ is an auxiliary variable only for calculating processes for moving systems. It is also important that Lorentz and later also Larmor formulated this transformation in two steps. At first an implicit Galilean transformation, and later the expansion into the "fictitious" electromagnetic system with the aid of the Lorentz transformation. In order to explain the negative result of the Michelson–Morley experiment, he (1892b)[R 12] introduced the additional hypothesis that also intermolecular forces are affected in a similar way and introduced length contraction in his theory (without proof as he admitted). The same hypothesis was already made by George FitzGerald in 1889 based on Heaviside's work. While length contraction was a real physical effect for Lorentz, he considered the time transformation only as a heuristic working hypothesis and a mathematical stipulation.

In 1895, Lorentz further elaborated on his theory and introduced the "theorem of corresponding states". This theorem states that a moving observer (relative to the ether) in his "fictitious" field makes the same observations as a resting observers in his "real" field for velocities to first order in v/c. Lorentz showed that the dimensions of electrostatic systems in the ether and a moving frame are connected by this transformation:[R 13]

For solving optical problems Lorentz used the following transformation, in which the modified time variable was called "local time" (Template:Lang-de) by him:[R 14]

With this concept Lorentz could explain the Doppler effect, the aberration of light, and the Fizeau experiment.[75]

Larmor (1897, 1900)

In 1897, Larmor extended the work of Lorentz and derived the following transformation[R 15]

Larmor noted that if it is assumed that the constitution of molecules is electrical then the FitzGerald–Lorentz contraction is a consequence of this transformation, explaining the Michelson–Morley experiment. It's notable that Larmor was the first who recognized that some sort of time dilation is a consequence of this transformation as well, because "individual electrons describe corresponding parts of their orbits in times shorter for the [rest] system in the ratio 1/γ".[76][77] Larmor wrote his electrodynamical equations and transformations neglecting terms of higher order than (v/c)2 – when his 1897 paper was reprinted in 1929, Larmor added the following comment in which he described how they can be made valid to all orders of v/c:[R 16]

Nothing need be neglected: the transformation is exact if v/c2 is replaced by εv/c2 in the equations and also in the change following from t to t′, as is worked out in Aether and Matter (1900), p. 168, and as Lorentz found it to be in 1904, thereby stimulating the modern schemes of intrinsic relational relativity.

In line with that comment, in his book Aether and Matter published in 1900, Larmor used a modified local time t″=t′-εvx′/c2 instead of the 1897 expression t′=t-vx/c2 by replacing v/c2 with εv/c2, so that t″ is now identical to the one given by Lorentz in 1892, which he combined with a Galilean transformation for the x′, y′, z′, t′ coordinates:[R 17]

Larmor knew that the Michelson–Morley experiment was accurate enough to detect an effect of motion depending on the factor (v/c)2, and so he sought the transformations which were "accurate to second order" (as he put it). Thus he wrote the final transformations (where x′=x-vt and t″ as given above) as:[R 18]

by which he arrived at the complete Lorentz transformation (4b). Larmor showed that Maxwell's equations were invariant under this two-step transformation, "to second order in v/c" – it was later shown by Lorentz (1904) and Poincaré (1905) that they are indeed invariant under this transformation to all orders in v/c.

Larmor gave credit to Lorentz in two papers published in 1904, in which he used the term "Lorentz transformation" for Lorentz's first order transformations of coordinates and field configurations:

p. 583: [..] Lorentz's transformation for passing from the field of activity of a stationary electrodynamic material system to that of one moving with uniform velocity of translation through the aether.
p. 585: [..] the Lorentz transformation has shown us what is not so immediately obvious [..][R 19]
p. 622: [..] the transformation first developed by Lorentz: namely, each point in space is to have its own origin from which time is measured, its "local time" in Lorentz's phraseology, and then the values of the electric and magnetic vectors [..] at all points in the aether between the molecules in the system at rest, are the same as those of the vectors [..] at the corresponding points in the convected system at the same local times.[R 20]

Lorentz (1899, 1904)

Also Lorentz extended his theorem of corresponding states in 1899. First he wrote a transformation equivalent to the one from 1892 (again, x* must be replaced by x-vt):[R 21]

Then he introduced a factor ε of which he said he has no means of determining it, and modified his transformation as follows (where the above value of t′ has to be inserted):[R 22]

This is equivalent to the complete Lorentz transformation (4b) when solved for x″ and t″ and with ε=1. Like Larmor, Lorentz noticed in 1899[R 23] also some sort of time dilation effect in relation to the frequency of oscillating electrons "that in S the time of vibrations be times as great as in S0", where S0 is the aether frame.[78]

In 1904 he rewrote the equations in the following form by setting l=1/ε (again, x* must be replaced by x-vt):[R 24]

Under the assumption that l=1 when v=0, he demonstrated that l=1 must be the case at all velocities, therefore length contraction can only arise in the line of motion. So by setting the factor l to unity, Lorentz's transformations now assumed the same form as Larmor's and are now completed. Unlike Larmor, who restricted himself to show the covariance of Maxwell's equations to second order, Lorentz tried to widen its covariance to all orders in v/c. He also derived the correct formulas for the velocity dependence of electromagnetic mass, and concluded that the transformation formulas must apply to all forces of nature, not only electrical ones.[R 25] However, he didn't achieve full covariance of the transformation equations for charge density and velocity.[79] When the 1904 paper was reprinted in 1913, Lorentz therefore added the following remark:[80]

One will notice that in this work the transformation equations of Einstein’s Relativity Theory have not quite been attained. [..] On this circumstance depends the clumsiness of many of the further considerations in this work.

Lorentz's 1904 transformation was cited and used by Alfred Bucherer in July 1904:[R 26]

or by Wilhelm Wien in July 1904:[R 27]

or by Emil Cohn in November 1904 (setting the speed of light to unity):[R 28]

or by Richard Gans in February 1905:[R 29]

Poincaré (1900, 1905)

Local time

Neither Lorentz or Larmor gave a clear physical interpretation of the origin of local time. However, Henri Poincaré in 1900 commented on the origin of Lorentz's "wonderful invention" of local time.[81] He remarked that it arose when clocks in a moving reference frame are synchronised by exchanging signals which are assumed to travel with the same speed in both directions, which lead to what is nowadays called relativity of simultaneity, although Poincaré's calculation does not involve length contraction or time dilation.[R 30] In order to synchronise the clocks here on Earth (the x*, t* frame) a light signal from one clock (at the origin) is sent to another (at x*), and is sent back. It's supposed that the Earth is moving with speed v in the x-direction (= x*-direction) in some rest system (x, t) (i.e. the luminiferous aether system for Lorentz and Larmor). The time of flight outwards is

and the time of flight back is

.

The elapsed time on the clock when the signal is returned is δta+δtb and the time t*=(δta+δtb)/2 is ascribed to the moment when the light signal reached the distant clock. In the rest frame the time t=δta is ascribed to that same instant. Some algebra gives the relation between the different time coordinates ascribed to the moment of reflection. Thus

identical to Lorentz (1892). By dropping the factor γ2 under the assumption that , Poincaré gave the result t*=t-vx*/c2, which is the form used by Lorentz in 1895.

Similar physical interpretations of local time were later given by Emil Cohn (1904)[R 31] and Max Abraham (1905).[R 32]

Lorentz transformation

On June 5, 1905 (published June 9) Poincaré formulated transformation equations which are algebraically equivalent to those of Larmor and Lorentz and gave them the modern form (4b):[R 33]

.

Apparently Poincaré was unaware of Larmor's contributions, because he only mentioned Lorentz and therefore used for the first time the name "Lorentz transformation".[82][83] Poincaré set the speed of light to unity, pointed out the group characteristics of the transformation by setting l=1, and modified/corrected Lorentz's derivation of the equations of electrodynamics in some details in order to fully satisfy the principle of relativity, i.e. making them fully Lorentz covariant.[84]

In July 1905 (published in January 1906)[R 34] Poincaré showed in detail how the transformations and electrodynamic equations are a consequence of the principle of least action; he demonstrated in more detail the group characteristics of the transformation, which he called Lorentz group, and he showed that the combination x2+y2+z2-t2 is invariant. He noticed that the Lorentz transformation is merely a rotation in four-dimensional space about the origin by introducing as a fourth imaginary coordinate, and he used an early form of four-vectors. He also formulated the velocity addition formula (4d), which he had already derived in unpublished letters to Lorentz from May 1905:[R 35]

.

Einstein (1905) – Special relativity

On June 30, 1905 (published September 1905) Einstein published what is now called special relativity and gave a new derivation of the transformation, which was based only on the principle on relativity and the principle of the constancy of the speed of light. While Lorentz considered "local time" to be a mathematical stipulation device for explaining the Michelson-Morley experiment, Einstein showed that the coordinates given by the Lorentz transformation were in fact the inertial coordinates of relatively moving frames of reference. For quantities of first order in v/c this was also done by Poincaré in 1900, while Einstein derived the complete transformation by this method. Unlike Lorentz and Poincaré who still distinguished between real time in the aether and apparent time for moving observers, Einstein showed that the transformations concern the nature of space and time.[85][86][87]

The notation for this transformation is equivalent to Poincaré's of 1905 and (4b), except that Einstein didn't set the speed of light to unity:[R 36]

Einstein also defined the velocity addition formula (4d, 4e):[R 37]

and the light aberration formula (4f):[R 38]

Minkowski (1907–1908) – Spacetime

The work on the principle of relativity by Lorentz, Einstein, Planck, together with Poincaré's four-dimensional approach, were further elaborated and combined with the hyperboloid model by Hermann Minkowski in 1907 and 1908.[R 39][R 40] Minkowski particularly reformulated electrodynamics in a four-dimensional way (Minkowski spacetime).[88] For instance, he wrote x, y, z, it in the form x1, x2, x3, x4. By defining ψ as the angle of rotation around the z-axis, the Lorentz transformation assumes a form (with c=1) in agreement with (2b):[R 41]

Even though Minkowski used the imaginary number iψ, he for once[R 41] directly used the tangens hyperbolicus in the equation for velocity

with .

Minkowski's expression can also by written as ψ=atanh(q) and was later called rapidity. He also wrote the Lorentz transformation in matrix form equivalent to (2a) (n=3):[R 42]

As a graphical representation of the Lorentz transformation he introduced the Minkowski diagram, which became a standard tool in textbooks and research articles on relativity:[R 43]

Original spacetime diagram by Minkowski in 1908.

Sommerfeld (1909) – Spherical trigonometry

Using an imaginary rapidity such as Minkowski, Arnold Sommerfeld (1909) formulated a transformation equivalent to Lorentz boost (3b), and the relativistc velocity addition (4d) in terms of trigonometric functions and the spherical law of cosines:[R 44]

Bateman and Cunningham (1909–1910) – Spherical wave transformation

In line with Lie's (1871) research on the relation between sphere transformations with an imaginary radius coordinate and 4D conformal transformations, it was pointed out by Bateman and Cunningham (1909–1910), that by setting u=ict as the imaginary fourth coordinates one can produce spacetime conformal transformations. Not only the quadratic form , but also Maxwells equations are covariant with respect to these transformations, irrespective of the choice of λ. These variants of conformal or Lie sphere transformations were called spherical wave transformations by Bateman.[R 45][R 46] However, this covariance is restricted to certain areas such as electrodynamics, whereas the totality of natural laws in inertial frames is covariant under the Lorentz group.[R 47] In particular, by setting λ=1 the Lorentz group SO(1,3) can be seen as a 10-parameter subgroup of the 15-parameter spacetime conformal group Con(1,3).

Bateman (1910/12)[89] also alluded to the identity between the Laguerre inversion and the Lorentz transformations. In general, the isomorphism between the Laguerre group and the Lorentz group was pointed out by Élie Cartan (1912, 1915/55),[24][R 48] Henri Poincaré (1912/21)[R 49] and others.

Herglotz (1909/10) – Möbius transformation

Following Klein (1889–1897) and Fricke & Klein (1897) concerning the Cayley absolute, hyperbolic motion and its transformation, Gustav Herglotz (1909/10) classified the one-parameter Lorentz transformations as loxodromic, hyperbolic, parabolic and elliptic. The general case (on the left) equivalent to Lorentz transformation (6a) and the hyperbolic case (on the right) equivalent to Lorentz transformation (3d) or squeeze mapping (9d) are as follows:[R 50]

Varićak (1910) – Hyperbolic functions

Following Sommerfeld (1909), hyperbolic functions were used by Vladimir Varićak in several papers starting from 1910, who represented the equations of special relativity on the basis of hyperbolic geometry in terms of Weierstrass coordinates. For instance, by setting l=ct and v/c=tanh(u) with u as rapidity he wrote the Lorentz transformation in agreement with (3b):[R 51]

and showed the relation of rapidity to the Gudermannian function and the angle of parallelism:[R 51]

He also related the velocity addition to the hyperbolic law of cosines:[R 52]

Subsequently, other authors such as E. T. Whittaker (1910) or Alfred Robb (1911, who coined the name rapidity) used similar expressions, which are still used in modern textbooks.[10]

Ignatowski (1910)

While earlier derivations and formulations of the Lorentz transformation relied from the outset on optics, electrodynamics, or the invariance of the speed of light, Vladimir Ignatowski (1910) showed that it is possible to use the principle of relativity (and related group theoretical principles) alone, in order to derive the following transformation between two inertial frames:[R 53][R 54]

The variable n can be seen as a space-time constant whose value has to be determined by experiment or taken from a known physical law such as electrodynamics. For that purpose, Ignatowski used the above-mentioned Heaviside ellipsoid representing a contraction of electrostatic fields by x/γ in the direction of motion. It can be seen that this is only consistent with Ignatowski's transformation when n=1/c2, resulting in p=γ and the Lorentz transformation (4b). With n=0, no length changes arise and the Galilean transformation follows. Ignatowski's method was further developed and improved by Philipp Frank and Hermann Rothe (1911, 1912),[R 55] with various authors developing similar methods in subsequent years.[90]

Noether (1910), Klein (1910) – Quaternions

Felix Klein (1908) described Cayley's (1854) 4D quaternion multiplications as "Drehstreckungen" (orthogonal substitutions in terms of rotations leaving invariant a quadratic form up to a factor), and pointed out that the modern principle of relativity as provided by Minkowski is essentially only the consequent application of such Drehstreckungen, even though he didn't provide details.[R 56]

In an appendix to Klein's and Sommerfeld's "Theory of the top" (1910), Fritz Noether showed how to formulate hyperbolic rotations using biquaternions with , which he also related to the speed of light by setting ω2=-c2. He concluded that this is the principal ingredient for a rational representation of the group of Lorentz transformations equivalent to (7a):[R 57]

Besides citing quaternion related standard works such as Cayley (1854), Noether referred to the entries in Klein's encyclopedia by Eduard Study (1899) and the French version by Élie Cartan (1908).[91] Cartan's version contains a description of Study's dual numbers, Clifford's biquaternions (including the choice for hyperbolic geometry), and Clifford algebra, with references to Stephanos (1883), Buchheim (1884/85), Vahlen (1901/02) and others.

Citing Noether, Klein himself published in August 1910 the following quaternion substitutions forming the group of Lorentz transformations:[R 58]

or in March 1911[R 59]

Conway (1911), Silberstein (1911) – Quaternions

Arthur W. Conway in February 1911 explicitly formulated quaternionic Lorentz transformations of various electromagnetic quantities in terms of velocity λ:[R 60]

Also Ludwik Silberstein in November 1911[R 61] as well as in 1914,[92] formulated the Lorentz transformation in terms of velocity v:

Silberstein cites Cayley (1854, 1855) and Study's encyclopedia entry (in the extended French version of Cartan in 1908), as well as the appendix of Klein's and Sommerfeld's book.

Herglotz (1911), Silberstein (1911) – Vector transformation

Gustav Herglotz (1911)[R 62] showed how to formulate the transformation equivalent to (4c) in order to allow for arbitrary velocities and coordinates v=(vx, vy, vz) and r=(x, y, z):

This was simplified using vector notation by Ludwik Silberstein (1911 on the left, 1914 on the right):[R 63]

Equivalent formulas were also given by Wolfgang Pauli (1921),[93] with Erwin Madelung (1922) providing the matrix form[94]

These formulas were called "general Lorentz transformation without rotation" by Christian Møller (1952),[95] who in addition gave an even more general Lorentz transformation in which the Cartesian axes have different orientations, using a rotation operator . In this case, v′=(v′x, v′y, v′z) is not equal to -v=(-vx, -vy, -vz), but the relation holds instead, with the result

Borel (1913–14) – Cayley–Hermite parameter

Borel (1913) started by demonstrating Euclidean motions using Euler-Rodrigues parameter in three dimensions, and Cayley's (1846) parameter in four dimensions. Then he demonstrated the connection to indefinite quadratic forms expressing hyperbolic motions and Lorentz transformations. In three dimensions equivalent to (5b):[R 64]

In four dimensions equivalent to (5c):[R 65]

Gruner (1921) – Trigonometric Lorentz boosts

In order to simplify the graphical representation of Minkowski space, Paul Gruner (1921) (with the aid of Josef Sauter) developed what is now called Loedel diagrams, using the following relations:[R 66]

This is equivalent to Lorentz transformation (8a) by the identity

In another paper Gruner used the alternative relations:[R 67]

This is equivalent to Lorentz Lorentz boost (8b) by the identity .

Euler's gap

In pursuing the history in years before Lorentz enunciated his expressions, one looks to the essence of the concept. In mathematical terms, Lorentz transformations are squeeze mappings, the linear transformations that turn a square into a rectangles of the same area. Before Euler, the squeezing was studied as quadrature of the hyperbola and led to the hyperbolic logarithm. In 1748 Euler issued his precalculus textbook where the number e is exploited for trigonometry in the unit circle. The first volume of Introduction to the Analysis of the Infinite had no diagrams, allowing teachers and students to draw their own illustrations.

There is a gap in Euler's text where Lorentz transformations arise. A feature of natural logarithm is its interpretation as area in hyperbolic sectors. In relativity the classical concept of velocity is replaced with rapidity, a hyperbolic angle concept built on hyperbolic sectors. A Lorentz transformation is a hyperbolic rotation which preserves differences of rapidity, just as the circular sector area is preserved with a circular rotation. Euler's gap is the lack of hyperbolic angle and hyperbolic functions, later developed by Johann H. Lambert. Euler described some transcendental functions: exponentiation and circular functions. He used the exponential series With the imaginary unit i2 = – 1, and splitting the series into even and odd terms, he obtained

This development misses the alternative:

(even and odd terms), and
which parametrizes the unit hyperbola.

Here Euler could have noted split-complex numbers along with complex numbers.

For physics, one space dimension is insufficient. But to extend split-complex arithmetic to four dimensions leads to hyperbolic quaternions, and opens the door to abstract algebra's hypercomplex numbers. Reviewing the expressions of Lorentz and Einstein, one observes that the Lorentz factor is an algebraic function of velocity. For readers uncomfortable with transcendental functions cosh and sinh, algebraic functions may be more to their liking.

See also

References

Historical mathematical sources

  1. ^ Killing (1885), p. 71
  2. ^ a b Cayley (1884), section 16.
  3. ^ Klein (1896/97), p. 12
  4. ^ Kepler (1609), chapter 60. The editors of Kepler's collected papers remark (p. 482), that Kepler's relations correspond to and and
  5. ^ Euler (1735/40), § 19
  6. ^ Euler (1748a), section VIII
  7. ^ Lagrange (1770/71), section I
  8. ^ Euler (1771), pp. 84-85
  9. ^ Euler (1771), pp. 77, 85-89
  10. ^ Rodrigues (1840), p. 405
  11. ^ Euler (1771), p. 101
  12. ^ Euler (1771), pp. 89–91
  13. ^ Euler (1748b), section 138.
  14. ^ Wessel (1799), § 28.
  15. ^ Riccati (1757), p. 71
  16. ^ Günther (1880/81), pp. 7–13
  17. ^ Lambert (1761/68), pp. 309–318
  18. ^ Lambert (1770), p. 335
  19. ^ Lagrange (1773/75), section 22
  20. ^ Gauss (1798/1801), articles 157–158;
  21. ^ Gauss (1798/1801), section 159
  22. ^ Gauss (1798/1801), articles 266–285
  23. ^ Gauss (1798/1801), article 277
  24. ^ Gauss (1800/1863), p. 311
  25. ^ Gauss (1818), pp. 5–10
  26. ^ Gauss (1818), pp. 9–10
  27. ^ Jacobi (1827), p. 235, 239–240
  28. ^ The orthogonal substitution and the imaginary transformation was defined in Jacobi (1832a), pp. 257, 265–267; Transformation system (2) and (3) and coefficients in Jacobi (1832b), pp. 321-325.
  29. ^ Jacobi (1833/34), pp. 7–8, 34–35, 41; Some misprints were corrected in Jacobi's collected papers, vol 3, pp. 229–230.
  30. ^ Jacobi (1833/34), p. 37. Some misprints were corrected in Jacobi's collected papers, vol 3, pp. 232–233.
  31. ^ Cauchy (1829), eq. 22, 98, 99, 101; Some misprints were corrected in Œuvres complètes, série 2, tome 9, pp. 174–195.
  32. ^ Lebesgue (1837), pp. 338-341
  33. ^ Lebesgue (1837), pp. 353–354
  34. ^ Lebesgue (1837), pp. 353–355
  35. ^ Hamilton (1844/45), p. 13
  36. ^ Hamilton (1844/45), p. 14
  37. ^ Cayley (1846)
  38. ^ Cayley (1855a), p. 288
  39. ^ Cayley (1858), p. 39
  40. ^ Cayley (1855b), p. 210
  41. ^ Cayley (1855b), p. 211
  42. ^ Cayley (1855b), pp. 212–213
  43. ^ Cayley (1854), p. 135
  44. ^ a b c Fricke & Klein (1897), §12–13
  45. ^ Cayley (1879), p. 238f.
  46. ^ Helmholtz (1866/67), p. 513
  47. ^ Cayley (1845), p. 142
  48. ^ Cayley (1848), p. 196
  49. ^ Cayley (1854), p. 211
  50. ^ Cayley (1855b), p. 312
  51. ^ Cayley (1859), sections 209–229
  52. ^ Cockle (1848), p. 437
  53. ^ Cockle (1848), p. 438
  54. ^ Hermite (1853/54a), p. 307ff.
  55. ^ Hermite (1854b), p. 64
  56. ^ Frobenius (1877)
  57. ^ Hermite (1854b), pp. 64–65
  58. ^ Bour (1856), pp. 61; 64–65
  59. ^ Somov (1863), pp. 12–14; p. 18 for differentials.
  60. ^ Beltrami (1868a), pp. 287-288; Note I; Note II
  61. ^ Beltrami (1868b), pp. 232, 240–241, 253–254
  62. ^ Bachmann (1869), p. 303
  63. ^ Klein (1871), pp. 601–602
  64. ^ Klein (1871), p. 618
  65. ^ Klein (1873), pp. 127-128
  66. ^ Klein (1872), 6
  67. ^ Wedekind (1875), 1
  68. ^ Klein (1875), §1–2
  69. ^ Klein (1878), 8.
  70. ^ Klein (1882), p. 173.
  71. ^ Klein (1884), Part I, Ch. I, §1–2; Part II, Ch. II, 10
  72. ^ Klein (1893a), p. 109ff; pp. 138–140; pp. 249–250
  73. ^ Klein (1893b); general surface: pp. 61–66, 116–119, hyperbolic space: pp. 82, 86, 143–144
  74. ^ Klein (1896/97), pp. 13–14
  75. ^ Klein (1871/72), p. 268
  76. ^ Darboux (1872/73), p. 137
  77. ^ Lie (1871), p. 208
  78. ^ Pockels (1891), pp. 197–206
  79. ^ Klein (1893c), pp. 200ff (pentaspherical), pp. 373ff (tetracyclical)
  80. ^ Bôcher (1894), pp. 30–34, 40–43
  81. ^ Liouville (1847)
  82. ^ Euler (1777), p. 140
  83. ^ Lie (1871), pp. 199–209
  84. ^ Lie (1871a), pp. 199–209
  85. ^ Lie (1871/72), p. 186
  86. ^ Lie (1879/80), Collected papers, vol. 3, p. 389
  87. ^ Lie (1879/81), Collected papers, vol. 3, p. 393
  88. ^ Lie (1880/81), Collected papers, vol. 3, pp. 477–478
  89. ^ Lie (1883/84), Collected papers, vol. 3, p. 556
  90. ^ Lie (1885/86), p. 411
  91. ^ Werner (1889), pp. 4, 28
  92. ^ Lie (1890a), p. 295;
  93. ^ Lie (1890a), p. 311
  94. ^ Lie (1893), p. 474
  95. ^ Lie (1893), p. 479
  96. ^ Lie (1893), p. 481
  97. ^ Selling (1873), p. 174 and p. 179
  98. ^ Selling (1873), pp. 182–183
  99. ^ Selling (1873/74), p. 227 (see also p. 225 for citation).
  100. ^ Laisant (1874a), pp. 73–76
  101. ^ Laisant (1874b), pp. 134–135
  102. ^ a b Gudermann (1830), §1–3, §18–19
  103. ^ Escherich (1874), p. 508
  104. ^ Escherich (1874), p. 510
  105. ^ Cox (1881), p. 186
  106. ^ Glaisher (1878), p. 30
  107. ^ Killing (1877/78), p. 74; Killing (1880), p. 279
  108. ^ Killing (1880), eq. 25 on p. 283
  109. ^ Killing (1880), p. 283
  110. ^ Killing (1877/78), eq. 25 on p. 283
  111. ^ Killing (1879/80), p. 274
  112. ^ Killing (1885), pp. 18, 28–30, 53
  113. ^ Killing (1884/85), pp. 42–43; Killing (1885), pp. 73–74, 222
  114. ^ Killing (1884/85), pp. 4–5
  115. ^ Killing (1885), Note 9 on p. 260
  116. ^ Killing (1893), see pp. 144, 327–328
  117. ^ Killing (1893), pp. 314–316, 216–217
  118. ^ Killing (1893), p. 331
  119. ^ Killing (1898), p. 133
  120. ^ Killing (1887/88a), pp. 274–275
  121. ^ Killing (1892), p. 177
  122. ^ Killing (1897/98), pp. 255–256
  123. ^ Günther (1880/81), pp. 383–385
  124. ^ Günther (1880/81), p. 405
  125. ^ a b Poincaré (1881a), pp. 133–134
  126. ^ Poincaré (1881), pp. 133–134
  127. ^ Poincaré (1887), p. 206
  128. ^ Poincaré (1881b), p. 333
  129. ^ Poincaré (1883), pp. 49–50; 53–54
  130. ^ Poincaré (1886), p. 735ff.
  131. ^ Salmon (1862), section 212, p. 165
  132. ^ Frischauf (1876), pp. 86–87
  133. ^ Cox (1881), p. 186 for Weierstrass coordinates; (1881/82), pp. 193–194 for Lorentz transformation. On p. 193, the misprinted expression should read
  134. ^ Cox (1881), pp. 199, 206–207
  135. ^ Cox (1881/82), p. 194
  136. ^ Cox (1882/83a), pp. 85–86
  137. ^ Cox (1882/83a), p. 88
  138. ^ Cox (1882/83b), p. 195
  139. ^ Cox (1882/83a), p. 97
  140. ^ On pp. 104-105 he started using the term v2=-1, on p. 106 he noted that one can simply use instead of v, and on p. 112 he adopted Clifford's notation by setting ω2=-1.
  141. ^ Cox (1882/83a), pp. 108-109
  142. ^ Hill (1882), pp. 323–325
  143. ^ Ribaucour (1870)
  144. ^ Laguerre (1882), pp. 550–551.
  145. ^ Picard (1882), pp. 307–308 first transformation system; pp. 315-317 second transformation system
  146. ^ Picard (1884a), p. 13
  147. ^ Picard (1884b), p. 416
  148. ^ Picard (1884c), pp. 123–124; 163
  149. ^ Stephanos (1883), p. 590ff
  150. ^ Stephanos (1883), p. 592
  151. ^ Buchheim (1885), p. 309
  152. ^ Darboux (1883), p. 849
  153. ^ Darboux (1891/94), pp. 381–382
  154. ^ Darboux (1887)
  155. ^ Callandreau (1885), pp. A.7; A.12
  156. ^ Lipschitz (1886), pp. 90–92
  157. ^ Lipschitz (1886), pp. 75–79
  158. ^ Lipschitz (1886), pp. 134–138
  159. ^ Lipschitz (1886), p. 76; p. 137
  160. ^ Lipschitz (1886), pp. 145–147
  161. ^ Schur (1885/86), p. 167
  162. ^ Schur (1900/02), p. 290; (1909), p. 83
  163. ^ Bianchi (1886), eq. 1 can be found on p. 226, eq. (2) on p. 240, eq. (3) on pp. 240–241, and for eq. (4) see the footnote on p. 240.
  164. ^ Bianchi (1894), pp. 433–434
  165. ^ Bianchi (1888), pp. 547; 562–563 (especially footnote on p. 563); 571–572
  166. ^ a b Bianchi (1893), § 3
  167. ^ Lindemann & Clebsch (1890/91), pp. 477–478, 524
  168. ^ Lindemann & Clebsch (1890/91), pp. 361–362
  169. ^ Lindemann & Clebsch (1890/91), p. 496
  170. ^ Lindemann & Clebsch (1890/91), pp. 477–478
  171. ^ Fricke (1891), §§ 1, 6
  172. ^ Fricke (1893), pp. 706, 710–711
  173. ^ Gérard (1892), pp. 40–41
  174. ^ Gérard (1892), pp. 40–41
  175. ^ Macfarlane (1892), p. 50; Macfarlane (1893), p. 24
  176. ^ Macfarlane (1894b), pp. 16–33
  177. ^ Macfarlane (1900), pp. 172, 175
  178. ^ Woods (1895), pp. 2–3; 10–11; 34–35
  179. ^ Woods (1901/02), p. 98, 104
  180. ^ Woods (1903/05), pp. 45–46; p. 48)
  181. ^ Woods (1903/05), p. 55
  182. ^ Woods (1903/05), p. 72
  183. ^ Whitehead (1898), pp. 459–460
  184. ^ Scheffers (1899), p. 158
  185. ^ Hausdorff (1899), p. 165, pp. 181-182
  186. ^ Hausdorff (1899), pp. 183-184
  187. ^ Smith (1900), p. 159
  188. ^ Vahlen (1902), pp. 586–587, 590; (1905), p. 282
  189. ^ Liebmann (1904/05), p. 168; pp. 175–176
  190. ^ Liebmann (1904/05), p. 174
  191. ^ Eisenhart (1905), p. 126

Historical relativity sources

  1. ^ a b Varićak (1912), p. 108
  2. ^ Borel (1914), pp. 39–41
  3. ^ Brill (1925)
  4. ^ Voigt (1887), p. 45
  5. ^ Lorentz (1915/16), p. 197
  6. ^ Lorentz (1915/16), p. 198
  7. ^ Bucherer (1908), p. 762
  8. ^ Heaviside (1888), p. 324
  9. ^ Thomson (1889), p. 12
  10. ^ Searle (1886), p. 333
  11. ^ Lorentz (1892a), p. 141
  12. ^ Lorentz (1892b), p. 141
  13. ^ Lorentz (1895), p. 37
  14. ^ Lorentz (1895), p. 49 for local time and p. 56 for spatial coordinates.
  15. ^ Larmor (1897), p. 229
  16. ^ Larmor (1897/1929), p. 39
  17. ^ Larmor (1900), p. 168
  18. ^ Larmor (1900), p. 174
  19. ^ Larmor (1904a), p. 583, 585
  20. ^ Larmor (1904b), p. 622
  21. ^ Lorentz (1899), p. 429
  22. ^ Lorentz (1899), p. 439
  23. ^ Lorentz (1899), p. 442
  24. ^ Lorentz (1904), p. 812
  25. ^ Lorentz (1904), p. 826
  26. ^ Bucherer, p. 129; Definition of s on p. 32
  27. ^ Wien (1904), p. 394
  28. ^ Cohn (1904a), pp. 1296-1297
  29. ^ Gans (1905), p. 169
  30. ^ Poincaré (1900), pp. 272–273
  31. ^ Cohn (1904b), p. 1408
  32. ^ Abraham (1905), § 42
  33. ^ Poincaré (1905), p. 1505
  34. ^ Poincaré (1905/06), pp. 129ff
  35. ^ Poincaré (1905/06), p. 144
  36. ^ Einstein (1905), p. 902
  37. ^ Einstein (1905), § 5 and § 9
  38. ^ Einstein (1905), § 7
  39. ^ Minkowski (1907/15), pp. 927ff
  40. ^ Minkowski (1907/08), pp. 53ff
  41. ^ a b Minkowski (1907/08), p. 59
  42. ^ Minkowski (1907/08), pp. 65–66, 81–82
  43. ^ Minkowski (1908/09), p. 77
  44. ^ Sommerfeld (1909), p. 826ff.
  45. ^ Bateman (1909/10), pp. 223ff
  46. ^ Cunningham (1909/10), pp. 77ff
  47. ^ Klein (1910)
  48. ^ Cartan (1912), p. 23
  49. ^ Poincaré (1912/21), p. 145
  50. ^ Herglotz (1909/10), pp. 404-408
  51. ^ a b Varićak (1910), p. 93
  52. ^ Varićak (1910), p. 94
  53. ^ Ignatowski (1910), pp. 973–974
  54. ^ Ignatowski (1910/11), p. 13
  55. ^ Frank & Rothe (1911), pp. 825ff; (1912), p. 750ff.
  56. ^ Klein (1908), p. 165
  57. ^ Noether (1910), pp. 939–943
  58. ^ Klein (1910), p. 300
  59. ^ Klein (1911), pp. 602ff.
  60. ^ Conway (1911), p. 8
  61. ^ Silberstein (1911/12), p. 793
  62. ^ Herglotz (1911), p. 497
  63. ^ Silberstein (1911/12), p. 792; (1914), p. 123
  64. ^ Borel (1913/14), p. 39
  65. ^ Borel (1913/14), p. 41
  66. ^ Gruner (1921a),
  67. ^ Gruner (1921b)

Secondary sources

  1. ^ Bôcher (1907), chapter X
  2. ^ Ratcliffe (1994), 3.1 and Theorem 3.1.4 and Exercise 3.1
  3. ^ Naimark (1964), 2 in four dimensions
  4. ^ Musen (1970) pointed out the intimate connection of Hill's scalar development and Minkowski's pseudo-Euclidean 3D space.
  5. ^ Touma et al. (2009) showed the analogy between Gauss and Hill's equations and Lorentz transformations, see eq. 22-29.
  6. ^ Müller (1910), p. 661, in particular footnote 247.
  7. ^ Sommerville (1911), p. 286, section K6.
  8. ^ Synge (1955), p. 129 for n=3
  9. ^ Laue (1921), pp. 79–80 for n=3
  10. ^ a b Rindler (1969), p. 45
  11. ^ Rosenfeld (1988), p. 231
  12. ^ a b Pauli (1921), p. 561
  13. ^ a b Barrett (2006), chapter 4, section 2
  14. ^ Miller (1981), chapter 1
  15. ^ Miller (1981), chapter 4–7
  16. ^ Møller (1952/55), Chapter II, § 18
  17. ^ Pauli (1921), pp. 562; 565–566
  18. ^ a b Plummer (1910), pp. 258-259: After deriving the relativistic expressions for the aberration angles φ' and φ, Plummer remarked on p. 259: Another geometrical representation is obtained by assimilating φ' to the eccentric and φ to the true anomaly in an ellipse whose eccentricity is v/U = sin β.
  19. ^ a b Robinson (1990), chapter 3-4, analyzed the relation between "Kepler's formula" and the "physical velocity addition formula" in special relativity.
  20. ^ Schottenloher (2008), section 2.2
  21. ^ Kastrup (2008), section 2.4.1
  22. ^ Schottenloher (2008), section 2.3
  23. ^ Coolidge (1916), p. 370
  24. ^ a b Cartan & Fano (1915/55), sections 14–15
  25. ^ Hawkins (2013), pp. 210–214
  26. ^ Meyer (1899), p. 329
  27. ^ Klein (1928), § 2B
  28. ^ Lorente (2003), section 3.3
  29. ^ a b Klein (1928), § 2A
  30. ^ a b Synge (1956), ch. IV, 11
  31. ^ Klein (1928), § 3A
  32. ^ Penrose & Rindler (1984), section 2.1
  33. ^ a b Lorente (2003), section 4
  34. ^ Penrose & Rindler (1984), p. 17
  35. ^ Synge (1972), pp. 13, 19, 24
  36. ^ Girard (1984), pp. 28–29
  37. ^ Sobczyk (1995)
  38. ^ Fjelstad (1986)
  39. ^ Cartan & Study (1908), section 36
  40. ^ Rothe (1916), section 16
  41. ^ a b Majerník (1986), 536–538
  42. ^ a b Terng & Uhlenbeck (2000), p. 21
  43. ^ a b Pocheco (2008), section 2.
  44. ^ Bondi (1964), p. 118
  45. ^ Volk (1976), p. 366
  46. ^ Barnett (2004), pp. 22–23
  47. ^ Dickson (1923), p. 210
  48. ^ Taurinus (1826), p. 66; see also p. 272 in the translation by Engel and Stäckel (1899)
  49. ^ Bonola (1912), p. 79
  50. ^ Gray (1979), p. 242
  51. ^ Hawkins (2013), p. 214
  52. ^ Hawkins (2013), p. 212
  53. ^ Hawkins (2013), pp. 219ff
  54. ^ Bachmann (1898), pp. 101–102
  55. ^ Kastrup (2008), p. 22
  56. ^ Kastrup (2008), section 2.1
  57. ^ Kastrup (2008), section 2.3
  58. ^ Bachmann (1923), chapter 16
  59. ^ Sommerville (1911), p. 297
  60. ^ Ratcliffe (1994), § 3.6
  61. ^ a b Reynolds (1993)
  62. ^ Gray (1997)
  63. ^ Dickson (1923), pp. 220–221
  64. ^ Dickson (1923), pp. 280-281
  65. ^ Cartan & Study (1908), p. 460
  66. ^ Rothe (1916), p. 1399
  67. ^ Schur (1900/02), p. 291; (1909), p. 83
  68. ^ Dickson (1923), pp. 221, 232
  69. ^ Miller (1981), 114–115
  70. ^ a b Pais (1982), Kap. 6b
  71. ^ Voigt's transformations and the beginning of the relativistic revolution, Ricardo Heras, arXiv:1411.2559 [1]
  72. ^ Brown (2003)
  73. ^ a b c Miller (1981), 98–99
  74. ^ a b Miller (1982), 1.4 & 1.5
  75. ^ Janssen (1995), 3.1
  76. ^ Darrigol (2000), Chap. 8.5
  77. ^ Macrossan (1986)
  78. ^ Jannsen (1995), Kap. 3.3
  79. ^ Miller (1981), Chap. 1.12.2
  80. ^ Jannsen (1995), Chap. 3.5.6
  81. ^ Darrigol (2005), Kap. 4
  82. ^ Pais (1982), Chap. 6c
  83. ^ Katzir (2005), 280–288
  84. ^ Miller (1981), Chap. 1.14
  85. ^ Miller (1981), Chap. 6
  86. ^ Pais (1982), Kap. 7
  87. ^ Darrigol (2005), Chap. 6
  88. ^ Walter (1999a)
  89. ^ Bateman (1910/12), pp. 358–359
  90. ^ Baccetti (2011), see references 1–25 therein.
  91. ^ Cartan & Study (1908), sections 35–36
  92. ^ Silberstein (1914), p. 156
  93. ^ Pauli (1921), p. 555
  94. ^ Madelung (1921), p. 207
  95. ^ Møller (1952/55), pp. 41–43