From Wikipedia, the free encyclopedia
  (Redirected from Spacetime continuum)
Jump to: navigation, search

In physics, spacetime is any mathematical model that combines dimensions of space and time into a single continuum. Refining the earlier notion of relative motion from Galilean relativity, which held that that the 3D shape of the universe was distinct from time, Albert Einstein's 1905 special theory of relativity described three dimensions of space and one of time with interwoven kinematics. This related descriptions by observers in relative motion with different inertial reference frames, founded on the premise of an invariant speed of light through empty space.

In 1908, Hermann Minkowski, expanded upon Einstein's work, presenting a geometric interpretation of spacetime, fusing Einstein's model into a unified vector space of spacetime events of a single four-dimensional continuum now known as Minkowski space—an example of what mathematicians call a 4‑dimensional manifold. The geometric character of this model is embedded in the notion of the spacetime interval, which measures distance on the manifold. Einstein further developed the notion of spacetime with his introduction of curved spacetime in the General theory of relativity. More modern work has also given spacetime a discrete character through quantization.

Early philosophy[edit]

During ancient Inca times, which spanned multiple centuries[when?], Incas regarded space and time as a single concept, referred to as pacha (Quechua: pacha, Aymara: pacha).[1][2] The peoples of the Andes maintain a similar understanding.[3]

Early development[edit]

Until the turn of the 20th century, the assumption had been that the 3D geometry of the universe (its description in terms of locations, shapes, distances, and directions) was distinct from time (the measurement of when events occur within the universe).

Relative motion[edit]

Vector formalism[edit]

The development of the notion of a spacetime was advanced by the mathematical formalism of vectors. The notion of vectors allowed for the characterization of spacetime event vectors on a vector space.[further explanation needed]

Spacetime event[edit]

A set of coordinates (a vector), usually labeled x, y, z and t are used to describe points in spacetime. These points are known as spacetime events, and carry additional time information not included by spatial vectors alone. These vectors carry no duration information by themselves, which prevents them from describing rate information.

This rate information can be recovered by considering a progression of spacetime events, however. A progression of events describing a particle follows a curve known as a world line.[4]:105

Reference frames[edit]
Figure 1-5. (a) Galilean diagram of two frames of reference in standard configuration. (b) spacetime diagram of two frames of reference. (c) spacetime diagram showing the path of a reflected light pulse.

In comparing measurements made by relatively moving observers in different reference frames, it is useful to work with the frames in a standard configuration. In Fig. 1‑5a, two Galilean reference frames (i.e. conventional 3-space frames) are displayed in relative motion. Frame S belongs to a first observer O, and frame S' belongs to a second observer O'.

  • The x, y, z axes of frame S are oriented parallel to the respective primed axes of frame S'.
  • Frame S' moves in the x-direction of frame S with a constant velocity v as measured in frame S.
  • The origins of frames S and S' are coincident when time t = 0 for frame S and t' = 0 for frame S'.[4]:107

Fig. 1-5b illustrates a spacetime diagram from the viewpoint of observer O. Since S and S' are in standard configuration, their origins coincide at times t = 0 in frame S and t' = 0 in frame S'. The ct' axis passes through the events in frame S' which have x' = 0. But the points with x' = 0 are moving in the x-direction of frame S with velocity v, so that they are not coincident with the ct axis at any time other than zero. Therefore, the ct' axis is tilted with respect to the ct axis by an angle θ given by

Figure 1-6. The light cone centered on an event divides the rest of spacetime into the future, the past, and elsewhere[clarification needed].

The x' axis is also tilted with respect to the x axis. To determine the angle of this tilt, we recall that the slope of the world line of a light pulse is always ±1. Fig. 1‑5c presents a spacetime diagram from the viewpoint of observer O'. Event P represents the emission of a light pulse at x' = 0, ct' = -a. The pulse is reflected from a mirror situated a distance a from the light source (event Q), and returns to the light source at x' = 0, ct' = a (event R).

The same events P, Q, R are plotted in Fig. 1‑5b in the frame of observer O. The light paths have slopes = 1 and −1 so that ΔPQR forms a right triangle. Since OP = OQ = OR, the angle between x' and x must also be θ.[4]:113–118


The vector formalism allows for formalization of the notion of dimensions, in particular notions of a spatial dimension and a temporal dimension. The most commonly considered spacetimes consist of 3 spatial dimensions, and one temporal dimension.

Newtonian spacetime[edit]

Newton's Principia Mathematica used[when?] an idea of an absolute spacetime. In this view, classical mechanics treats time as a universal quantity of measurement which is uniform throughout space and which is separate from space. Classical mechanics assumes that time has a constant rate of passage that is independent of the state of motion of an observer, or indeed of anything external.[5] Furthermore, it assumes that space is Euclidean, which is to say it assumes a similar structure to that of Euclid's geometry, which include assumptions about parallel and orthogonal lines and the existence of an origin--assumptions that were later challenged.[6]

Newtonian spacetime is not a metric space.

Formal description[edit]

Galilean transformations[edit]
Figure 2-1. Standard configuration of coordinate systems for Galilean transformations.

A basic goal is to be able to compare measurements made by observers in relative motion. Say we have an observer O in frame S who has measured the time and space coordinates of an event, assigning this event three Cartesian coordinates and the time as measured on his lattice of synchronized clocks (x, y, z, t) (see Fig. 1‑1). A second observer O' (pronounced O prime) in a different frame S' measures the same event in her coordinate system and her lattice of synchronized clocks (x', y', z', t'). Since we are dealing with inertial frames, neither observer is under acceleration, and a simple set of equations allows us to relate coordinates (x, y, z, t) to (x', y', z', t'). Given that the two coordinate systems are in standard configuration, meaning that they are aligned with parallel (x, y, z) coordinates and that t = 0 when t' = 0, the coordinate transformation is as follows:[7][8]

Figure 2-2. Galilean Spacetime and composition of velocities.

Fig. 2-2 illustrates that in Newton's theory, time is universal, not the velocity of light.[9]:36–37 Consider the following thought experiment: The red arrow illustrates a train that is moving at 0.4 c with respect to the platform. Within the train, a passenger shoots a bullet with a speed of 0.4 c in the frame of the train. The blue arrow illustrates that a person standing on the train tracks measures the bullet as traveling at 0.8 c. This is in accordance with our naive expectations.

More generally, assume that frame S' is moving at velocity v with respect to frame S. Within frame S', observer O' measures an object moving with velocity u'. What is its velocity u with respect to frame S? Since x = ut, x' = x - vt, and t = t', we can write x' = ut - vt = (u - v)t = (u - v)t'. This leads to u' = x'/t' and ultimately


which is the common-sense Galilean law for the addition of velocities.

Example: Terence and Stella are at a 100 meter race. Terence is an official at the starting blocks, while Stella is a participant. At t = t' = 0, Stella begins running at a speed of 9 m/s. At 5 s into the race, Terence, in his unprimed coordinate system, observes their mother, situated 45 m downfield from the starting blocks and 10 m to the left (45 m, 10 m, 0 m, 5 s), waving at Stella. Stella, in her primed coordinate system, observes their mother waving at her at x' = x - vt = 45 - 9×5 = 0 m, y' = y = 10 m, i.e. she observes their mother as waving from directly to her left (0 m, 10 m, 0 m, 5 s).

Four-dimensional description[edit]

In Encyclopedie, published in 1754, under the term dimension Jean le Rond d'Alembert speculated that a duration of time might be considered a fourth dimension, beyond the established three spatial dimensions.[clarification needed][10]

Another early venture was by Joseph Louis Lagrange in his Theory of Analytic Functions (1797, 1813). He said, "One may view mechanics as a geometry of four dimensions, and mechanical analysis as an extension of geometric analysis".[11]

Various authors have credited W. K. Clifford with having anticipated the spacetime concept in general relativity as far back as 1870.[clarification needed][12][13] In an 1876 publication, Clifford speculated that "curvature of space is what really happens in that phenomenon which we call the motion of matter". His speculation did not include a notion of curved spacetime, however.[14]

The idea of a unified spacetime is stated by Edgar Allan Poe in his essay on cosmology titled Eureka (1848) that "Space and duration are one". In 1895, in his novel The Time Machine, H. G. Wells wrote, "There is no difference between time and any of the three dimensions of space except that our consciousness moves along it", and that "any real body must have extension in four directions: it must have Length, Breadth, Thickness, and Duration".

Maxwell's equations[edit]

The ancient idea of the cosmos gradually was described mathematically with differential equations[further explanation needed], differential geometry[further explanation needed], and abstract algebra[further explanation needed]. These mathematical articulations blossomed in the nineteenth century as electrical technology stimulated men like Michael Faraday and James Clerk Maxwell to describe the reciprocal relations of electric and magnetic fields. Daniel Siegel phrased Maxwell's role in relativity as follows:

[...] the idea of the propagation of forces at the velocity of light through the electromagnetic field as described by Maxwell's equations—rather than instantaneously at a distance—formed the necessary basis for relativity theory.[15]:189

Maxwell used vortex models in his papers on On Physical Lines of Force, but ultimately gave up on any substance but the electromagnetic field. Pierre Duhem wrote:

[Maxwell] was not able to create the theory that he envisaged except by giving up the use of any model, and by extending by means of analogy the abstract system of electrodynamics to displacement currents.[16]

In Siegel's estimation, "this very abstract view of the electromagnetic fields, involving no visualizable picture of what is going on out there in the field, is Maxwell's legacy."[15]:191

Describing the behaviour of electric fields and magnetic fields led Maxwell to view the combination as an electromagnetic field. These fields have a value at every point of spacetime. It is the intermingling of electric and magnetic manifestations, described by Maxwell's equations, that give spacetime its structure. In particular, the rate of motion of an observer determines the electric and magnetic profiles of the electromagnetic field. The propagation of the field is determined by the electromagnetic wave equation, which requires spacetime for description.[17]:1–30

Relativistic spacetime[edit]

Measuring aether[edit]

Figure 1-2. Michelson and Morley expected that motion through the aether would cause a differential phase shift between light traversing the two arms of their apparatus. The most logical explanation of their negative result, aether dragging, was in conflict with the observation of stellar aberration.

By the mid-1800s, various experiments such as the observation of the Arago spot and differential measurements of the speed of light in air versus water were considered to have proven the wave nature of light as opposed to corpuscular theory.[18] Waves implied the existence of a medium which waved, but attempts to measure the properties of the hypothetical luminiferous aether implied by these experiments provided contradictory results. For example, the Fizeau experiment of 1851 demonstrated that the speed of light in flowing water was less than the sum of the speed of light in air plus the speed of the water by an amount dependent on the water's index of refraction. Among other issues, the dependence of the partial aether-dragging implied by this experiment on the index of refraction (which is dependent on wavelength) led to the unpalatable conclusion that aether simultaneously flows at different speeds for different colors of light.[19] The famous Michelson–Morley experiment of 1887 showed no differential influence of Earth's motions through the hypothetical aether on the speed of light, and the most likely explanation, complete aether dragging, was in conflict with the observation of stellar aberration (Fig. 1‑3).[20]

Figure 1-3. Aether dragging is inconsistent with the phenomenon of stellar aberration. Aberration occurs when the observer's velocity has a component that is perpendicular to the line traveled by the light incoming from the star. In the animation on the left, the telescope must be tilted before the star will appear in the center of the eyepiece. In the animation on the right, if the aether is dragged in the vicinity of the earth, then the telescope must be pointed directly at the star for the star to appear in the center of the eyepiece.

George Francis FitzGerald in 1889 and Hendrik Lorentz in 1892 independently proposed that material bodies traveling through the fixed aether were physically affected by their passage, contracting in the direction of motion by an amount that was exactly what was necessary to explain the negative results of the Michelson-Morley experiment. (No length changes occur in directions transverse to the direction of motion.) By 1904, Lorentz had expanded his theory such that he had arrived at equations formally identical with those that Einstein were to derive later (i.e. the Lorentz transform), but with a fundamentally different interpretation. As a theory of dynamics (the study of forces and torques and their effect on motion), his theory assumed actual physical deformations of the physical constituents of matter, and it predicted various physical effects which might be observable.[21]:163–174 For example, most physicists believed that Lorentz contraction would be detectable by such experiments as the Trouton–Noble experiment or the Experiments of Rayleigh and Brace.[22]:64 However, these gave negative results, and in his 1904 theory of the electron, Lorentz explained these negative results as an inevitable consequence of his transforms. Poincaré, correcting some errors in Lorentz's analysis, demonstrated the non-detectability of the aether, but continued to believe in the dynamical interpretation of the Lorentz transform for the rest of his life.[21]:163–174

Special relativity[edit]

Einstein's theory of special relativity, introduced in 1905, provided a complete resolution to these and other major physics puzzles[which?], and it made predictions that have since been repeatedly confirmed. Einstein performed his analyses in terms of kinematics (the study of moving bodies without reference to forces) rather than dynamics. It would appear[according to whom?] that he did not at first think geometrically about spacetime. It was Einstein's former mathematics professor, Hermann Minkowski, who was to provide a geometric interpretation of special relativity.[23]:219

Einstein was initially dismissive of the geometric interpretation of special relativity, regarding it as "überflüssige Gelehrsamkeit" (superfluous learnedness). However, the geometric interpretation of relativity was to prove vital to Einstein's later development of general relativity, and in 1916, Einstein fully acknowledged his indebtedness to Minkowski, whose interpretation greatly facilitated the transition to general relativity.[21]:151–152 The spacetime of special relativity has since come to be known as Minkowski spacetime.

Albert Einstein's 1905 special theory of relativity postulated that the speed of light through empty space has one definite value—a constant—that is independent of the motion of the light source. This constant , conventionally called the "speed of light", relates distances measured in space with distances measured in time. The magnitude of this scale factor (nearly 300,000 km in space being equivalent to 1 s in time). Einstein's equations precisely described important consequences of this fact: both the shape of space and the measurement of time simultaneously change for observers who have different relative velocities, which is to say, for observers who have different inertial frames of reference.

Einstein's theory was framed in terms of kinematics (the study of moving bodies), and showed how measurements of space and time varied for observers in different reference frames. His theory was a breakthrough advance over Lorentz's 1904 theory of electromagnetic phenomena and Poincaré's electrodynamic theory. Although these theories included equations identical to those that Einstein introduced (i.e. the Lorentz transformation), they were essentially ad hoc models proposed to explain the results of various experiments—including the famous Michelson–Morley interferometer experiment—that were extremely difficult to fit into existing paradigms.

Einstein's inspiration for the theory of special relativity, may have been inconsistencies that he perceived in the manner in which Maxwell's theory was being interpreted. Although in 1905, Einstein wrote of the Moving magnet and conductor problem as being generally understood as leading to asymmetries, these asymmetries do not appear to have been noted as a significant issue requiring explanation by any of the leading expositors of Maxwell's theory, including Hertz, Lorentz, and Maxwell himself.[22]:135–142

At ordinary, non-relativistic speeds and at ordinary, human-scale distances, there is little that humans might observe which is noticeably different from what they might observe if the world were Euclidean. It was only with the advent of sensitive scientific measurements in the mid-1800s, such as the Fizeau experiment and the Michelson–Morley experiment, that puzzling discrepancies began to be noted between observation versus predictions based on the implicit assumption of Euclidean space.[20]

In 1908, Hermann Minkowski, expanding upon Einstein's work, presented a geometric interpretation of special relativity that fused the temporal dimension of time and the three spatial dimensions of space into a single four-dimensional continuum—what mathematicians call a 4‑dimensional manifold. A key feature of this interpretation is the definition of a spacetime interval that combines distance and time. Although measurements of distance and time between events differ among observers, the spacetime interval is independent of the inertial frame of reference in which they are recorded. The resultant spacetime came to be known as Minkowski space.

The subsection headings of this Introduction all link to brief summaries.


In the context of special relativity, time cannot be separated from the three dimensions of space, because the observed rate at which time passes for an object depends on the object's velocity relative to the observer. General relativity, in addition, provides an explanation of how gravitational fields can slow the passage of time for an object as seen by an observer outside the field.

Figure 1-1. Special relativity uses the term observer in a manner distinct from the ordinary English meaning.[clarification needed]

In special relativity, an observer is a frame of reference from which a set of objects or events are being measured. This usage differs significantly from the ordinary English meaning of the term. Reference frames are inherently nonlocal constructs, and it does not make sense to speak of an observer as having a location. In Fig. 1‑1, imagine that a scientist is in control of a dense lattice of clocks, synchronized within her reference frame, that extends indefinitely throughout the three dimensions of space. Her location within the lattice is not important. She uses her latticework of clocks to determine the time and position of events taking place within its reach. The entire ensemble of clocks associated with one inertial frame of reference is referred to as an observer.[24] An ideal observer experiences no time delays between the firing of an event and its recording. In real life, there will typically be delays between the emission of a signal and its detection. However, in the data reduction following an experiment, the time when a signal is received will be corrected to reflect its actual time were it to have been recorded by an idealized lattice of clocks.

Measurement and observation[edit]

Physicists distinguish[how?] between what one measures or observes (after one has factored out signal propagation delays[how?]), versus what one sees (what one visualizes without such corrections). Failure to understand the difference between what one measures/observes versus what one sees is the source of much error among beginning students of relativity.[25][clarification needed]

Spacetime interval[edit]

Although time comes in as a fourth dimension, it is treated differently than the spatial dimensions. Minkowski space hence differs in important respects from four-dimensional Euclidean space. The fundamental reason for merging space and time into spacetime is that space and time are separately not invariant, which is to say that, under the proper conditions, different observers will disagree on the length of time between two events (because of time dilation) or the distance between the two events (because of length contraction). But special relativity provides a new invariant, called the spacetime interval,[23]:217 which combines distances in space and in time. All observers who measure time and distance carefully will find the same spacetime interval between any two events. Suppose an observer measures two events as being separated by a time and a spatial distance . Then the spacetime interval between the two events is given by

   (or for three space dimensions, )[Note 1]
Figure 1-4. Spacetime diagram illustrating two photons, A and B, originating at the same event, and a slower-than-light-speed object, C.

The equation above is simply an expression of the Pythagorean theorem, except with a minus sign between the and the terms. Note also that the spacetime interval is the quantity , not itself. The reason is that unlike distances in Euclidean geometry, intervals in Minkowski spacetime can be negative. Rather than deal with square roots of negative numbers, physicists customarily regard as a distinct symbol in itself, rather than the square of something.[23]:217

Because of the minus sign, the spacetime interval between two events can be zero. Spacetime intervals are zero when . In other words, the spacetime interval is zero for something moving at the speed of light. A photon arriving in our eye from a distant star will not have aged, despite having (from our perspective) spent years in its passage.

A spacetime diagram is typically drawn with only a single space and a single time coordinate. Fig. 1‑4 presents a spacetime diagram illustrating the world lines (i.e. paths in spacetime) of two photons, A and B, originating from the same event and going in opposite directions. In addition, C illustrates the world line of a slower-than-light-speed object. The vertical time coordinate is scaled by so that it has the same units (meters) as the horizontal space coordinate. Since photons travel at the speed of light, their world lines have a slope of ±1. In other words, every meter that a photon travels to the left or right requires approximately 3.3 nanoseconds of time.

Light cone[edit]

In Fig. 1-6, event O is centered at the origin of the spacetime diagram, and the two diagonal lines represent all events that have zero spacetime interval with respect to the origin event. These two lines form what is called the light cone, since as illustrated in Fig. 1‑7, if another dimension is added to the diagram, the appearance would be that of two cones meeting with their apexes at the origin and centered around the time axis, one cone extending into the future, the other into the past. Since the spacetime interval is an invariant, all observers will assign the same events to the light cone of a given event. This statement is equivalent to the second postulate of special relativity: all observers measure the speed of light to be .[23]:220

Figure 1-7. Light cone in 2D space plus a time dimension.

The light cone divides spacetime into separate regions. The interior of the future light cone consists of events that are separated from the origin event by more time than there is space: these events comprise the "timelike future" of the origin event. Likewise, the "timelike past" comprises the interior events of the past light cone. Since in timelike intervals, the separation in time is greater than the separation in space, timelike intervals are positive. The region exterior to the light cone consists of events that are separated from the origin event by more space than there is time. These events comprise the spacelike "Elsewhere" of the origin event. Events on the light cone itself are said to be "lightlike" (or null) separated from the origin. Because of the invariance of spacetime, all observers will agree on this division of spacetime.[23]:220

The light cone has an essential role in defining the concept of causality. Signals cannot travel faster than the speed of light. In Fig. 1‑6, it is possible for a slower-than-or-equal-to-light-speed signal to travel from the position and time of O, to the position and time of D. It is hence possible for event O to have a causal influence on event D. The future light cone contains all of the events that could be causally influenced by O. Likewise, it is possible for a slower-than-or-equal-to-light-speed signal to travel from the position and time of A, to the position and time of O. The past light cone contains all of the events that could have a causal influence on O. In contrast, in the Elsewhere region, event O cannot affect or be affected by event C, nor can event O affect or be affected by event B. There is no causal relationship between O and any events in the Elsewhere region.[26]

Causal structure[edit]

The causal structure of a spacetime describes causal relationships between pairs of points in the spacetime based on the existence of certain[which?] types of curves joining the points.

Relativity of simultaneity[edit]

Figure 1-8. Animation illustrating relativity of simultaneity.

All observers will agree that for any given event, an event within the given event's future light cone occurs after the given event. Likewise, for any given event, an event within the given event's past light cone occurs before the given event. The before-after relationship observed for timelike-separated events remains unchanged no matter what the reference frame of the observer, i.e. no matter how the observer may be moving. The situation is quite different for spacelike-separated events. Fig. 1‑6 was drawn from the reference frame of an observer moving at v = 0. From this reference frame, event C is observed to occur after event O, and event B is observed to occur before event O. From a different reference frame, the orderings of these non-causally-related events can be reversed. In particular, one notes that if two events are simultaneous in a particular reference frame, they are necessarily separated by a spacelike interval and thus are noncausally related. The observation that simultaneity is not absolute, but depends on the observer's reference frame, is termed the relativity of simultaneity.[27]

Fig. 1-8 illustrates the use of spacetime diagrams in the analysis of the relativity of simultaneity. The events in spacetime are invariant, but the coordinate frames transform as discussed above for Fig. 1‑5. The three events (A, B, C) are simultaneous from the reference frame of an observer moving at v = 0. From the reference frame of an observer moving at v = 0.3 c, the events appear to occur in the order C, B, A. From the reference frame of an observer moving at v = −0.5 c, the events appear to occur in the order A, B, C. The white line represents a plane of simultaneity being moved from the past of the observer to the future of the observer, highlighting events residing on it. The gray area is the light cone of the observer, which remains invariant.

Figure 1-9. (a) Families of invariant hyperbolae. (b) Hyperboloids of two sheets and one sheet.

The spacetime interval gives[how?] the same distance that an observer would measure if the events being measured were simultaneous to the observer. The spacetime interval hence provides a measure of "proper distance", i.e. the true distance in spacetime. Likewise, the spacetime interval gives the same measure of time as would be presented by the cumulative ticking of a clock that moves along a given world line. The spacetime interval hence provides a measure of "proper time".[23]:220–221

Relative motion and transformations[edit]

Invariant hyperbola[edit]

In Euclidean space, the set of points that are equidistant from an origin form a circle (in two dimensions) or a sphere (in three dimensions). In Minkowski spacetime, the points at some constant spacetime interval from the origin form a curve given by the equation

The example above is the equation of a hyperbola in an x–ct spacetime diagram, which is termed[according to whom?] an invariant hyperbola.

Figure 1-10. The invariant hyperbola comprises the points that can be reached from the origin in a fixed proper time by clocks traveling at different speeds.

In Fig. 1-9a, the magenta hyperbolae connect events of equal spacelike separation from the origin, while the green hyperbolae connect events of equal timelike separation from the origin. Fig. 1‑9b shows that when viewed in an extra dimension of space, the spacelike invariant hyperbolae generate hyperboloids of one sheet, while the timelike invariant hyperbolae generate hyperboloids of two sheets.

Time dilation and length contraction[edit]

Fig. 1-10 illustrates the invariant hyperbola for all events that can be reached from the origin in a proper time of 5 meters (approximately 1.67×10−8s). Different world lines represent clocks moving at different speeds. A clock that is stationary with respect to the observer has a world line that is vertical, and the elapsed time measured by the observer is the same as the proper time. For a clock traveling at 0.3c, the elapsed time measured by the observer is 5.24 meters (1.75×10−8s), while for a clock traveling at 0.7c, the elapsed time measured by the observer is 7.00 meters (2.34×10−8s). This illustrates the phenomenon known as time dilation. Clocks that travel faster take longer (in the observer frame) to tick out the same amount of proper time, and they travel further along the x–axis than they would have without time dilation.[23]:220–221 The measurement of time dilation by two observers in different inertial reference frames is mutual. If observer O measures the clocks of observer O' as running slower in his frame, observer O' in turn will measure the clocks of observer O as running slower.

Figure 1-11. In this spacetime diagram, the 1 m length of the moving rod, as measured in the primed frame, is the foreshortened distance OC when projected onto the unprimed frame.

Length contraction, like time dilation, is a manifestation of the relativity of simultaneity. Measurement of length requires measurement of the spacetime interval between two events that are simultaneous in one's frame of reference. But events that are simultaneous in one frame of reference are, in general, not simultaneous in other frames of reference.

Fig. 1-11 illustrates the motions of a 1 m rod that is traveling at 0.5 c along the x axis. The edges of the blue band represent the world lines of the rod's two endpoints. The invariant hyperbola illustrates events separated from the origin by a spacelike interval of 1 m. The endpoints O and B measured when t' = 0 are simultaneous events in the S' frame. But to an observer in frame S, events O and B are not simultaneous. To measure length, the observer in frame S measures the endpoints of the rod as projected onto the x-axis along their world lines. The projection of the rod's world sheet onto the x axis yields the foreshortened length OC.[4]:125

(not illustrated) Drawing a vertical line through A so that it intersects the x' axis demonstrates that, even as OB is foreshortened from the point of view of observer O, OA is likewise foreshortened from the point of view of observer O'. In the same way that each observer measures the other's clocks as running slow, each observer measures the other's rulers as being contracted.

Measurement versus visual appearance[edit]

Figure 1-12. Comparison of the measured length contraction of a cube versus its visual appearance.

Time dilation and length contraction are not optical illusions, but genuine effects. Measurements of these effects are not an artifact of Doppler shift, nor are they the result of neglecting to take into account the time it takes light to travel from an event to an observer.

That being said, scientists make a fundamental distinction between measurement or observation on the one hand, versus visual appearance, or what one sees.

For many years, the distinction between the two had not been generally appreciated. For example, it had generally been thought that a length contracted object passing by an observer would in fact actually be seen as length contracted. In 1959, James Terrell and Roger Penrose independently pointed out that differential time lag effects in signals reaching the observer from the different parts of a moving object result in a fast moving object's visual appearance being quite different from its measured shape. For example, a receding object would appear contracted, an approaching object would appear elongated, and a passing object would have a skew appearance that has been likened to a rotation.[28][29][30][31] A sphere in motion retains the appearance of a sphere, although images on the surface of the sphere will appear distorted.[32]

Figure 1-13. Galaxy M87 streams out a black-hole-powered jet of electrons and other sub-atomic particles traveling at nearly the speed of light.

Fig. 1‑12 illustrates a cube viewed from a distance of four times the length of its sides. At high speeds, the sides of the cube that are perpendicular to the direction of motion appear hyperbolic in shape. The cube is actually not rotated. Rather, light from the rear of the cube takes longer to reach one's eyes compared with light from the front, during which time the cube has moved to the right. This illusion has come to be known as Terrell rotation or the Terrell–Penrose effect.[Note 2]

Another example where visual appearance is at odds with measurement comes from the observation of apparent superluminal motion in various radio galaxies, BL Lac objects, quasars, and other astronomical objects that eject relativistic-speed jets of matter at narrow angles with respect to the viewer. An optical illusion results giving the appearance of faster than light travel.[33][34][35] In Fig. 1‑13, galaxy M87 streams out a high-speed jet of subatomic particles almost directly towards us, but Penrose–Terrell rotation causes the jet to appear to be moving laterally in the same manner that the appearance of the cube in Fig. 1‑12 has been stretched out.[36]

Mutual time dilation and the twin paradox[edit]

Mutual time dilation[edit]

Mutual time dilation and length contraction tend to strike beginners as inherently self-contradictory concepts. The worry is that if observer A measures observer B's clocks as running slowly, simply because B is moving at speed v relative to A, then the principle of relativity requires that observer B likewise measures A's clocks as running slowly. This is an important question that "goes to the heart of understanding special relativity."[23]:198

Basically, A and B are performing two different measurements.

In order to measure the rate of ticking of one of B's clocks, A must use two of his own clocks, the first to record the time where B's clock first ticked at the first location of B, and second to record the time where B's clock emitted its second tick at the next location of B. Observer A needs two clocks because B is moving, so a grand total of three clocks are involved in the measurement. A's two clocks must be synchronized in A's frame. Conversely, B requires two clocks synchronized in her frame to record the ticks of A's clocks at the locations where A's clocks emitted their ticks. Therefore, A and B are performing their measurements with different sets of three clocks each. Since they are not doing the same measurement with the same clocks, there is no inherent necessity that the measurements be reciprocally "consistent" such that, if one observer measures the other's clock to be slow, the other observer measures the one's clock to be fast.[23]:198–199

In regards to mutual length contraction, Fig. 1‑11 illustrates that the primed and unprimed frames are mutually rotated by a hyperbolic angle (analogous to ordinary angles in Euclidean geometry).[Note 3] Because of this rotation, the projection of a primed meter-stick onto the unprimed x-axis is foreshortened, while the projection of an unprimed meter-stick onto the primed x'-axis is likewise foreshortened.

Figure 1-14. Mutual time dilation

Fig. 1-14 reinforces previous discussions about mutual time dilation. In this figure, Events A and C are separated from event O by equal timelike intervals. From the unprimed frame, events A and B are measured as simultaneous, but more time has passed for the unprimed observer than has passed for the primed observer. From the primed frame, events C and D are measured as simultaneous, but more time has passed for the primed observer than has passed for the unprimed observer. Each observer measures the clocks of the other observer as running more slowly.[4]:124

An observer's state of motion cannot affect an observed object, but it can affect the observer's observations of the object.

In Fig. 1-14, each line drawn parallel to the x axis represents a line of simultaneity for the unprimed observer. All events on that line have the same time value of ct. Likewise, each line drawn parallel to the x' axis represents a line of simultaneity for the primed observer. All events on that line have the same time value of ct'.

Twin paradox[edit]

The twin paradox is a thought experiment involving identical twins, one of whom makes a journey into space in a high-speed rocket, returning home to find that the twin who remained on Earth has aged more. This result appears puzzling because each twin observes the other twin as moving, and so at first glance, it would appear that each should find the other to have aged less. The twin paradox sidesteps the justification for mutual time dilation presented above by avoiding the requirement for a third clock.[23]:207 Nevertheless, the paradox is understandable within special relativity.

The impression that a paradox exists stems from a misunderstanding of what special relativity states. Special relativity does not declare all frames of reference to be equivalent, only inertial frames. The traveling twin's frame is not inertial during periods when she is accelerating. Furthermore, the difference between the twins is observationally detectable: the traveling twin needs to fire her rockets to be able to return home, while the stay-at-home twin does not.[37]

Figure 1-15. Spacetime explanation of the twin paradox

Deeper analysis is needed before we can understand why these distinctions should result in a difference in the twins' ages. Consider the spacetime diagram of Fig. 1‑14. This presents the simple case of a twin going straight out along the x axis and immediately turning back. From the standpoint of the stay-at-home twin, there is nothing puzzling about the twin paradox at all. The proper time measured along the traveling twin's world line from O to C, plus the proper time measured from C to B, is less than the stay-at-home twin's proper time measured from O to A to B. More complex trajectories require the evaluation the integral of the proper times along the curve (i.e. the path integral) to calculate the total amount of proper time experienced by the traveling twin.[37]

Complications arise if the twin paradox is analyzed from the traveling twin's point of view.

For the rest of this discussion, we adopt Weiss's nomenclature, designating the stay-at-home twin as Terence and the traveling twin as Stella.[37]

We had previously noted that Stella is not in an inertial frame. Given this fact, it is sometimes stated that full resolution of the twin paradox requires general relativity.[37] Analyzed in Stella's frame, she is motionless for the entire trip. When she fires her rockets for the turnaround, she experiences a pseudo-gravitational field that exactly cancels the force of her rockets, so she remains motionless.[37]

Figs. 1-8 and 1-14 illustrate the concept of lines (planes) of simultaneity: Lines parallel to the observer's x-axis (xy-plane) represent sets of events that are simultaneous in the observer frame. In Fig. 1‑15, the blue lines connect events on Terence's world line which, from Stella's point of view, are simultaneous with events on her world line. (Terence, in turn, would observe a set of horizontal lines of simultaneity.) Throughout both the outbound and the inbound legs of Stella's journey, she measures Terence's clocks as running slower than her own. But during the turnaround (i.e. between the bold blue lines in the figure), a shift takes place in the angle of her lines of simultaneity, corresponding to a rapid skip-over of the events in Terence's world line that Stella considers to be simultaneous with her own. Therefore, at the end of her trip, Stella finds that Terence has aged more than she has.[37]

Formal description[edit]

Relativistic composition of velocities[edit]

Figure 2-3. Relativistic composition of velocities

The composition of velocities[clarification needed] is quite different[clarification needed] in relativistic spacetime. To reduce the complexity of the equations slightly, we introduce a common shorthand for the ratio of the speed of an object relative to light,

Fig. 2-3a illustrates a red train that is moving forward at a speed given by v/c = β = s/a. From the primed frame of the train, a passenger shoots a bullet with a speed given by u'/c = β' = n/m, where the distance is measured along a line parallel to the red x' axis rather than parallel to the black x axis. What is the composite velocity u of the bullet relative to the platform, as represented by the blue arrow? Referring to Fig. 2‑3b:

  1. From the platform, the composite speed of the bullet is given by u = c(s + r)/(a + b).
  2. The two yellow triangles are similar because they are right triangles that share a common angle α. In the large yellow triangle, the ratio s/a = v/c = β.
  3. The ratios of corresponding sides of the two yellow triangles are constant, so that r/a = b/s = n/m = β'. So b = u's/c and r = u'a/c.
  4. Substitute the expressions for b and r into the expression for u in step 1 to yield Einstein's formula for the addition of velocities:[9]:42–48

The relativistic formula for addition of velocities presented above exhibits several important features:

  • If u' and v are both very small compared with the speed of light, then the product vu'/c2 becomes vanishingly small, and the overall result becomes indistinguishable from the Galilean formula (Newton's formula) for the addition of velocities: u = u' + v. The Galilean formula is a special case of the relativistic formula applicable to low velocities.
  • If u' is set equal to c, then the formula yields u = c regardless of the starting value of v. The velocity of light is the same for all observers regardless their motions relative to the emitting source.[9]:49

Time dilation and length contraction[edit]

Figure 2-4. Spacetime diagrams illustrating time dilation and length contraction

We had previously discussed, in qualitative terms, time dilation and length contraction. It is straightforward to obtain quantitative expressions for these effects. Fig. 2‑4 is a composite image containing individual frames taken from two previous animations, simplified and relabeled for the purposes of this section.

To reduce the complexity of the equations slightly, we see in the literature a variety of different shorthand notations for ct :

and are common.
One also sees very frequently the use of the convention
Figure 2-5. Lorentz factor as a function of velocity

In Fig. 2-4a, segments OA and OK represent equal spacetime intervals. Time dilation is represented by the ratio OB/OK. The invariant hyperbola has the equation w = x2 + k2 where k = OK, and the red line representing the world line of a particle in motion has the equation w = x/β = xc/v. A bit of algebraic manipulation yields

The expression involving the square root symbol appears very frequently in relativity, and one over the expression is called the Lorentz factor, denoted by the Greek letter gamma :[38]

We note that if v is greater than or equal to c, the expression for becomes physically meaningless, implying that c is the maximum possible speed in nature. Next, we note that for any v greater than zero, the Lorentz factor will be greater than one, although the shape of the curve is such that for low speeds, the Lorentz factor is extremely close to one.

In Fig. 2-4b, segments OA and OK represent equal spacetime intervals. Length contraction is represented by the ratio OB/OK. The invariant hyperbola has the equation x = w2 + k2 where k = OK, and the edges of the blue band representing the world lines of the endpoints of a rod in motion have slope 1/β = c/v. Event A has coordinates (x, w) = (γk, γβk). Since the tangent line through A and B has the equation w = (x − OB)/β, we have γβk = (γk − OB)/β and

Lorentz transformations[edit]

The Galilean transformations and their consequent commonsense law of addition of velocities work well in our ordinary low-speed world of planes, cars and balls. Beginning in the mid-1800s, however, sensitive scientific instrumentation began finding anomalies that did not fit well with the ordinary addition of velocities.

To compare one set of coordinates to another in special relativity, we use the Lorentz transformations.

The Lorentz factor appears in the Lorentz transformations:

The inverse Lorentz transformations are:

When v << c, the v2/c2 and vx/c2 terms approach zero, and the Lorentz transformations approximate to the Galilean transformations.

Example: Terence and Stella are at an Earth-to-Mars space race. Terence is an official at the starting line, while Stella is a participant. At time t = t' = 0, Stella's spaceship accelerates instantaneously to a speed of 0.5 c. The distance from Earth to Mars is 300 light-seconds (about 90.0×106 km). Terence observes Stella crossing the finish-line clock at t = 600.00 s. But Stella observes the time on her ship chronometer to be t' = (t - vx/c2) = 519.62 s as she passes the finish line, and she calculates the distance between the starting and finish lines, as measured in her frame, to be 259.81 light-seconds (about 77.9x106 km). 1).

Figure 2-6. Derivation of Lorentz Transformation

There have been many dozens of derivations of the Lorentz transformations since Einstein's original work in 1905, each with its particular focus. Although Einstein's derivation was based on the invariance of the speed of light, there are other physical principles that may serve as starting points. Ultimately, these alternative starting points can be considered different expressions of the underlying principle of locality, which states that the influence that one particle exerts on another can not be transmitted instantaneously.[39]

The derivation given here and illustrated in Fig. 2‑6 is based on one presented by Bais[9]:64–66 and makes use of previous results from the Relativistic Composition of Velocities, Time Dilation, and Length Contraction sections. Event P has coordinates (w, x) in the black "rest system" and coordinates (w', x') in the red frame that is moving with velocity parameter β = v/c. How do we determine w' and x' in terms of w and x ? (Or the other way around, of course.)

It is easier at first to derive the inverse Lorentz transformation.

  1. We start by noting that there can be no such thing as length expansion/contraction in the transverse directions. y' must equal y and z' must equal z, otherwise whether a fast moving 1 m ball could fit through a 1 m circular hole would depend on the observer. The first postulate of relativity states that all inertial frames are equivalent, and transverse expansion/contraction would violate this law.[40]:27–28
  2. From the drawing, w = a + b and x = r + s 
  3. From previous results using similar triangles, we know that s/a = b/r = v/c = β.
  4. We know that because of time dilation, a = γw'
  5. Substituting equation (4) into s/a = β yields s = γw'β.
  6. Length contraction and similar triangles give us r = γx' and b = βr = βγx'
  7. Substituting the expressions for s, a, r and b into the equations in Step 2 immediately yield

The above equations are alternate expressions for the t and x equations of the inverse Lorentz transformation, as can be seen by substituting ct for w, ct' for w', and v/c for β. From the inverse transformation, the equations of the forwards transformation can be derived by solving for t' and x'.


The Lorentz transformations have a mathematical property called linearity, since x' and t' are obtained as linear combinations of x and t, with no higher powers involved. The linearity of the transformation reflects a fundamental property of spacetime that we tacitly assumed while performing the derivation, namely, that the properties of inertial frames of reference are independent of location and time. In the absence of gravity, spacetime looks the same everywhere.[9]:67 All inertial observers will agree on what constitutes accelerating and non-accelerating motion.[40]:72–73 Any one observer can use her own measurements of space and time, but there is nothing absolute about them. Another observer's conventions will do just as well.[23]:190

A result of linearity is that if two Lorentz transformations are applied sequentially, the result is also a Lorentz transformation.

Example: Terence observes Stella speeding away from him at 0.500 c, and he can use the Lorentz transformations with β = 0.500 to relate Stella's measurements to his own. Stella, in her frame, observes Ursula traveling away from her at 0.250 c, and she can use the Lorentz transformations with β = 0.250 to relate Ursula's measurements with her own. Because of the linearity of the transformations and the relativistic composition of velocities, Terence can use the Lorentz transformations with β = 0.666 to relate Ursula's measurements with his own.

Curved spacetime[edit]

In general relativity, it is assumed that spacetime is curved by the presence of matter (energy), this curvature being represented by the Riemann tensor. In special relativity, the Riemann tensor is identically zero. This is sometimes expressed by the statement Minkowski spacetime is flat.[17]:1–30

The notions of time-like, light-like and space-like intervals in special relativity can similarly be used to classify one-dimensional curves through curved spacetime. A time-like curve can be understood as one where the interval between any two infinitesimally close events on the curve is time-like, and likewise for light-like and space-like curves. Technically the three types of curves are usually defined in terms of whether the tangent vector at each point on the curve is time-like, light-like or space-like. The world line of a slower-than-light object will always be a time-like curve, the world line of a massless particle such as a photon will be a light-like curve, and a space-like curve could be the world line of a hypothetical tachyon. In the local neighborhood of any event, time-like curves that pass through the event will remain inside that event's past and future light cones, light-like curves that pass through the event will be on the surface of the light cones, and space-like curves that pass through the event will be outside the light cones. One can also define the notion of a three-dimensional "spacelike hypersurface", a continuous three-dimensional slice through the four-dimensional property[clarification needed] with the property that every curve that is contained entirely within this hypersurface is a space-like curve.[41]

General Relativity[edit]

This introductory section has focused on the spacetime of special relativity, since it is the easiest to describe. Minkowski spacetime is flat, takes no account of gravity, is uniform throughout, and serves as nothing more than a static background for the events that take place in it. The presence of gravity greatly complicates the description of spacetime. In general relativity, spacetime is no longer a static background, but actively interacts with the physical systems that it contains. Spacetime curves in the presence of matter, can propagate waves, bends light, and exhibits a host of other phenomena.[23]:221 A few of these phenomena are described in the later sections of this article.

Quantized spacetime[edit]

In general relativity, spacetime is assumed to be smooth and continuous—and not just in the mathematical sense. In the theory of quantum mechanics, there is an inherent discreteness present in physics. In attempting to reconcile these two theories, it is sometimes postulated that spacetime should be quantized at the very smallest scales. Current theory is focused on the nature of spacetime at the Planck scale. Causal sets, loop quantum gravity, string theory, causal dynamical triangulation, and black hole thermodynamics all predict a quantized spacetime with agreement on the order of magnitude. Loop quantum gravity makes precise predictions about the geometry of spacetime at the Planck scale.[42]

Spin networks provide a language to describe quantum geometry of space. Spin foam does the same job on spacetime. A spin network is a one-dimensional graph, together with labels on its vertices and edges which encodes aspects of a spatial geometry.[43]

Mathematical description[edit]

Doppler effect[edit]

The Doppler effect is the change in frequency or wavelength of a wave for a receiver and source in relative motion. For simplicity, we consider here two basic scenarios: (1) The motions of the source and/or receiver are exactly along the line connecting them (longitudinal Doppler effect), and (2) the motions are at right angles to the said line (transverse Doppler effect). We are ignoring scenarios where they move along intermediate angles.

Longitudinal Doppler effect[edit]

Figure 2-7. Galilean spacetime diagrams of classical Doppler effect. (a) Source moving away from the receiver. (b) Receiver moving away from the source.

The classical Doppler analysis deals with waves that are propagating in a medium, such as sound waves or water ripples, and which are transmitted between sources and receivers that are moving towards or away from each other. The analysis of such waves depends on whether the source, the receiver, or both are moving relative to the medium. Fig. 2‑7a illustrates a Galilean spacetime diagram of the scenario where the source is moving directly away from a receiver that is stationary with respect to the medium. The source travels at a speed of vs for a velocity parameter of βs. With the source moving, the wavelength is affected, resulting in a frequency change as perceived by the observer. In the figure, the green arrows depict the transmission of each pulse at constant speed c through the medium. If the period T0 of the pulses is 1 per unit of time, the received period is T = 1+βs, so that the observed frequency f is given by

Figure 2-8. Spacetime diagram of relativistic Doppler effect

Fig. 2‑7b illustrates a Galilean spacetime diagram of the scenario where the receiver is moving directly away from a stationary source at a speed of vr for a velocity parameter of βr. With the source stationary, the wavelength is not changed, but the transmission velocity of the waves relative to the observer is decreased. In the diagram inset, we see that period T = 1+a. The variable a is calculated by noting the similar triangle relationship (1+a)/a = a/(a−βr), so that T = 1/(1−βr) and the observed frequency f is given by

Light, unlike sound or water ripples, does not propagate through a medium, and there is no distinction between a source moving away from the receiver or a receiver moving away from the source. Fig. 2‑8 illustrates a relativistic spacetime diagram showing a source separating from the receiver with a velocity parameter β, so that the separation between source and receiver at time w is βw. Because of time dilation, w = γw'. Since the slope of the green light ray is −1, T = w+βw = γw'(1). Hence, the relativistic Doppler effect is given by[9]:58–59

Transverse Doppler effect[edit]

Figure 2-9. Transverse Doppler effect scenarios

Suppose that a source, moving in a straight line, is at its closest point to the receiver. It would appear that the classical analysis predicts that the receiver detects no Doppler shift. Due to subtleties in the analysis, that expectation is not necessarily true. Nevertheless, when appropriately defined, transverse Doppler shift is a relativistic effect that has no classical analog. The subtleties are these:[40]:94–96

  • Fig. 2-9a. If a source, moving in a straight line, is crossing the receiver's field of view, what is the frequency measurement when the source is at its closest approach to the receiver?
  • Fig. 2-9b. If a source is moving in a straight line, what is the frequency measurement when the receiver sees the source as being closest to it?
  • Fig. 2-9c. If receiver is moving in a circle around the source, what frequency does the receiver measure?
  • Fig. 2-9d. If the source is moving in a circle around the receiver, what frequency does the receiver measure?

In scenario (a), when the source is closest to the receiver, the light hitting the receiver actually comes from a direction where the source had been some time back, and it has a significant longitudinal component, making an analysis from the frame of the receiver tricky. It is easier to make the analysis from S', the frame of the source. The point of closest approach is frame-independent and represents the moment where there is no change in distance versus time (i.e. dr/dt = 0 where r is the distance between receiver and source) and hence no longitudinal Doppler shift. The source observes the receiver as being illuminated by light of frequency f', but also observes the receiver as having a time-dilated clock. In frame S, the receiver is therefore illuminated by blueshifted light of frequency

Scenario (b) is best analyzed from S, the frame of the receiver. The illustration shows the receiver being illuminated by light from when the source was closest to the receiver, even though the source has moved on. Because the source's clocks are time dilated, and since dr/dt was equal to zero at this point, the light from the source, emitted from this closest point, is redshifted with frequency

Scenarios (c) and (d) can be analyzed by simple time dilation arguments. In (c), the receiver observes light from the source as being blueshifted by a factor of , and in (d), the light is redshifted. The only seeming complication is that the orbiting objects are in accelerated motion. However, if an inertial observer looks at an accelerating clock, only the clock's instantaneous speed is important when computing time dilation. (The converse, however, is not true.)[40]:94–96 Most reports of transverse Doppler shift refer to the effect as a redshift and analyze the effect in terms of scenarios (b) or (d).[Note 4]

Energy and momentum[edit]


Figure 2-10. Relativistic spacetime momentum vector

In classical mechanics, the state of motion of a particle is characterized by its mass and its velocity. Linear momentum, the product of a particle's mass and velocity, is a vector quantity, possessing the same direction as the velocity: p = mv. It is a conserved quantity, meaning that if a closed system is not affected by external forces, its total linear momentum cannot change.

In relativistic mechanics, the momentum vector is extended to four dimensions. Added to the momentum vector is a time component that allows the spacetime momentum vector to transform like the spacetime position vector (x, t). In exploring the properties of the spacetime momentum, we start, in Fig. 2‑10a, by examining what a particle looks like at rest. In the rest frame, the spatial component of the momentum is zero, i.e. p = 0, but the time component equals mc.

We can obtain the components of the four-momentum in the moving frame by using the Lorentz transformations, or we can read it directly from the figure because we know that (mc)' = γmc and p' = −βγmc, since the red axes are rescaled by gamma. Fig. 2‑10b illustrates the situation as it appears in the moving frame. It is apparent that the space and time components of the four-momentum go to infinity as the velocity of the moving frame approaches c.[9]:84–87

Momentum of light[edit]

Figure 2-11. Energy and momentum of light in different inertial frames

Light particles, or photons, travel at the speed of c, the constant that is conventionally known as the "speed of light". This statement is not a tautology, since many modern formulations of relativity do not start with constant speed of light as a postulate. Photons therefore propagate along a light-like world line and, in appropriate units, have equal space and time components for every observer.

A consequence of Maxwell's theory of electromagnetism is that light carries energy and momentum, and that their ratio is a constant: E/p = c. Rearranging, E/c = p, and since for photons, the space and time components are equal, E/c must therefore be equated with the time component of the spacetime momentum vector.

Photons travel at the speed of light, yet have finite momentum and energy. For this to be so, the mass term in γmc must be zero, meaning that photons are massless particles. Infinity times zero is an ill-defined quantity, but E/c is well-defined.

By this analysis, if the energy of a photon equals E in the rest frame, it equals E' = (1 - β)γE in a moving frame. This result can by derived by inspection of Fig. 2‑11 or by application of the Lorentz transformations, and is consistent with the analysis of Doppler effect given previously.[9]:88

Mass-energy relationship[edit]

Consideration of the interrelationships between the various components of the relativistic momentum vector led Einstein to several famous conclusions.

  • In the low speed limit as β = v/c approaches zero, approaches 1, so the spatial component of the relativistic momentum βγmc = γmv approaches mv, the classical term for momentum. Following this perspective, γm can be interpreted as a relativistic generalization of m. Einstein proposed that the relativistic mass of an object increases with velocity according to the formula mrel = γm.
  • Likewise, comparing the time component of the relativistic momentum with that of the photon, γmc = mrelc = E/c, so that Einstein arrived at the relationship E = mrelc2. Simplified to the case of zero velocity, this is Einstein's famous equation relating energy and mass.

Another way of looking at the relationship between mass and energy is to consider the time component of the relativistic momentum vector at low velocity. A series expansion of γm yields

In other words, at low velocity, mrel = m + ½ mv2/c2. The second term is just an expression for the kinetic energy of the particle divided by c2. Mass indeed appears to be another form of energy.[9]:90–92[40]:129–130,180

Although validated daily around the globe, the concept of relativistic mass that Einstein introduced in 1905, denoted mrel, is not a concept that has served as a basis for other theoretical development, such as general relativity.

For this and other reasons, among which educational, most physicists[44] prefer a different terminology when referring to the relationship between mass and energy. "Relativistic mass" is a deprecated term. The term "mass" by itself refers to the rest mass or invariant mass, and is equal to the invariant length of the relativistic momentum vector (E, pc). Expressed as a formula,

This formula applies to all particles, massless as well as massive. For massless photons, it yields the same relationship that we had earlier established, E = ±pc.[9]:90–92

Because of the close relationship between mass and energy, the four-momentum (also called 4‑momentum) is also called the energy-momentum 4‑vector. Using an uppercase P to represent the four-momentum and a lowercase p to denote the spatial momentum, the four-momentum may be written as

or alternatively,
using the convention that [40]:129–130,180

Invariance and symmetry[edit]

Often in relativity, spacetimes that have some form of symmetry are studied. As well as helping to classify spacetimes, these symmetries usually serve as a simplifying assumption in specialized work. Some of the most popular ones include:

Conservation laws[edit]

In physics, conservation laws state that certain particular measurable properties of an isolated physical system do not change as the system evolves over time. In 1915, Emmy Noether discovered that underlying each conservation law is a fundamental symmetry of nature.[45] The fact that physical processes don't care where in space they take place (space translation symmetry) yields conservation of momentum, the fact that such processes don't care when they take place (time translation symmetry) yields conservation of energy, and so on. In this section, we examine the Newtonian views of conservation of mass, momentum and energy from a relativistic perspective.

Total momentum[edit]

Figure 2-12. Relativistic conservation of momentum

To understand how the Newtonian view of conservation of momentum needs to be modified in a relativistic context, we examine the problem of two colliding bodies limited to a single dimension.

In Newtonian mechanics, two extreme cases of this problem may be distinguished yielding mathematics of minimum complexity: (1) The two bodies rebound from each other in a completely elastic collision. (2) The two bodies stick together and continue moving as a single particle. This second case is the case of completely inelastic collision. For both cases (1) and (2), momentum, mass, and total energy are conserved. However, kinetic energy is not conserved in cases of inelastic collision. A certain fraction of the initial kinetic energy is converted to heat.

In case (2), two masses with momentums p1 = m1v1 and p2 = m2v2 collide to produce a single particle of conserved mass m = m1 + m2 traveling at the center of mass velocity of the original system, vcm = (m1v1 + m2v2)/(m1 + m2). The total momentum p = p1 + p2 is conserved.

Fig. 2‑12 illustrates the inelastic collision of two particles from a relativistic perspective. The time components E1/c and E2/c add up to total E/c of the resultant vector, meaning that energy is conserved. Likewise, the space components p1 and p2 add up to form p of the resultant vector. The four-momentum is, as expected, a conserved quantity. However, the invariant mass of the fused particle, given by the point where the invariant hyperbola of the total momentum intersects the energy axis, is not equal to the sum of the invariant masses of the individual particles that collided. Indeed, it is larger than the sum of the individual masses: m > m1 + m2.[9]:94–97

Looking at the events of this scenario in reverse sequence, we see that non-conservation of mass is a common occurrence: when an unstable elementary particle spontaneously decays into two lighter particles, total energy is conserved, but the mass is not. Part of the mass is converted into kinetic energy.[40]:134–138

Topology of a spacetime continuum[edit]

Spacetimes can be modeled as various topological spaces.[further explanation needed]. The topology may be as simple as just a collection of coordinates, or it may have additional structure, due to theoretical demands.[which?] Various assumptions[which?] contained in the definition of a spacetime are usually justified by the following considerations.[clarification needed]

Many spacetime continua have physical interpretations which most physicists would consider bizarre or unsettling. For example, a compact spacetime has closed timelike curves, which violate our usual ideas of causality (that is, future events could affect past ones). For this reason, mathematical physicists usually consider only restricted subsets of all the possible spacetimes. One way to do this is to study realistic[clarification needed] solutions of the equations of general relativity. Another way is to add some additional physically reasonable[clarification needed] but still fairly general geometric restrictions and try to prove interesting[clarification needed] things about the resulting spacetimes. The latter approach has led to results such as the Penrose–Hawking singularity theorems.[46]

Connectedness serves two main purposes. First, different observers making measurements (represented by coordinate charts) should be able to compare their observations on the non-empty intersection of the charts. If the connectedness assumption were dropped, this would not be possible. Second, for a manifold, the properties of connectedness and path-connectedness are equivalent, and one requires the existence of paths (in particular, geodesics) in the spacetime to represent the motion of particles and radiation.[47]

Every spacetime is paracompact[why?]. This property, allied with the smoothness of the spacetime, gives rise[how?] to a smooth linear connection, an important structure in general relativity.[47]


Some important theorems on constructing spacetimes from compact and non-compact manifolds include the following:

  • A compact manifold can be turned into a spacetime if, and only if, its Euler characteristic is 0. (Proof idea: the existence of a Lorentzian metric is shown to be equivalent to the existence of a nonvanishing vector field.)
  • Any non-compact 4-manifold can be turned into a spacetime.[48]


A manifold is a topological space that locally resembles Euclidean space near each point. This is similar to how a globe appears flat at small scales.[49]

For physical reasons[which?], a spacetime continuum is frequently modeled as a four-dimensional, smooth, connected Lorentzian manifold . This means the smooth Lorentz metric has signature . The metric determines the geometry of spacetime, as well as determining the geodesics of particles and light beams. About each point (event) on this manifold, coordinate charts are used to represent observers in reference frames. Usually, Cartesian coordinates are used. Moreover, for simplicity's sake, units of measurement are usually chosen such that the speed of light is equal to 1.[50]

A reference frame (observer) can be identified with one of these coordinate charts; any such observer can describe any event . Another reference frame may be identified by a second coordinate chart about . Two observers (one in each reference frame) may describe the same event but obtain different descriptions.[50]

Usually, many overlapping coordinate charts are needed to cover a manifold. Given two coordinate charts, one containing (representing an observer) and another containing (representing another observer), the intersection of the charts represents the region of spacetime in which both observers can measure physical quantities and hence compare results. The relation between the two sets of measurements is given by a non-singular coordinate transformation on this intersection. The idea of coordinate charts as local observers who can perform measurements in their vicinity also makes good physical sense, as this is how one actually collects physical data—locally.[50]

For example, two observers, one of whom is on Earth, but the other one who is on a fast rocket to Jupiter, may observe a comet crashing into Jupiter (this is the event ). In general, they will disagree about the exact location and timing of this impact, i.e., they will have different 4-tuples (as they are using different coordinate systems). Although their kinematic descriptions will differ, dynamical (physical) laws, such as momentum conservation and the first law of thermodynamics, will still hold. In fact, relativity theory requires more than this in the sense that it stipulates these (and all other physical) laws must take the same form in all coordinate systems. This introduces tensors into relativity, by which all physical quantities are represented.

Geodesics are said to be time-like, null, or space-like if the tangent vector to one point of the geodesic is of this nature. Paths of particles and light beams in spacetime are represented by time-like and null (light-like) geodesics, respectively.[50]


Spacetime is sometimes represented on orbifolds.[further explanation needed]

See also[edit]


  1. ^ There are two sign conventions in use in the relativity literature:
    s2 = c2t2 − x2 − y2 − z2
    s2 = −c2t2 + x2 + y2 + z2
    These sign conventions are associated with the metric signatures (+ − − −) and (− + + +). A minor variation is to place the time coordinate last rather than first. Since it is unclear which sign convention is in greater use, one should simply accept the fact that these variations exist.
  2. ^ Even though it has been many decades since Terrell and Penrose published their observations, popular writings continue to conflate measurement versus appearance. For example, Michio Kaku wrote in Einstein's Cosmos (W. W. Norton & Company, 2004. p. 65): "... imagine that the speed of light is only 20 miles per hour. If a car were to go down the street, it might look compressed in the direction of motion, being squeezed like an accordion down to perhaps 1 inch in length."
  3. ^ In a Cartesian plane, ordinary rotation leaves a circle unchanged. In spacetime, hyperbolic rotation preserves the hyperbolic metric.
  4. ^ The Kündig experiment is an example of an experiment that was set up to measure transverse blueshift, with a Mössbauer source set up at the center of a centrifuge rotor and an absorber at the rim.


  1. ^ "Pacha: un concepto andino de espacio y tiempo. | Manga Quespi | Revista Española de Antropología Americana" (PDF). Retrieved 2016-12-17. 
  2. ^ Steele, Paul R.; Allen, Catherine J. (2004). Handbook of Inca Mythology. Santa Barbara, California: ABC-Clio. p. 86. ISBN 1576073548. 
  3. ^ Ardener, Shirley (1997). Women and space: ground rules and social maps (2nd ed.). Oxford: Berg. p. 36. ISBN 0854967281. 
  4. ^ a b c d e Collier, Peter (2014). A Most Incomprehensible Thing: Notes Towards a Very Gentle Introduction to the Mathematics of Relativity (2nd ed.). Incomprehensible Books. ISBN 9780957389458. 
  5. ^ Rynasiewicz, Robert. "Newton's Views on Space, Time, and Motion". Stanford Encyclopedia of Philosophy. Metaphysics Research Lab, Stanford University. Retrieved 24 March 2017. 
  6. ^ Davis, Philip J. (2006). Mathematics & Common Sense: A Case of Creative Tension. Wellesley, Massachusetts: A.K. Peters. p. 86. ISBN 9781439864326. 
  7. ^ Mould, Richard A. (1994). Basic Relativity (1st ed.). Springer. p. 42. ISBN 9780387952109. Retrieved 22 April 2017. 
  8. ^ Lerner, Lawrence S. (1997). Physics for Scientists and Engineers, Volume 2 (1st ed.). Jones & Bartlett Pub. p. 1047. ISBN 9780763704605. Retrieved 22 April 2017. 
  9. ^ a b c d e f g h i j k Bais, Sander (2007). Very Special Relativity: An Illustrated Guide. Cambridge, Massachusetts: Harvard University Press. ISBN 067402611X. 
  10. ^ "ARTFL Encyclopédie Search Results". Retrieved 17 December 2016. 
  11. ^ "Archibald : Time as a fourth dimension". 2007-07-03. Retrieved 2016-12-17. 
  12. ^ Bell, Eric Temple (1945). The Development of Mathematics. McGraw-Hill Inc. pp. 359–360. ISBN 9780070043305. 
  13. ^ Weyl, Hermann (1923). Raum Zeit Materie. Berlin: Springer-Verlag. p. 101. 
  14. ^ Clifford, William Kingdon (1876). "On the Space-Theory of Matter". Proceedings of the Cambridge philosophical society. 2: 157–158. 
  15. ^ a b Flood, Raymond; McCartney, Mark; Whitaker, Andrew (2014). James Clerk Maxwell: Perspectives on his Life and Work (1st ed.). Oxford: Oxford University Press. ISBN 9780199664375. 
  16. ^ Duhem, Pierre (1991). The Aim and Structure of Physical Theory. Princeton: Princeton University Press. p. 98. ISBN 069102524X. 
  17. ^ a b Carroll, Sean M. (2 December 1997). "Lecture Notes on General Relativity". University of California, Santa Barbara. Retrieved 15 April 2017. 
  18. ^ Hughes, Stefan (2013). Catchers of the Light: Catching Space: Origins, Lunar, Solar, Solar System and Deep Space. Paphos, Cyprus: ArtDeCiel Publishing. pp. 202–233. ISBN 9781467579926. Retrieved 7 April 2017. 
  19. ^ Stachel, John (2005). "Fresnel's (Dragging) Coefficient as a Challenge to 19th Century Optics of Moving Bodies.". In Kox, A. J.; Eisenstaedt, Jean. The Universe of General Relativity. Boston: Birkhäuser. pp. 1–13. ISBN 081764380X. Archived from the original (PDF) on 2017-04-13. 
  20. ^ a b French, A.P. (1968). Special Relativity. Boca Raton, Florida: CRC Press. pp. 35–60. ISBN 0748764224. 
  21. ^ a b c Pais, Abraham (1982). ""Subtle is the Lord-- ": The Science and the Life of Albert Einstein (11th ed.). Oxford: Oxford University Press. ISBN 019853907X. 
  22. ^ a b Miller, Arthur I. (1998). Albert Einstein's Special Theory of Relativity. New York: Springer-Verlag. ISBN 0387948708. 
  23. ^ a b c d e f g h i j k l Schutz, Bernard (2004). Gravity from the Ground Up: An Introductory Guide to Gravity and General Relativity (Reprint ed.). Cambridge: Cambridge University Press. ISBN 0521455065. Retrieved 24 May 2017. 
  24. ^ Taylor, Edwin F.; Wheeler, John Archibald (1966). Spacetime Physics: Introduction to Special Relativity (13th ed.). San Francisco: Freeman. pp. 17–22. ISBN 071670336X. Retrieved 14 April 2017. 
  25. ^ Scherr, Rachel E.; Shaffer, Peter S.; Vokos, Stamatis (July 2001). "Student understanding of time in special relativity: Simultaneity and reference frames" (PDF). American Journal of Physics. 69 (S1): S24–S35. doi:10.1119/1.1371254. Retrieved 11 April 2017. 
  26. ^ Curiel, Erik; Bokulich, Peter. "Lightcones and Causal Structure". Stanford Encyclopedia of Philosophy. Metaphysics Research Lab, Stanford University. Retrieved 26 March 2017. 
  27. ^ Savitt, Steven. "Being and Becoming in Modern Physics. 3. The Special Theory of Relativity". The Stanford Encyclopedia of Philosophy. Metaphysics Research Lab, Stanford University. Retrieved 26 March 2017. 
  28. ^ Terrell, James. "Invisibility of the Lorentz Contraction". Physical Reviewdate=15 November 1959. 116 (4): 1041–1045. Bibcode:1959PhRv..116.1041T. doi:10.1103/PhysRev.116.1041. 
  29. ^ Penrose, Roger (24 October 2008). "The Apparent Shape of a Relativistically Moving Sphere". Mathematical Proceedings of the Cambridge Philosophical Society. 55 (01): 137. Bibcode:1959PCPS...55..137P. doi:10.1017/S0305004100033776. 
  30. ^ Cook, Helen. "Relativistic Distortion". Mathematics Department, University of British Columbia. Retrieved 12 April 2017. 
  31. ^ Signell, Peter. "Appearances at Relativistic Speeds". Project PHYSNET. Michigan State University, East Lansing, MI. Archived from the original (PDF) on 12 April 2017. Retrieved 12 April 2017. 
  32. ^ Kraus, Ute. "The Ball is Round". Space Time Travel: Relativity visualized. Institut für Physik Universität Hildesheim. Archived from the original on 16 April 2017. Retrieved 16 April 2017. 
  33. ^ Zensus, J. Anton; Pearson, Timothy J. (1987). Superluminal Radio Sources (1st ed.). Cambridge, New York: Cambridge University Press. p. 3. ISBN 9780521345606. 
  34. ^ Chase, Scott I. "Apparent Superluminal Velocity of Galaxies". The Original Usenet Physics FAQ. Department of Mathematics, University of California, Riverside. Retrieved 12 April 2017. 
  35. ^ Richmond, Michael. ""Superluminal" motions in astronomical sources". Physics 200 Lecture Notes. School of Physics and Astronomy, Rochester Institute of Technology. Archived from the original on 19 April 2017. Retrieved 20 April 2017. 
  36. ^ Keel, Bill. "Jets, Superluminal Motion, and Gamma-Ray Bursts". Galaxies and the Universe - WWW Course Notes. Department of Physics and Astronomy, University of Alabama. Archived from the original on 29 April 2017. Retrieved 29 April 2017. 
  37. ^ a b c d e f Weiss, Michael. "The Twin Paradox". The Physics and Relativity FAQ. Retrieved 10 April 2017. 
  38. ^ Forshaw, Jeffrey; Smith, Gavin (2014). Dynamics and Relativity. John Wiley & Sons. p. 118. ISBN 9781118933299. Retrieved 24 April 2017. 
  39. ^ Landau, L. D.; Lifshitz, E. M. (2006). The Classical Theory of Fields, Course of Theoretical Physics, Volume 2 (4th ed.). Amsterdam: Elsevier. pp. 1–24. ISBN 9780750627689. 
  40. ^ a b c d e f g Morin, David (2017). Special Relativity for the Enthusiastic Beginner. CreateSpace Independent Publishing Platform. ISBN 9781542323512. 
  41. ^ Eilstein, Helena (2002). A Collection of Polish Works on Philosophical Problems of Time and Spacetime. Dordrecht: Kluwer Academic Publishers. p. 32. ISBN 9781402006708. Retrieved 30 January 2017. 
  42. ^ Hooft, G. T. (2001). "Obstacles on the way towards the quantisation of space, time and matter—And possible resolutions" (PDF). Studies in History and Philosophy of Science Part B: Studies in History and Philosophy of Modern Physics. 32 (2): 157–180. Retrieved 14 April 2017. 
  43. ^ Baez, John C. "An Introduction to Spin Foam Models of Quantum Gravity and BF Theory". Retrieved 14 April 2017. 
  44. ^ Griffiths, David J. (2013). Revolutions in Twentieth-Century Physics. Cambridge: Cambridge University Press. p. 60. ISBN 9781107602175. Retrieved 24 May 2017. 
  45. ^ Byers, Nina. "E. Noether's Discovery of the Deep Connection Between Symmetries and Conservation Laws". Cornell University. Retrieved 13 May 2017. 
  46. ^ Senovilla, José M M; Garfinkle, David (25 June 2015). "The 1965 Penrose singularity theorem". Classical and Quantum Gravity. 32 (12): 124008. doi:10.1088/0264-9381/32/12/124008. Retrieved 14 April 2017. 
  47. ^ a b Gray, R. Dale (2015). Manifolds, Groups, Bundles, and Spacetime. pp. 103–126. ISBN 9781329408258. Retrieved 14 April 2017. 
  48. ^ Israel, S.W.; Hawking, W. (1989). General Relativity: An Einstein Centenary Survey (1st ed.). Cambridge: Cambridge University Press. p. 219. ISBN 0521299284. 
  49. ^ Rowland, Todd. "Manifold". Wolfram Mathworld. Wolfram Research. Retrieved 24 March 2017. 
  50. ^ a b c d Bär, Christian; Fredenhagen, Klaus (2009). "Lorentzian Manifolds". Quantum Field Theory on Curved Spacetimes: Concepts and Mathematical Foundations. Dordrecht: Springer. pp. 39–58. ISBN 9783642027796. Archived from the original (PDF) on 13 April 2017. Retrieved 14 April 2017. 

Further reading[edit]

External links[edit]