Cartesian coordinate system
|This article needs additional citations for verification. (June 2012)|
A Cartesian coordinate system is a coordinate system that specifies each point uniquely in a plane by a pair of numerical coordinates, which are the signed distances from the point to two fixed perpendicular directed lines, measured in the same unit of length. Each reference line is called a coordinate axis or just axis of the system, and the point where they meet is its origin, usually at ordered pair (0, 0). The coordinates can also be defined as the positions of the perpendicular projections of the point onto the two axes, expressed as signed distances from the origin.
One can use the same principle to specify the position of any point in three-dimensional space by three Cartesian coordinates, its signed distances to three mutually perpendicular planes (or, equivalently, by its perpendicular projection onto three mutually perpendicular lines). In general, n Cartesian coordinates (an element of real n-space) specify the point in an n-dimensional Euclidean space for any dimension n. These coordinates are equal, up to sign, to distances from the point to n mutually perpendicular hyperplanes.
The invention of Cartesian coordinates in the 17th century by René Descartes (Latinized name: Cartesius) revolutionized mathematics by providing the first systematic link between Euclidean geometry and algebra. Using the Cartesian coordinate system, geometric shapes (such as curves) can be described by Cartesian equations: algebraic equations involving the coordinates of the points lying on the shape. For example, a circle of radius 2 may be described as the set of all points whose coordinates x and y satisfy the equation x2 + y2 = 4.
Cartesian coordinates are the foundation of analytic geometry, and provide enlightening geometric interpretations for many other branches of mathematics, such as linear algebra, complex analysis, differential geometry, multivariate calculus, group theory, and more. A familiar example is the concept of the graph of a function. Cartesian coordinates are also essential tools for most applied disciplines that deal with geometry, including astronomy, physics, engineering, and many more. They are the most common coordinate system used in computer graphics, computer-aided geometric design, and other geometry-related data processing.
- 1 History
- 2 Definitions
- 3 Notations and conventions
- 4 Cartesian space
- 5 Cartesian formulas for the plane
- 6 Orientation and handedness
- 7 Representing a vector in the standard basis
- 8 Applications
- 9 See also
- 10 Notes
- 11 References
- 12 Further reading
- 13 External links
The idea of this system was developed in 1637 in writings by Descartes and independently by Pierre de Fermat, although Fermat also worked in three dimensions, and did not publish the discovery. Both authors used a single axis in their treatments and have a variable length measured in reference to this axis. The concept of using a pair of axes was introduced later, after Descartes' La Géométrie was translated into Latin in 1649 by Frans van Schooten and his students. These commentators introduced several concepts while trying to clarify the ideas contained in Descartes' work.
Nicole Oresme, a French cleric and friend of the Dauphin (later to become King Charles V) of the 14th Century, used constructions similar to Cartesian coordinates well before the time of Descartes and Fermat.
Choosing a Cartesian coordinate system for a one-dimensional space—that is, for a straight line—means choosing a point O of the line (the origin), a unit of length, and an orientation for the line. An orientation chooses which of the two half-lines determined by O is the positive, and which is negative; we then say that the line "is oriented" (or "points") from the negative half towards the positive half. Then each point p of the line can be specified by its distance from O, taken with a + or − sign depending on which half-line contains p.
A line with a chosen Cartesian system is called a number line. Every real number, whether integer, rational, or irrational, has a unique location on the line. Conversely, every point on the line can be interpreted as a number in an ordered continuum which includes the real numbers.
Cartesian coordinates in two dimensions
The modern Cartesian coordinate system in two dimensions (also called a rectangular coordinate system) is defined by an ordered pair of perpendicular lines (axes), a single unit of length for both axes, and an orientation for each axis. (Early systems allowed "oblique" axes, that is, axes that did not meet at right angles.) The lines are commonly referred to as the x and y-axes where the x-axis is taken to be horizontal and the y-axis is taken to be vertical. The point where the axes meet is taken as the origin for both, thus turning each axis into a number line. For a given point P, a line is drawn through P perpendicular to the x-axis to meet it at X and second line is drawn through P perpendicular to the y-axis to meet it at Y. The coordinates of P are then X and Y interpreted as numbers x and y on the corresponding number lines. The coordinates are written as an ordered pair (x, y).
The point where the axes meet is the common origin of the two number lines and is simply called the origin. It is often labeled O and if so then the axes are called Ox and Oy. A plane with x and y-axes defined is often referred to as the Cartesian plane or xy plane. The value of x is called the x-coordinate or abscissa and the value of y is called the y-coordinate or ordinate.
The choices of letters come from the original convention, which is to use the latter part of the alphabet to indicate unknown values. The first part of the alphabet was used to designate known values.
Cartesian coordinates in three dimensions
Choosing a Cartesian coordinate system for a three-dimensional space means choosing an ordered triplet of lines (axes), any two of them being perpendicular; a single unit of length for all three axes; and an orientation for each axis. As in the two-dimensional case, each axis becomes a number line. The coordinates of a point p are obtained by drawing a line through p perpendicular to each coordinate axis, and reading the points where these lines meet the axes as three numbers of these number lines.
Alternatively, the coordinates of a point p can also be taken as the (signed) distances from p to the three planes defined by the three axes. If the axes are named x, y, and z, then the x coordinate is the distance from the plane defined by the y and z axes. The distance is to be taken with the + or − sign, depending on which of the two half-spaces separated by that plane contains p. The y and z coordinates can be obtained in the same way from the (x,z) and (x,y) planes, respectively.
One can generalize the concept of Cartesian coordinates to allow axes that are not perpendicular to each other, and/or different units along each axis. In that case, each coordinate is obtained by projecting the point onto one axis along a direction that is parallel to the other axis (or, in general, to the hyperplane defined by all the other axes). In those oblique coordinate systems the computations of distances and angles is more complicated than in standard Cartesian systems, and many standard formulas (such as the Pythagorean formula for the distance) do not hold.
Notations and conventions
The Cartesian coordinates of a point are usually written in parentheses and separated by commas, as in (10,5) or (3,5,7). The origin is often labelled with the capital letter O. In analytic geometry, unknown or generic coordinates are often denoted by the letters x and y on the plane, and x, y, and z in three-dimensional space. w is often used for four-dimensional space, but the rarity of such usage precludes concrete convention here. This custom comes from an old convention of algebra, to use letters near the end of the alphabet for unknown values (such as were the coordinates of points in many geometric problems), and letters near the beginning for given quantities.
These conventional names are often used in other domains, such as physics and engineering. However, other letters may be used too. For example, in a graph showing how a pressure varies with time, the graph coordinates may be denoted t and P. Each axis is usually named after the coordinate which is measured along it; so one says the x-axis, the y-axis, the t-axis, etc.
Another common convention for coordinate naming is to use subscripts, as in x1, x2, ... xn for the n coordinates in an n-dimensional space; especially when n is greater than 3, or variable. Some authors (and many programmers) prefer the numbering x0, x1, ... xn−1. These notations are especially advantageous in computer programming: by storing the coordinates of a point as an array, instead of a record, one can use iterative commands or procedure parameters instead of repeating the same commands for each coordinate.
In mathematical illustrations of two-dimensional Cartesian systems, the first coordinate (traditionally called the abscissa) is measured along a horizontal axis, oriented from left to right. The second coordinate (the ordinate) is then measured along a vertical axis, usually oriented from bottom to top.
However, in computer graphics and image processing one often uses a coordinate system with the y axis pointing down (as displayed on the computer's screen). This convention developed in the 1960s (or earlier) from the way that images were originally stored in display buffers.
For three-dimensional systems, a convention is to portray the x-y plane horizontally, with the z axis added to represent height (positive up). Furthermore, there is a convention to orient "x" facing toward the viewer, biased either to the right or left. If a diagram (3D projection or 2D perspective drawing) shows the x and y axis horizontally and vertically, respectively, then the z axis should be shown pointing "out of the page" towards the viewer or camera. In such a 2D diagram of a 3D coordinate system, the z axis would appear as a line or ray pointing down and to the left or down and to the right, depending on the presumed viewer or camera perspective. In any diagram or display, the orientation of the three axes, as a whole, is arbitrary. However, the orientation of the axes relative to each other should always comply with the right-hand rule, unless specifically stated otherwise. All laws of physics and math assume this right-handedness, which ensures consistency. For 3D diagrams, the names "abscissa" and "ordinate" are rarely used for x and y, respectively. When they are, the z-coordinate is sometimes called the applicate.
The words abscissa, ordinate and applicate are sometimes used to refer to coordinate axes rather than values.
Quadrants and octants
The axes of a two-dimensional Cartesian system divide the plane into four infinite regions, called quadrants, each bounded by two half-axes. These are often numbered from 1st to 4th and denoted by Roman numerals: I (where the signs of the two coordinates are I (+,+), II (−,+), III (−,−), and IV (+,−). When the axes are drawn according to the mathematical custom, the numbering goes counter-clockwise starting from the upper right ("northeast") quadrant.
Similarly, a three-dimensional Cartesian system defines a division of space into eight regions or octants, according to the signs of the coordinates of the points. The convention used for naming a specific octant is to list its signs, e.g. (+ + +) or ( - + - ). The n-dimensional generalization of the quadrant and octant is the orthant, and the same naming system applies.
A Euclidean plane with a chosen Cartesian system is called a Cartesian plane. Since Cartesian coordinates are unique and non-ambiguous, the points of a Cartesian plane can be identified with all possible pairs of real numbers; that is with the Cartesian product , where is the set of all reals. In the same way one defines a Cartesian space of any dimension n, whose points can be identified with the tuples (lists) of n real numbers, that is, with .
Cartesian formulas for the plane
Distance between two points
The Euclidean distance between two points of the plane with Cartesian coordinates and is
This is the Cartesian version of Pythagoras' theorem. In three-dimensional space, the distance between points and is
which can be obtained by two consecutive applications of Pythagoras' theorem.
The Euclidean transformations or Euclidean motions are the (bijective) mappings of points of the Euclidean plane to themselves which preserve distances between points. There are four types of these mappings (also called isometries): translations, rotations, reflections and glide reflections.
Translating a set of points of the plane, preserving the distances and directions between them, is equivalent to adding a fixed pair of numbers (a,b) to the Cartesian coordinates of every point in the set. That is, if the original coordinates of a point are (x,y), after the translation they will be
If (x, y) are the Cartesian coordinates of a point, then (−x, y) are the coordinates of its reflection across the second coordinate axis (the Y axis), as if that line were a mirror. Likewise, (x, −y) are the coordinates of its reflection across the first coordinate axis (the X axis). In more generality, reflection across a line through the origin making an angle with the x-axis, is equivalent to replacing every point with coordinates (x,y) by the point with coordinates (x',y'), where
A glide reflection is the composition of a reflection across a line followed by a translation in the direction of that line. It can be seen that the order of these operations does not matter (the translation can come first, followed by the reflection).
General matrix form of the transformations
These Euclidean transformations of the plane can all be described in a uniform way by using matrices. The result of applying a Euclidean transformation to a point is given by the formula
- [Note the use of row vectors for point coordinates and that the matrix is written on the right.]
To be orthogonal, the matrix A must have orthogonal rows with same Euclidean length of one, that is,
This is equivalent to saying that A times its transpose must be the identity matrix. If these conditions do not hold, the formula describes a more general affine transformation of the plane provided that the determinant of A is not zero.
A reflection or glide reflection is obtained when,
Assuming that translation is not used transformations can be combined by simply multiplying the associated transformation matrices.
Another way to represent coordinate transformations in Cartesian coordinates is through affine transformations. In affine transformations an extra dimension is added and all points are given a value of 1 for this extra dimension. The advantage of doing this is that point translations can be specified in the final column of matrix A. In this way, all of the euclidean transformations become transactable as matrix • point multiplications. The affine transformation is given by:
- [Note the A matrix from above was transposed. The matrix is on the left and column vectors for point coordinates are used.]
Using affine transformations multiple different euclidean transformations including translation can be combined by simply multiplying the corresponding matrices.
An example of an affine transformation which is not a Euclidean motion is given by scaling. To make a figure larger or smaller is equivalent to multiplying the Cartesian coordinates of every point by the same positive number m. If (x,y) are the coordinates of a point on the original figure, the corresponding point on the scaled figure has coordinates
If m is greater than 1, the figure becomes larger; if m is between 0 and 1, it becomes smaller.
A shearing transformation will push the top of a square sideways to form a parallelogram. Horizontal shearing is defined by:
Shearing can also be applied vertically:
Orientation and handedness
In two dimensions
Fixing or choosing the x-axis determines the y-axis up to direction. Namely, the y-axis is necessarily the perpendicular to the x-axis through the point marked 0 on the x-axis. But there is a choice of which of the two half lines on the perpendicular to designate as positive and which as negative. Each of these two choices determines a different orientation (also called handedness) of the Cartesian plane.
The usual way of orienting the axes, with the positive x-axis pointing right and the positive y-axis pointing up (and the x-axis being the "first" and the y-axis the "second" axis) is considered the positive or standard orientation, also called the right-handed orientation.
A commonly used mnemonic for defining the positive orientation is the right hand rule. Placing a somewhat closed right hand on the plane with the thumb pointing up, the fingers point from the x-axis to the y-axis, in a positively oriented coordinate system.
The other way of orienting the axes is following the left hand rule, placing the left hand on the plane with the thumb pointing up.
When pointing the thumb away from the origin along an axis towards positive, the curvature of the fingers indicates a positive rotation along that axis.
Regardless of the rule used to orient the axes, rotating the coordinate system will preserve the orientation. Switching any two axes will reverse the orientation.
In three dimensions
Once the x- and y-axes are specified, they determine the line along which the z-axis should lie, but there are two possible directions on this line. The two possible coordinate systems which result are called 'right-handed' and 'left-handed'. The standard orientation, where the xy-plane is horizontal and the z-axis points up (and the x- and the y-axis form a positively oriented two-dimensional coordinate system in the xy-plane if observed from above the xy-plane) is called right-handed or positive.
The name derives from the right-hand rule. If the index finger of the right hand is pointed forward, the middle finger bent inward at a right angle to it, and the thumb placed at a right angle to both, the three fingers indicate the relative directions of the x-, y-, and z-axes in a right-handed system. The thumb indicates the x-axis, the index finger the y-axis and the middle finger the z-axis. Conversely, if the same is done with the left hand, a left-handed system results.
Figure 7 depicts a left and a right-handed coordinate system. Because a three-dimensional object is represented on the two-dimensional screen, distortion and ambiguity result. The axis pointing downward (and to the right) is also meant to point towards the observer, whereas the "middle" axis is meant to point away from the observer. The red circle is parallel to the horizontal xy-plane and indicates rotation from the x-axis to the y-axis (in both cases). Hence the red arrow passes in front of the z-axis.
Figure 8 is another attempt at depicting a right-handed coordinate system. Again, there is an ambiguity caused by projecting the three-dimensional coordinate system into the plane. Many observers see Figure 8 as "flipping in and out" between a convex cube and a concave "corner". This corresponds to the two possible orientations of the coordinate system. Seeing the figure as convex gives a left-handed coordinate system. Thus the "correct" way to view Figure 8 is to imagine the x-axis as pointing towards the observer and thus seeing a concave corner.
Representing a vector in the standard basis
A point in space in a Cartesian coordinate system may also be represented by a position vector, which can be thought of as an arrow pointing from the origin of the coordinate system to the point. If the coordinates represent spatial positions (displacements), it is common to represent the vector from the origin to the point of interest as . In two dimensions, the vector from the origin to the point with Cartesian coordinates (x, y) can be written as:
where , and are unit vectors in the direction of the x-axis and y-axis respectively, generally referred to as the standard basis (in some application areas these may also be referred to as versors). Similarly, in three dimensions, the vector from the origin to the point with Cartesian coordinates can be written as:
where is the unit vector in the direction of the z-axis.
There is no natural interpretation of multiplying vectors to obtain another vector that works in all dimensions, however there is a way to use complex numbers to provide such a multiplication. In a two dimensional cartesian plane, identify the point with coordinates (x, y) with the complex number z = x + iy. Here, i is the complex number whose square is the real number −1 and is identified with the point with coordinates (0,1), so it is not the unit vector in the direction of the x-axis (this confusion is just an unfortunate historical accident). Since the complex numbers can be multiplied giving another complex number, this identification provides a means to "multiply" vectors. In a three dimensional cartesian space a similar identification can be made with a subset of the quaternions.
Cartesian coordinates are an abstraction that have a multitude of possible applications in the real world. However, three constructive steps are involved in superimposing coordinates on a problem application. 1) Units of distance must be decided defining the spatial size represented by the numbers used as coordinates. 2) An origin must be assigned to a specific spatial location or landmark, and 3) the orientation of the axes must be defined using available directional cues for (n-1) of the n axes.
Consider as an example superimposing 3D Cartesian coordinates over all points on the Earth (i.e. geospatial 3D). What units make sense? Kilometers are a good choice, since the original definition of the kilometer was geospatial...10,000 km equalling the surface distance from Equator to North Pole. Where to place the origin? Based on symmetry, the gravitational center of the Earth suggests a natural landmark (which can be sensed via satellite orbits). Finally, how to orient X, Y and Z axis directions? The axis of Earth's spin provides a natural direction strongly associated with "up vs. down", so positive Z can adopt the direction from geocenter to North Pole. A location on the Equator is needed to define the X-axis, and the Prime Meridian stands out as a reference direction, so the X-axis takes the direction from geocenter out to [ 0 degrees longitude, 0 degrees latitude ]. Note that with 3 dimensions, and two perpendicular axes directions pinned down for X and Z, the Y-axis is determined by the first two choices. In order to obey the right hand rule, the Y-axis must point out from the geocenter to [ 90 degrees longitude, 0 degrees latitude ]. So what are the geocentric coordinates of the Empire State Building in New York City? Using [ longitude = -73.985656, latitude = 40.748433 ], Earth radius = 40,000 km / 2 pi, and transforming from spherical --> Cartesian coordinates, you can estimate the geocentric coordinates of the Empire State Building, [ x, y, z ] = [ 1330.53 km, -4635.75 km, 4155.46 km ]. GPS navigation relies on such geocentric coordinates.
In engineering projects, agreement on the definition of coordinates is a crucial foundation. One cannot assume that coordinates come predefined for a novel application, so knowledge of how to erect a coordinate system where there is none is essential to applying René Descartes' ingenious thinking.
While spatial apps employ identical units along all axes, in business and scientific apps, each axis may have different units of measurement associated with it (such as kilograms, seconds, pounds, etc.). Although four- and higher-dimensional spaces are difficult to visualize, the algebra of Cartesian coordinates can be extended relatively easily to four or more variables, so that certain calculations involving many variables can be done. (This sort of algebraic extension is what is used to define the geometry of higher-dimensional spaces.) Conversely, it is often helpful to use the geometry of Cartesian coordinates in two or three dimensions to visualize algebraic relationships between two or three of many non-spatial variables.
The graph of a function or relation is the set of all points satisfying that function or relation. For a function of one variable, f, the set of all points (x,y) where y = f(x) is the graph of the function f. For a function of two variables, g, the set of all points (x,y,z) where z = g(x,y) is the graph of the function g. A sketch of the graph of such a function or relation would consist of all the salient parts of the function or relation which would include its relative extrema, its concavity and points of inflection, any points of discontinuity and its end behavior. All of these terms are more fully defined in calculus. Such graphs are useful in calculus to understand the nature and behavior of a function or relation.
- Horizontal and vertical
- Jones diagram, which plots four variables rather than two.
- Orthogonal coordinates
- "analytic geometry". Encyclopædia Britannica (Encyclopædia Britannica Online ed.). 2008.
- Burton 2011, p. 374
- A Tour of the Calculus, David Berlinski
- Springer online reference Encyclopedia of Mathematics
- Smart 1998, Chap. 2
- Brannan, Esplen & Gray 1998, pg. 49
- Brannan, Esplen & Gray 1998, Appendix 2, pp. 377–382
- David J. Griffiths (1999). Introduction to Electrodynamics. Prentice Hall. ISBN 0-13-805326-X.
- Brennan, David A.; Esplen, Matthew F.; Gray, Jeremy J. (1998), Geometry, Cambridge: Cambridge University Press, ISBN 0-521-59787-0
- Burton, David M. (2011), The History of Mathematics/An Introduction (7th ed.), New York: McGraw-Hill, ISBN 978-0-07-338315-6
- Smart, James R. (1998), Modern Geometries (5th ed.), Pacific Grove: Brooks/Cole, ISBN 0-534-35188-3
- Descartes, René (2001). Discourse on Method, Optics, Geometry, and Meteorology. Trans. by Paul J. Oscamp (Revised ed.). Indianapolis, IN: Hackett Publishing. ISBN 0-87220-567-3. OCLC 488633510.
- Korn GA, Korn TM (1961). Mathematical Handbook for Scientists and Engineers (1st ed.). New York: McGraw-Hill. pp. 55–79. LCCN 59-14456. OCLC 19959906.
- Margenau H, Murphy GM (1956). The Mathematics of Physics and Chemistry. New York: D. van Nostrand. LCCN 55-10911.
- Moon P, Spencer DE (1988). "Rectangular Coordinates (x, y, z)". Field Theory Handbook, Including Coordinate Systems, Differential Equations, and Their Solutions (corrected 2nd, 3rd print ed.). New York: Springer-Verlag. pp. 9–11 (Table 1.01). ISBN 978-0-387-18430-2.
- Morse PM, Feshbach H (1953). Methods of Theoretical Physics, Part I. New York: McGraw-Hill. ISBN 0-07-043316-X. LCCN 52-11515.
- Sauer R, Szabó I (1967). Mathematische Hilfsmittel des Ingenieurs. New York: Springer Verlag. LCCN 67-25285.
- Cartesian Coordinate System
- Printable Cartesian Coordinates
- Cartesian coordinates, PlanetMath.org.
- MathWorld description of Cartesian coordinates
- Coordinate Converter – converts between polar, Cartesian and spherical coordinates
- Coordinates of a point Interactive tool to explore coordinates of a point