Inverse function

From Wikipedia, the free encyclopedia
  (Redirected from Inverse operation)
Jump to: navigation, search
A function f and its inverse f −1. Because f maps a to 3, the inverse f −1 maps 3 back to a.

In mathematics, an inverse function is a function that "reverses" another function: if the function f applied to an input x gives a result of y, then applying its inverse function g to y gives the result x, and vice versa. i.e., f(x) = y if and only if g(y) = x.

A function f that has an inverse is said to be invertible. When it exists, the inverse function is uniquely determined by f and is denoted by f −1, read f inverse. Superscripted "−1" does not, in general, refer to numerical exponentiation.

In some situations, for instance when f is an invertible real-valued function of a real variable, the relationship between f and f−1 can be written more compactly, in this case, f−1(f(x)) = x = f(f−1(x)), meaning f−1 composed with f, in either order, is the identity function on R.

Definitions[edit]

If f maps X to Y, then f −1 maps Y back to X.
See also: Inverse element

Let f be a function whose domain is the set X, and whose image (range) is the set Y. Then f is invertible if there exists a function g with domain Y and image X, with the property:

f(x) = y\,\,\Leftrightarrow\,\,g(y) = x.

If f is invertible, the function g is unique; in other words, there is exactly one function g satisfying this property (no more, no less). That function g is then called the inverse of f, and usually denoted as f −1.

Stated otherwise, a function is invertible if and only if its inverse relation is a function on the range Y, in which case the inverse relation is the inverse function.[1]

Not all functions have an inverse. For this rule to be applicable, each element yY must correspond to no more than one xX; a function f with this property is called one-to-one or an injection. If f and f −1 are functions on X and Y respectively, then both are bijections. The inverse of an injection that is not a bijection is a partial function, that means for some yY it is undefined.

Example: squaring and square root functions[edit]

The function f(x) = x2 may or may not be invertible, depending on what kinds of numbers are being considered (the "domain").

If the domain is the real numbers, then each possible result y corresponds to two different starting points in X: one positive and one negative (±x), and so this function is not invertible: as it is impossible to deduce from its output the sign of its input. Such a function is called non-injective or information-losing. Neither the square root nor the principal square root function is the inverse of x2 because the first is not single-valued, and the second returns x when x is negative.

If only positive numbers (and zero) are being considered, then the function is injective and invertible.

Inverses in higher mathematics[edit]

The definition given above is commonly adopted in set theory and calculus. In higher mathematics, the notation

f\colon X \to Y

means "f is a function mapping elements of a set X to elements of a set Y ". The source, X, is called the domain of f, and the target, Y, is called the codomain. The codomain contains the range of f as a subset, and is considered part of the definition of f.[2]

When using codomains, the inverse of a function f: XY is required to have domain Y and codomain X. For the inverse to be defined on all of Y, every element of Y must lie in the range of the function f. A function with this property is called onto or a surjection. Thus, a function with a codomain is invertible if and only if it is both injective (one-to-one) and surjective (onto). Such a function is called a one-to-one correspondence or a bijection, and has the property that every element yY corresponds to exactly one element xX.

Inverses and composition[edit]

If f is an invertible function with domain X and range Y, then

 f^{-1}\left( \, f(x) \, \right) = x, for every x \in X.

Using the composition of functions we can rewrite this statement as follows:

 f^{-1} \circ f = \mathrm{id}_X,

where idX is the identity function on the set X; that is, the function that leaves its argument unchanged. In category theory, this statement is used as the definition of an inverse morphism.

Considering function composition helps to understand the notation f −1. (Repeatedly) composing a function with itself is called iteration, and is denoted fn(x) if f is applied n times, starting with the value x; so f 2(x) = f (f (x)), etc. Since f −1(f (x)) = x, composing f −1 and fn yields fn−1, "undoing" the effect of one application of f.

The notation can also be linked to regular multiplication, by considering multiplication functions fx(y) = xy. Applying fx−1 to fx(y) gives y, which is the same as dividing xy by x, or multiplying by x −1.

Note on notation[edit]

The superscript notation for inverses can sometimes be confused with other uses of superscripts, especially when dealing with trigonometric and hyperbolic functions. To avoid this confusion, the notations f [−1] or with the "−1" above the f are sometimes used.[citation needed]

Whereas the notation f −1(x) might be misunderstood, f(x)−1 certainly denotes the multiplicative inverse of f(x) and has nothing to do with inversion of f.

The expression sin−1 x does not represent the multiplicative inverse to sin x,[3] but the inverse of the sine function applied to x (actually a partial inverse; see below). To avoid confusion, an inverse trigonometric function is often indicated by the prefix "arc". For instance, the inverse of the sine function is typically called the arcsine function, written as arcsin, which is, like sin, conventionally denoted in roman type and not in italics (note that software libraries of mathematical functions often use the name asin):

\sin^{-1} x = \arcsin x.

The function (sin x) −1 is the multiplicative inverse to the sine, and is called the cosecant. It is usually denoted csc x:

 \csc x = (\sin x)^{-1} = \frac{1}{\sin x}.

Hyperbolic functions behave similarly, using the prefix "ar" for their inverse functions, as in arsinh for the inverse function of sinh, and csch x for the multiplicative inverse of sinh x.

Non-example: inverse operations that lead to inverse functions[edit]

In the context of proportionality, direct variation functions represent a relationship between x and y such that the quotient of the two variables equal a constant, k. Thus, the direct variation function is as follows: y = kx. An alternative view of this equation is the slope-intercept form, where k is the slope and always positive.

The inverse variation function represents an inverted relationship between x and y when compared to their relationship in direct variation functions. This notion is not to be confused with finding the inverse function of the direct variation function. The inverse variation function simply implies that as the value of one variable increases the other variable decreases. The function for this relationship cannot be found by finding the inverse of the direct variation function because the result will yield another linear function with a slope of, which is a positive value. Instead, the product of the two variables should always produce a constant. Thus, the inverse variation function is as follows: y = k/x . As x increases, a larger number is dividing the constant k, so y is approaching 0. [4]

Non-example: percentages[edit]

Despite their familiarity, percentage changes do not have a straightforward inverse. That is, an X per cent fall is not the inverse of an X per cent rise.

Properties[edit]

Uniqueness[edit]

If an inverse function exists for a given function f, it is unique: it must be the inverse relation.

Symmetry[edit]

There is a symmetry between a function and its inverse. Specifically, if f is an invertible function with domain X and range Y, then its inverse f −1 has domain Y and range X, and the inverse of f −1 is the original function f. In symbols, for f a function with domain X and range Y, and g a function with domain Y and range X:

\begin{align}
 &\text{If }   &g \circ f = \mathrm{id}_X\text{,} \\
 &\text{then } &f \circ g = \mathrm{id}_Y\text{.}
\end{align}

This follows from the connection between function inverse and relation inverse, because inversion of relations is an involution.

This statement is an obvious consequence of the deduction that for f to be invertible it must be injective (first definition of the inverse) or bijective (second definition). The property of involutive symmetry can be concisely expressed by the following formula:

\left(f^{-1}\right)^{-1} = f .
The inverse of g ∘ f is f −1 ∘ g −1.

The inverse of a composition of functions is given by the formula

(g \circ f)^{-1} = f^{-1} \circ g^{-1}

Notice that the order of g and f have been reversed; to undo f followed by g, we must first undo g and then undo f.

For example, let f(x) = 3x and let g(x) = x + 5. Then the composition g ∘ f is the function that first multiplies by three and then adds five:

(g \circ f)(x) = 3x + 5

To reverse this process, we must first subtract five, and then divide by three:

(g \circ f)^{-1}(y) = \tfrac13(y - 5)

This is the composition (f −1 ∘ g −1)(y).

Self-inverses[edit]

If X is a set, then the identity function on X is its own inverse:

\mathrm{id}_X^{-1} = \mathrm{id}_X

More generally, a function f : XX is equal to its own inverse if and only if the composition f ∘ f is equal to idX. Such a function is called an involution.

Inverses in calculus[edit]

Single-variable calculus is primarily concerned with functions that map real numbers to real numbers. Such functions are often defined through formulas, such as:

f(x) = (2x + 8)^3 .

A function f from the real numbers to the real numbers possesses an inverse as long as it is one-to-one, i.e. as long as the graph of y = f(x) has, for each possible y value only one corresponding x value, and thus passes the horizontal line test.

The following table shows several standard functions and their inverses:

Function f(x) Inverse f −1(y) Notes
x + a y a
ax ay
mx y/m m ≠ 0
1/x 1/y x, y ≠ 0
x2 y x, y ≥ 0 only
x3 3y no restriction on x and y
xp y1/p (i.e. py) x, y ≥ 0 in general, p ≠ 0
ex lny y > 0
ax logay y > 0 and a > 0
trigonometric functions inverse trigonometric functions various restrictions (see table below)

Formula for the inverse[edit]

One approach to finding a formula for f −1, if it exists, is to solve the equation y = f(x) for x. For example, if f is the function

f(x) = (2x + 8)^3

then we must solve the equation y = (2x + 8)3 for x:

\begin{align}
      y         & = (2x+8)^3 \\
  \sqrt[3]{y}   & = 2x + 8   \\
\sqrt[3]{y} - 8 & = 2x       \\
\dfrac{\sqrt[3]{y} - 8}{2} & = x .
\end{align}

Thus the inverse function f −1 is given by the formula

f^{-1}(y) = \dfrac{\sqrt[3]{y} - 8}{2} .

Sometimes the inverse of a function cannot be expressed by a formula with a finite number of terms. For example, if f is the function

f(x) = x - \sin x ,

then f is one-to-one, and therefore possesses an inverse function f −1. The formula for this inverse has an infinite number of terms:

 f^{-1}(y) =
\displaystyle \sum_{n=1}^{\infty}
 {\frac{y^{\frac{n}{3}}}{n!}} \lim_{ \theta \to 0} \left(
 \frac{\mathrm{d}^{\,n-1}}{\mathrm{d} \theta^{\,n-1}} \left(
 \frac{ \theta }{ \sqrt[3]{ \theta - \sin( \theta )} } ^n \right)
\right)

Graph of the inverse[edit]

The graphs of y = f(x) and y = f −1(x). The dotted line is y = x.

If f is invertible, then the graph of the function

y = f^{-1}(x)

is the same as the graph of the equation

x = f(y) .

This is identical to the equation y = f(x) that defines the graph of f, except that the roles of x and y have been reversed. Thus the graph of f −1 can be obtained from the graph of f by switching the positions of the x and y axes. This is equivalent to reflecting the graph across the line y = x.

Inverses and derivatives[edit]

A continuous function f is one-to-one (and hence invertible) if and only if it is either strictly increasing or decreasing (with no local maxima or minima). For example, the function

f(x) = x^3 + x

is invertible, since the derivative f′(x) = 3x2 + 1 is always positive.

If the function f is differentiable, then the inverse f −1 will be differentiable as long as f′(x) ≠ 0. The derivative of the inverse is given by the inverse function theorem:

\left(f^{-1}\right)^\prime (y)  = \frac{1}{f'\left(f^{-1}(y)\right)} .

If we set x = f −1(y), then the formula above can be written

\frac{dx}{dy} = \frac{1}{dy / dx} .

This result follows from the chain rule (see the article on inverse functions and differentiation).

The inverse function theorem can be generalized to functions of several variables. Specifically, a differentiable multivariable function f : RnRn is invertible in a neighborhood of a point p as long as the Jacobian matrix of f at p is invertible. In this case, the Jacobian of f −1 at f(p) is the matrix inverse of the Jacobian of f at p.

Real-world examples[edit]

1. Let f be the function that converts a temperature in degrees Celsius to a temperature in degrees Fahrenheit:

 F = f(C) = \tfrac95 C + 32 ;

then its inverse function converts degrees Fahrenheit to degrees Celsius:

 C = f^{-1}(F) = \tfrac59 (F - 32) ,

since

 f^{-1}\left( \, f(C) \, \right) = f^{-1}\left( \, \tfrac95 C + 32 \, \right) = \tfrac59 \left( \left( \, \tfrac95 C + 32 \, \right) - 32 \right) =  C\text{, for every }C\text{.}

2. Suppose f assigns each child in a family its birth year. An inverse function would output which child was born in a given year. However, if the family has twins (or triplets) then the output cannot be known when the input is the common birth year. As well, if a year is given in which no child was born then a child cannot be named. But if each child was born in a separate year, and if we restrict attention to the three years in which a child was born, then we do have an inverse function. For example,

\begin{align}
 f(\text{Allan})&=2005 , \quad & f(\text{Brad})&=2007 , \quad & f(\text{Cary})&=2001 \\
 f^{-1}(2005)&=\text{Allan} , \quad & f^{-1}(2007)&=\text{Brad} , \quad & f^{-1}(2001)&=\text{Cary}
\end{align}

3. Let R be the function that leads to an x percentage rise of some quantity, and F be the function producing an x percentage fall. Applied to $100 with x = 10%, we find that applying the first function followed by the second does not restore the original value of $100, demonstrating the fact that, despite appearances, these two functions are not inverses of each other.

Generalizations[edit]

Partial inverses[edit]

The square root of x is a partial inverse to f(x) = x2.

Even if a function f is not one-to-one, it may be possible to define a partial inverse of f by restricting the domain. For example, the function

f(x) = x^2

is not one-to-one, since x2 = (−x)2. However, the function becomes one-to-one if we restrict to the domain x ≥ 0, in which case

f^{-1}(y) = \sqrt{y} .

(If we instead restrict to the domain x ≤ 0, then the inverse is the negative of the square root of y.) Alternatively, there is no need to restrict the domain if we are content with the inverse being a multivalued function:

f^{-1}(y) = \pm\sqrt{y} .
The inverse of this cubic function has three branches.

Sometimes this multivalued inverse is called the full inverse of f, and the portions (such as x and −x) are called branches. The most important branch of a multivalued function (e.g. the positive square root) is called the principal branch, and its value at y is called the principal value of f −1(y).

For a continuous function on the real line, one branch is required between each pair of local extrema. For example, the inverse of a cubic function with a local maximum and a local minimum has three branches (see the picture to the right).

The arcsine is a partial inverse of the sine function.

These considerations are particularly important for defining the inverses of trigonometric functions. For example, the sine function is not one-to-one, since

\sin(x + 2\pi) = \sin(x)

for every real x (and more generally sin(x + 2πn) = sin(x) for every integer n). However, the sine is one-to-one on the interval [−π/2, π/2], and the corresponding partial inverse is called the arcsine. This is considered the principal branch of the inverse sine, so the principal value of the inverse sine is always between −π/2 and π/2. The following table describes the principal branch of each inverse trigonometric function:

function Range of usual principal value
sin−1 π/2 ≤ sin−1(x) ≤ π/2
cos−1 0 ≤ cos−1(x) ≤ π
tan−1 π/2 < tan−1(x) < π/2
cot−1 0 < cot−1(x) < π
sec−1 0 ≤ sec−1(x) ≤ π
csc−1 π/2 ≤ csc−1(x) ≤ π/2

Left and right inverses[edit]

If f: XY, a left inverse for f (or retraction of f) is a function g: YX such that

g \circ f = \mathrm{id}_X .

That is, the function g satisfies the rule

If \displaystyle f(x) = y, then \displaystyle g(y) = x .

Thus, g must equal the inverse of f on the image of f, but may take any values for elements of Y not in the image. A function f with a left inverse is necessarily injective. In classical mathematics, every injective function f necessarily has a left inverse; however, this may fail in constructive mathematics. For instance, a left inverse of the inclusion {0,1} → R of the two-element set in the reals violates indecomposability by giving a retraction of the real line to the set {0,1} .

A right inverse for f (or section of f) is a function h: YX such that

f \circ h = \mathrm{id}_Y .

That is, the function h satisfies the rule

If \displaystyle h(y) = x, then \displaystyle f(x) = y .

Thus, h(y) may be any of the elements of X that map to y under f. A function f has a right inverse if and only if it is surjective (though constructing such an inverse in general requires the axiom of choice).

An inverse which is both a left and right inverse must be unique. Likewise, if g is a left inverse for f, then g may or may not be a right inverse for f; and if g is a right inverse for f, then g is not necessarily a left inverse for f. For example let f: R[0, ∞) denote the squaring map, such that f(x) = x2 for all x in R, and let g: [0, ∞)R denote the square root map, such that g(x) = x for all x ≥ 0. Then f(g(x)) = x for all x in [0, ∞); that is, g is a right inverse to f. However, g is not a left inverse to f, since, e.g., g(f(−1)) = 1 ≠ −1.

Preimages[edit]

If f: XY is any function (not necessarily invertible), the preimage (or inverse image) of an element yY is the set of all elements of X that map to y:

f^{-1}(y) = \left\{ x\in X : f(x) = y \right\} .

The preimage of y can be thought of as the image of y under the (multivalued) full inverse of the function f.

Similarly, if S is any subset of Y, the preimage of S is the set of all elements of X that map to S:

f^{-1}(S) = \left\{ x\in X : f(x) \in S \right\} .

For example, take a function f: RR, where f: xx2. This function is not invertible for reasons discussed above. Yet preimages may be defined for subsets of the codomain:

f^{-1}(\left\{1,4,9,16\right\}) = \left\{-4,-3,-2,-1,1,2,3,4\right\}

The preimage of a single element yY – a singleton set {y}  – is sometimes called the fiber of y. When Y is the set of real numbers, it is common to refer to f −1(y) as a level set.

See also[edit]

Notes[edit]

  1. ^ Smith, Eggen & St. Andre 2006, p. 202, Theorem 4.9
  2. ^ Smith, Eggen & St. Andre 2006, p. 179
  3. ^ Thomas 1972, pp. 304-309
  4. ^ Khan, Salman. "Direct and Inverse Variation". Retrieved 2014-05-01. 

References[edit]

  • Smith, Douglas; Eggen, Maurice; St. Andre, Richard (2006), A Transition to Advanced Mathematics (6th ed.), Thompson Brooks/Cole, ISBN 978-0-534-39900-9 
  • Thomas, Jr., George B. (1972), Calculus and Analytic Geometry Part 1: Functions of One Variable and Analytic Geometry (Alternate ed.), Addison-Wesley 

Further reading[edit]

External links[edit]