Jump to content

Implicit function theorem

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Haseldon (talk | contribs) at 06:52, 31 October 2007 (→‎See also: see also). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

In the branch of mathematics called multivariable calculus, the implicit function theorem is a tool which allows relations to be converted to functions. It does this by representing the relation as the graph of a function. There may not be a single function whose graph is the entire relation, but there may be such a function on a restriction of the domain of the relation. The implicit function theorem gives a sufficient condition to ensure that there is such a function.

The theorem states that if the equation R(x, y) = 0 (an Implicit function) satisfies some mild conditions on its partial derivatives, then one can in principle solve this equation for y, at least over some small interval. Geometrically, the graph defined by R(x,y) = 0 will overlap locally with the graph of a function y = f(x).

Example

A circle specified by the implicit function . Around point A y can be expressed as a function of x, specifically . No such function exists for point B.

Consider the unit circle. If we define the function as , then the relation cuts out the unit circle. Explicitly, the unit circle is the set . There is no way to represent the unit circle as the graph of a function because for each choice of , there are two choices of . Namely, and .

However, it is possible to represent part of the circle as a function. If we let for , then the graph of provides the upper half of the circle. Similarly, if , then the graph of gives the lower half of the circle.

It is not possible to find a function which will cut out a neighbourhood of or . Any neighbourhood of or contains both the upper and lower halves of the circle. Because functions must be single-valued, there is no way of writing both the upper and lower halves using one function . Consequently, there is no function whose graph looks like a neighbourhood of or . In these two cases, the conclusion of the implicit function theorem fails.

The purpose of the implicit function theorem is to tell us the existence of functions like and in situations where we cannot write down explicit formulas. It guarantees that and are differentiable, and it even works in situations where we do not have a formula for .

Statement of the theorem

Let f : Rn+mRm be a continuously differentiable function. We think of Rn+m as the cartesian product Rn × Rm, and we write a point of this product as (x1, ..., xn, y1, ..., ym). f is the given relation. Our goal is to construct a function g : RnRm whose graph (x1, ..., xn, g(x1, ..., xn)) is precisely the set of all (x1, ..., xn, y1, ..., ym) such that f(x1, ..., xn, y1, ..., ym) = 0.

As noted above, this may not always be possible. As such, we will fix a point (a1, ..., an, b1, ..., bm) which satisfies f(a1, ..., an, b1, ..., bm) = 0, and we will ask for a g that works near the point (a1, ..., an, b1, ..., bm). In other words, we want an open set U of Rn, an open set V of Rm, and a function g : UV such that the graph of g equals the relation f = 0 on U × V. In symbols,

To state the implicit function theorem, we need the Jacobian, also called the differential or total derivative, of . This is the matrix of partial derivatives of . Abbreviating to , the Jacobian matrix is

where is the matrix of partial derivatives in the 's and is the matrix of partial derivatives in the 's. The implicit function theorem says that if is an invertible matrix, then there are , , and as desired. Writing all the hypotheses together gives the following statement.

Let f : Rn+mRm be a continuously differentiable function, and let Rn+m have coordinates (x1,...,xn, y1, ..., ym). Fix a point (a1,...,an,b1,...,bm) = (a,b) with f(a,b)=c, where cRm. If the matrix [(∂fi/∂yj)(a,b)] is invertible, then there exists an open set U containing (a1,..., an), an open set V containing (b1,...,bm), and a differentiable function g:UV such that

Example

Lets go back to the example of the unit circle. In this case and . The matrix of partial derivatives is just a -matrix, given by

Thus, here, Y is just a number; the linear map defined by it is invertible iff . By the implicit function theorem we see that we can write the circle in the form for all points where . For and we run into trouble, as noted before.

Example

Suppose we have an m-dimensional space, parametrised by a set of coordinates . We can introduce a new coordinate system by giving m functions . These functions allow to calculate the new coordinates of a point, given the old coordinates . One might want to verify if the opposite is possible: given coordinates , can we 'go back' and calculate ? The implicit function theorem will provide an answer to this question. The (new and old) coordinates are related by , with

Now the Jacobian matrix of f at a certain point is given by

Where denotes the identity matrix, and J is the matrix of partial derivatives, evaluated at . (In the above, these blocks were denoted by X and Y.) The implicit function theorem now states that we can locally express as a function of if J is invertible. Demanding J is invertible is equivalent to , thus we see that we can go back from the primed to the unprimed coordinates if the determinant of the Jacobian J is non-zero. This statement is also known as the inverse function theorem.

Example

As a simple application of the above, consider the plane, parametrised by polar coordinates . We can go to a new coordinate system (cartesian coordinates) by defining functions and . This makes is possible given any point to find corresponding cartesian coordinates . When can we go back, and converse cartesian into polar coordinates? By the previous example, we need , with

Since , the conversion back to polar coordinates is only possible if . This is a consequence of the fact that at that point polar coordinates are not good: at the origin the value of is not well-defined.

Banach space version

Based on Grave's inverse function theorem it is possible to extend the implicit function theorem to Banach space valued mappings.

Let , , be Banach spaces. Let the mapping be Fréchet differentiable. If , , and is a Banach space isomorphism from onto . Then there exist neighborhoods of and of and a Frechet differentiable function such that and if and only if , for all .

See also