Rolle's theorem

If a real-valued function $f$ is continuous on a closed interval $[a, b]$ , differentiable on the open interval $(a, b)$ , and $f (a) = f (b)$ , then there exists a $c$ in the open interval $(a, b)$ such that $f'(c) = 0$ .

In calculus, Rolle's theorem or Rolle's lemma essentially states that any real-valued differentiable function that attains equal values at two distinct points must have at least one point, somewhere between them, at which the slope of the tangent line is zero. Such a point is known as a stationary point. It is a point at which the first derivative of the function is zero. The theorem is named after Michel Rolle.

Standard version of the theorem

If a real-valued function $f$ is continuous on a proper closed interval $[a, b]$ , differentiable on the open interval $(a, b)$ , and $f (a) = f (b)$ , then there exists at least one $c$ in the open interval $(a, b)$ such that $f'(c)=0.$

This version of Rolle's theorem is used to prove the mean value theorem, of which Rolle's theorem is indeed a special case. It is also the basis for the proof of Taylor's theorem.

History

Although the theorem is named after Michel Rolle, Rolle's 1691 proof covered only the case of polynomial functions. His proof did not use the methods of differential calculus, which at that point in his life he considered to be fallacious. The theorem was first proved by Cauchy in 1823 as a corollary of a proof of the mean value theorem.^[1] The name "Rolle's theorem" was first used by Moritz Wilhelm Drobisch of Germany in 1834 and by Giusto Bellavitis of Italy in 1846.^[2]

Examples

Half circle

For a radius $r > 0$ , consider the function $f(x)={\sqrt {r^{2}-x^{2}}},\quad x\in [-r,r].$

Its graph is the upper semicircle centered at the origin. This function is continuous on the closed interval $[- r, r]$ and differentiable in the open interval $(- r, r)$ , but not differentiable at the endpoints $- r$ and $r$ . Since $f (- r) = f (r)$ , Rolle's theorem applies, and indeed, there is a point where the derivative of $f$ is zero. The theorem applies even when the function cannot be differentiated at the endpoints because it only requires the function to be differentiable in the open interval.

Absolute value

If differentiability fails at an interior point of the interval, the conclusion of Rolle's theorem may not hold. Consider the absolute value function $f(x)=|x|,\quad x\in [-1,1].$

Then $f (-1) = f (1)$ , but there is no $c$ between −1 and 1 for which the $f'(c)$ is zero. This is because that function, although continuous, is not differentiable at $x = 0$ . The derivative of $f$ changes its sign at $x = 0$ , but without attaining the value 0. The theorem cannot be applied to this function because it does not satisfy the condition that the function must be differentiable for every $x$ in the open interval. However, when the differentiability requirement is dropped from Rolle's theorem, $f$ will still have a critical number in the open interval $(a, b)$ , but it may not yield a horizontal tangent (as in the case of the absolute value represented in the graph).

Functions with zero derivative

Rolle's theorem implies that a differentiable function whose derivative is ⁠ $0$ ⁠ in an interval is constant in this interval.

Indeed, if $a$ and $b$ are two points in an interval where a function $f$ is differentiable, then the function $g(x)=f(x)-f(a)-{\frac {f(b)-f(a)}{b-a}}(x-a)$ satisfies the hypotheses of Rolle's theorem on the interval ⁠ $[a,b]$ ⁠.

If the derivative of ⁠ $f$ ⁠ is zero everywhere, the derivative of ⁠ $g$ ⁠ is $g'(x)={\frac {f(b)-f(a)}{b-a}},$ and Rolle's theorem implies that there is ⁠ $c\in (a,b)$ ⁠ such that $0=g'(x)={\frac {f(b)-f(a)}{b-a}}.$

Hence, ⁠ $f(a)=f(b)$ ⁠ for every ⁠ $a$ ⁠ and ⁠ $b$ ⁠, and the function ⁠ $f$ ⁠ is constant.

Generalization

The second example illustrates the following generalization of Rolle's theorem:

Consider a real-valued, continuous function $f$ on a closed interval $[a, b]$ with $f (a) = f (b)$ . If for every $x$ in the open interval $(a, b)$ the right-hand limit $f'(x^{+}):=\lim _{h\to 0^{+}}{\frac {f(x+h)-f(x)}{h}}$ and the left-hand limit $f'(x^{-}):=\lim _{h\to 0^{-}}{\frac {f(x+h)-f(x)}{h}}$

exist in the extended real line $[-\infty, \infty]$ , then there is some number $c$ in the open interval $(a, b)$ such that one of the two limits $f'(c^{+})\quad {\text{and}}\quad f'(c^{-})$ is $\geq 0$ and the other one is $\leq 0$ (in the extended real line). If the right- and left-hand limits agree for every $x$ , then they agree in particular for $c$ , hence the derivative of $f$ exists at $c$ and is equal to zero.

Remarks

If $f$ is convex or concave, then the right- and left-hand derivatives exist at every inner point, hence the above limits exist and are real numbers.
This generalized version of the theorem is sufficient to prove convexity when the one-sided derivatives are monotonically increasing:^[3] $f'(x^{-})\leq f'(x^{+})\leq f'(y^{-}),\quad x<y.$

Proof of the generalized version

Since the proof for the standard version of Rolle's theorem and the generalization are very similar, we prove the generalization.

The idea of the proof is to argue that if $f (a) = f (b)$ , then $f$ must attain either a maximum or a minimum somewhere between $a$ and $b$ , say at $c$ , and the function must change from increasing to decreasing (or the other way around) at $c$ . In particular, if the derivative exists, it must be zero at $c$ .

By assumption, $f$ is continuous on $[a, b]$ , and by the extreme value theorem attains both its maximum and its minimum in $[a, b]$ . If these are both attained at the endpoints of $[a, b]$ , then $f$ is constant on $[a, b]$ and so the derivative of $f$ is zero at every point in $(a, b)$ .

Suppose then that the maximum is obtained at an interior point $c$ of $(a, b)$ (the argument for the minimum is very similar, just consider $- f$ ). We shall examine the above right- and left-hand limits separately.

For a real $h$ such that $c + h$ is in $[a, b]$ , the value $f (c + h)$ is smaller or equal to $f (c)$ because $f$ attains its maximum at $c$ . Therefore, for every $h > 0$ , ${\frac {f(c+h)-f(c)}{h}}\leq 0,$ hence $f'(c^{+}):=\lim _{h\to 0^{+}}{\frac {f(c+h)-f(c)}{h}}\leq 0,$ where the limit exists by assumption, it may be minus infinity.

Similarly, for every $h < 0$ , the inequality turns around because the denominator is now negative and we get ${\frac {f(c+h)-f(c)}{h}}\geq 0,$ hence $f'(c^{-}):=\lim _{h\to 0^{-}}{\frac {f(c+h)-f(c)}{h}}\geq 0,$ where the limit might be plus infinity.

Finally, when the above right- and left-hand limits agree (in particular when $f$ is differentiable), then the derivative of $f$ at $c$ must be zero.

(Alternatively, we can apply Fermat's stationary point theorem directly.)

Generalization to higher derivatives

We can also generalize Rolle's theorem by requiring that $f$ has more points with equal values and greater regularity. Specifically, suppose that

the function $f$ is $n - 1$ times continuously differentiable on the closed interval $[a, b]$ and the $n$ th derivative exists on the open interval $(a, b)$ , and
there are $n$ intervals given by $a 1 < b 1 \leq a 2 < b 2 \leq \dots \leq a n < b n$ in $[a, b]$ such that $f (a k) = f (b k)$ for every $k$ from 1 to $n$ .

Then there is a number $c$ in $(a, b)$ such that the $n$ th derivative of $f$ at $c$ is zero.

The requirements concerning the $n$ th derivative of $f$ can be weakened as in the generalization above, giving the corresponding (possibly weaker) assertions for the right- and left-hand limits defined above with $f (n - 1)$ in place of $f$ .

Particularly, this version of the theorem asserts that if a function differentiable enough times has $n$ roots (so they have the same value, that is 0), then there is an internal point where $f (n - 1)$ vanishes.

Proof

The proof uses mathematical induction. The case $n = 1$ is simply the standard version of Rolle's theorem. For $n > 1$ , take as the induction hypothesis that the generalization is true for $n - 1$ . We want to prove it for $n$ . Assume the function $f$ satisfies the hypotheses of the theorem. By the standard version of Rolle's theorem, for every integer $k$ from 1 to $n$ , there exists a $c k$ in the open interval $(a k, b k)$ such that $f'(c k) = 0$ . Hence, the first derivative satisfies the assumptions on the $n - 1$ closed intervals $[c 1, c 2], \dots, [c n - 1, c n]$ . By the induction hypothesis, there is a $c$ such that the $(n - 1)$ st derivative of $f'$ at $c$ is zero.

Generalizations to other fields

Rolle's theorem is a property of differentiable functions over the real numbers, which are an ordered field. As such, it does not generalize to other fields, but the following corollary does: if a real polynomial factors (has all of its roots) over the real numbers, then its derivative does as well. One may call this property of a field Rolle's property.^{[citation needed]} More general fields do not always have differentiable functions, but they do always have polynomials, which can be symbolically differentiated. Similarly, more general fields may not have an order, but one has a notion of a root of a polynomial lying in a field.

Thus Rolle's theorem shows that the real numbers have Rolle's property. Any algebraically closed field such as the complex numbers has Rolle's property. However, the rational numbers do not – for example, $x 3 - x = x (x - 1)(x + 1)$ factors over the rationals, but its derivative, $3x^{2}-1=3\left(x-{\tfrac {1}{\sqrt {3}}}\right)\left(x+{\tfrac {1}{\sqrt {3}}}\right),$ does not. The question of which fields satisfy Rolle's property was raised in Kaplansky 1972.^[4] For finite fields, the answer is that only $F 2$ and $F 4$ have Rolle's property.^[5]^[6]

For a complex version, see Voorhoeve index.

References

^ Besenyei, A. (September 17, 2012). "A brief history of the mean value theorem" (PDF).
^ See Cajori, Florian (1999). A History of Mathematics. American Mathematical Soc. p. 224. ISBN 9780821821022.
^ Artin, Emil (1964) [1931], The Gamma Function, translated by Butler, Michael, Holt, Rinehart and Winston, pp. 3–4.
^ Kaplansky, Irving (1972), Fields and Rings.^{[full citation needed]}
^ Craven, Thomas; Csordas, George (1977), "Multiplier sequences for fields", Illinois J. Math., 21 (4): 801–817, doi:10.1215/ijm/1256048929.
^ Ballantine, C.; Roberts, J. (January 2002), "A Simple Proof of Rolle's Theorem for Finite Fields", The American Mathematical Monthly, 109 (1), Mathematical Association of America: 72–74, doi:10.2307/2695770, JSTOR 2695770.

External links

"Rolle theorem", Encyclopedia of Mathematics, EMS Press, 2001 [1994]
Rolle's and Mean Value Theorems at cut-the-knot.
Mizar system proof: http://mizar.org/version/current/html/rolle.html#T2

[1] Besenyei, A. (September 17, 2012). "A brief history of the mean value theorem" (PDF).

[2] See Cajori, Florian (1999). A History of Mathematics. American Mathematical Soc. p. 224. ISBN 9780821821022.

[3] Artin, Emil (1964) [1931], The Gamma Function, translated by Butler, Michael, Holt, Rinehart and Winston, pp. 3–4.

[4] Kaplansky, Irving (1972), Fields and Rings.^{[full citation needed]}

[5] Craven, Thomas; Csordas, George (1977), "Multiplier sequences for fields", Illinois J. Math., 21 (4): 801–817, doi:10.1215/ijm/1256048929.

[6] Ballantine, C.; Roberts, J. (January 2002), "A Simple Proof of Rolle's Theorem for Finite Fields", The American Mathematical Monthly, 109 (1), Mathematical Association of America: 72–74, doi:10.2307/2695770, JSTOR 2695770.

[1]

[2]

[3]

[4]

[5]

[6]