- 1 Motivation
- 2 Keisler's textbook
- 3 Definition of derivative
- 4 Continuity
- 5 Uniform continuity
- 6 Compactness
- 7 Heine–Cantor theorem
- 8 Why is the squaring function not uniformly continuous?
- 9 Example: Dirichlet function
- 10 Limit
- 11 Limit of sequence
- 12 Extreme value theorem
- 13 Intermediate value theorem
- 14 Basic theorems
- 15 Applications
- 16 See also
- 17 Notes
- 18 References
- 19 External links
In mathematics, non-standard calculus is the modern application of infinitesimals, in the sense of non-standard analysis, to differential and integral calculus. It provides a rigorous justification for some arguments in calculus that were previously considered merely heuristic.
Calculations with infinitesimals were widely used before Karl Weierstrass sought to replace them with the (ε, δ)-definition of limit starting in the 1870s. (See history of calculus.) For almost one hundred years thereafter, mathematicians like Richard Courant viewed infinitesimals as being naive and vague or meaningless.
Contrary to such views, Abraham Robinson showed in 1960 that infinitesimals are precise, clear, and meaningful, building upon work by Edwin Hewitt and Jerzy Łoś. According to Jerome Keisler, "Robinson solved a three hundred year old problem by giving a precise treatment of infinitesimals. Robinson's achievement will probably rank as one of the major mathematical advances of the twentieth century."
To calculate the derivative of the function at x, both approaches agree on the algebraic manipulations:
This becomes a computation of the derivatives using the hyperreals if we interpret Δx as an infinitesimal and let the symbol "" be the relation "is infinitely close to".
In order to make f ' a real-valued function, we must dispense with the final term . In the standard approach using only real numbers, that is done by taking the limit as tends to zero. In the hyperreal approach, the quantity is taken to be an infinitesimal, a nonzero number that is closer to 0 than to any nonzero real. The manipulations displayed above then show that is infinitely close to 2x, so the derivative of f at x is then 2x.
Discarding the "error term" is accomplished by an application of the standard part function. Dispensing with infinitesimal error terms was historically considered paradoxical by some writers, most notably George Berkeley.
Once the hyperreal number system (an infinitesimal-enriched continuum) is in place, one has successfully incorporated a large part of the technical difficulties at the foundational level. Thus, the epsilon, delta techniques that some believe to be the essence of analysis can be implemented once and for all at the foundational level, and the students needn't be "dressed to perform multiple-quantifier logical stunts on pretense of being taught infinitesimal calculus", to quote a recent study. More specifically, the basic concepts of calculus such as continuity, derivative, and integral can be defined using infinitesimals without reference to epsilon, delta (see next section).
Keisler's Elementary Calculus: An Infinitesimal Approach defines continuity on page 125 in terms of infinitesimals, to the exclusion of epsilon, delta methods. The derivative is defined on page 45 using infinitesimals rather than an epsilon-delta approach. The integral is defined on page 183 in terms of infinitesimals. Epsilon, delta definitions are introduced on page 282.
Definition of derivative
The hyperreals can be constructed in the framework of Zermelo-Fraenkel set theory, the standard axiomatisation of set theory used elsewhere in mathematics. To give an intuitive idea for the hyperreal approach, note that, naively speaking, non-standard analysis postulates the existence of positive numbers ε which are infinitely small, meaning that ε is smaller than any standard positive real, yet greater than zero. Every real number x is surrounded by an infinitesimal "cloud" of hyperreal numbers infinitely close to it. To define the derivative of f at a standard real number x in this approach, one no longer needs an infinite limiting process as in standard calculus. Instead, one sets
where st is the standard part function, yielding the real number infinitely close to the hyperreal argument of st, and is the natural extension of to the hyperreals.
A real function f is continuous at a standard real number x if for every hyperreal x' infinitely close to x, the value f(x' ) is also infinitely close to f(x). This captures Cauchy's definition of continuity as presented in his 1821 textbook Cours d'Analyse, p. 34.
Using the notation for the relation of being infinitely close as above, the definition can be extended to arbitrary (standard or non-standard) points as follows:
A function f is microcontinuous at x if whenever , one has
Here the point x' is assumed to be in the domain of (the natural extension of) f.
The above requires fewer quantifiers than the (ε, δ)-definition familiar from standard elementary calculus:
f is continuous at x if for every ε > 0, there exists a δ > 0 such that for every x' , whenever |x − x' | < δ, one has |ƒ(x) − ƒ(x' )| < ε.
A function f on an interval I is uniformly continuous if its natural extension f* in I* has the following property (see Keisler, Foundations of Infinitesimal Calculus ('07), p. 45):
for every pair of hyperreals x and y in I*, if then .
In terms of microcontinuity defined in the previous section, this can be stated as follows: a real function is uniformly continuous if its natural extension f* is microcontinuous at every point of the domain of f*.
This definition has a reduced quantifier complexity when compared with the standard (ε, δ)-definition. Namely, the epsilon-delta definition of uniform continuity requires four quantifiers, while the infinitesimal definition requires only two quantifiers. It has the same quantifier complexity as the definition of uniform continuity in terms of sequences in standard calculus, which however is not expressible in the first-order language of the real numbers.
The hyperreal definition can be illustrated by the following three examples.
Example 1: a function f is uniformly continuous on the semi-open interval (0,1], if and only if its natural extension f* is microcontinuous (in the sense of the formula above) at every positive infinitesimal, in addition to continuity at the standard points of the interval.
Example 2: a function f is uniformly continuous on the semi-open interval [0,∞) if and only if it is continuous at the standard points of the interval, and in addition, the natural extension f* is microcontinuous at every positive infinite hyperreal point.
Example 3: similarly, the failure of uniform continuity for the squaring function
is due to the absence of microcontinuity at a single infinite hyperreal point, see below.
- The number of quantifiers in a mathematical statement gives a rough measure of the statement’s complexity. Statements involving three or more quantifiers can be difficult to understand. This is the main reason why it is hard to understand the rigorous definitions of limit, convergence, continuity and differentiability in analysis as they have many quantifiers. In fact, it is the alternation of the and that causes the complexity.
Andreas Blass wrote as follows:
- Often ... the nonstandard definition of a concept is simpler than the standard definition (both intuitively simpler and simpler in a technical sense, such as quantifiers over lower types or fewer alternations of quantifiers).
A set A is compact if and only if its natural extension A* has the following property: every point in A* is infinitely close to a point of A. Thus, the open interval (0,1) is not compact because its natural extension contains positive infinitesimals which are not infinitely close to any positive real number.
The fact that a continuous function on a compact interval I is necessarily uniformly continuous (the Heine–Cantor theorem) admits a succinct hyperreal proof. Let x, y be hyperreals in the natural extension I* of I. Since I is compact, both st(x) and st(y) belong to I. If x and y were infinitely close, then by the triangle inequality, they would have the same standard part
Since the function is assumed continuous at c, we have
and therefore f(x) and f(y) are infinitely close, proving uniform continuity of f.
Why is the squaring function not uniformly continuous?
Let f(x) = x2 defined on . Let be an infinite hyperreal. The hyperreal number is infinitely close to N. Meanwhile, the difference
is not infinitesimal. Therefore f* fails to be microcontinuous at the hyperreal point N. Thus, the squaring function is not uniformly continuous, according to the definition in uniform continuity above.
A similar proof may be given in the standard setting (Fitzpatrick 2006, Example 3.15).
Example: Dirichlet function
Consider the Dirichlet function
It is well known that the function is discontinuous at every point. Let us check this in terms of the hyperreal definition of continuity above, for instance let us show that the Dirichlet function is not continuous at π. Consider the continued fraction approximation an of π. Now let the index n be an infinite hypernatural number. By the transfer principle, the natural extension of the Dirichlet function takes the value 1 at an. Note that the hyperrational point an is infinitely close to π. Thus the natural extension of the Dirichlet function takes different values (0 and 1) at these two infinitely close points, and therefore the Dirichlet function is not continuous at π.
While the thrust of Robinson's approach is that one can dispense with the approach using multiple quantifiers, the notion of limit can be easily recaptured in terms of the standard part function st, namely
if and only if whenever the difference x − a is infinitesimal, the difference ƒ(x) − L is infinitesimal, as well, or in formulas:
- if st(x) = a then st(ƒ(x)) = L,
Limit of sequence
Given a sequence of real numbers , if we say L is the limit of the sequence and write
if for every infinite hypernatural n, we have st(xn)=L (here the extension principle is used to define xn for every hyperinteger n).
Extreme value theorem
To show that a real continuous function f on [0,1] has a maximum, let N be an infinite hyperinteger. The interval [0, 1] has a natural hyperreal extension. The function ƒ is also naturally extended to hyperreals between 0 and 1. Consider the partition of the hyperreal interval [0,1] into N subintervals of equal infinitesimal length 1/N, with partition points xi = i /N as i "runs" from 0 to N. In the standard setting (when N is finite), a point with the maximal value of ƒ can always be chosen among the N+1 points xi, by induction. Hence, by the transfer principle, there is a hyperinteger i0 such that 0 ≤ i0 ≤ N and for all i = 0, …, N (an alternative explanation is that every hyperfinite set admits a maximum). Consider the real point
where st is the standard part function. An arbitrary real point x lies in a suitable sub-interval of the partition, namely , so that st(xi) = x. Applying st to the inequality , we obtain . By continuity of ƒ we have
Hence ƒ(c) ≥ ƒ(x), for all x, proving c to be a maximum of the real function ƒ. See Keisler (1986, p. 164).
Intermediate value theorem
Let f be a continuous function on [a,b] such that f(a)<0 while f(b)>0. Then there exists a point c in [a,b] such that f(c)=0.
The proof proceeds as follows. Let N be an infinite hyperinteger. Consider a partition of [a,b] into N intervals of equal length, with partition points xi as i runs from 0 to N. Consider the collection I of indices such that f(xi)>0. Let i0 be the least element in I (such an element exists by the transfer principle, as I is a hyperfinite set; see non-standard analysis). Then the real number
is the desired zero of f. Such a proof reduces the quantifier complexity of a standard proof of the IVT.
If f is a real valued function defined on an interval [a, b], then the transfer operator applied to f, denoted by *f, is an internal, hyperreal-valued function defined on the hyperreal interval [*a, *b].
Theorem. Let f be a real-valued function defined on an interval [a, b]. Then f is differentiable at a < x < b if and only if for every non-zero infinitesimal h, the value
is independent of h. In that case, the common value is the derivative of f at x.
Note that a similar result holds for differentiability at the endpoints a, b provided the sign of the infinitesimal h is suitably restricted.
For the second theorem, we consider the Riemann integral. This integral is defined as the limit, if it exists, of a directed family of Riemann sums; these are sums of the form
We will call such a sequence of values a partition or mesh and
the width of the mesh. In the definition of the Riemann integral, the limit of the Riemann sums is taken as the width of the mesh goes to 0.
Theorem. Let f be a real-valued function defined on an interval [a, b]. Then f is Riemann-integrable on [a, b] if and only if for every internal mesh of infinitesimal width, the quantity
is independent of the mesh. In this case, the common value is the Riemann integral of f over [a, b].
One immediate application is an extension of the standard definitions of differentiation and integration to internal functions on intervals of hyperreal numbers.
An internal hyperreal-valued function f on [a, b] is S-differentiable at x, provided
exists and is independent of the infinitesimal h. The value is the S derivative at x.
Theorem. Suppose f is S-differentiable at every point of [a, b] where b − a is a bounded hyperreal. Suppose furthermore that
Then for some infinitesimal ε
To prove this, let N be a non-standard natural number. Divide the interval [a, b] into N subintervals by placing N − 1 equally spaced intermediate points:
Now the maximum of any internal set of infinitesimals is infinitesimal. Thus all the εk's are dominated by an infinitesimal ε. Therefore,
from which the result follows.
- Criticism of non-standard analysis
- Archimedes' use of infinitesimals
- Elementary Calculus: An Infinitesimal Approach
- Non-classical analysis
- Courant described infinitesimals on page 81 of Differential and Integral Calculus, Vol I, as "devoid of any clear meaning" and "naive befogging". Similarly on page 101, Courant described them as "incompatible with the clarity of ideas demanded in mathematics", "entirely meaningless", "fog which hung round the foundations", and a "hazy idea".
- Elementary Calculus: An Infinitesimal Approach
- Katz, Mikhail; Tall, David (2011), Tension between Intuitive Infinitesimals and Formal Mathematical Analysis, Bharath Sriraman, Editor. Crossroads in the History of Mathematics and Mathematics Education. The Montana Mathematics Enthusiast Monographs in Mathematics Education 12, Information Age Publishing, Inc., Charlotte, NC, arXiv:1110.5747
- Kevin Houston, How to Think Like a Mathematician, ISBN 978-0-521-71978-0
- Blass, Andreas (1978), "Review: Martin Davis, Applied nonstandard analysis, and K. D. Stroyan and W. A. J. Luxemburg, Introduction to the theory of infinitesimals, and H. Jerome Keisler, Foundations of infinitesimal calculus", Bull. Amer. Math. Soc. 84 (1): 34–41, p. 37.
- Fitzpatrick, Patrick (2006), Advanced Calculus, Brooks/Cole
- H. Jerome Keisler: Elementary Calculus: An Approach Using Infinitesimals. First edition 1976; 2nd edition 1986. (This book is now out of print. The publisher has reverted the copyright to the author, who has made available the 2nd edition in .pdf format available for downloading at http://www.math.wisc.edu/~keisler/calc.html.)
- H. Jerome Keisler: Foundations of Infinitesimal Calculus, available for downloading at http://www.math.wisc.edu/~keisler/foundations.html (10 jan '07)
- Blass, Andreas (1978), "Review: Martin Davis, Applied nonstandard analysis, and K. D. Stroyan and W. A. J. Luxemburg, Introduction to the theory of infinitesimals, and H. Jerome Keisler, Foundations of infinitesimal calculus", Bull. Amer. Math. Soc. 84 (1): 34–41
- Online version of "Elementary Calculus: An Approach Using Infinitesimals"
- An online calculus text using infinitesimals