Generalizations of the derivative
|Part of a series of articles about|
This article may require cleanup to meet Wikipedia's quality standards. The specific problem is: too much bold; see WP:MOSBOLD (April 2015) (Learn how and when to remove this template message)
In mathematics, the derivative is a fundamental construction of differential calculus and admits many possible generalizations within the fields of mathematical analysis, combinatorics, algebra, and geometry.
- 1 Derivatives in analysis
- 2 Difference operator, q-analogues and time scales
- 3 Derivatives in algebra
- 4 Derivatives in geometry
- 5 Other generalizations
- 6 See also
- 7 Notes
Derivatives in analysis
In real, complex, and functional analysis, derivatives are generalized to functions of several real or complex variables and functions between topological vector spaces. An important case is the variational derivative in the calculus of variations. Repeated application of differentiation leads to derivatives of higher order and differential operators.
The derivative is often met for the first time as an operation on a single real function of a single real variable. One of the simplest settings for generalizations is to vector valued functions of several variables (most often the domain forms a vector space as well). This is the field of multivariable calculus.
In one-variable calculus, we say that a function is differentiable at a point x if the limit
exists. Its value is then the derivative ƒ'(x). A function is differentiable on an interval if it is differentiable at every point within the interval. Since the line is tangent to the original function at the point the derivative can be seen as a way to find the best linear approximation of a function. If one ignores the constant term, setting , L(z) becomes an actual linear operator on R considered as a vector space over itself.
This motivates the following generalization to functions mapping Rm to Rn: ƒ is differentiable at x if there exists a linear operator A(x) (depending on x) such that
Although this definition is perhaps not as explicit as the above, if such an operator exists, then it is unique, and in the one-dimensional case coincides with the original definition. (In this case the derivative is represented by a 1-by-1 matrix consisting of the sole entry f'(x).) Note that, in general, we concern ourselves mostly with functions being differentiable in some open neighbourhood of rather than at individual points, as not doing so tends to lead to many pathological counterexamples.
An n by m matrix, of the linear operator A(x) is known as Jacobian matrix Jx(ƒ) of the mapping ƒ at point x. Each entry of this matrix represents a partial derivative, specifying the rate of change of one range coordinate with respect to a change in a domain coordinate. Of course, the Jacobian matrix of the composition g°f is a product of corresponding Jacobian matrices: Jx(g°f) =Jƒ(x)(g)Jx(ƒ). This is a higher-dimensional statement of the chain rule.
For real valued functions from Rn to R (scalar fields), the total derivative can be interpreted as a vector field called the gradient. An intuitive interpretation of the gradient is that it points "up": in other words, it points in the direction of fastest increase of the function. It can be used to calculate directional derivatives of scalar functions or normal directions.
Several linear combinations of partial derivatives are especially useful in the context of differential equations defined by a vector valued function Rn to Rn. The divergence gives a measure of how much "source" or "sink" near a point there is. It can be used to calculate flux by divergence theorem. The curl measures how much "rotation" a vector field has near a point.
For vector-valued functions from R to Rn (i.e., parametric curves), one can take the derivative of each component separately. The resulting derivative is another vector valued function. This is useful, for example, if the vector-valued function is the position vector of a particle through time, then the derivative is the velocity vector of the particle through time.
The convective derivative takes into account changes due to time dependence and motion through space along vector field.
Higher-order derivatives and differential operators
One can iterate the differentiation process, that is, apply derivatives more than once, obtaining derivatives of second and higher order. A more sophisticated idea is to combine several derivatives, possibly of different orders, in one algebraic expression, a differential operator. This is especially useful in considering ordinary linear differential equations with constant coefficients. For example, if f(x) is a twice differentiable function of one variable, the differential equation
may be rewritten in the form
is a second order linear constant coefficient differential operator acting on functions of x. The key idea here is that we consider a particular linear combination of zeroth, first and second order derivatives "all at once". This allows us to think of the set of solutions of this differential equation as a "generalized antiderivative" of its right hand side 4x − 1, by analogy with ordinary integration, and formally write
Higher derivatives can also be defined for functions of several variables, studied in multivariable calculus. In this case, instead of repeatedly applying the derivative, one repeatedly applies partial derivatives with respect to different variables. For example, the second order partial derivatives of a scalar function of n variables can be organized into an n by n matrix, the Hessian matrix. One of the subtle points is that the higher derivatives are not intrinsically defined, and depend on the choice of the coordinates in a complicated fashion (in particular, the Hessian matrix of a function is not a tensor). Nevertheless, higher derivatives have important applications to analysis of local extrema of a function at its critical points. For an advanced application of this analysis to topology of manifolds, see Morse theory.
As in the case of functions of one variable, we can combine first and higher order partial derivatives to arrive at a notion of a partial differential operator. Some of these operators are so important that they have their own names:
- The Laplace operator or Laplacian on R3 is a second-order partial differential operator Δ given by the divergence of the gradient of a scalar function of three variables, or explicitly as
Analogous operators can be defined for functions of any number of variables.
- The d'Alembertian or wave operator is similar to the Laplacian, but acts on functions of four variables. Its definition uses the indefinite metric tensor of Minkowski space, instead of the Euclidean dot product of R3:
Analysis on fractals
Laplacians and differential equations can be defined on fractals.
In addition to n-th derivatives for any natural number n, there are various ways to define derivatives of fractional or negative orders, which are studied in fractional calculus. The -1 order derivative corresponds to the integral, whence the term differintegral.
The Schwarzian derivative describes how a complex function is approximated by a fractional-linear map, in much the same way that a normal derivative describes how a function is approximated by a linear map.
The Wirtinger derivatives are a set of differential operators that permit the construction of a differential calculus for complex functions that is entirely analogous to the ordinary differential calculus for functions of real variables.
In functional analysis, the functional derivative defines the derivative with respect to a function of a functional on a space of functions. This is an extension of the directional derivative to an infinite dimensional vector space.
The Fréchet derivative allows the extension of the directional derivative to a general Banach space. The Gâteaux derivative extends the concept to locally convex topological vector spaces. Fréchet differentiability is a strictly stronger condition than Gâteaux differentiability, even in finite dimensions. Between the two extremes is the quasi-derivative.
In measure theory, the Radon–Nikodym derivative generalizes the Jacobian, used for changing variables, to measures. It expresses one measure μ in terms of another measure ν (under certain conditions).
On a function space, the linear operator which assigns to each function its derivative is an example of a differential operator. General differential operators include higher order derivatives. By means of the Fourier transform, pseudo-differential operators can be defined which allow for fractional calculus.
Analogues of derivatives in fields of positive characteristic
The Carlitz derivative is an operation similar to usual differentiation have been devised with the usual context of real or complex numbers changed to local fields of positive characteristic in the form of formal Laurent series with coefficients in some finite field Fq (it is known that any local field of positive characteristic is isomorphic to a Laurent series field).
Along with suitably defined analogs to the exponential function, logarithms and others the derivative can be used to develop notions of smoothness, analycity, integration, Taylor series as well as a theory of differential equations.
Difference operator, q-analogues and time scales
- The q-derivative of a function is defined by the formula
For x nonzero, if f is a differentiable function of x then in the limit as q → 1 we obtain the ordinary derivative, thus the q-derivative may be viewed as its q-deformation. A large body of results from ordinary differential calculus, such as binomial formula and Taylor expansion, have natural q-analogues that were discovered in the 19th century, but remained relatively obscure for a big part of the 20th century, outside of the theory of special functions. The progress of combinatorics and the discovery of quantum groups have changed the situation dramatically, and the popularity of q-analogues is on the rise.
- The difference operator of difference equations is another discrete analog of the standard derivative.
- The q-derivative, the difference operator and the standard derivative can all be viewed as the same thing on different time scales. For example, taking , we may have
Hahn difference is not only a generalization of q-derivative but also an extension of forward difference.
- Also note that the q-derivative is nothing but a special case of the familiar derivative. Take . Then we have,
Derivatives in algebra
A derivation is a linear map on a ring or algebra which satisfies the Leibniz law (the product rule). Higher derivatives and algebraic differential operators can also be defined. They are studied in a purely algebraic setting in differential Galois theory and the theory of D-modules, but also turn up in many other areas, where they often agree with less algebraic definitions of derivatives.
The notion of derivation applies to noncommutative as well as commutative rings, and even to non-associative algebraic structures, such as Lie algebras.
In commutative algebra, Kähler differentials are universal derivations of a commutative ring or module. They can be used to define an analogue of exterior derivative from differential geometry that applies to arbitrary algebraic varieties, instead of just smooth manifolds.
Many abstract data types in mathematics and computer science can be described as the algebra generated by a transformation that maps structures based on the type back into the type. For example, the type T of binary trees containing values of type A can be represented as the algebra generated by the transformation 1+A×T2→T. The "1" represents the construction of an empty tree, and the second term represents the construction of a tree from a value and two subtrees. The "+" indicates that a tree can be constructed either way.
The derivative of such a type is the type that describes the context of a particular substructure with respect to its next outer containing structure. Put another way, it is the type representing the "difference" between the two. In the tree example, the derivative is a type that describes the information needed, given a particular subtree, to construct its parent tree. This information is a tuple that contains a binary indicator of whether the child is on the left or right, the value at the parent, and the sibling subtree. This type can be represented as 2×A×T, which looks very much like the derivative of the transformation that generated the tree type.
Derivatives in geometry
Main types of derivatives in geometry is Lie derivatives along a vector field, exterior differential, and covariant derivatives.
In differential topology, a vector field may be defined as a derivation on the ring of smooth functions on a manifold, and a tangent vector may be defined as a derivation at a point. This allows the abstraction of the notion of a directional derivative of a scalar function to general manifolds. For manifolds that are subsets of Rn, this tangent vector will agree with the directional derivative defined above.
On the exterior algebra of differential forms over a smooth manifold, the exterior derivative is the unique linear map which satisfies a graded version of the Leibniz law and squares to zero. It is a grade 1 derivation on the exterior algebra.
The Lie derivative is the rate of change of a vector or tensor field along the flow of another vector field. On vector fields, it is an example of a Lie bracket (vector fields form the Lie algebra of the diffeomorphism group of the manifold). It is a grade 0 derivation on the algebra.
In differential geometry, the covariant derivative makes a choice for taking directional derivatives of vector fields along curves. This extends the directional derivative of scalar functions to sections of vector bundles or principal bundles. In Riemannian geometry, the existence of a metric chooses a unique preferred torsion-free covariant derivative, known as the Levi-Civita connection. See also gauge covariant derivative for a treatment oriented to physics.
The exterior covariant derivative extends the exterior derivative to vector valued forms.
In geometric calculus, the geometric derivative satisfies a weaker form of the Leibniz rule. It specializes the Frechet derivative to the objects of geometric algebra. Geometric calculus is a powerful formalism that has been shown to encompass the similar frameworks of differential forms and differential geometry.
It may be possible to combine two or more of the above different notions of extension or abstraction of the original derivative. For example, in Finsler geometry, one studies spaces which look locally like Banach spaces. Thus one might want a derivative with some of the features of a functional derivative and the covariant derivative.
Multiplicative calculus replaces addition with multiplication, and hence rather than dealing with the limit of a ratio of differences, it deals with the limit of an exponentiation of ratios. This allows the development of the geometric derivative and bigeometric derivative. Moreover, just like the classical differential operator has a discrete analog, the difference operator, there are also discrete analogs of these multiplicative derivatives.
- Arithmetic derivative
- Dini derivative
- Non-Newtonian calculus
- Non-classical analysis
- Symmetric derivative
- Kochubei, Anatoly N. (2009). Analysis in Positive Characteristic. New York: Cambridge University Press. ISBN 978-0-521-50977-0.
- David Hestenes, Garrett Sobczyk: Clifford Algebra to Geometric Calculus, a Unified Language for mathematics and Physics (Dordrecht/Boston:G.Reidel Publ.Co., 1984, ISBN 90-277-2561-6