Symmetry of second derivatives

In mathematics, the symmetry of second derivatives (also called the equality of mixed partials) refers to the possibility of interchanging the order of taking partial derivatives of a function

f\left(x_{1},\,x_{2},\,\ldots ,\,x_{n}\right)

of $n$ variables without changing the result under certain conditions (see below). The symmetry is the assertion that the second-order partial derivatives satisfy the identity

{\frac {\partial }{\partial x_{i}}}\left({\frac {\partial f}{\partial x_{j}}}\right)\ =\ {\frac {\partial }{\partial x_{j}}}\left({\frac {\partial f}{\partial x_{i}}}\right)

so that they form an $n\times n$ symmetric matrix, known as the function's Hessian matrix. Sufficient conditions for the above symmetry to hold are established by a result known as Schwarz's theorem, Clairaut's theorem, or Young's theorem.^[1]^[2]

In the context of partial differential equations it is called the Schwarz integrability condition.

Formal expressions of symmetry

In symbols, the symmetry may be expressed as:

{\frac {\partial }{\partial x}}\left({\frac {\partial f}{\partial y}}\right)\ =\ {\frac {\partial }{\partial y}}\left({\frac {\partial f}{\partial x}}\right)\qquad {\text{or}}\qquad {\frac {\partial ^{2}\!f}{\partial x\,\partial y}}\ =\ {\frac {\partial ^{2}\!f}{\partial y\,\partial x}}.

Another notation is:

\partial _{x}\partial _{y}f=\partial _{y}\partial _{x}f\qquad {\text{or}}\qquad f_{yx}=f_{xy}.

In terms of composition of the differential operator $D i$ which takes the partial derivative with respect to $x i$ :

D_{i}\circ D_{j}=D_{j}\circ D_{i}

.

From this relation it follows that the ring of differential operators with constant coefficients, generated by the $D i$ , is commutative; but this is only true as operators over a domain of sufficiently differentiable functions. It is easy to check the symmetry as applied to monomials, so that one can take polynomials in the $x i$ as a domain. In fact smooth functions are another valid domain.

History

The result on the equality of mixed partial derivatives under certain conditions has a long history. The list of unsuccessful proposed proofs started with Euler's, published in 1740,^[3] although already in 1721 Bernoulli had implicitly assumed the result with no formal justification.^[4] Clairaut also published a proposed proof in 1740, with no other attempts until the end of the 18th century. Starting then, for a period of 70 years, a number of incomplete proofs were proposed. The proof of Lagrange (1797) was improved by Cauchy (1823), but assumed the existence and continuity of the partial derivatives ${\tfrac {\partial ^{2}f}{\partial x^{2}}}$ and ${\tfrac {\partial ^{2}f}{\partial y^{2}}}$ .^[5] Other attempts were made by P. Blanchet (1841), Duhamel (1856), Sturm (1857), Schlömilch (1862), and Bertrand (1864). Finally in 1867 Lindelöf systematically analyzed all the earlier flawed proofs and was able to exhibit a specific counterexample where mixed derivatives failed to be equal.^[6]^[7]

Six years after that, Schwarz succeeded in giving the first rigorous proof.^[8] Dini later contributed by finding more general conditions than those of Schwarz. Eventually a clean and more general version was found by Jordan in 1883 that is still the proof found in most textbooks. Minor variants of earlier proofs were published by Laurent (1885), Peano (1889 and 1893), J. Edwards (1892), P. Haag (1893), J. K. Whittemore (1898), Vivanti (1899) and Pierpont (1905). Further progress was made in 1907-1909 when E. W. Hobson and W. H. Young found proofs with weaker conditions than those of Schwarz and Dini. In 1918, Carathéodory gave a different proof based on the Lebesgue integral.^[7]

Schwarz's theorem

In mathematical analysis, Schwarz's theorem (or Clairaut's theorem on equality of mixed partials)^[9] named after Alexis Clairaut and Hermann Schwarz, states that for a function $f\colon \Omega \to \mathbb {R}$ defined on a set $\Omega \subset \mathbb {R} ^{n}$ , if $\mathbf {p} \in \mathbb {R} ^{n}$ is a point such that some neighborhood of $\mathbf {p}$ is contained in $\Omega$ and $f$ has continuous second partial derivatives on that neighborhood of $\mathbf {p}$ , then for all $i$ and $j$ in $\{1,2\ldots ,\,n\},$

{\frac {\partial ^{2}}{\partial x_{i}\,\partial x_{j}}}f(\mathbf {p} )={\frac {\partial ^{2}}{\partial x_{j}\,\partial x_{i}}}f(\mathbf {p} ).

The partial derivatives of this function commute at that point.

One easy way to establish this theorem (in the case where $n=2$ , $i=1$ , and $j=2$ , which readily entails the result in general) is by applying Green's theorem to the gradient of $f.$

An elementary proof for functions on open subsets of the plane is as follows (by a simple reduction, the general case for the theorem of Schwarz easily reduces to the planar case).^[10] Let $f(x,y)$ be a differentiable function on an open rectangle $\Omega$ containing a point $(a,b)$ and suppose that $df$ is continuous with continuous $\partial _{x}\partial _{y}f$ and $\partial _{y}\partial _{x}f$ over $\Omega .$ Define

{\begin{aligned}u\left(h,\,k\right)&=f\left(a+h,\,b+k\right)-f\left(a+h,\,b\right),\\v\left(h,\,k\right)&=f\left(a+h,\,b+k\right)-f\left(a,\,b+k\right),\\w\left(h,\,k\right)&=f\left(a+h,\,b+k\right)-f\left(a+h,\,b\right)-f\left(a,\,b+k\right)+f\left(a,\,b\right).\end{aligned}}

These functions are defined for $\left|h\right|,\,\left|k\right|<\varepsilon$ , where $\varepsilon >0$ and $\left[a-\varepsilon ,\,a+\varepsilon \right]\times \left[b-\varepsilon ,\,b+\varepsilon \right]$ is contained in $\Omega .$

By the mean value theorem, for fixed $h$ and $k$ non-zero, $\theta ,\theta ',\phi ,\phi '$ can be found in the open interval $(0,1)$ with

{\begin{aligned}w\left(h,\,k\right)&=u\left(h,\,k\right)-u\left(0,\,k\right)=h\,\partial _{x}u\left(\theta h,\,k\right)\\&=h\,\left[\partial _{x}f\left(a+\theta h,\,b+k\right)-\partial _{x}f\left(a+\theta h,\,b\right)\right]\\&=hk\,\partial _{y}\partial _{x}f\left(a+\theta h,\,b+\theta ^{\prime }k\right)\\w\left(h,\,k\right)&=v\left(h,\,k\right)-v\left(h,\,0\right)=k\,\partial _{y}v\left(h,\,\phi k\right)\\&=k\left[\partial _{y}f\left(a+h,\,b+\phi k\right)-\partial _{y}f\left(a,\,b+\phi k\right)\right]\\&=hk\,\partial _{x}\partial _{y}f\left(a+\phi ^{\prime }h,\,b+\phi k\right).\end{aligned}}

Since $h,\,k\neq 0$ , the first equality below can be divided by $hk$ :

{\begin{aligned}hk\,\partial _{y}\partial _{x}f\left(a+\theta h,\,b+\theta ^{\prime }k\right)&=hk\,\partial _{x}\partial _{y}f\left(a+\phi ^{\prime }h,\,b+\phi k\right),\\\partial _{y}\partial _{x}f\left(a+\theta h,\,b+\theta ^{\prime }k\right)&=\partial _{x}\partial _{y}f\left(a+\phi ^{\prime }h,\,b+\phi k\right).\end{aligned}}

Letting $h,\,k$ tend to zero in the last equality, the continuity assumptions on $\partial _{y}\partial _{x}f$ and $\partial _{x}\partial _{y}f$ now imply that

{\frac {\partial ^{2}}{\partial x\partial y}}f\left(a,\,b\right)={\frac {\partial ^{2}}{\partial y\partial x}}f\left(a,\,b\right).

This account is a straightforward classical method found in many text books, for example in Burkill, Apostol and Rudin.^[10]^[11]^[12]

Although the derivation above is elementary, the approach can also be viewed from a more conceptual perspective so that the result becomes more apparent.^[13]^[14]^[15]^[16]^[17] Indeed the difference operators $\Delta _{x}^{t},\,\,\Delta _{y}^{t}$ commute and $\Delta _{x}^{t}f,\,\,\Delta _{y}^{t}f$ tend to $\partial _{x}f,\,\,\partial _{y}f$ as $t$ tends to 0, with a similar statement for second order operators.^[a] Here, for $z$ a vector in the plane and $u$ a directional vector ${\tbinom {1}{0}}$ or ${\tbinom {0}{1}}$ , the difference operator is defined by

\Delta _{u}^{t}f(z)={f(z+tu)-f(z) \over t}.

By the fundamental theorem of calculus for $C^{1}$ functions $f$ on an open interval $I$ with $(a,b)\subset I$

\int _{a}^{b}f^{\prime }(x)\,dx=f(b)-f(a).

Hence

|f(b)-f(a)|\leq (b-a)\,\sup _{c\in (a,b)}|f^{\prime }(c)|

.

This is a generalized version of the mean value theorem. Recall that the elementary discussion on maxima or minima for real-valued functions implies that if $f$ is continuous on $[a,b]$ and differentiable on $(a,b)$ , then there is a point $c$ in $(a,b)$ such that

{f(b)-f(a) \over b-a}=f^{\prime }(c).

For vector-valued functions with $V$ a finite-dimensional normed space, there is no analogue of the equality above, indeed it fails. But since $\inf f^{\prime }\leq f^{\prime }(c)\leq \sup f^{\prime }$ , the inequality above is a useful substitute. Moreover, using the pairing of the dual of $V$ with its dual norm, yields the following inequality:

\|f(b)-f(a)\|\leq (b-a)\,\sup _{c\in (a,b)}\|f^{\prime }(c)\|

.

These versions of the mean valued theorem are discussed in Rudin, Hörmander and elsewhere.^[19]^[20]

For $f$ a $C^{2}$ function on an open set in the plane, define $D_{1}=\partial _{x}$ and $D_{2}=\partial _{y}$ . Furthermore for $t\neq 0$ set

\Delta _{1}^{t}f(x,y)=[f(x+t,y)-f(x,y)]/t,\,\,\,\,\,\,\Delta _{2}^{t}f(x,y)=[f(x,y+t)-f(x,y)]/t

.

Then for $(x_{0},y_{0})$ in the open set, the generalized mean value theorem can be applied twice:

\left|\Delta _{1}^{t}\Delta _{2}^{t}f(x_{0},y_{0})-D_{1}D_{2}f(x_{0},y_{0})\right|\leq \sup _{0\leq s\leq 1}\left|\Delta _{1}^{t}D_{2}f(x_{0},y_{0}+ts)-D_{1}D_{2}f(x_{0},y_{0})\right|\leq \sup _{0\leq r,s\leq 1}\left|D_{1}D_{2}f(x_{0}+tr,y_{0}+ts)-D_{1}D_{2}f(x_{0},y_{0})\right|.

Thus $\Delta _{1}^{t}\Delta _{2}^{t}f(x_{0},y_{0})$ tends to $D_{1}D_{2}f(x_{0},y_{0})$ as $t$ tends to 0. The same argument shows that $\Delta _{2}^{t}\Delta _{1}^{t}f(x_{0},y_{0})$ tends to $D_{2}D_{1}f(x_{0},y_{0})$ . Hence, since the difference operators commute, so do the partial differential operators $D_{1}$ and $D_{2}$ , as claimed.^[21]^[22]^[23]^[24]^[25]

Remark. By two applications of the classical mean value theorem,

\Delta _{1}^{t}\Delta _{2}^{t}f(x_{0},y_{0})=D_{1}D_{2}f(x_{0}+t\theta ,y_{0}+t\theta ^{\prime })

for some $\theta$ and $\theta ^{\prime }$ in $(0,1)$ . Thus the first elementary proof can be reinterpreted using difference operators. Conversely, instead of using the generalized mean value theorem in the second proof, the classical mean valued theorem could be used.

Proof of Clairaut's theorem using iterated integrals

The properties of repeated Riemann integrals of a continuous function $F$ on a compact rectangle $[a, b] \times [c, d]$ are easily established.^[26] The uniform continuity of $F$ implies immediately that the functions $g(x)=\int _{c}^{d}F(x,y)\,dy$ and $h(y)=\int _{a}^{b}F(x,y)\,dx$ are continuous.^[27] It follows that

\int _{a}^{b}\int _{c}^{d}F(x,y)\,dy\,dx=\int _{c}^{d}\int _{a}^{b}F(x,y)\,dx\,dy

;

moreover it is immediate that the iterated integral is positive if $F$ is positive.^[28] The equality above is a simple case of Fubini's theorem, involving no measure theory. Titchmarsh (1939) proves it in a straightforward way using Riemann approximating sums corresponding to subdivisions of a rectangle into smaller rectangles.

To prove Clairaut's theorem, assume $f$ is a differentiable function on an open set $U$ , for which the mixed second partial derivatives $f yx$ and $f xy$ exist and are continuous. Using the fundamental theorem of calculus twice,

\int _{c}^{d}\int _{a}^{b}f_{yx}(x,y)\,dx\,dy=\int _{c}^{d}f_{y}(b,y)-f_{y}(a,y)\,dy=f(b,d)-f(a,d)-f(b,c)+f(a,c).

Similarly

\int _{a}^{b}\int _{c}^{d}f_{xy}(x,y)\,dy\,dx=\int _{a}^{b}f_{x}(x,d)-f_{x}(x,c)\,dx=f(b,d)-f(a,d)-f(b,c)+f(a,c).

The two iterated integrals are therefore equal. On the other hand, since $f xy (x, y)$ is continuous, the second iterated integral can be performed by first integrating over $x$ and then afterwards over $y$ . But then the iterated integral of $f yx - f xy$ on $[a, b] \times [c, d]$ must vanish. However, if the iterated integral of a continuous function function $F$ vanishes for all rectangles, then $F$ must be identically zero; for otherwise $F$ or $- F$ would be strictly positive at some point and therefore by continuity on a rectangle, which is not possible. Hence $f yx - f xy$ must vanish identically, so that $f yx = f xy$ everywhere.^[29]^[30]^[31]^[32]^[33]

Sufficiency of twice-differentiability

A weaker condition than the continuity of second partial derivatives (which is implied by the latter) which suffices to ensure symmetry is that all partial derivatives are themselves differentiable.^[34] Another strengthening of the theorem, in which existence of the permuted mixed partial is asserted, was provided by Peano in a short 1890 note on Mathesis:

If $f:E\to \mathbb {R}$ is defined on an open set $E\subset \mathbb {R} ^{2}$ ; $\partial _{1}f(x,\,y)$ and $\partial _{2,1}f(x,\,y)$ exist everywhere on $E$ ; $\partial _{2,1}f$ is continuous at $\left(x_{0},\,y_{0}\right)\in E$ , and if $\partial _{2}f(x,\,y_{0})$ exists in a neighborhood of $x=x_{0}$ , then $\partial _{1,2}f$ exists at $\left(x_{0},\,y_{0}\right)$ and $\partial _{1,2}f\left(x_{0},\,y_{0}\right)=\partial _{2,1}f\left(x_{0},\,y_{0}\right)$ .^[35]

Distribution theory formulation

The theory of distributions (generalized functions) eliminates analytic problems with the symmetry. The derivative of an integrable function can always be defined as a distribution, and symmetry of mixed partial derivatives always holds as an equality of distributions. The use of formal integration by parts to define differentiation of distributions puts the symmetry question back onto the test functions, which are smooth and certainly satisfy this symmetry. In more detail (where f is a distribution, written as an operator on test functions, and φ is a test function),

\left(D_{1}D_{2}f\right)[\phi ]=-\left(D_{2}f\right)\left[D_{1}\phi \right]=f\left[D_{2}D_{1}\phi \right]=f\left[D_{1}D_{2}\phi \right]=-\left(D_{1}f\right)\left[D_{2}\phi \right]=\left(D_{2}D_{1}f\right)[\phi ].

Another approach, which defines the Fourier transform of a function, is to note that on such transforms partial derivatives become multiplication operators that commute much more obviously.^[a]

Requirement of continuity

The symmetry may be broken if the function fails to have differentiable partial derivatives, which is possible if Clairaut's theorem is not satisfied (the second partial derivatives are not continuous).

The function f(x, y), as shown in equation (1), does not have symmetric second derivatives at its origin.

An example of non-symmetry is the function (due to Peano)^[36]^[37]

f(x,\,y)={\begin{cases}{\frac {xy\left(x^{2}-y^{2}\right)}{x^{2}+y^{2}}}&{\mbox{ for }}(x,\,y)\neq (0,\,0),\\0&{\mbox{ for }}(x,\,y)=(0,\,0).\end{cases}}

(1)

This can be visualized by the polar form $f(r\cos(\theta ),r\sin(\theta ))={\frac {r^{2}\sin(4\theta )}{4}}$ ; it is everywhere continuous, but its derivatives at (0, 0) cannot be computed algebraically. Rather, the limit of difference quotients shows that $f_{x}(0,0)=f_{y}(0,0)=0$ , so the graph $z=f(x,y)$ has a horizontal tangent plane at (0, 0), and the partial derivatives $f_{x},f_{y}$ exist and are everywhere continuous. However, the second partial derivatives are not continuous at (0, 0), and the symmetry fails. In fact, along the x-axis the y-derivative is $f_{y}(x,0)=x$ , and so:

f_{yx}(0,0)=\lim _{\varepsilon \to 0}{\frac {f_{y}(\varepsilon ,0)-f_{y}(0,0)}{\varepsilon }}=1.

In contrast, along the y-axis the x-derivative $f_{x}(0,y)=-y$ , and so $f_{xy}(0,0)=-1$ . That is, $f_{yx}\neq f_{xy}$ at (0, 0), although the mixed partial derivatives do exist, and at every other point the symmetry does hold.

The above function, written in a cylindrical coordinate system, can be expressed as

f(r,\,\theta )={\frac {r^{2}\sin {4\theta }}{4}},

showing that the function oscillates four times when traveling once around an arbitrarily small loop containing the origin. Intuitively, therefore, the local behavior of the function at (0, 0) cannot be described as a quadratic form, and the Hessian matrix thus fails to be symmetric.

In general, the interchange of limiting operations need not commute. Given two variables near (0, 0) and two limiting processes on

f(h,\,k)-f(h,\,0)-f(0,\,k)+f(0,\,0)

corresponding to making h → 0 first, and to making k → 0 first. It can matter, looking at the first-order terms, which is applied first. This leads to the construction of pathological examples in which second derivatives are non-symmetric. This kind of example belongs to the theory of real analysis where the pointwise value of functions matters. When viewed as a distribution the second partial derivative's values can be changed at an arbitrary set of points as long as this has Lebesgue measure 0. Since in the example the Hessian is symmetric everywhere except (0, 0), there is no contradiction with the fact that the Hessian, viewed as a Schwartz distribution, is symmetric.

In Lie theory

Consider the first-order differential operators D_i to be infinitesimal operators on Euclidean space. That is, D_i in a sense generates the one-parameter group of translations parallel to the x_i-axis. These groups commute with each other, and therefore the infinitesimal generators do also; the Lie bracket

[D_i, D_j] = 0

is this property's reflection. In other words, the Lie derivative of one coordinate with respect to another is zero.

Application to differential forms

The Clairaut-Schwarz theorem is the key fact needed to prove that for every $C^{\infty }$ (or at least twice differentiable) differential form $\omega \in \Omega ^{k}(M)$ , the second exterior derivative vanishes: $d^{2}\omega :=d(d\omega )=0$ . This implies that every differentiable exact form (i.e., a form $\alpha$ such that $\alpha =d\omega$ for some form $\omega$ ) is closed (i.e., $d\alpha =0$ ), since $d\alpha =d(d\omega )=0$ .^[38]

In the middle of the 18th century, the theory of differential forms was first studied in the simplest case of 1-forms in the plane, i.e. $A\,dx+B\,dy$ , where $A$ and $B$ are functions in the plane. The study of 1-forms and the differentials of functions began with Clairaut's papers in 1739 and 1740. At that stage his investigations were interpreted as ways of solving ordinary differential equations. Formally Clairaut showed that a 1-form $\omega =A\,dx+B\,dy$ on an open rectangle is closed, i.e. $d\omega =0$ , if and only $\omega$ has the form $df$ for some function $f$ in the disk. The solution for $f$ can be written by Cauchy's integral formula

f(x,y)=\int _{x_{0}}^{x}A(x,y)\,dx+\int _{y_{0}}^{y}B(x,y)\,dy;

while if $\omega =df$ , the closed property $d\omega =0$ is the identity $\partial _{x}\partial _{y}f=\partial _{y}\partial _{x}f$ . (In modern language this is one version of the Poincaré lemma.)^[39]

Notes

^ ^a ^b These can also be rephrased in terms of the action of operators on Schwartz functions on the plane. Under Fourier transform, the difference and differential operators are just multiplication operators.^[18]

^ "Young's Theorem" (PDF). University of California Berkeley. Archived from the original (PDF) on 2006-05-18. Retrieved 2015-01-02.
^ Allen 1964, pp. 300–305.
^ Euler 1740.
^ Sandifer 2007, pp. 142–147, footnote: Comm. Acad. Sci. Imp. Petropol. 7 (1734/1735) 1740, 174-189, 180-183; Opera Omnia, 1.22, 34-56..
^ Minguzzi 2015.
^ Lindelöf 1867.
^ ^a ^b Higgins 1940.
^ Schwarz 1873.
^ James 1966, p. ^{[page needed]}.
^ ^a ^b Burkill 1962, pp. 154–155
^ Apostol 1965.
^ Rudin 1976.
^ Hörmander 2015, pp. 7, 11. This condensed account is possibly the shortest.
^ Dieudonné 1960, pp. 179–180.
^ Godement 1998b, pp. 287–289.
^ Lang 1969, pp. 108–111.
^ Cartan 1971, pp. 64–67.
^ Hörmander 2015, Chapter VII.
^ Hörmander 2015, p. 6.
^ Rudin 1976, p. ^{[page needed]}.
^ Hörmander 2015, p. 11.
^ Dieudonné 1960.
^ Godement 1998a.
^ Lang 1969.
^ Cartan 1971.
^ Titchmarsh 1939, p. ^{[page needed]}.
^ Titchmarsh 1939, pp. 23–25.
^ Titchmarsh 1939, pp. 49–50.
^ Spivak 1965, p. 61.
^ McGrath 2014.
^ Aksoy & Martelli 2002.
^ Axler 2020, pp. 142–143.
^ Marshall, Donald E., Theorems of Fubini and Clairaut (PDF), University of Washington
^ Hubbard & Hubbard 2015, pp. 732–733.
^ Rudin 1976, pp. 235–236.
^ Hobson 1921, pp. 403–404.
^ Apostol 1974, pp. 358–359.
^ Tu 2010.
^ Katz 1981.

References

Aksoy, A.; Martelli, M. (2002), "Mixed Partial Derivatives and Fubini's Theorem", College Mathematics Journal of MAA, 33 (2): 126–130, doi:10.1080/07468342.2002.11921930, S2CID 124561972
Allen, R. G. D. (1964). Mathematical Analysis for Economists. New York: St. Martin's Press. ISBN 9781443725224.
Apostol, Tom M. (1965), Mathematical analysis: a modern approach to advanced calculus, London: Addison-Wesley, OCLC 901554874
Apostol, Tom M. (1974), Mathematical Analysis, Addison-Wesley, ISBN 9780201002881
Axler, Sheldon (2020), Measure, integration & real analysis, Graduate Texts in Mathematics, vol. 282, Springer, ISBN 9783030331436
Bourbaki, Nicolas (1952), "Chapitre III: Mesures sur les espaces localement compacts", Eléments de mathématique, Livre VI: Intégration (in French), Hermann et Cie
Burkill, J. C. (1962), A First Course in Mathematical Analysis, Cambridge University Press, ISBN 9780521294683 (reprinted 1978)
Cartan, Henri (1971), Calcul Differentiel (in French), Hermann, ISBN 9780395120330
Clairaut, A. C. (1739), "Recherches générales sur le calcul intégral", Mémoires de l'Académie Royale des Sciences: 425–436
Clairaut, A. C. (1740), "Sur l'integration ou la construction des equations différentielles du premier ordre", Mémoires de l'Académie Royale des Sciences, 2: 293–323
Dieudonné, J. (1937), "Sur les fonctions continues numérique définies dans une produit de deux espaces compacts", Comptes Rendus de l'Académie des Sciences de Paris, 205: 593–595
Dieudonné, J. (1960), Foundations of Modern Analysis, Pure and Applied Mathematics, vol. 10, Academic Press, ISBN 9780122155505
Dieudonné, J. (1976), Treatise on analysis. Vol. II., Pure and Applied Mathematics, vol. 10-II, translated by I. G. Macdonald, Academic Press, ISBN 9780122155024
Euler, Leonhard (1740). "De infinitis curvis eiusdem generis seu methodus inveniendi aequationes pro infinitis curvis eiusdem generis" [On infinite(ly many) curves of the same type, that is, a method of finding equations for infinite(ly many) curves of the same type]. Commentarii Academiae Scientiarum Petropolitanae (in Latin). 7: 174–189, 180–183 – via The Euler Archive, maintained by the University of the Pacific.
Gilkey, Peter; Park, JeongHyeong; Vázquez-Lorenzo, Ramón (2015), Aspects of differential geometry I, Synthesis Lectures on Mathematics and Statistics, vol. 15, Morgan & Claypool, ISBN 9781627056632
Godement, Roger (1998a), Analyse mathématique I, Springer
Godement, Roger (1998b), Analyse mathématique II, Springer
Higgins, Thomas James (1940). "A note on the history of mixed partial derivatives". Scripta Mathematica. 7: 59–62. Archived from the original on 2017-04-19. Retrieved 2017-04-19.
Hobson, E. W. (1921), The theory of functions of a real variable and the theory of Fourier's series. Vol. I., Cambridge University Press
Hörmander, Lars (2015), The Analysis of Linear Partial Differential Operators I: Distribution Theory and Fourier Analysis, Classics in Mathematics (2nd ed.), Springer, ISBN 9783642614972
Hubbard, John; Hubbard, Barbara (2015). Vector Calculus, Linear Algebra and Differential Forms (5th ed.). Matrix Editions. ISBN 9780971576681.
James, R. C. (1966). Advanced Calculus. Belmont, CA: Wadsworth.
Jordan, Camille (1893), Cours d'analyse de l'École polytechnique. Tome I. Calcul différentiel (Les Grands Classiques Gauthier-Villars), Éditions Jacques Gaba]
Katz, Victor J. (1981), "The history of differential forms from Clairaut to Poincaré", Historia Mathematica, 8 (2): 161–188, doi:10.1016/0315-0860(81)90027-6
Lang, Serge (1969), Real Analysis, Addison-Wesley, ISBN 0201041790
Lindelöf, L. L. (1867), "Remarques sur les différentes manières d'établir la formule d² z/dx dy = d² z/dy dx", Acta Societatis Scientiarum Fennicae, 8: 205–213
Loomis, Lynn H. (1953), An introduction to abstract harmonic analysis, D. Van Nostrand, hdl:2027/uc1.b4250788
McGrath, Peter J. (2014), "Another proof of Clairaut's theorem", Amer. Math. Monthly, 121 (2): 165–166, doi:10.4169/amer.math.monthly.121.02.165, S2CID 12698408
Minguzzi, E. (2015). "The equality of mixed partial derivatives under weak differentiability conditions". Real Analysis Exchange. 40: 81–98. arXiv:1309.5841. doi:10.14321/realanalexch.40.1.0081. S2CID 119315951.
Nachbin, Leopoldo (1965), Elements of approximation theory, Notas de Matemática, vol. 33, Rio de Janeiro: Fascículo publicado pelo Instituto de Matemática Pura e Aplicada do Conselho Nacional de Pesquisas
Rudin, Walter (1976), Principles of Mathematical Analysis, International Series in Pure & Applied Mathematics, McGraw-Hill, ISBN 0-07-054235-X
Sandifer, C. Edward (2007), "Mixed partial derivatives are equal", The Early Mathematics of Leonard Euler, Vol. 1, Mathematics Association of America, ISBN 9780883855591
Schwarz, H. A. (1873), "Communication", Archives des Sciences Physiques et Naturelles, 48: 38–44
Spivak, Michael (1965), Calculus on manifolds. A modern approach to classical theorems of advanced calculus, W. A. Benjamin
Tao, Terence (2006), Analysis II (PDF), Texts and Readings in Mathematics, vol. 38, Hindustan Book Agency, doi:10.1007/978-981-10-1804-6, ISBN 8185931631
Titchmarsh, E. C. (1939), The Theory of Functions (2nd ed.), Oxford University Press
Tu, Loring W. (2010), An Introduction to Manifolds (2nd ed.), New York: Springer, ISBN 978-1-4419-7399-3