# Variation of parameters

(Redirected from Method of variation of parameters)

In mathematics, variation of parameters, also known as variation of constants, is a general method to solve inhomogeneous linear ordinary differential equations.

For first-order inhomogeneous linear differential equations it is usually possible to find solutions via integrating factors or undetermined coefficients with considerably less effort, although those methods leverage heuristics that involve guessing and don't work for all inhomogeneous linear differential equations.

Variation of parameters extends to linear partial differential equations as well, specifically to inhomogeneous problems for linear evolution equations like the heat equation, wave equation, and vibrating plate equation. In this setting, the method is more often known as Duhamel's principle, named after Jean-Marie Duhamel(1797-1872) who first applied the method to solve the inhomogeneous heat equation. Sometimes variation of parameters itself is called Duhamel's principle and vice versa.

## History

The method of variation of parameters was introduced by the Swiss-born mathematician Leonhard Euler (1707–1783) and completed by the Italian-French mathematician Joseph-Louis Lagrange (1736–1813).[1] A forerunner of the method of variation of a celestial body's orbital elements appeared in Euler's work in 1748, while he was studying the mutual perturbations of Jupiter and Saturn.[2] In his 1749 study of the motions of the earth, Euler obtained differential equations for the orbital elements;[3] and in 1753 he applied the method to his study of the motions of the moon.[4] Lagrange first used the method in 1766.[5] Between 1778 and 1783, Lagrange further developed the method both in a series of memoirs on variations in the motions of the planets[6] and in another series of memoirs on determining the orbit of a comet from three observations.[7] (It should be noted that Euler and Lagrange applied this method to nonlinear differential equations and that, instead of varying the coefficients of linear combinations of solutions to homogeneous equations, they varied the constants of the unperturbed motions of the celestial bodies.[8]) During 1808-1810, Lagrange gave the method of variation of parameters its final form in a series of papers.[9] The central result of his study was the system of planetary equations in the form of Lagrange, which described the evolution of the Keplerian parameters (orbital elements) of a perturbed orbit.

In his description of evolving orbits, Lagrange set a reduced two-body problem as an unperturbed solution, and presumed that all perturbations come from the gravitational pull which the bodies other than the primary exert at the secondary (orbiting) body. Accordingly, his method implied that the perturbations depend solely on the position of the secondary, but not on its velocity. In the 20th century, celestial mechanics began to consider interactions which depend on both positions and velocities (relativistic corrections, atmospheric drag, inertial forces). Therefore, the method of variation of parameters used by Lagrange was extended to the situation with velocity-dependent forces.[10]

## Description of method

Given an ordinary non-homogeneous linear differential equation of order n

${\displaystyle y^{(n)}(x)+\sum _{i=0}^{n-1}a_{i}(x)y^{(i)}(x)=b(x).\quad \quad {\rm {(i)}}}$

Let ${\displaystyle y_{1}(x),\ldots ,y_{n}(x)}$ be a fundamental system of solutions of the corresponding homogeneous equation

${\displaystyle y^{(n)}(x)+\sum _{i=0}^{n-1}a_{i}(x)y^{(i)}(x)=0.\quad \quad {\rm {(ii)}}}$

Then a particular solution to the non-homogeneous equation is given by

${\displaystyle y_{p}(x)=\sum _{i=1}^{n}c_{i}(x)y_{i}(x)\quad \quad {\rm {(iii)}}}$

where the ${\displaystyle c_{i}(x)}$ are differentiable functions which are assumed to satisfy the conditions

${\displaystyle \sum _{i=1}^{n}c_{i}'(x)y_{i}^{(j)}(x)=0\,\mathrm {,} \quad j=0,\ldots ,n-2.\quad \quad {\rm {(iv)}}}$

Starting with (iii), repeated differentiation combined with repeated use of (iv) gives

${\displaystyle y_{p}^{(j)}(x)=\sum _{i=1}^{n}c_{i}(x)y_{i}^{(j)}(x)\,\mathrm {,} \quad j=0,\ldots ,n-1\,\mathrm {.} \quad \quad {\rm {(v)}}}$

One last differentiation gives

${\displaystyle y_{p}^{(n)}(x)=\sum _{i=1}^{n}c_{i}'(x)y_{i}^{(n-1)}(x)+\sum _{i=1}^{n}c_{i}(x)y_{i}^{(n)}(x)\,\mathrm {.} \quad \quad {\rm {(vi)}}}$

By substituting (iii) into (i) and applying (v) and (vi) it follows that

${\displaystyle \sum _{i=1}^{n}c_{i}'(x)y_{i}^{(n-1)}(x)=b(x).\quad \quad {\rm {(vii)}}}$

The linear system (iv and vii) of n equations can then be solved using Cramer's rule yielding

${\displaystyle c_{i}'(x)={\frac {W_{i}(x)}{W(x)}},\,\quad i=1,\ldots ,n}$

where ${\displaystyle W(x)}$ is the Wronskian determinant of the fundamental system and ${\displaystyle W_{i}(x)}$ is the Wronskian determinant of the fundamental system with the i-th column replaced by ${\displaystyle (0,0,\ldots ,b(x)).}$

The particular solution to the non-homogeneous equation can then be written as

${\displaystyle \sum _{i=1}^{n}y_{i}(x)\,\int {\frac {W_{i}(x)}{W(x)}}\ \mathrm {d} x.}$

## Examples

### First order equation

${\displaystyle y'+p(x)y=q(x)}$

The general solution of the corresponding homogeneous equation (written below) is the complementary solution to our original (inhomogeneous) equation:

${\displaystyle y'+p(x)y=0}$.

This homogeneous differential equation can be solved by different methods, for example separation of variables:

${\displaystyle {\frac {d}{dx}}y+p(x)y=0}$
${\displaystyle {\frac {dy}{dx}}=-p(x)y}$
${\displaystyle {dy \over y}=-{p(x)dx},}$
${\displaystyle \int {\frac {1}{y}}\,dy=-\int p(x)\,dx}$
${\displaystyle \ln |y|=-\int p(x)\,dx+C_{0}}$
${\displaystyle y=\pm e^{-\int p(x)\,dx+C_{0}}=C_{0}e^{-\int p(x)\,dx}}$

The complementary solution to our original equation is therefore:

${\displaystyle y_{c}=C_{0}e^{-\int p(x)\,dx}}$

${\displaystyle y'+p(x)y=q(x)}$

Using the method variation of parameters, the particular solution is formed by multiplying the complementary solution by an unknown function C(x):

${\displaystyle y_{p}=C(x)e^{-\int p(x)\,dx}}$

By substituting the particular solution into the non-homogeneous equation, we can find C(x):

${\displaystyle C'(x)e^{-\int p(x)\,dx}-C(x)p(x)e^{-\int p(x)\,dx}+p(x)C(x)e^{-\int p(x)\,dx}=q(x)}$
${\displaystyle C'(x)e^{-\int p(x)\,dx}=q(x)}$
${\displaystyle C'(x)=q(x)e^{\int p(x)\,dx}}$
${\displaystyle C(x)=\int q(x)e^{\int p(x)\,dx}\,dx+C_{1}}$

We only need a single particular solution, so we arbitrarily select ${\displaystyle C_{1}=0}$ for simplicity. Therefore the particular solution is:

${\displaystyle y_{p}=e^{-\int p(x)\,dx}\int q(x)e^{\int p(x)\,dx}\,dx}$

The final solution of the differential equation is:

${\displaystyle y=y_{c}+y_{p}}$
${\displaystyle y=e^{-\int p(x)\,dx}\int q(x)e^{\int p(x)\,dx}\,dx+C_{0}e^{-\int p(x)\,dx}}$

### Specific second order equation

Let us solve

${\displaystyle y''+4y'+4y=\cosh {x}.}$

We want to find the general solution to the differential equation, that is, we want to find solutions to the homogeneous differential equation

${\displaystyle y''+4y'+4y=0.}$

The characteristic equation is:

${\displaystyle \lambda ^{2}+4\lambda +4=(\lambda +2)^{2}=0}$

Since ${\displaystyle \lambda =-2}$ is a repeated root, we have to introduce a factor of x for one solution to ensure linear independence: u1 = e−2x and u2 = xe−2x. The Wronskian of these two functions is

${\displaystyle W={\begin{vmatrix}e^{-2x}&xe^{-2x}\\-2e^{-2x}&-e^{-2x}(2x-1)\\\end{vmatrix}}=-e^{-2x}e^{-2x}(2x-1)+2xe^{-2x}e^{-2x}=e^{-4x}.}$

Because the Wronskian is non-zero, the two functions are linearly independent, so this is in fact the general solution for the homogeneous differential equation (and not a mere subset of it).

We seek functions A(x) and B(x) so A(x)u1 + B(x)u2 is a general solution of the non-homogeneous equation. We need only calculate the integrals

${\displaystyle A(x)=-\int {1 \over W}u_{2}(x)b(x)\,\mathrm {d} x,\;B(x)=\int {1 \over W}u_{1}(x)b(x)\,\mathrm {d} x}$

Recall that for this example

${\displaystyle b(x)=\cosh {x}}$

That is,

${\displaystyle A(x)=-\int {1 \over e^{-4x}}xe^{-2x}\cosh {x}\,\mathrm {d} x=-\int xe^{2x}\cosh {x}\,\mathrm {d} x=-{1 \over 18}e^{x}(9(x-1)+e^{2x}(3x-1))+C_{1}}$
${\displaystyle B(x)=\int {1 \over e^{-4x}}e^{-2x}\cosh {x}\,\mathrm {d} x=\int e^{2x}\cosh {x}\,\mathrm {d} x={1 \over 6}e^{x}(3+e^{2x})+C_{2}}$

where ${\displaystyle C_{1}}$ and ${\displaystyle C_{2}}$ are constants of integration.

### General second order equation

We have a differential equation of the form

${\displaystyle u''+p(x)u'+q(x)u=f(x)}$

and we define the linear operator

${\displaystyle L=D^{2}+p(x)D+q(x)}$

where D represents the differential operator. We therefore have to solve the equation ${\displaystyle Lu(x)=f(x)}$ for ${\displaystyle u(x)}$, where ${\displaystyle L}$ and ${\displaystyle f(x)}$ are known.

We must solve first the corresponding homogeneous equation:

${\displaystyle u''+p(x)u'+q(x)u=0}$

by the technique of our choice. Once we've obtained two linearly independent solutions to this homogeneous differential equation (because this ODE is second-order) — call them u1 and u2 — we can proceed with variation of parameters.

Now, we seek the general solution to the differential equation ${\displaystyle u_{G}(x)}$ which we assume to be of the form

${\displaystyle u_{G}(x)=A(x)u_{1}(x)+B(x)u_{2}(x).}$

Here, ${\displaystyle A(x)}$ and ${\displaystyle B(x)}$ are unknown and ${\displaystyle u_{1}(x)}$ and ${\displaystyle u_{2}(x)}$ are the solutions to the homogeneous equation. (Observe that if ${\displaystyle A(x)}$ and ${\displaystyle B(x)}$ are constants, then ${\displaystyle Lu_{G}(x)=0}$.) Since the above is only one equation and we have two unknown functions, it is reasonable to impose a second condition. We choose the following:

${\displaystyle A'(x)u_{1}(x)+B'(x)u_{2}(x)=0.}$

Now,

{\displaystyle {\begin{aligned}u_{G}'(x)&=\left(A(x)u_{1}(x)+B(x)u_{2}(x)\right)'\\&=\left(A(x)u_{1}(x)\right)'+\left(B(x)u_{2}(x)\right)'\\&=A'(x)u_{1}(x)+A(x)u_{1}'(x)+B'(x)u_{2}(x)+B(x)u_{2}'(x)\\&=A'(x)u_{1}(x)+B'(x)u_{2}(x)+A(x)u_{1}'(x)+B(x)u_{2}'(x)\\&=A(x)u_{1}'(x)+B(x)u_{2}'(x)&&A'(x)u_{1}(x)+B'(x)u_{2}(x)=0\end{aligned}}}

Differentiating again (omitting intermediary steps)

${\displaystyle u_{G}''(x)=A(x)u_{1}''(x)+B(x)u_{2}''(x)+A'(x)u_{1}'(x)+B'(x)u_{2}'(x).}$

Now we can write the action of L upon uG as

${\displaystyle Lu_{G}=A(x)Lu_{1}(x)+B(x)Lu_{2}(x)+A'(x)u_{1}'(x)+B'(x)u_{2}'(x).}$

Since u1 and u2 are solutions, then

${\displaystyle Lu_{G}=A'(x)u_{1}'(x)+B'(x)u_{2}'(x).}$

We have the system of equations

${\displaystyle {\begin{pmatrix}u_{1}(x)&u_{2}(x)\\u_{1}'(x)&u_{2}'(x)\end{pmatrix}}{\begin{pmatrix}A'(x)\\B'(x)\end{pmatrix}}={\begin{pmatrix}0\\f\end{pmatrix}}.}$

Expanding,

${\displaystyle {\begin{pmatrix}A'(x)u_{1}(x)+B'(x)u_{2}(x)\\A'(x)u_{1}'(x)+B'(x)u_{2}'(x)\end{pmatrix}}={\begin{pmatrix}0\\f\end{pmatrix}}.}$

So the above system determines precisely the conditions

${\displaystyle A'(x)u_{1}(x)+B'(x)u_{2}(x)=0.}$
${\displaystyle A'(x)u_{1}'(x)+B'(x)u_{2}'(x)=Lu_{G}=f.}$

We seek A(x) and B(x) from these conditions, so, given

${\displaystyle {\begin{pmatrix}u_{1}(x)&u_{2}(x)\\u_{1}'(x)&u_{2}'(x)\end{pmatrix}}{\begin{pmatrix}A'(x)\\B'(x)\end{pmatrix}}={\begin{pmatrix}0\\f\end{pmatrix}}}$

we can solve for (A′(x), B′(x))T, so

${\displaystyle {\begin{pmatrix}A'(x)\\B'(x)\end{pmatrix}}={\begin{pmatrix}u_{1}(x)&u_{2}(x)\\u_{1}'(x)&u_{2}'(x)\end{pmatrix}}^{-1}{\begin{pmatrix}0\\f\end{pmatrix}}={\frac {1}{W}}{\begin{pmatrix}u_{2}'(x)&-u_{2}(x)\\-u_{1}'(x)&u_{1}(x)\end{pmatrix}}{\begin{pmatrix}0\\f\end{pmatrix}},}$

where W denotes the Wronskian of u1 and u2. (We know that W is nonzero, from the assumption that u1 and u2 are linearly independent.) So,

{\displaystyle {\begin{aligned}A'(x)&=-{1 \over W}u_{2}(x)f(x),\;B'(x)={1 \over W}u_{1}(x)f(x)\\A(x)&=-\int {1 \over W}u_{2}(x)f(x)\,\mathrm {d} x,\;B(x)=\int {1 \over W}u_{1}(x)f(x)\,\mathrm {d} x\end{aligned}}}

While homogeneous equations are relatively easy to solve, this method allows the calculation of the coefficients of the general solution of the inhomogeneous equation, and thus the complete general solution of the inhomogeneous equation can be determined.

Note that ${\displaystyle A(x)}$ and ${\displaystyle B(x)}$ are each determined only up to an arbitrary additive constant (the constant of integration). Adding a constant to ${\displaystyle A(x)}$ or ${\displaystyle B(x)}$ does not change the value of ${\displaystyle Lu_{G}(x)}$ because the extra term is just a linear combination of u1 and u2, which is a solution of ${\displaystyle L}$ by definition.

## References

1. ^ See:
2. ^ Euler, L. (1748) "Recherches sur la question des inégalités du mouvement de Saturne et de Jupiter, sujet proposé pour le prix de l'année 1748, par l’Académie Royale des Sciences de Paris" [Investigations on the question of the differences in the movement of Saturn and Jupiter; this subject proposed for the prize of 1748 by the Royal Academy of Sciences (Paris)] (Paris, France: G. Martin, J.B. Coignard, & H.L. Guerin, 1749).
3. ^ Euler, L. (1749) "Recherches sur la précession des équinoxes, et sur la nutation de l’axe de la terre," Histoire [or Mémoires ] de l'Académie Royale des Sciences et Belles-lettres (Berlin), pages 289-325 [published in 1751].
4. ^ Euler, L. (1753) Theoria motus lunae: exhibens omnes ejus inaequalitates ... [The theory of the motion of the moon: demonstrating all of its inequalities ... ] (Saint Petersburg, Russia: Academia Imperialis Scientiarum Petropolitanae [Imperial Academy of Science (St. Petersburg)], 1753).
5. ^ Lagrange, J.-L. (1766) “Solution de différens problèmes du calcul integral,” Mélanges de philosophie et de mathématique de la Société royale de Turin, vol. 3, pages 179-380.
6. ^ See:
7. ^ See:
8. ^ Michael Efroimsky (2002) "Implicit gauge symmetry emerging in the N-body problem of celestial mechanics," page 3.
9. ^ See:
• Lagrange, J.-L. (1808) “Sur la théorie des variations des éléments des planètes et en particulier des variations des grands axes de leurs orbites,” Mémoires de la première Classe de l’Institut de France. Reprinted in: Joseph-Louis Lagrange with Joseph-Alfred Serret, ed., Oeuvres de Lagrange (Paris, France: Gauthier-Villars, 1873), vol. 6, pages 713-768.
• Lagrange, J.-L. (1809) “Sur la théorie générale de la variation des constantes arbitraires dans tous les problèmes de la méchanique,” Mémoires de la première Classe de l’Institut de France. Reprinted in: Joseph-Louis Lagrange with Joseph-Alfred Serret, ed., Oeuvres de Lagrange (Paris, France: Gauthier-Villars, 1873), vol. 6, pages 771-805.
• Lagrange, J.-L. (1810) “Second mémoire sur la théorie générale de la variation des constantes arbitraires dans tous les problèmes de la méchanique, ... ,” Mémoires de la première Classe de l’Institut de France. Reprinted in: Joseph-Louis Lagrange with Joseph-Alfred Serret, ed., Oeuvres de Lagrange (Paris, France: Gauthier-Villars, 1873), vol. 6, pages 809-816.
10. ^ See: