Jump to content

Sturm's theorem: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Tbsmith (talk | contribs)
No edit summary
Line 1: Line 1:
In [[mathematics]], '''Sturm's theorem''' is a symbolic procedure to determine the number of distinct [[real number|real]] [[root (mathematics)|root]]s of a [[polynomial]]. It was named for [[Jacques Charles François Sturm]], though it had actually been discovered by [[Joseph Fourier|Jean Baptiste Fourier]]; Fourier's paper, delivered on the eve of the French Revolution, was forgotten for many years.{{Fact|date=October 2007}}
In [[mathematics]], '''Sturm's theorem''' is a symbolic procedure to determine the number of distinct [[real number|real]] [[root (mathematics)|root]]s of a [[polynomial]]. It was named for [[Jacques Charles François Sturm]], though it had actually been discovered by [[Joseph Fourier|Jean Baptiste Fourier]]; Fourier's paper, delivered on the eve of the [[French Revolution]], was forgotten for many years.{{Fact|date=October 2007}}


Whereas the [[fundamental theorem of algebra]] readily yields the number of real or complex roots of a polynomial, counted according to their multiplicities, Sturm's theorem deals with real roots and disregards their multiplicities.
Whereas the [[fundamental theorem of algebra]] readily yields the number of real or complex roots of a polynomial, counted according to their multiplicities, Sturm's theorem deals with real roots and disregards their multiplicities.

Revision as of 11:29, 1 February 2009

In mathematics, Sturm's theorem is a symbolic procedure to determine the number of distinct real roots of a polynomial. It was named for Jacques Charles François Sturm, though it had actually been discovered by Jean Baptiste Fourier; Fourier's paper, delivered on the eve of the French Revolution, was forgotten for many years.[citation needed]

Whereas the fundamental theorem of algebra readily yields the number of real or complex roots of a polynomial, counted according to their multiplicities, Sturm's theorem deals with real roots and disregards their multiplicities.

Sturm chains

A Sturm chain or Sturm sequence is a finite sequence of polynomials

of decreasing degree with these following properties:

  • is square free;
  • if , then ;
  • if , then ;
  • does not change its sign.

To obtain a Sturm chain Sturm himself proposed to choose the intermediary results when applying Euclid's algorithm to p and its derivative:

That is, successively take the remainders with polynomial division and change their signs. Since for , the algorithm terminates. The final polynomial, pm, is the greatest common divisor of p and its derivative. Since p only has simple roots, pm will be a constant. The Sturm chain then is

Statement

Let σ(ξ) be the number of sign changes (zeroes are not counted) in the sequence

where p is a square-free polynomial. Sturm's theorem then states that for two real numbers a < b, the number of distinct roots in the half-open interval (a,b] is σ(a)−σ(b).

Applications

This can be used to compute the total number of real roots a polynomial has (to use, for example, as an input to a numerical root finder) by choosing a and b appropriately. For example, a bound due to Cauchy says that all real roots of a polynomial with coefficients ai are in the interval [−M,M], where

Alternatively, we can use the fact that for large x, the sign of

is , whereas is .

In this way, simply counting the sign changes in the leading coefficients in the Sturm chain readily gives the number of distinct real roots of a polynomial.

We can also determine the multiplicity of a given root, say ξ, with the help of Sturm's theorem. Indeed, suppose we know a and b bracketing ξ, with σ(a)−σ(b) = 1. Then, ξ has multiplicity m precisely when ξ is a root with multiplicity m−1 of Xr (since it is the GCD of X and its derivative).

Generalized Sturm chains

Let ξ be in the compact interval [a,b]. A generalized Sturm chain over [a,b] is a finite sequence of real polynomials (X0,X1,…,Xr) such that:

  1. X(a)X(b) ≠ 0
  2. sign(Xr) is constant on [a,b]
  3. If Xi(ξ) = 0 for 1 ≤ ir−1, then Xi−1(ξ)Xi+1(ξ) < 0.

One can check that each Sturm chain is indeed a generalized Sturm chain.

See also

  • C code from Graphic Gems by D.G. Hook and P.R. McAree.