Aberth method

The Aberth method, or Aberth–Ehrlich method or Ehrlich–Aberth method, named after Oliver Aberth^[1] and Louis W. Ehrlich,^[2] is a root-finding algorithm developed in 1967 for simultaneous approximation of all the roots of a univariate polynomial.

This method converges cubically, an improvement over the Durand–Kerner method, another algorithm for approximating all roots at once, which converges quadratically.^[1]^[2] (However, both algorithms converge linearly at multiple zeros.^[3])

This method is used in MPSolve, which is the reference software for approximating all roots of a polynomial to an arbitrary precision.

Description[edit]

Let $p(x)=p_{n}x^{n}+p_{n-1}x^{n-1}+\cdots +p_{1}x+p_{0}$ be a univariate polynomial of degree $n$ with real or complex coefficients. Then there exist complex numbers $z_{1}^{*},\,z_{2}^{*},\dots ,z_{n}^{*}$ , the roots of $p(x)$ , that give the factorization:

p(x)=p_{n}\cdot (x-z_{1}^{*})\cdot (x-z_{2}^{*})\cdots (x-z_{n}^{*}).

Although those numbers are unknown, upper and lower bounds for their absolute values are computable from the coefficients of the polynomial. Now one can pick $n$ distinct numbers in the complex plane—randomly or evenly distributed—such that their absolute values are within the same bounds. (Also, if the zeros are symmetrical, the starting points must not be exactly symmetrical along the same axis, as this can prevent convergence.)^[1] A set of such numbers is called an initial approximation of the set of roots of $p(x)$ . This approximation can be iteratively improved using the following procedure.

Let $z_{1},\dots ,z_{n}\in \mathbb {C}$ be the current approximations of the zeros of $p(x)$ . Then offset numbers $w_{1},\dots ,w_{n}\in \mathbb {C}$ are computed as

w_{k}={\frac {\frac {p(z_{k})}{p'(z_{k})}}{1-{\frac {p(z_{k})}{p'(z_{k})}}\cdot \sum _{j\neq k}{\frac {1}{z_{k}-z_{j}}}}},

where $p'(z_{k})$ is the polynomial derivative of $p$ evaluated in the point $z_{k}$ .

The next set of approximations of roots of $p(x)$ is then $z_{1}-w_{1},\dots ,z_{n}-w_{n}$ . One can measure the quality of the current approximation by the values of the polynomial or by the size of the offsets.

Conceptually, this method uses an electrostatic analogy, modeling the approximated zeros as movable negative point charges, which converge toward the true zeros, represented by fixed positive point charges.^[1] A direct application of Newton's method to each approximated zero will often cause multiple starting points to incorrectly converge to the same root. The Aberth method avoids this by also modeling the repulsive effect the movable charges have on each other. In this way, when a movable charge has converged on a zero, their charges will cancel out, so that other movable charges are no longer attracted to that location, encouraging them to converge to other "unoccupied" zeros. (Stieltjes also modeled the positions of zeros of polynomials as solutions to electrostatic problems.)

Inside the formula of the Aberth method one can find elements of Newton's method and the Durand–Kerner method. Details for an efficient implementation, esp. on the choice of good initial approximations, can be found in Bini (1996).^[3]

The updates of the roots may be executed as a simultaneous Jacobi-like iteration where first all new approximations are computed from the old approximations or as a sequential Gauss–Seidel-like iteration that uses each new approximation from the time it is computed.

A very similar method is the Newton-Maehly method. It computes the zeros one after another, but instead of an explicit deflation it divides by the already acquired linear factors on the fly. The Aberth method is like the Newton-Maehly method for computing the last root while pretending you have already found the other ones.^[4]

Derivation from Newton's method[edit]

The iteration formula is the univariate Newton iteration for the function

F(x)={\frac {p(x)}{\prod _{j=1;\,j\neq k}^{n}(x-z_{j})}}

If the values $z_{1},\dots ,z_{n}$ are already close to the roots of $p(x)$ , then the rational function $F(x)$ is almost linear with a dominant root close to $z_{k}$ and poles at $z_{1},\dots ,z_{k-1},z_{k+1},\dots ,z_{n}$ that direct the Newton iteration away from the roots of p(x) that are close to them. That is, the corresponding basins of attraction get rather small, while the root close to $z_{k}$ has a wide region of attraction.

The Newton step ${\tfrac {F(x)}{F'(x)}}$ in the univariate case is the reciprocal value to the logarithmic derivative

{\begin{aligned}{\frac {F'(x)}{F(x)}}&={\frac {d}{dx}}\ln |F(x)|\\&={\frac {d}{dx}}{\big (}\ln |p(x)|-\sum _{j=1;\,j\neq k}^{n}\ln |x-z_{j}|{\big )}\\&={\frac {p'(x)}{p(x)}}-\sum _{j=1;\,j\neq k}^{n}{\frac {1}{x-z_{j}}}\end{aligned}}

Thus, the new approximation is computed as

z_{k}'=z_{k}-{\frac {F(z_{k})}{F'(z_{k})}}=z_{k}-{\frac {1}{{\frac {p'(z_{k})}{p(z_{k})}}-\sum _{j=1;\,j\neq k}^{n}{\frac {1}{z_{k}-z_{j}}}}}\,,

which is the update formula of the Aberth–Ehrlich method.

Literature[edit]

^ ^a ^b ^c ^d Aberth, Oliver (1973). "Iteration methods for finding all zeros of a polynomial simultaneously". Math. Comp. 27 (122). Mathematics of Computation, Vol. 27, No. 122: 339–344. doi:10.2307/2005621. JSTOR 2005621. Because of the obvious analogy from electrostatics, this field may be called the field of a unit plus charge ... To avoid this, we assign a unit minus charge at each sampling point. The idea here is that when a sampling point z, is near a simple zero, the field from the minus charge at z, should counteract that from the plus charge at the zero, preventing a second sampling point from converging to this zero.
^ ^a ^b Ehrlich, Louis W. (1967). "A modified Newton method for polynomials". Comm. ACM. 10 (2): 107–108. doi:10.1145/363067.363115.
^ ^a ^b Bini, Dario Andrea (1996). "Numerical computation of polynomial zeros by means of Aberth's method". Numerical Algorithms. 13 (2): 179–200. Bibcode:1996NuAlg..13..179B. doi:10.1007/BF02207694. S2CID 23899456.
^ Bauer, F.L.; Stoer, J. (1962). "Algorithm 105: Newton Maehly". Comm. ACM. 5 (7): 387–388. doi:10.1145/368273.368423.

Description[edit]

Derivation from Newton's method[edit]

Literature[edit]

See also[edit]