# Householder's method

In mathematics, and more specifically in numerical analysis, Householder's methods are a class of root-finding algorithms that are used for functions of one real variable with continuous derivatives up to some order d + 1. Each of these methods is characterized by the number d, which is known as the order of the method. The algorithm is iterative and has a rate of convergence of d + 1.

These methods are named after the American mathematician Alston Scott Householder.

## Method

Householder's method is a numerical algorithm for solving the nonlinear equation f(x) = 0. In this case, the function f has to be a function of one real variable. The method consists of a sequence of iterations

${\displaystyle x_{n+1}=x_{n}+d\;{\frac {\left(1/f\right)^{(d-1)}(x_{n})}{\left(1/f\right)^{(d)}(x_{n})}}}$

beginning with an initial guess x0.[1]

If f is a d + 1 times continuously differentiable function and a is a zero of f but not of its derivative, then, in a neighborhood of a, the iterates xn satisfy:[citation needed]

${\displaystyle |x_{n+1}-a|\leq K\cdot {|x_{n}-a|}^{d+1}}$, for some ${\displaystyle K>0.\!}$

This means that the iterates converge to the zero if the initial guess is sufficiently close, and that the convergence has order d + 1.

Despite their order of convergence, these methods are not widely used because the gain in precision is not commensurate with the rise in effort for large d. The Ostrowski index expresses the error reduction in the number of function evaluations instead of the iteration count.[2]

• For polynomials, the evaluation of the first d derivatives of f at xn using the Horner method has an effort of d + 1 polynomial evaluations. Since n(d + 1) evaluations over n iterations give an error exponent of (d + 1)n, the exponent for one function evaluation is ${\displaystyle {\sqrt[{d+1}]{d+1}}}$, numerically 1.4142, 1.4422, 1.4142, 1.3797 for d = 1, 2, 3, 4, and falling after that.
• For general functions the derivative evaluation using the Taylor arithmetic of automatic differentiation requires the equivalent of (d + 1)(d + 2)/2 function evaluations. One function evaluation thus reduces the error by an exponent of ${\displaystyle {\sqrt[{3}]{2}}\approx 1.2599}$ for Newton's method, ${\displaystyle {\sqrt[{6}]{3}}\approx 1.2009}$ for Halley's method and falling towards 1 or linear convergence for the higher order methods.

## Motivation

### First approach

An approximate idea of Householder's method derives from the geometric series. Let the real-valued, continuously differentiable function f(x) have a simple zero at x = a, that is a root f(a) = 0 of multiplicity one, which is equivalent to ${\displaystyle f'(a)\neq 0}$. Then 1/f(x) has a singularity at a, specifically a simple pole (also of multiplicity one), and close to a the behavior of 1/f(x) is dominated by 1/(xa). Approximately, one gets

${\displaystyle {\frac {1}{f(x)}}={\frac {1}{f(x)-f(a)}}={\frac {x-a}{f(x)-f(a)}}\cdot {\frac {-1}{a(1-x/a)}}\approx {\frac {-1}{af'(a)}}\cdot \sum _{k=0}^{\infty }\left({\frac {x}{a}}\right)^{k}.}$

Here ${\displaystyle f'(a)\neq 0}$ because a is a simple zero of f(x). The coefficient of degree d has the value ${\displaystyle C\,a^{-d}}$. Thus, one can now reconstruct the zero a by dividing the coefficient of degree d − 1 by the coefficient of degree d. Since this geometric series is an approximation to the Taylor expansion of 1/f(x), one can get estimates of the zero of f(x) – now without prior knowledge of the location of this zero – by dividing the corresponding coefficients of the Taylor expansion of 1/f(x) or, more generally, 1/f(b+x). From that one gets, for any integer d, and if the corresponding derivatives exist,

${\displaystyle a\approx b+{\frac {(1/f)^{(d-1)}(b)}{(d-1)!}}\;{\frac {d!}{(1/f)^{(d)}(b)}}=b+d\;{\frac {(1/f)^{(d-1)}(b)}{(1/f)^{(d)}(b)}}.}$

### Second approach

Suppose x = a is a simple root. Then near x = a, (1/f)(x) is a meromorphic function. Suppose we have the Taylor expansion:

${\displaystyle (1/f)(x)=\sum _{d=0}^{\infty }{\frac {(1/f)^{(d)}(b)}{d!}}(x-b)^{d}.}$

By König's theorem, we have:

${\displaystyle a-b=\lim _{d\rightarrow \infty }{\frac {\frac {(1/f)^{(d-1)}(b)}{(d-1)!}}{\frac {(1/f)^{(d)}(b)}{d!}}}=\lim _{d\rightarrow \infty }d{\frac {(1/f)^{(d-1)}(b)}{(1/f)^{(d)}(b)}}.}$

This suggests that Householder's iteration might be a good convergent iteration. The actual proof of the convergence is also based on this idea.

## The methods of lower order

Householder's method of order 1 is just Newton's method, since:

${\displaystyle {\begin{array}{rl}x_{n+1}=&x_{n}+1\,{\frac {\left(1/f\right)(x_{n})}{\left(1/f\right)^{(1)}(x_{n})}}\\[.7em]=&x_{n}+{\frac {1}{f(x_{n})}}\cdot \left({\frac {-f'(x_{n})}{f(x_{n})^{2}}}\right)^{-1}\\[.7em]=&x_{n}-{\frac {f(x_{n})}{f'(x_{n})}}.\end{array}}}$

For Householder's method of order 2 one gets Halley's method, since the identities

${\displaystyle \textstyle (1/f)'(x)=-{\frac {f'(x)}{f(x)^{2}}}\ }$

and

${\displaystyle \textstyle \ (1/f)''(x)=-{\frac {f''(x)}{f(x)^{2}}}+2{\frac {f'(x)^{2}}{f(x)^{3}}}}$

result in

${\displaystyle {\begin{array}{rl}x_{n+1}=&x_{n}+2\,{\frac {\left(1/f\right)'(x_{n})}{\left(1/f\right)''(x_{n})}}\\[1em]=&x_{n}+{\frac {-2f(x_{n})\,f'(x_{n})}{-f(x_{n})f''(x_{n})+2f'(x_{n})^{2}}}\\[1em]=&x_{n}-{\frac {f(x_{n})f'(x_{n})}{f'(x_{n})^{2}-{\tfrac {1}{2}}f(x_{n})f''(x_{n})}}\\[1em]=&x_{n}+h_{n}\;{\frac {1}{1+{\frac {1}{2}}(f''/f')(x_{n})\,h_{n}}}.\end{array}}}$

In the last line, ${\displaystyle h_{n}=-{\tfrac {f(x_{n})}{f'(x_{n})}}}$ is the update of the Newton iteration at the point ${\displaystyle x_{n}}$. This line was added to demonstrate where the difference to the simple Newton's method lies.

The third order method is obtained from the identity of the third order derivative of 1/f

${\displaystyle \textstyle (1/f)'''(x)=-{\frac {f'''(x)}{f(x)^{2}}}+6{\frac {f'(x)\,f''(x)}{f(x)^{3}}}-6{\frac {f'(x)^{3}}{f(x)^{4}}}}$

and has the formula

${\displaystyle {\begin{array}{rl}x_{n+1}=&x_{n}+3\,{\frac {\left(1/f\right)''(x_{n})}{\left(1/f\right)'''(x_{n})}}\\[1em]=&x_{n}-{\frac {6f(x_{n})\,f'(x_{n})^{2}-3f(x_{n})^{2}f''(x_{n})}{6f'(x_{n})^{3}-6f(x_{n})f'(x_{n})\,f''(x_{n})+f(x_{n})^{2}\,f'''(x_{n})}}\\[1em]=&x_{n}+h_{n}{\frac {1+{\frac {1}{2}}(f''/f')(x_{n})\,h_{n}}{1+(f''/f')(x_{n})\,h_{n}+{\frac {1}{6}}(f'''/f')(x_{n})\,h_{n}^{2}}}\end{array}}}$

and so on.

## Example

The first problem solved by Newton with the Newton-Raphson-Simpson method was the polynomial equation ${\displaystyle y^{3}-2y-5=0}$. He observed that there should be a solution close to 2. Replacing y = x + 2 transforms the equation into

${\displaystyle 0=f(x)=-1+10x+6x^{2}+x^{3}}$.

The Taylor series of the reciprocal function starts with

${\displaystyle {\begin{array}{rl}1/f(x)=&-1-10\,x-106\,x^{2}-1121\,x^{3}-11856\,x^{4}-125392\,x^{5}\\&-1326177\,x^{6}-14025978\,x^{7}-148342234\,x^{8}-1568904385\,x^{9}\\&-16593123232\,x^{10}+O(x^{11})\end{array}}}$

The result of applying Householder's methods of various orders at x = 0 is also obtained by dividing neighboring coefficients of the latter power series. For the first orders one gets the following values after just one iteration step: For an example, in the case of the 3rd order, ${\displaystyle x_{1}=0.0+106/1121=0.09455842997324}$.

d x1
1 0.100000000000000000000000000000000
2 0.094339622641509433962264150943396
3 0.094558429973238180196253345227475
4 0.094551282051282051282051282051281
5 0.094551486538216154140615031261961
6 0.094551481438752142436492263099118
7 0.094551481543746895938379484125813
8 0.094551481542336756233561913325371
9 0.094551481542324837086869382419375
10 0.094551481542326678478801765822984

As one can see, there are a little bit more than d correct decimal places for each order d. The first one hundred digits of the correct solution are 0.0945514815423265914823865405793029638573061056282391803041285290453121899834836671462672817771577578.

Let's calculate the ${\displaystyle x_{2},x_{3},x_{4}}$ values for some lowest order,

${\displaystyle f=-1+10x+6x^{2}+x^{3}}$
${\displaystyle f^{\prime }=10+12x+3x^{2}}$
${\displaystyle f^{\prime \prime }=12+6x}$
${\displaystyle f^{\prime \prime \prime }=6}$

And using following relations,

1st order; ${\displaystyle x_{i+1}=x_{i}-f(x_{i})/f^{\prime }(x_{i})}$
2nd order; ${\displaystyle x_{i+1}=x_{i}+(-2ff^{\prime })/(2{f^{\prime }}^{2}-ff^{\prime \prime })}$
3rd order; ${\displaystyle x_{i+1}=x_{i}-{\frac {6f{f^{\prime }}^{2}-3f^{2}f^{\prime \prime }}{6{f^{\prime }}^{3}-6ff^{\prime }f^{\prime \prime }+f^{2}f^{\prime \prime \prime }}}}$
x 1st (Newton) 2nd (Halley) 3rd order 4th order
x1 0.100000000000000000000000000000000 0.094339622641509433962264150943395 0.094558429973238180196253345227475 0.09455128205128
x2 0.094568121104185218165627782724844 0.094551481540164214717107966227500 0.094551481542326591482567319958483
x3 0.094551481698199302883823703544266 0.094551481542326591482386540579303 0.094551481542326591482386540579303
x4 0.094551481542326591496064847153714 0.094551481542326591482386540579303 0.094551481542326591482386540579303
x5 0.094551481542326591482386540579303
x6 0.094551481542326591482386540579303

## Derivation

An exact derivation of Householder's methods starts from the Padé approximation of order d + 1 of the function, where the approximant with linear numerator is chosen. Once this has been achieved, the update for the next approximation results from computing the unique zero of the numerator.

The Padé approximation has the form

${\displaystyle f(x+h)={\frac {a_{0}+h}{b_{0}+b_{1}h+\cdots +b_{d-1}h^{d-1}}}+O(h^{d+1}).}$

The rational function has a zero at ${\displaystyle h=-a_{0}}$.

Just as the Taylor polynomial of degree d has d + 1 coefficients that depend on the function f, the Padé approximation also has d + 1 coefficients dependent on f and its derivatives. More precisely, in any Padé approximant, the degrees of the numerator and denominator polynomials have to add to the order of the approximant. Therefore, ${\displaystyle b_{d}=0}$ has to hold.

One could determine the Padé approximant starting from the Taylor polynomial of f using Euclid's algorithm. However, starting from the Taylor polynomial of 1/f is shorter and leads directly to the given formula. Since

${\displaystyle (1/f)(x+h)=(1/f)(x)+(1/f)'(x)h+\cdots +(1/f)^{(d-1)}(x){\frac {h^{d-1}}{(d-1)!}}+(1/f)^{(d)}(x){\frac {h^{d}}{d!}}+O(h^{d+1})}$

has to be equal to the inverse of the desired rational function, we get after multiplying with ${\displaystyle a_{0}+h}$ in the power ${\displaystyle h^{d}}$ the equation

${\displaystyle 0=b_{d}=a_{0}(1/f)^{(d)}(x){\frac {1}{d!}}+(1/f)^{(d-1)}(x){\frac {1}{(d-1)!}}}$.

Now, solving the last equation for the zero ${\displaystyle h=-a_{0}}$ of the numerator results in

${\displaystyle {\begin{array}{rl}h=&-a_{0}={\frac {{\frac {1}{(d-1)!}}(1/f)^{(d-1)}(x)}{{\frac {1}{d!}}(1/f)^{(d)}(x)}}\\[1em]=&d\,{\frac {(1/f)^{(d-1)}(x)}{(1/f)^{(d)}(x)}}\end{array}}}$.

This implies the iteration formula

${\displaystyle x_{n+1}=x_{n}+d\;{\frac {\left(1/f\right)^{(d-1)}(x_{n})}{\left(1/f\right)^{(d)}(x_{n})}}}$.

## Relation to Newton's method

Householder's method applied to the real-valued function f(x) is the same as Newton's method applied to the function g(x):

${\displaystyle x_{n+1}=x_{n}-{\frac {g(x_{n})}{g'(x_{n})}}}$

with

${\displaystyle g(x)=\left|(1/f)^{(d-1)}\right|^{-1/d}\,.}$

In particular, d = 1 gives Newton's method unmodified and d = 2 gives Halley's method.

## References

1. ^ Householder, Alston Scott (1970). The Numerical Treatment of a Single Nonlinear Equation. McGraw-Hill. p. 169. ISBN 0-07-030465-3.
2. ^ Ostrowski, A. M. (1966). Solution of Equations and Systems of Equations. Pure and Applied Mathematics. 9 (Second ed.). New York: Academic Press.