Estrin's scheme

In numerical analysis, Estrin's scheme (after Gerald Estrin), also known as Estrin's method, is an algorithm for numerical evaluation of polynomials.

The Horner scheme for evaluation of polynomials is one of the most commonly used algorithms for this purpose and unlike Estrin's scheme it is optimal in the sense that it minimizes the number of multiplications and addition required to evaluate an arbitrary polynomial. On a modern processor architecture that allows out-of-order execution, instructions that do not depend on each other's results may run in parallel. The Horner scheme contains a series of multiplications and additions that depend on the previous instruction and so cannot execute in parallel. Estrin's scheme is one method that attempts to overcome this serialization while still being reasonably close to optimal.

Description of the algorithm

Given an arbitrary polynomial P_n(x)= C₀ + C₁x + C₂x² + C₃x³ + ... + C_nxⁿ one can isolate sub-expressions of the form (A + Bx) and of the form x^2ⁿ.

Rewritten using Estrin's scheme we get P_n(x) = (C₀ + C₁x) + (C₂ + C₃x) x² + ((C₄ + C₅x) + (C₆ + C₇x) x²))x⁴ + ...

x^2ⁿ can be evaluated once and kept until no longer required. As is evident from looking at this expression there are many sub-expression that may be evaluated in parallel.

The sub-expressions of form (A+ Bx) can be evaluated using a native multiply–accumulate instruction on some architectures, an advantage that is shared with the Horner scheme.

Examples

Take P_n(x) to mean the nth order polynomial of the form: P_n(x) = C₀ + C₁x + C₂x² + C₃x³ + C_nxⁿ

Written with Estrin's scheme we have:

P₃(x) = (C₀ +C₁x) + (C₂ +C₃x) x²

P₄(x) = (C₀ +C₁x) + (C₂ +C₃x) x² + C₄x⁴

P₅(x) = (C₀ +C₁x) + (C₂ +C₃x) x² + (C₄ +C₅x) x⁴

P₆(x) = (C₀ +C₁x) + (C₂ +C₃x) x² + ((C₄ +C₅x) + C₆x²)x⁴

P₇(x) = (C₀ +C₁x) + (C₂ +C₃x) x² + ((C₄ +C₅x) + (C₆ +C₇x) x²)x⁴

P₈(x) = (C₀ +C₁x) + (C₂ +C₃x) x² + ((C₄ +C₅x) + (C₆ +C₇x) x²)x⁴ +C₈x⁸

P₉(x) = (C₀ +C₁x) + (C₂ +C₃x) x² + ((C₄ +C₅x) + (C₆ +C₇x) x²)x⁴ + (C₈ +C₉x) x⁸

...

References

Jean-Michel Muller, Elementary Functions: Algorithms And Implementation, 2nd edition, Springer Verlag, page 58.
G. Estrin, Organization of computer systems - The fixed plus variable structure computer, in Proc. Western Joint Comput. Conf., May 1960, pp. 33-40.