# Simpson's rule

Simpson's rule can be derived by approximating the integrand f (x) (in blue) by the quadratic interpolant P(x) (in red).
An animation showing how Simpson's rule approximates the function with a parabola and the reduction in error with decreased step size
An animation showing how Simpson's rule approximation improves with more strips.

In numerical integration, Simpson's rules are several approximations for definite integrals, named after Thomas Simpson (1710–1761).

The most basic of these rules, called Simpson's 1/3 rule, or just Simpson's rule, reads

${\displaystyle \int _{a}^{b}f(x)\,dx\approx {\tfrac {b-a}{6}}\left[f(a)+4f\left({\tfrac {a+b}{2}}\right)+f(b)\right].}$

In German and some other languages, it is named after Johannes Kepler who derived it in 1615 after seeing it used for wine barrels (barrel rule, Keplersche Fassregel). The approximate equality in the rule becomes exact if f is a polynomial up to quadratic degree.

If the 1/3 rule is applied to n equal subdivisions of the integration range [a,b], one obtains the composite Simpson's rule. Points inside the integration range are given alternating weights 4/3 and 2/3.

Simpson's 3/8 rule, also called Simpson's second rule requests one more function evaluation inside the integration range, and is exact if f is a polynomial up to cubic degree.

Simpson's 1/3 and 3/8 rules are two special cases of closed Newton–Cotes formulas.

In naval architecture and ship stability estimation, there also exists Simpon's third rule, which has no special importance in general numerical analysis, see Simpson's rules (ship stability).

## Simpson's 1/3 rule

### Derivations

One derivation replaces the integrand ${\displaystyle f(x)}$ by the quadratic polynomial (i.e. parabola) ${\displaystyle P(x)}$ which takes the same values as ${\displaystyle f(x)}$ at the end points ${\displaystyle a}$ and ${\displaystyle b}$ and the midpoint ${\displaystyle m=(a+b)/2}$. One can use Lagrange polynomial interpolation to find an expression for this polynomial,

${\displaystyle P(x)=f(a){\tfrac {(x-m)(x-b)}{(a-m)(a-b)}}+f(m){\tfrac {(x-a)(x-b)}{(m-a)(m-b)}}+f(b){\tfrac {(x-a)(x-m)}{(b-a)(b-m)}}.}$

Using integration by substitution one can show that[1]

${\displaystyle \int _{a}^{b}P(x)\,dx={\tfrac {b-a}{6}}\left[f(a)+4f\left({\tfrac {a+b}{2}}\right)+f(b)\right].}$

Introducing the step size ${\displaystyle h=(b-a)/2}$ this is also commonly written as

${\displaystyle \int _{a}^{b}P(x)\,dx={\tfrac {h}{3}}\left[f(a)+4f\left({\tfrac {a+b}{2}}\right)+f(b)\right].}$

Because of the ${\displaystyle 1/3}$ factor Simpson's rule is also referred to as Simpson's 1/3 rule (see below for generalization).

#### Averaging the midpoint and the trapezoidal rules

Another derivation constructs Simpson's rule from two simpler approximations: the midpoint rule

${\displaystyle M=(b-a)f\left({\tfrac {a+b}{2}}\right)}$

and the trapezoidal rule

${\displaystyle T={\tfrac {1}{2}}(b-a)(f(a)+f(b)).}$

The errors in these approximations are

${\displaystyle {\tfrac {1}{24}}(b-a)^{3}f''(a)+O((b-a)^{4})\quad {\text{and}}\quad -{\tfrac {1}{12}}(b-a)^{3}f''(a)+O((b-a)^{4}),}$

respectively, where ${\displaystyle O((b-a)^{4})}$ denotes a term asymptotically proportional to ${\displaystyle (b-a)^{4}}$. The two ${\displaystyle O((b-a)^{4})}$ terms are not equal; see Big O notation for more details. It follows from the above formulas for the errors of the midpoint and trapezoidal rule that the leading error term vanishes if we take the weighted average

${\displaystyle {\tfrac {2M+T}{3}}.}$

This weighted average is exactly Simpson's rule.

Using another approximation (for example, the trapezoidal rule with twice as many points), it is possible to take a suitable weighted average and eliminate another error term. This is Romberg's method.

#### Undetermined coefficients

The third derivation starts from the ansatz

${\displaystyle {\tfrac {1}{b-a}}\int _{a}^{b}f(x)\,dx\approx \alpha f(a)+\beta f\left({\tfrac {a+b}{2}}\right)+\gamma f(b).}$

The coefficients α, β and γ can be fixed by requiring that this approximation be exact for all quadratic polynomials. This yields Simpson's rule.

### Error

The error in approximating an integral by Simpson's rule for ${\displaystyle n=2}$ is

${\displaystyle -{\tfrac {1}{90}}\left({\tfrac {b-a}{2}}\right)^{5}f^{(4)}(\xi ),}$

where ${\displaystyle \xi }$ (the Greek letter xi) is some number between ${\displaystyle a}$ and ${\displaystyle b}$.[2]

The error is asymptotically proportional to ${\displaystyle (b-a)^{5}}$. However, the above derivations suggest an error proportional to ${\displaystyle (b-a)^{4}}$. Simpson's rule gains an extra order because the points at which the integrand is evaluated are distributed symmetrically in the interval ${\displaystyle [a,\ b]}$.

Since the error term is proportional to the fourth derivative of ${\displaystyle f}$ at ${\displaystyle \xi }$, this shows that Simpson's rule provides exact results for any polynomial ${\displaystyle f}$ of degree three or less, since the fourth derivative of such a polynomial is zero at all points.

If the second derivative ${\displaystyle f''}$ exists and is convex in the interval ${\displaystyle (a,\ b)}$:

${\displaystyle (b-a)f\left({\tfrac {a+b}{2}}\right)+{\tfrac {1}{3}}\left({\tfrac {b-a}{2}}\right)^{3}f''\left({\tfrac {a+b}{2}}\right)\leq \int _{a}^{b}f(x)\,dx\leq {\tfrac {b-a}{6}}\left[f(a)+4f\left({\tfrac {a+b}{2}}\right)+f(b)\right].}$

### Composite Simpson's rule

If the interval of integration ${\displaystyle [a,b]}$ is in some sense "small", then Simpson's rule with ${\displaystyle n=2}$ subintervals will provide an adequate approximation to the exact integral. By small, what we really mean is that the function being integrated is relatively smooth over the interval ${\displaystyle [a,b]}$. For such a function, a smooth quadratic interpolant like the one used in Simpson's rule will give good results.

However, it is often the case that the function we are trying to integrate is not smooth over the interval. Typically, this means that either the function is highly oscillatory, or it lacks derivatives at certain points. In these cases, Simpson's rule may give very poor results. One common way of handling this problem is by breaking up the interval ${\displaystyle [a,b]}$ into ${\displaystyle n>2}$ small subintervals. Simpson's rule is then applied to each subinterval, with the results being summed to produce an approximation for the integral over the entire interval. This sort of approach is termed the composite Simpson's rule.

Suppose that the interval ${\displaystyle [a,b]}$ is split up into ${\displaystyle n}$ sub-intervals, with ${\displaystyle n}$ an even number. Then, the composite Simpson's rule is given by

{\displaystyle {\begin{aligned}\int _{a}^{b}f(x)\,dx&\approx {\frac {h}{3}}\sum _{j=1}^{n/2}{\bigg [}f(x_{2j-2})+4f(x_{2j-1})+f(x_{2j}){\bigg ]}\\{}&={\frac {h}{3}}{\bigg [}f(x_{0})+2\sum _{j=1}^{n/2-1}f(x_{2j})+4\sum _{j=1}^{n/2}f(x_{2j-1})+f(x_{n}){\bigg ]},\end{aligned}}}

where ${\displaystyle x_{j}=a+jh}$ for ${\displaystyle j=0,1,...,n-1,n}$ with ${\displaystyle h=(b-a)/n}$; in particular, ${\displaystyle x_{0}=a}$ and ${\displaystyle x_{n}=b}$. This composite rule with ${\displaystyle n=2}$ corresponds with the regular Simpson's Rule of the preceding section.

The error committed by the composite Simpson's rule is

${\displaystyle -{\frac {h^{4}}{180}}(b-a)f^{(4)}(\xi ),}$

where ${\displaystyle \xi }$ is some number between ${\displaystyle a}$ and ${\displaystyle b}$ and ${\displaystyle h=(b-a)/n}$ is the "step length".[3] The error is bounded (in absolute value) by

${\displaystyle {\tfrac {h^{4}}{180}}(b-a)\max _{\xi \in [a,b]}|f^{(4)}(\xi )|.}$

This formulation splits the interval ${\displaystyle [a,b]}$ in subintervals of equal length. In practice, it is often advantageous to use subintervals of different lengths, and concentrate the efforts on the places where the integrand is less well-behaved. This leads to the adaptive Simpson's method.

## Simpson's 3/8 rule

Simpson's 3/8 rule, also called Simpson's second rule, is another method for numerical integration proposed by Thomas Simpson. It is based upon a cubic interpolation rather than a quadratic interpolation. Simpson's 3/8 rule is as follows:

${\displaystyle \int _{a}^{b}f(x)\,dx\approx {\tfrac {3h}{8}}\left[f(a)+3f\left({\tfrac {2a+b}{3}}\right)+3f\left({\tfrac {a+2b}{3}}\right)+f(b)\right]={\tfrac {(b-a)}{8}}\left[f(a)+3f\left({\tfrac {2a+b}{3}}\right)+3f\left({\tfrac {a+2b}{3}}\right)+f(b)\right],}$

where b − a = 3h. The error of this method is:

${\displaystyle -{\tfrac {(b-a)^{5}}{6480}}f^{(4)}(\xi ),}$

where ${\displaystyle \xi }$ is some number between ${\displaystyle a}$ and ${\displaystyle b}$. Thus, the 3/8 rule is about twice as accurate as the standard method, but it uses one more function value. A composite 3/8 rule also exists, similarly as above.[4]

A further generalization of this concept for interpolation with arbitrary-degree polynomials are the Newton–Cotes formulas.

### Composite Simpson's 3/8 rule

Dividing the interval ${\displaystyle [a,b]}$ into ${\displaystyle n}$ subintervals of length ${\displaystyle h=(b-a)/n}$ and introducing the nodes ${\displaystyle x_{i}=a+ih}$ we have

{\displaystyle {\begin{aligned}\int _{a}^{b}f(x)\,dx&\approx {\tfrac {3h}{8}}\left[f(x_{0})+3f(x_{1})+3f(x_{2})+2f(x_{3})+3f(x_{4})+3f(x_{5})+2f(x_{6})+\cdots +3f(x_{n-2})+3f(x_{n-1})+f(x_{n})\right].\\&={\frac {3h}{8}}\left[f(x_{0})+3\sum _{i\neq 3k}^{n-1}f(x_{i})+2\sum _{j=1}^{n/3-1}f(x_{3j})+f(x_{n})\right]\qquad {\text{For: }}k\in \mathbb {N} _{0}\end{aligned}}}

While the remainder for the rule is shown as:

${\displaystyle -{\frac {h^{4}}{80}}(b-a)f^{(4)}(\xi ),}$[4]

We can only use this if ${\displaystyle n}$ is a multiple of three.

## Alternative extended Simpson's rule

This is another formulation of a composite Simpson's rule: instead of applying Simpson's rule to disjoint segments of the integral to be approximated, Simpson's rule is applied to overlapping segments, yielding:[5]

${\displaystyle \int _{a}^{b}f(x)\,dx\approx {\tfrac {h}{48}}{\bigg [}17f(x_{0})+59f(x_{1})+43f(x_{2})+49f(x_{3})+48\sum _{i=4}^{n-4}f(x_{i})+49f(x_{n-3})+43f(x_{n-2})+59f(x_{n-1})+17f(x_{n}){\bigg ]}.}$

The formula above is obtained by combining the original composite Simpson's rule with the one consisting of using Simpson's 3/8 rule in the extreme subintervals and the standard 3-point rule in the remaining subintervals. The result is then obtained by taking the mean of the two formulas.

### Simpson's rules in the case of narrow peaks

In the task of estimation of full area of narrow peak-like functions, Simpson's rules are much less efficient than trapezoidal rule. Namely, composite Simpson's 1/3 rule requires 1.8 times more points to achieve the same accuracy[6] as trapezoidal rule. Composite Simpson's 3/8 rule is even less accurate. Integral by Simpson's 1/3 rule can be represented as a sum of 2/3 of integral by trapezoidal rule with step h and 1/3 of integral by rectangle rule with step 2h. Averaging of Simpson's 1/3 rule composite sums with properly shifted frames produces the following rules:

${\displaystyle \int _{a}^{b}f(x)\,dx\approx {\tfrac {h}{24}}{\bigg [}-f(x_{-1})+12f(x_{0})+25f(x_{1})+24\sum _{i=2}^{n-2}f(x_{i})+25f(x_{n-1})+12f(x_{n})-f(x_{n+1}){\bigg ]}}$

where two points outside of the integrated region are exploited and

${\displaystyle \int _{a}^{b}f(x)\,dx\approx {\tfrac {h}{24}}{\bigg [}9f(x_{0})+28f(x_{1})+23f(x_{2})+24\sum _{i=3}^{n-3}f(x_{i})+23f(x_{n-2})+28f(x_{n-1})+9f(x_{n}){\bigg ]}}$

These rules are very much similar to Press's alternative extended Simpson's rule. Coefficients within the major part of the region being integrated equal one, differences are only at the edges. These three rules can be associated with Euler-MacLaurin formula with the first derivative term and named Euler-MacLaurin integration rules.[6] They differ only in how the first derivative at the region end is calculated.

## Composite Simpson's rule for irregularly spaced data

For some applications, the integration interval ${\displaystyle I=[a,b]}$ needs to be divided into uneven intervals – perhaps due to uneven sampling of data, or missing or corrupted data points. Suppose we divide the interval ${\displaystyle I}$ into even number ${\displaystyle N}$ of subintervals of widths ${\displaystyle h_{k}}$. Then the composite Simpson's rule is given by[7][8]

${\displaystyle \int _{a}^{b}f(x)\,\mathrm {d} x=\sum _{i=0}^{N/2-1}\left(\alpha _{i}f_{2i+2}+\beta _{i}f_{2i+1}+\eta _{i}f_{2i}\right),}$

where ${\displaystyle f_{k}=f\left(a+\sum _{i=0}^{k-1}h_{i}\right)}$ are the function values at the ${\displaystyle k}$th sampling point on the interval ${\displaystyle I}$, and the coefficients ${\displaystyle \alpha _{i},\,\beta _{i},}$ and ${\displaystyle \eta _{i}}$ are given by

${\displaystyle \alpha _{i}={\frac {2h_{2i+1}^{3}-h_{2i}^{3}+3h_{2i}h_{2i+1}^{2}}{6h_{2i+1}(h_{2i+1}+h_{2i})}},}$
${\displaystyle \beta _{i}={\frac {h_{2i+1}^{3}+h_{2i}^{3}+3h_{2i+1}h_{2i}(h_{2i+1}+h_{2i})}{6h_{2i+1}h_{2i}}},{\text{and}}}$
${\displaystyle \eta _{i}={\frac {2h_{2i}^{3}-h_{2i+1}^{3}+3h_{2i+1}h_{2i}^{2}}{6h_{2i}(h_{2i+1}+h_{2i})}}.}$

In case of odd number ${\displaystyle N}$ of subintervals, the above formula are used up to the second to last interval, and the last interval is handled separately by adding the following to the result:

${\displaystyle \alpha f_{N}+\beta f_{N-1}-\eta f_{N-2},}$

where

${\displaystyle \alpha ={\frac {2h_{N-1}^{2}+3h_{N-1}h_{N-2}}{6(h_{N-2}+h_{N-1})}},}$
${\displaystyle \beta ={\frac {h_{N-1}^{2}+3h_{N-1}h_{N-2}}{6h_{N-2}}},\,{\text{and}}}$
${\displaystyle \eta ={\frac {h_{N-1}^{3}}{6h_{N-2}(h_{N-2}+h_{N-1})}}.}$