# Buffon's needle problem

In probability theory, Buffon's needle problem is a question first posed in the 18th century by Georges-Louis Leclerc, Comte de Buffon:

Suppose we have a floor made of parallel strips of wood, each the same width, and we drop a needle onto the floor. What is the probability that the needle will lie across a line between two strips?

Buffon's needle was the earliest problem in geometric probability to be solved; it can be solved using integral geometry. The solution for the sought probability p, in the case where the needle length l is not greater than the width t of the strips, is

$p={\frac {2}{\pi }}\cdot {\frac {l}{t}}.$ This can be used to design a Monte Carlo method for approximating the number π, although that was not the original motivation for de Buffon's question.

## Solution

The problem in more mathematical terms is: Given a needle of length l dropped on a plane ruled with parallel lines t units apart, what is the probability that the needle will lie across a line upon landing?

Let x be the distance from the center of the needle to the closest parallel line, and let θ be the acute angle between the needle and one of the parallel lines.

The uniform probability density function (PDF) of x between 0 and t/2 is

$f_{X}(x)={\begin{cases}{\dfrac {2}{t}}&:\ 0\leq x\leq {\dfrac {t}{2}}\\[4px]0&:{\text{elsewhere.}}\end{cases}}$ Here, x = 0 represents a needle that is centered directly on a line, and x = t/2 represents a needle that is perfectly centered between two lines. The uniform PDF assumes the needle is equally likely to fall anywhere in this range, but could not fall outside of it.

The uniform probability density function of θ between 0 and π/2 is

$f_{\Theta }(\theta )={\begin{cases}{\dfrac {2}{\pi }}&:\ 0\leq \theta \leq {\dfrac {\pi }{2}}\\[4px]0&:{\text{elsewhere.}}\end{cases}}$ Here, θ = 0 represents a needle that is parallel to the marked lines, and θ = π/2 radians represents a needle that is perpendicular to the marked lines. Any angle within this range is assumed an equally likely outcome.

The two random variables, x and θ, are independent, so the joint probability density function is the product

$f_{X,\Theta }(x,\theta )={\begin{cases}{\dfrac {4}{t\pi }}&:\ 0\leq x\leq {\dfrac {t}{2}},\ 0\leq \theta \leq {\dfrac {\pi }{2}}\\[4px]0&:{\text{elsewhere.}}\end{cases}}$ The needle crosses a line if

$x\leq {\frac {l}{2}}\sin \theta .$ Now there are two cases.

### Case 1: Short needle (l ≤ t)

Integrating the joint probability density function gives the probability that the needle will cross a line:

$P=\int _{\theta =0}^{\frac {\pi }{2}}\int _{x=0}^{{\frac {l}{2}}\sin \theta }{\frac {4}{t\pi }}\,dx\,d\theta ={\frac {2l}{t\pi }}.$ ### Case 2: Long needle (l > t)

Suppose l > t. In this case, integrating the joint probability density function, we obtain:

$\int _{\theta =0}^{\frac {\pi }{2}}\int _{x=0}^{m(\theta )}{\frac {4}{t\pi }}\,dx\,d\theta ,$ where m(θ) is the minimum between l/2 sin θ and t/2.

Thus, performing the above integration, we see that, when l > t, the probability that the needle will cross at least one line is

$P={\frac {2l}{t\pi }}-{\frac {2}{t\pi }}\left({\sqrt {l^{2}-t^{2}}}+t\arcsin {\frac {t}{l}}\right)+1$ or

$P={\frac {2}{\pi }}\arccos {\frac {t}{l}}+{\frac {2}{\pi }}\cdot {\frac {l}{t}}\left(1-{\sqrt {1-\left({\frac {t}{l}}\right)^{2}}}\right).$ In the second expression, the first term represents the probability of the angle of the needle being such that it will always cross at least one line. The right term represents the probability that the needle falls at an angle where its position matters, and it crosses the line.

Alternatively, notice that whenever θ has a value such that l sin θt, that is, in the range 0 ≤ θ ≤ arcsin t/l, the probability of crossing is the same as in the short needle case. However if l sin θ > t, that is, arcsin t/l < θπ/2 the probability is constant and is equal to 1.

{\begin{aligned}P&=\left(\int _{\theta =0}^{\arcsin {\frac {t}{l}}}\int _{x=0}^{{\frac {l}{2}}\sin \theta }{\frac {4}{t\pi }}\right)+\left(\int _{\arcsin {\frac {t}{l}}}^{\frac {\pi }{2}}{\frac {2}{\pi }}\right)\\[6px]&={\frac {2l}{t\pi }}-{\frac {2}{t\pi }}\left({\sqrt {l^{2}-t^{2}}}+t\arcsin {\frac {t}{l}}\right)+1\end{aligned}} ## Using elementary calculus

The following solution for the "short needle" case, while equivalent to the one above, has a more visual flavor, and avoids iterated integrals.

We can calculate the probability P as the product of two probabilities: P = P1 · P2, where P1 is the probability that the center of the needle falls close enough to a line for the needle to possibly cross it, and P2 is the probability that the needle actually crosses the line, given that the center is within reach.

Looking at the illustration in the above section, it is apparent that the needle can cross a line if the center of the needle is within l/2 units of either side of the strip. Adding l/2 + l/2 from both sides and dividing by the whole width t, we obtain P1 = l/t. The red and blue needles are both centered at x. The red one falls within the gray area, contained by an angle of 2θ on each side, so it crosses the vertical line; the blue one does not. The proportion of the circle that is gray is what we integrate as the center x goes from 0 to 1

Now, we assume that the center is within reach of the edge of the strip, and calculate P2. To simplify the calculation, we can assume that $l=2$ .

Let x and θ be as in the illustration in this section. Placing a needle's center at x, the needle will cross the vertical axis if it falls within a range of 2θ radians, out of π radians of possible orientations. This represents the gray area to the left of x in the figure. For a fixed x, we can express θ as a function of x: θ(x) = arccos(x). Now we can let x range from 0 to 1, and integrate:

{\begin{aligned}P_{2}&=\int _{0}^{1}{\frac {2\theta (x)}{\pi }}\,dx\\[6px]&={\frac {2}{\pi }}\int _{0}^{1}\cos ^{-1}(x)\,dx\\[6px]&={\frac {2}{\pi }}\cdot 1={\frac {2}{\pi }}.\end{aligned}} Multiplying both results, we obtain P = P1 · P2 = l/t · 2/π = 2l/ as above.

There is an even more elegant and simple method of calculating the "short needle case". The end of the needle farthest away from any one of the two lines bordering its region must be located within a horizontal (perpendicular to the bordering lines) distance of l cos θ (where θ is the angle between the needle and the horizontal) from this line in order for the needle to cross it. The farthest this end of the needle can move away from this line horizontally in its region is t. The probability that the farthest end of the needle is located no more than a distance l cos θ away from the line (and thus that the needle crosses the line) out of the total distance t it can move in its region for 0 ≤ θπ/2 is given by

{\begin{aligned}P&={\frac \int _{0}^{\frac {\pi }{2}}l\cos \theta \,d\theta }\int _{0}^{\frac {\pi }{2}}t\,d\theta }}\\[6px]&={\frac {l}{t}}\cdot {\frac \int _{0}^{\frac {\pi }{2}}\cos \theta \,d\theta }\int _{0}^{\frac {\pi }{2}}d\theta }}\\[6px]&={\frac {l}{t}}\cdot {\frac {1}{\,{\frac {\pi }{2}}\,}}\\[6px]&={\frac {2l}{t\pi }}.\end{aligned}} ## Without integrals

The short-needle problem can also be solved without any integration, in a way that explains the formula for p from the geometric fact that a circle of diameter t will cross the distance t strips always (i.e. with probability 1) in exactly two spots. This solution was given by Joseph-Émile Barbier in 1860 and is also referred to as "Buffon's noodle".

## Estimating π An experiment to find π. Matches with the length of 9 squares have been thrown 17 times between rows with the width of 9 squares. 11 of the matches have landed at random across the drawn lines marked by the green points.2l · n/th = 2 × 9 × 17/9 × 11 ≈ 3.1 ≈ π. A Python 3 based simulation using Matplotlib to sketch Buffon's needle experiment with the parameters t = 5.0, l = 2.6. Observe the calculated value of π (y-axis) approaching 3.14 as the number of tosses (x-axis) approaches infinity.

In the first, simpler case above, the formula obtained for the probability P can be rearranged to

$\pi ={\frac {2l}{tP}}.$ Thus, if we conduct an experiment to estimate P, we will also have an estimate for π.

Suppose we drop n needles and find that h of those needles are crossing lines, so P is approximated by the fraction h/n. This leads to the formula:

$\pi \approx {\frac {2l\cdot n}{th}}.$ In 1901, Italian mathematician Mario Lazzarini performed Buffon's needle experiment. Tossing a needle 3,408 times, he obtained the well-known approximation 355/113 for π, accurate to six decimal places. Lazzarini's "experiment" is an example of confirmation bias, as it was set up to replicate the already well-known approximation of 355/113 (in fact, there is no better rational approximation with fewer than five digits in the numerator and denominator, see also Milü), yielding a more accurate "prediction" of π than would be expected from the number of trials, as follows: 

Lazzarini chose needles whose length was 5/6 of the width of the strips of wood. In this case, the probability that the needles will cross the lines is 5/3π. Thus if one were to drop n needles and get x crossings, one would estimate π as

$\pi \approx {\frac {5}{3}}\cdot {\frac {n}{x}}$ So if Lazzarini was aiming for the result 355/113, he needed n and x such that

${\frac {355}{113}}={\frac {5}{3}}\cdot {\frac {n}{x}},$ or equivalently,

$x={\frac {113n}{213}}.$ To do this, one should pick n as a multiple of 213, because then 113n/213 is an integer; one then drops n needles, and hopes for exactly x = 113n/213 successes. If one drops 213 needles and happens to get 113 successes, then one can triumphantly report an estimate of π accurate to six decimal places. If not, one can just do 213 more trials and hope for a total of 226 successes; if not, just repeat as necessary. Lazzarini performed 3,408 = 213 × 16 trials, making it seem likely that this is the strategy he used to obtain his "estimate".

The above description of strategy might even be considered charitable to Lazzarini. A statistical analysis of intermediate results he reported for fewer tosses leads to a very low probability of achieving such close agreement to the expected value all through the experiment. This makes it very possible that the "experiment" itself was never physically performed, but based on numbers concocted from imagination to match statistical expectations, but too well, as it turns out.

Dutch science journalist Hans van Maanen argues, however, that Lazzarini's article was never meant to be taken too seriously as it would have been pretty obvious for the readers of the magazine (aimed at school teachers) that the apparatus that Lazzarini said to have built cannot possibly work as described.

## Laplace's extension (short needle case)

Now consider the case where the plane contains two sets of parallel lines orthogonal to one another, creating a standard perpendicular grid. We aim to find the probability that the needle intersects at least one line on the grid. Let a and b be the sides of the rectangle that contains the midpoint of the needle whose length is l. Since this is the short needle case, l < a, l < b. Let (x,y) mark the coordinates of the needle's midpoint and let φ mark the angle formed by the needle and the x-axis. Similar to the examples described above, we consider x, y, φ to be independent uniform random variables over the ranges 0 ≤ xa, 0 ≤ yb, π/2φπ/2.

To solve such a problem, we first compute the probability that the needle crosses no lines, and then we take its compliment. We compute this first probability by determining the volume of the domain where the needle crosses no lines and then divide that by the volume of all possibilities, V. We can easily see that V = πab.

Now let V* be the volume of possibilities where the needle does not intersect any line. Developed by J.V. Uspensky,

$V^{*}=\int _{-{\frac {\pi }{2}}}^{\frac {\pi }{2}}F(\varphi )\,d\varphi$ where F(φ) is the region where the needle does not intersect any line given an angle φ. To determine F(φ), let's first look at the case for the horizontal edges of the bounding rectangle. The total side length is a and the midpoint must not be within l/2 cos φ of either endpoint of the edge. Thus, the total allowable length for no intersection is a − 2(l/2 cos φ) or simply just al cos φ. Equivalently, for the vertical edges with length b, we have b ± l sin φ. The ± accounts for the cases where φ is positive or negative. Taking the positive case and then adding the absolute value signs in the final answer for generality, we get

$F(\varphi )=(a-l\cos \varphi )(b-l\sin \varphi )=ab-bl\cos \varphi -al|\sin \varphi |+{\tfrac {1}{2}}l^{2}|\sin 2\varphi |.$ Now we can compute the following integral:

$V^{*}=\int _{-{\frac {\pi }{2}}}^{\frac {\pi }{2}}F(\varphi )\,d\varphi =\pi ab-2bl-2al+l^{2}.$ Thus, the probability that the needle does not intersect any line is

${\frac {V^{*}}{V}}={\frac {\pi ab-2bl-2al+l^{2}}{\pi ab}}=1-{\frac {2l(a+b)-l^{2}}{\pi ab}}.$ And finally, if we want to calculate the probability, P, that the needle does intersect at least one line, we need to subtract the above result from 1 to compute its compliment, yielding

$P={\frac {2l(a+b)-l^{2}}{\pi ab}}$ .

## Comparing estimators of π

As mentioned above, Buffon's needle experiment can be used to estimate π. This fact holds for Laplace's extension too since π shows up in that answer as well. The following question then naturally arises and is discussed by E.F. Schuster in 1974. Is Buffon's experiment or Laplace's a better estimator of the value of π? Since in Laplace's extension there are two sets of parallel lines, we compare N drops when there is a grid (Laplace), and 2N drops in Buffon's original experiment.

Let A be the event that the needle intersects a horizontal line (parallel to the x-axis)

$x={\begin{cases}1&:{\text{intersection occurs}}\\0&:{\text{no intersection}}\end{cases}}$ and let B be the event that the needle intersects a vertical line (parallel to the y-axis)

$y={\begin{cases}1&:{\text{intersection occurs}}\\0&:{\text{no intersection}}\end{cases}}$ For simplicity in the algebraic formulation ahead, let a = b = t = 2l such that the original result in Buffon's problem is P(A) = P(B) = 1/π. Furthermore, let N = 100 drops.

Now let us examine P(AB) for Laplace's result, that is, the probability the needle intersects both a horizontal and a vertical line. We know that

$P(AB)=1-P(AB')-P(A'B)-P(A'B').$ From the above section, P(AB′), or the probability that the needle intersects no lines is

$P(A'B')=1-{\frac {2l(a+b)-l^{2}}{\pi ab}}=1-{\frac {2l(4l)-l^{2}}{4l^{2}\pi }}=1-{\frac {7}{4\pi }}.$ We can solve for P(AB) and P(AB′) using the following method:

{\begin{aligned}P(A)&={\frac {1}{\pi }}=P(AB)+P(AB')\\[4px]P(B)&={\frac {1}{\pi }}=P(AB)+P(A'B).\end{aligned}} Solving for P(AB) and P(AB′) and plugging that into the original definition for P(AB) a few lines above, we get

$P(AB)=1-2\left({\frac {1}{\pi }}-P(AB)\right)-\left(1-{\frac {7}{4\pi }}\right)={\frac {1}{4\pi }}$ Although not necessary to the problem, it is now possible to see that P(AB) = P(AB′) = 3/4π. With the values above, we are now able to determine which of these estimators is a better estimator for π. For the Laplace variant, let be the estimator for the probability that there is a line intersection such that

${\hat {p}}={\frac {1}{100}}\sum _{n=1}^{100}{\frac {x_{n}+y_{n}}{2}}$ .

We are interested in the variance of such an estimator to understand the usefulness or efficiency of it. To compute the variance of , we first compute Var(xn + yn) where

$\operatorname {Var} (x_{n}+y_{n})=\operatorname {Var} (x_{n})+\operatorname {Var} (y_{n})+2\operatorname {Cov} (x_{n},y_{n}).$ Solving for each part individually,

{\begin{aligned}\operatorname {Var} (x_{n})=\operatorname {Var} (y_{n})&=\sum _{i=1}^{2}p_{i}{\bigl (}x_{i}-\mathbb {E} (x_{i}){\bigr )}^{2}\\[6px]&=P(x_{i}=1)\left(1-{\frac {1}{\pi }}\right)^{2}+P(x_{i}=0)\left(0-{\frac {1}{\pi }}\right)^{2}\\[6px]&={\frac {1}{\pi }}\left(1-{\frac {1}{\pi }}\right)^{2}+\left(1-{\frac {1}{\pi }}\right)\left(-{\frac {1}{\pi }}\right)^{2}={\frac {1}{\pi }}\left(1-{\frac {1}{\pi }}\right).\\[12px]\operatorname {Cov} (x_{n},y_{n})&=\mathbb {E} (x_{n}y_{n})-\mathbb {E} (x_{n})\mathbb {E} (y_{n})\end{aligned}} We know from the previous section that

$\mathbb {E} (x_{n}y_{n})=P(AB)={\frac {1}{4\pi }}$ yielding

$\operatorname {Cov} (x_{n},y_{n})={\frac {1}{4\pi }}-{\frac {1}{\pi }}\cdot {\frac {1}{\pi }}={\frac {\pi -4}{4\pi ^{2}}}<0$ Thus,

$\operatorname {Var} (x_{n}+y_{n})={\frac {1}{\pi }}\left(1-{\frac {1}{\pi }}\right)+{\frac {1}{\pi }}\left(1-{\frac {1}{\pi }}\right)+2\left({\frac {\pi -4}{4\pi ^{2}}}\right)={\frac {5\pi -8}{2\pi ^{2}}}$ Returning to the original problem of this section, the variance of estimator is

$\operatorname {Var} ({\hat {p}})={\frac {1}{200^{2}}}(100)\left({\frac {5\pi -8}{2\pi ^{2}}}\right)\approx 0.000\,976.$ Now let us calculate the number of drops, M, needed to achieve the same variance as 100 drops over perpendicular lines. If M < 200 then we can conclude that the setup with only parallel lines is more efficient than the case with perpendicular lines. Conversely if M is equal to or more than 200, than Buffon's experiment is equally or less efficient, respectively. Let be the estimator for Buffon's original experiment. Then,

${\hat {q}}={\frac {1}{M}}\sum _{m=1}^{M}x_{m}$ and

$\operatorname {Var} ({\hat {q}})={\frac {1}{M^{2}}}(M)\operatorname {Var} (x_{m})={\frac {1}{M}}\cdot {\frac {1}{\pi }}\left(1-{\frac {1}{\pi }}\right)\approx {\frac {0.217}{M}}$ Solving for M,

${\frac {0.217}{M}}=0.000\,976\implies M\approx 222.$ Thus, it takes 222 drops with only parallel lines to have the same certainty as 100 drops in Laplace's case. This isn't actually surprising because of the observation that Cov(xn,yn) < 0. Because xn and yn are negatively correlated random variables, they act to reduce the total variance in the estimator that is an average of the two of them. This method of variance reduction is known as the antithetic variates method.