Sum of normally distributed random variables
| This article does not cite any references or sources. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed. (December 2009) |
In probability theory, calculation of the sum of normally distributed random variables is an instance of the arithmetic of random variables, which can be quite complex based on the probability distributions of the random variables involved and their relationships.
Contents |
[edit] Independent random variables
If X and Y are independent random variables that are normally distributed (not necessarily jointly so), then their sum is also normally distributed. i.e., if
and
and
are independent, then
This means that the sum of two independent normally distributed random variables is normal, with its mean being the sum of the two means, and its variance being the sum of the two variances (i.e., the square of the standard deviation is the sum of the squares of the standard deviations).
Note that the result that the sum is normally distributed requires the assumption of independence, not just uncorrelatedness; two separately (not jointly) normally distributed random variables can be uncorrelated without being independent, in which case their sum can be non-normally distributed (see Normally distributed and uncorrelated does not imply independent#A symmetric example). The result about the mean holds in all cases, while the result for the variance requires uncorrelatedness, but not independence.
[edit] Proofs
[edit] Proof using characteristic functions
of the sum of two independent random variables
and
is just the product of the two separate characteristic functions:
and
of
and
.
The characteristic function of the normal distribution with expected value
and variance
is
So
This is the characteristic function of the normal distribution with expected value
and variance 
Finally, recall that no two distinct distributions can both have the same characteristic function, so the distribution of
must be just this normal distribution.
[edit] Proof using convolutions
For random variables
and
, the distribution
of
equals the convolution of
and
:
Given that
and
are normal densities,
Substituting into the convolution:
The expression in the integral is a normal density distribution on
, and so the integral evaluates to 1. The desired result follows:
[edit] Geometric proof
First consider the normalized case when
and
,
so that their PDFs are
and
Let
. Then the CDF for
will be
This integral is over the half-plane which lies under the line
.
The key observation is that the function
is radially symmetric. So we rotate the coordinate plane about the origin, choosing new coordinates
such that the line
is described by the equation
where
is determined geometrically. Because of the radial symmetry, we have
, and the CDF for
is
This is easy to integrate; we find that the CDF for
is
To determine the value
, note that we rotated the plane so that the line
now runs vertically with
-intercept equal to
. So
is just the distance from the origin to the line
along the perpendicular bisector, which meets the line at its nearest point to the origin, in this case
. So the distance is
, and the CDF for
is
, i.e., 
Now, if
are any real constants (not both zero!) then the probability that
is found by the same integral as above, but with the bounding line
. The same rotation method works, and in this more general case we find that the closest point on the line to the origin is located a (signed) distance
away, so that
The same argument in higher dimensions shows that if
for 
then
Now we are essentially done, because
So in general, if
for 
then
[edit]
In the event that the variables X and Y are jointly normally distributed random variables, then X + Y is still normally distributed (see Multivariate normal distribution) and the mean is the sum of the means. However, the variances are not additive due to the correlation. Indeed,
where ρ is the correlation. In particular, whenever ρ < 0, then the standard deviation is less than the sum of the standard deviations of X and Y. This is perhaps the simplest demonstration of the principle of diversification.
Extensions of this result can be made for more than two random variables, using the covariance matrix.
[edit] Proof
In this case, one needs to consider
As above, one makes the substitution 
This integral is more complicated to simplify analytically, but can be done easily using a symbolic mathematics program. The probability distribution ƒZ(z) is given in this case by
where
If one considers instead Z = X − Y, then one obtains
which also can be rewritten with
The standard deviations of each distribution are obvious by comparison with the standard normal distribution.












![\begin{align}
f_Z(z) &= \int_{-\infty}^\infty \frac{1}{\sqrt{2\pi}\sigma_Y} e^{-{(z-x-\mu_Y)^2 \over 2\sigma_Y^2}} \frac{1}{\sqrt{2\pi}\sigma_X} e^{-{(x-\mu_X)^2 \over 2\sigma_X^2}} dx \\
&= \int_{-\infty}^\infty \frac{1}{\sqrt{2\pi}\sqrt{\sigma_X^2+\sigma_Y^2}} \exp \left[ - { (z-(\mu_X+\mu_Y))^2 \over 2(\sigma_X^2+\sigma_Y^2) } \right] \frac{1}{\sqrt{2\pi}\frac{\sigma_X\sigma_Y}{\sqrt{\sigma_X^2+\sigma_Y^2}}} \exp \left[ - \frac{\left(x-\frac{\sigma_X^2(z-\mu_Y)+\sigma_Y^2\mu_X}{\sigma_X^2+\sigma_Y^2}\right)^2}{2\left(\frac{\sigma_X\sigma_Y}{\sqrt{\sigma_X^2+\sigma_Y^2}}\right)^2} \right] dx \\
&= \frac{1}{\sqrt{2\pi(\sigma_X^2+\sigma_Y^2)}} \exp \left[ - { (z-(\mu_X+\mu_Y))^2 \over 2(\sigma_X^2+\sigma_Y^2) } \right] \int_{-\infty}^\infty \frac{1}{\sqrt{2\pi}\frac{\sigma_X\sigma_Y}{\sqrt{\sigma_X^2+\sigma_Y^2}}} \exp \left[ - \frac{\left(x-\frac{\sigma_X^2(z-\mu_Y)+\sigma_Y^2\mu_X}{\sigma_X^2+\sigma_Y^2}\right)^2}{2\left(\frac{\sigma_X\sigma_Y}{\sqrt{\sigma_X^2+\sigma_Y^2}}\right)^2} \right] dx \\
\end{align}](http://upload.wikimedia.org/wikipedia/en/math/4/2/c/42cf96163b172a8dc89ae347754e0e50.png)
![f_Z(z) = \frac{1}{\sqrt{2\pi(\sigma_X^2+\sigma_Y^2)}} \exp \left[ - { (z-(\mu_X+\mu_Y))^2 \over 2(\sigma_X^2+\sigma_Y^2) } \right]](http://upload.wikimedia.org/wikipedia/en/math/0/2/2/022851c15a3e9c9c734b16ffae3f4e94.png)

,







for 


for 

![\frac{1}{2 \pi \sigma_x \sigma_y \sqrt{1-\rho^2}} \iint_{x\,y} \exp \left[ -\frac{1}{2(1-\rho^2)} \left(\frac{x^2}{\sigma_x^2} + \frac{y^2}{\sigma_y^2} - \frac{2 \rho x y}{\sigma_x\sigma_y}\right)\right] \delta(z - (x+y))\, \operatorname{d}x\,\operatorname{d}y.](http://upload.wikimedia.org/wikipedia/en/math/c/e/2/ce22f00c0171e54b8ea5c9e749856bef.png)



