= Generalized chi-squared distribution =

</math> | char = $\frac{\exp\left[it \left( m + \sum_j \frac{w_j \lambda_j}{1-2i w_j t} \right)-\frac{s^2 t^2}{2}\right]}{\prod_j \left(1-2i w_j t \right)^{k_j/2}}$
}}

In probability theory and statistics, the generalized chi-squared distribution (or generalized chi-square distribution) is the distribution of a quadratic function of a multinormal variable (normal vector), or a linear combination of different normal variables and squares of normal variables. Equivalently, it is also a linear sum of independent noncentral chi-square variables and a normal variable. There are several other such generalizations for which the same term is sometimes used; some of them are special cases of the family discussed here, for example the gamma distribution.

==Definition==
The generalized chi-squared variable may be described in multiple ways. One is to write it as a weighted sum of independent noncentral chi-square variables $}{\left(1-\frac{w_j}{w_*} \right)^{k_j/2}}.$

Here, $w_*$ is the largest positive or negative weight if we are looking at the upper or lower tail respectively, and $k_*$ and $\lambda_*$ are its corresponding degree and non-centrality. $f_$ and $F_$ are the PDF and CDF of the non-central chi-square distribution with parameters $k_*$ and $\lambda_*$. $Q$ is the Marcum Q-function.

In the far tails, these expressions can be further simplified to ones that are then identical for the pdf $f(x)$ and tail CDF $p(x)$ (which is the CDF at a point in the lower tail, or the complementary CDF at a point in the upper tail):
$f(x) \approx p(x) \approx
\begin{cases}
\left(\tfrac{x}{w_*}\right)^{\tfrac{k_*-2}{2}} e^{-x/2w_*}, & \text{if } \lambda_*=0, \\[1ex]
\left(\tfrac{x}{w_*}\right)^{\tfrac{k_*-3}{4}} e^{-x/2w_* + \sqrt{\lambda_* x/w_*}}, & \text{if } \lambda_*>0.
\end{cases}$

Here again, $w_*$ is the largest positive or negative weight if we are looking at the upper or lower tail respectively.

==Applications==

===In model fitting and selection===
If a predictive model is fitted by least squares, but the residuals have either autocorrelation or heteroscedasticity, then alternative models can be compared (in model selection) by relating changes in the sum of squares to an asymptotically valid generalized chi-squared distribution.

===Classifying normal vectors using Gaussian discriminant analysis===
If $\boldsymbol{x}$ is a normal vector, its log likelihood is a quadratic form of $\boldsymbol{x}$, and is hence distributed as a generalized chi-squared. The log likelihood ratio that $\boldsymbol{x}$ arises from one normal distribution versus another is also a quadratic form, so distributed as a generalized chi-squared.

In Gaussian discriminant analysis, samples from multinormal distributions are optimally separated by using a quadratic classifier, a boundary that is a quadratic function (e.g. the curve defined by setting the likelihood ratio between two Gaussians to 1). The classification error rates of different types (false positives and false negatives) are integrals of the normal distributions within the quadratic regions defined by this classifier. Since this is mathematically equivalent to integrating a quadratic form of a normal vector, the result is an integral of a generalized-chi-squared variable.

===In signal processing===
The following application arises in the context of Fourier analysis in signal processing, renewal theory in probability theory, and multi-antenna systems in wireless communication. The common factor of these areas is that the sum of exponentially distributed variables is of importance (or identically, the sum of squared magnitudes of circularly-symmetric centered complex Gaussian variables).

If $Z_i$ are k independent, circularly-symmetric centered complex Gaussian random variables with mean 0 and variance $\sigma_i^2$, then the random variable

$\tilde{Q} = \sum_{i=1}^k |Z_i|^2$

has a generalized chi-squared distribution of a particular form. The difference from the standard chi-squared distribution is that $Z_i$ are complex and can have different variances, and the difference from the more general generalized chi-squared distribution is that the relevant scaling matrix A is diagonal. If $\mu=\sigma_i^2$ for all i, then $\tilde{Q}$, scaled down by $\mu/2$ (i.e. multiplied by $2/\mu$), has a chi-squared distribution, $\chi^2(2k)$, also known as an Erlang distribution. If $\sigma_i^2$ have distinct values for all i, then $\tilde{Q}$ has the pdf
$f(x; k,\sigma_1^2,\ldots,\sigma_k^2) = \sum_{i=1}^k \frac{e^{-\frac x {\sigma_i^2}}}{\sigma_i^2 \prod_{j=1, j\neq i}^k \left(1- \frac{\sigma_j^2}{\sigma_i^2}\right)} \quad\text{for } x\geq0.$

If there are sets of repeated variances among $\sigma_i^2$, assume that they are divided into M sets, each representing a certain variance value. Denote $\mathbf{r}=(r_1, r_2, \dots, r_M)$ to be the number of repetitions in each group. That is, the mth set contains $r_m$ variables that have variance $\sigma^2_m.$ It represents an arbitrary linear combination of independent $\chi^2$-distributed random variables with different degrees of freedom:

$\tilde{Q} = \sum_{m=1}^M \sigma^2_m/2* Q_m, \quad Q_m \sim \chi^2(2r_m) \, .$

The pdf of $\tilde{Q}$ is

$f(x; \mathbf{r}, \sigma^2_1, \dots \sigma^2_M) = \prod_{m=1}^M \frac{1}{\sigma^{2r_m}_m} \sum_{k=1}^M \sum_{l=1}^{r_k} \frac{\Psi_{k,l,\mathbf{r}}}{(r_k-l)!} (-x)^{r_k-l} e^{-\frac{x}{\sigma^2_k}}, \quad \text{ for }x\geq0 ,$

where

$\Psi_{k,l,\mathbf{r}} = (-1)^{r_k-1} \sum_{\mathbf{i} \in
\Omega_{k,l}} \prod_{j \neq k} \binom{i_j + r_j-1}{i_j}
\left(\frac 1 {\sigma^2_j}\!-\!\frac{1}{\sigma^2_k} \right)^{-(r_j + i_j)},$

with $\mathbf{i}=[i_1,\ldots,i_M]^T$ from the set $\Omega_{k,l}$ of
all partitions of $l-1$ (with $i_k=0$) defined as

$\Omega_{k,l} = \left\{ [i_1,\ldots,i_m]\in \mathbb{Z}^m; \sum_{j=1}^M i_j \!= l-1, i_k=0, i_j\geq 0 \text{ for all } j \right\}.$

==See also==
- Noncentral chi-squared distribution
- Chi-squared distribution
