FastICA: Difference between revisions

Content deleted Content added

Inline

Revision as of 23:21, 18 March 2013

FastICA is an efficient and popular algorithm for independent component analysis invented by Aapo Hyvärinen at Helsinki University of Technology. The algorithm is based on a fixed-point iteration scheme maximizing non-Gaussianity as a measure of statistical independence. It can be also derived as an approximative Newton iteration.

Algorithm

Preprocess the data

Before the FastICA algorithm can be applied, the input vector data $\mathbf {x}$ should be centered and whitened.

Centering the data

The input data $\mathbf {x}$ is centered by computing the mean of each component of $\mathbf {x}$ and subtracting that mean. This has the effect of making each component have zero mean. Thus:

\mathbf {x} \leftarrow \mathbf {x} -E\left\{\mathbf {x} \right\}

Whitening the data

Whitening the data involves linearly transforming the data so that the new components are uncorrelated and have variance one. If ${\widetilde {\mathbf {x} }}$ is the whitened data, then the covariance matrix of the whitened data is the identity matrix:

E\left\{{\widetilde {\mathbf {x} }}{\widetilde {\mathbf {x} }}^{T}\right\}=\mathbf {I}

This can be done using eigenvalue decomposition of the covariance matrix of the data: $E\left\{\mathbf {x} \mathbf {x} ^{T}\right\}=\mathbf {E} \mathbf {D} \mathbf {E} ^{T}$ , where $\mathbf {E}$ is the matrix of eigenvectors and $\mathbf {D}$ is the diagonal matrix of eigenvalues. Once eigenvalue decomposition is done, the whitened data is:

\mathbf {x} \leftarrow \mathbf {D} ^{-1/2}\mathbf {E} ^{T}\mathbf {x}

Single Independent Component Calculation

The iterative algorithm finds the direction for the weight vector $\mathbf {w}$ maximizing the non-Gaussianity of the projection $\mathbf {w} ^{T}\mathbf {x}$ for the data $\mathbf {x}$ . The function $g(\cdot )$ is the derivative of a nonquadratic nonlinearity $f(\cdot )$ . Hyvärinen states that good values for $f$ (shown with their derivatives $g$ and second derivatives ${g}'$ ) are:

{\begin{aligned}f(u)&=\log \cosh(u);\quad g(u)=\tanh(u);\quad {g}'(u)=1/\cosh ^{2}(u)\\f(u)&=-e^{-u^{2}/2};\quad g(u)=ue^{-u^{2}/2};\quad {g}'(u)=(1-u^{2})e^{-u^{2}/2}\end{aligned}}

The first equation is a good general-purpose equation, while the second is highly robust.

Randomize the initial weight vector $\mathbf {w}$
Let $\mathbf {w} ^{+}\leftarrow E\left\{\mathbf {x} g(\mathbf {w} ^{T}\mathbf {x} )\right\}-E\left\{g'(\mathbf {w} ^{T}\mathbf {x} )\right\}\mathbf {w}$
Let $\mathbf {w} \leftarrow \mathbf {w} ^{+}/\|\mathbf {w} ^{+}\|$
If not converged, go back to 2

Multiple Component Extraction

The single unit iterative algorithm only estimates one of the independent components, to estimate more the algorithm must repeated, and the projection vectors decorated. Although Hyvärinen provides several ways of decorating results the simplest multiple unit algorithm follows. Note that \mathbf{1} indicates a column vector of 1's with dimension M. This notation is simpler than the expectation since it indicates the dimension that has been removed.

Algorithm FastICA

Input:

C

Number of desired components

Input:

\mathbf {X} \in \mathbb {R} ^{N\times M}

Matrix, where each column represents a N-dimensional sample, where

C<N

Output:

\mathbf {W} \in \mathbb {R} ^{C\times N}

Un-mixing matrix where each row projects X onto into independent component.

Output:

\mathbf {S} \in \mathbb {R} ^{C\times M}

Independent components matrix, with M columns representing a sample with C dimensions.

for p in 1 to C:
     $\mathbf {w_{p}} \leftarrow$  Random vector of length C
    while  $\mathbf {w_{p}}$  changes
         $\mathbf {w_{p}} \leftarrow {\frac {1}{M}}\mathbf {X} g(\mathbf {w_{p}} ^{T}\mathbf {X} )-{\frac {1}{M}}g'(\mathbf {w_{p}} ^{T}\mathbf {X} )\mathbf {1} \mathbf {w_{p}}$ 
         $\mathbf {w_{p}} \leftarrow \mathbf {w_{p}} -\sum _{j=1}^{p-1}\mathbf {w_{p}} ^{T}\mathbf {w_{j}} \mathbf {w_{j}}$ 
         $\mathbf {w_{p}} \leftarrow {\frac {\mathbf {w_{p}} }{\|\mathbf {w_{p}} \|}}$ 
Output:  $\mathbf {W} ={\begin{bmatrix}\mathbf {w_{1}} \\\vdots \\\mathbf {w_{C}} \end{bmatrix}}$ 
Output:  $\mathbf {S} =\mathbf {W} \mathbf {X}$

External links

References

Hyvärinen, A; Oja, E (2000). Independent Component Analysis: Algorithms and Applications. Neural Networks, 13(4-5),411-430.
Hyvärinen, A (1999). Fast and Robust Fixed-Point Algorithms for Independent Component Analysis. IEEE Transactions on Neural Networks, 10(3),626-634.

This statistics-related article is a stub. You can help Wikipedia by expanding it.

@@ Line 45: / Line 45: @@
 # If not converged, go back to 2
-=== FastICA for several units ===
+=== Multiple Component Extraction ===
 The single unit iterative algorithm only estimates one of the independent components, to estimate more the algorithm must repeated, and the projection vectors decorated. Although Hyvärinen provides several ways of decorating results the simplest multiple unit algorithm follows. Note that \mathbf{1} indicates a column vector of 1's with dimension M. This notation is simpler than the expectation since it indicates the dimension that has been removed.