Gabor transform

The Gabor transform, named after Dennis Gabor, is a special case of the short-time Fourier transform. It is used to determine the sinusoidal frequency and phase content of local sections of a signal as it changes over time. The function to be transformed is first multiplied by a Gaussian function, which can be regarded as a window function, and the resulting function is then transformed with a Fourier transform to derive the time-frequency analysis.[1] The window function means that the signal near the time being analyzed will have higher weight. The Gabor transform of a signal x(t) is defined by this formula:

${\displaystyle G_{x}(t,f)=\int _{-\infty }^{\infty }e^{-\pi (\tau -t)^{2}}e^{-j2\pi f\tau }x(\tau )\,d\tau }$
Magnitude of Gaussian function.

The Gaussian function has infinite range and it is impractical for implementation. However, a level of significance can be chosen (for instance 0.00001) for the distribution of the Gaussian function.

${\displaystyle {\begin{cases}e^{-{\pi }a^{2}}\geq 0.00001;&\left|a\right|\leq 1.9143\\e^{-{\pi }a^{2}}<0.00001;&\left|a\right|>1.9143\end{cases}}}$

Outside these limits of integration (${\displaystyle \left|a\right|>1.9143}$) the Gaussian function is small enough to be ignored. Thus the Gabor transform can be satisfactorily approximated as

${\displaystyle G_{x}(t,f)=\int _{-1.9143+t}^{1.9143+t}e^{-\pi (\tau -t)^{2}}e^{-j2\pi f\tau }x(\tau )\,d\tau }$

This simplification makes the Gabor transform practical and realizable.

The window function width can also be varied to optimize the time-frequency resolution tradeoff for a particular application by replacing the ${\displaystyle {-{\pi }(\tau -t)^{2}}}$ with ${\displaystyle {-{\pi }\alpha (\tau -t)^{2}}}$ for some chosen alpha.

Inverse Gabor transform

The Gabor transform is invertible. The original signal can be recovered by the following equation[dubious ]

${\displaystyle x(t)=\int _{-\infty }^{\infty }\int _{-\infty }^{\infty }G_{x}(\tau ,f)e^{j2\pi tf}\,df\,d\tau }$

Compare this inversion formula with property No. 5 below.

Properties of the Gabor transform

The Gabor transform has many properties like those of the Fourier transform. These properties are listed in the following tables.

Signal Gabor transform Remarks
${\displaystyle x(t)\,}$ ${\displaystyle G_{x}(t,f)=\int _{-\infty }^{\infty }e^{-\pi (\tau -t)^{2}}e^{-j2\pi f\tau }x(\tau )\,d\tau }$
1 ${\displaystyle a\cdot x(t)+b\cdot y(t)\,}$ ${\displaystyle a\cdot G_{x}(t,f)+b\cdot G_{y}(t,f)\,}$ Linearity property
2 ${\displaystyle x(t-t_{0})\,}$ ${\displaystyle G_{x}(t-t_{0},f)e^{-j2\pi ft_{0}}\,}$ Shifting property
3 ${\displaystyle x(t)e^{j2\pi f_{0}t}\,}$ ${\displaystyle G_{x}(t,f-f_{0})\,}$ Modulation property
Remarks
1 ${\displaystyle \int _{-\infty }^{\infty }\left|G_{x}(t,f)\right|^{2}\,df=\int _{-\infty }^{\infty }e^{-2\pi (\tau -t)^{2}}\left|x(\tau )\right|^{2}d\tau \approx \int _{u-1.9143}^{u+1.9143}e^{-2\pi (\tau -u)^{2}}\left|x(\tau )\right|^{2}d\tau }$ Power integration property
2 ${\displaystyle \int _{-\infty }^{\infty }\int _{-\infty }^{\infty }G_{x}(t,f)G_{y}^{*}(t,f)\,df\,dt=\int _{-\infty }^{\infty }x(\tau )y^{*}(\tau )\,d\tau }$ Energy sum property
3 ${\displaystyle {\begin{cases}\displaystyle \int _{-\infty }^{\infty }\left|G_{x}(t,f)\right|^{2}dft_{0}\\[12pt]\displaystyle \int _{-\infty }^{\infty }\left|G_{x}(t,f)\right|^{2}\,dtf_{0}\end{cases}}}$ Power decay property
4 ${\displaystyle \int _{-\infty }^{\infty }G_{x}(t,f)e^{j2\pi ktf}\,df=e^{-\pi (k-1)^{2}t^{2}}x(kt)}$ Integration property
5 ${\displaystyle \int _{-\infty }^{\infty }G_{x}(t,f)e^{j2\pi tf}\,df=x(t)}$ Recovery property

Application and example

Time/frequency distribution.

The main application of the Gabor transform is used in time–frequency analysis. Take the following equation as an example. The input signal has 1 Hz frequency component when t ≤ 0 and has 2 Hz frequency component when t > 0

${\displaystyle x(t)={\begin{cases}\cos(2\pi t)&{\text{for }}t\leq 0,\\\cos(4\pi t)&{\text{for }}t>0.\end{cases}}}$

But if the total bandwidth available is 5 Hz, other frequency bands except x(t) are wasted. Through time–frequency analysis by applying the Gabor transform, the available bandwidth can be known and those frequency bands can be used for other applications and bandwidth is saved. The right side picture shows the input signal x(t) and the output of the Gabor transform. As was our expectation, the frequency distribution can be separated into two parts. One is t ≤ 0 and the other is t > 0. The white part is the frequency band occupied by x(t) and the black part is not used. Note that for each point in time there is both a negative (upper white part) and a positive (lower white part) frequency component.

Discrete Gabor-transformation

A discrete version of Gabor representation

${\displaystyle y(t)=\sum _{m=-\infty }^{\infty }\sum _{n=-\infty }^{\infty }C_{nm}\cdot g_{nm}(t)}$

with ${\displaystyle g_{nm}(t)=s(t-m\tau _{0})\cdot e^{j\Omega nt}}$

can be derived easily by discretizing the Gabor-basis-function in these equations. Hereby the continuous parameter t is replaced by the discrete time k. Furthermore the now finite summation limit in Gabor representation has to be considered. In this way, the sampled signal y(k) is split into M time frames of length N. According to ${\displaystyle \Omega \leq {\tfrac {2\pi }{\tau _{0}}}}$, the factor Ω for critical sampling is ${\displaystyle \Omega ={\tfrac {2\pi }{N}}}$

Similar to the DFT (discrete Fourier transformation) a frequency domain split into N discrete partitions is obtained. An inverse transformation of these N spectral partitions then leads to N values y(k)for the time window, which consists of N sample values. For overall M time windows with N sample values, each signal y(k) contains K=N ${\displaystyle \cdot }$ M sample values: (the discrete Gabor representation)

${\displaystyle y(k)=\sum _{m=0}^{M-1}\sum _{n=0}^{N-1}C_{nm}\cdot g_{nm}(k)}$

with ${\displaystyle g_{nm}(k)=s(k-mN)\cdot e^{j\Omega nk}}$

According to the equation above, the N${\displaystyle \cdot }$M coefficients ${\displaystyle C_{nm}}$ correspond to the number of sample values K of the signal.

For over-sampling ${\displaystyle \Omega }$ is set to ${\displaystyle \Omega \leq {\tfrac {2\pi }{N}}={\tfrac {2\pi }{N^{\prime }}}}$ with N' > N, which results in N' > N summation coefficients in the second sum of the discrete Gabor representation. In this case, the number of obtained Gabor-coefficients would be M${\displaystyle \cdot }$N'>K. Hence, more coefficients than sample values are available and therefore a redundant representation would be achieved.