Multidimensional transform

From Wikipedia, the free encyclopedia
Jump to: navigation, search

In mathematical analysis and applications, multidimensional transforms are used to analyze the frequency content of signals in a domain of two or more dimensions.

Multidimensional Fourier transform[edit]

One of the more popular multidimensional transforms is the Fourier transform, which converts a signal from a time/space domain representation to a frequency domain representation.[1] The discrete-domain multidimensional Fourier transform (FT) can be computed as follows:

 F(w_1,w_2,\dots,w_m) = \sum_{n_1=-\infty}^\infty \sum_{n_2=-\infty}^\infty \cdots \sum_{n_m=-\infty}^\infty f(n_1,n_2,\dots,n_m) e^{-j w_1 n_1 -j w_2 n_2 \cdots -j w_m n_m}

where F stands for the multidimensional Fourier transform, m stands for multidimensional dimension. Define f as a multidimensional discrete-domain signal. The inverse multidimensional Fourier transform is given by

 f(n_1,n_2,\dots,n_m) = \left(\frac{1}{2 \pi}\right)^m \int_{- \pi}^{\pi} \cdots \int_{-\pi}^{\pi} F(w_1,w_2,\ldots,w_m) e^{j w_1 n_1 +j w_2 n_2 + \cdots+j w_m n_m} \, dw_1 \cdots \,dw_m

The multidimensional Fourier transform for continuous-domain signals is defined as follows:[1]

F(\Omega_1,\Omega_2,\ldots,\Omega_m) = \int_{-\infty}^{\infty} \cdots \int_{-\infty}^{\infty} 
f(t_1,t_2,\ldots,t_m) e^{-j \Omega_1 t_1-j \Omega_2 t_2 \cdots -j \Omega_m t_m} \, dt_1 \cdots \,dt_m

A fast Fourier transform (FFT) is an algorithm to compute the discrete Fourier transform (DFT) and its inverse. An FFT computes the DFT and produces exactly the same result as evaluating the DFT definition directly; the only difference is that an FFT is much faster. (In the presence of round-off error, many FFT algorithms are also much more accurate than evaluating the DFT definition directly).There are many different FFT algorithms involving a wide range of mathematics, from simple complex-number arithmetic to group theory and number theory. See more in FFT.

The multidimensional discrete Fourier transform (DFT) is a sampled version of the discrete-domain FT by evaluating it at sample frequencies that are uniformly spaced.[2] The N1 × N2 × ... NM DFT is given by:

  Fx(K_1,K_2,\ldots,K_n)= \sum_{n_1=0}^{N_1-1} \cdots \sum_{n_m}^{N_m-1} fx(n_1,n_2,\ldots,n_N) e^{-j \frac{2 \pi}{N_1} n_1 K_1  -j \frac{2 \pi}{N_2} n_2 K_2 \cdots -j \frac{2 \pi}{N_m} n_m K_m}

for 0 ≤ KiNi − 1, i = 1, 2, ..., m.

The inverse multidimensional DFT equation is

 fx(n_1,n_2,\ldots,n_m)= \frac{1}{N_1 \cdots N_m} \sum_{K_1=0}^{N_1-1} \cdots \sum_{K_m}^{N_m-1} Fx(K_1,K_2, \ldots ,K_m) e^{j \frac{2 \pi}{N_1} n_1 K_1 +j \frac{2 \pi}{N_2}  n_2 K_2\cdots+j \frac{2 \pi}{N_m} n_m K_m}

for 0 ≤ n1, n2, ... , nmN(1, 2, ... , m) – 1.

Multidimensional discrete cosine transform[edit]

The discrete cosine transform (DCT) is used in a wide range of applications such as data compression, feature extraction, Image reconstruction, multi-frame detection and so on. The multidimensional DCT is given by:

 Fx(K_1,K_2,\ldots,K_r ) = \sum_{n_1=0}^{N_1-1} \sum_{n_2=0}^{N_2-1} \cdots \sum_{n_r=0}^{N_r-1} fx(n_1,n_2,\ldots,n_r) \cos { \frac{ \pi (2n_1+1) K_1}{2N_1}} \cdots \cos { \frac{ \pi (2n_r+1) K_r}{2N_r}}

for ki = 0, 1, ..., Ni − 1, i = 1, 2, ..., r.


The DCT and DFT are often used in signal processing[3] and image processing, and they are also used to efficiently solve partial differential equations by spectral methods. The DFT can also be used to perform other operations such as convolutions or multiplying large integers. The DFT and DCT have seen wide usage across a large number of fields, we only sketch a few examples below.

Image processing[edit]

Two-dimensional DCT frequencies from the JPEG DCT

The DCT is used in JPEG image compression, MJPEG, MPEG, DV, Daala, and Theora video compression. There, the two-dimensional DCT-II of NxN blocks are computed and the results are quantized and entropy coded. In this case, N is typically 8 and the DCT-II formula is applied to each row and column of the block. The result is an 8x8 transform coefficient array in which the: (0,0) element (top-left) is the DC (zero-frequency) component and entries with increasing vertical and horizontal index values represent higher vertical and horizontal spatial frequencies, as shown in the picture on the right.

In image processing, one can also analyze and describe unconventional cryptographic methods based on 2D DCTs, for inserting non-visible binary watermarks into the 2D image plane,[4] and According to different orientations, the 2-D directional DCT-DWT hybrid transform can be applied in denoising ultrasound images.[5] 3-D DCT can also be used to transform video data or 3-D image data in watermark embedding schemes in transform domain.[6][7]

Spectral analysis[edit]

When the DFT is used for spectral analysis, the {xn} sequence usually represents a finite set of uniformly spaced time-samples of some signal x(t) where t represents time. The conversion from continuous time to samples (discrete-time) changes the underlying Fourier transform of x(t) into a discrete-time Fourier transform (DTFT), which generally entails a type of distortion called aliasing. Choice of an appropriate sample-rate (see Nyquist rate) is the key to minimizing that distortion. Similarly, the conversion from a very long (or infinite) sequence to a manageable size entails a type of distortion called leakage, which is manifested as a loss of detail (aka resolution) in the DTFT. Choice of an appropriate sub-sequence length is the primary key to minimizing that effect. When the available data (and time to process it) is more than the amount needed to attain the desired frequency resolution, a standard technique is to perform multiple DFTs, for example to create a spectrogram. If the desired result is a power spectrum and noise or randomness is present in the data, averaging the magnitude components of the multiple DFTs is a useful procedure to reduce the variance of the spectrum (also called a periodogram in this context); two examples of such techniques are the Welch method and the Bartlett method; the general subject of estimating the power spectrum of a noisy signal is called spectral estimation.

A final source of distortion (or perhaps illusion) is the DFT itself, because it is just a discrete sampling of the DTFT, which is a function of a continuous frequency domain. That can be mitigated by increasing the resolution of the DFT. That procedure is illustrated at Sampling the DTFT.

  • The procedure is sometimes referred to as zero-padding, which is a particular implementation used in conjunction with the fast Fourier transform (FFT) algorithm. The inefficiency of performing multiplications and additions with zero-valued "samples" is more than offset by the inherent efficiency of the FFT.
  • As already noted, leakage imposes a limit on the inherent resolution of the DTFT. So there is a practical limit to the benefit that can be obtained from a fine-grained DFT.

Partial differential equations[edit]

Discrete Fourier transforms are often used to solve partial differential equations, where again the DFT is used as an approximation for the Fourier series (which is recovered in the limit of infinite N). The advantage of this approach is that it expands the signal in complex exponentials einx, which are eigenfunctions of differentiation: d/dx einx = in einx. Thus, in the Fourier representation, differentiation is simple—we just multiply by i n. (Note, however, that the choice of n is not unique due to aliasing; for the method to be convergent, a choice similar to that in the trigonometric interpolation section above should be used.) A linear differential equation with constant coefficients is transformed into an easily solvable algebraic equation. One then uses the inverse DFT to transform the result back into the ordinary spatial representation. Such an approach is called a spectral method.

DCTs are also widely employed in solving partial differential equations by spectral methods, where the different variants of the DCT correspond to slightly different even/odd boundary conditions at the two ends of the array.

Image processing for arts surface analysis by FFT[edit]

One very important factor is that we must apply a non-destructive method to obtain those rare valuables information (from the HVS viewing point, is focused in whole colorimetric and spatial information) about works of art and zero-damage on them. We can understand the arts by looking at a color change or by measuring the surface uniformity change. Since the whole image will be very huge, so we use a double raised cosine window to truncate the image:[8]

 w(x,y)=\frac{1}{4} \left(1 + \cos {\frac{x \pi}{N}}\right)\left(1 + \cos {\frac{y \pi}{N}}\right)

where N is the image dimension and x, y are the coordinates from the center of image spans from 0 to N/2. The author wanted to compute an equal value for spatial frequency such as:[8]

 A_m {(f)}^2= \left[\sum_{i=-f}^f \operatorname{FFT}(-f,i)^2+ \sum_{i=-f}^f \operatorname{FFT}(f,i)^2+ \sum_{i=-f+1}^{f-1} \operatorname{FFT}(i,-f)^2+ \sum_{i=-f+1}^{f-1} \operatorname{FFT}(i,f)^2 \right]

where "FFT" denotes the fast Fourier transform, and f is the spatial frequency spans from 0 to N/2 – 1. The proposed FFT-based imaging approach is diagnostic technology to ensure a long life and stable to culture arts. This is a simple, cheap which can be used in museums without affecting their daily use. But this method doesn’t allow a quantitative measure of the corrosion rate.

See also[edit]


  1. ^ a b Smith,W. Handbook of Real-Time Fast Fourier Transforms:Algorithms to Product Testing, Wiley_IEEE Press, edition 1, pages 73–80, 1995
  2. ^ Dudgeon and Mersereau, Multidimensional Digital Signal Processing,2nd edition,1995
  3. ^ Tan Xiao, Shao-hai Hu, Yang Xiao. 2-D DFT-DWT Application to Multidimensional Signal Processing. ICSP2006 Proceedings, 2006 IEEE
  4. ^ Peter KULLAI, Pavol SABAKAI, JozefHUSKAI. Simple Possibilities of 2D DCT Application in Digital Monochrome Image Cryptography. Radioelektronika, 17th International Conference, IEEE, 2007, pp. 1–6
  5. ^ Xin-ling Wen, Yang Xiao. The 2-D Directional DCT-DWT Hybrid Transform and Its Application in Denoising Ultrasound Image. Signal Processing. ICSP 2008. 9th International Conference, Page(s): 946–949
  6. ^ Jinwei Wang, Shiguo Lian, Zhongxuan Liu, Zhen Ren, Yuewei Dai, Haila Wang. Image Watermarking Scheme Based on 3-D DCT.Industrial Electronics and Applications, 2006 1ST IEEE Conference, pp. 1–6
  7. ^ Jin Li, Moncef Gabbouj, Jarmo Takala, Hexin Chen. Direct 3-D DCT-to-DCT Resizing Algorithm for Video Coding. Image and Signal Processing and Analysis, 2009. ISPA 2009. Proceedings of 6th International Symposium pp. 105–110
  8. ^ a b Angelini, E., Grassin, S. ; Piantanida, M. ; Corbellini, S. ; Ferraris, F. ; Neri, A. ; Parvis, M. FFT-based imaging processing for cultural heritage monitoring Instrumentation and Measurement Technology Conference (I2MTC), 2010 IEEE