Sample rate conversion
|This article needs additional citations for verification. (April 2011)|
Sample-rate conversion is the process of changing the sampling rate of a discrete signal to obtain a new discrete representation of the underlying continuous signal. Application areas include image scaling, and audio/visual systems, where different sampling-rates may be used for engineering, economic, or historical reasons.
For example, Compact Disc Digital Audio and Digital Audio Tape systems use different sampling rates, and American television, European television, and movies all use different frame rates. Sample rate conversion prevents changes in speed and pitch that would otherwise occur when transferring recorded material between such systems.
Within specific domains or for specific conversions, the following alternative terms for sample-rate conversion are also used: sampling-frequency conversion, resampling, upsampling, downsampling, interpolation, decimation, upscaling, downscaling. The term multi-rate digital signal processing is sometimes used to refer to systems that incorporate sample-rate conversion.
Conceptual approaches to sample-rate conversion include: converting to an analog continuous signal, then re-sampling at the new rate, or calculating the values of the new samples directly from the old samples. The latter approach is more satisfactory since it introduces less noise and distortion. Two possible implementation methods are as follows:
- If the ratio of the two sample-rates is (or can be approximated by)[nb 1] a fixed, rational number L/M: generate an intermediate signal by inserting L–1 0s between each of the original samples. Low-pass filter this signal at half of the lower of the two rates. Select every Mth sample from the filtered output, to obtain the result.
- Treat the samples as geometric points and create any needed new points by interpolation. Choosing an interpolation method is a trade-off between implementation complexity and conversion quality (according to application requirements). Commonly used are: ZOH (for film/video frames), cubic (for image processing) and windowed sinc function (for audio).
The two methods are mathematically identical: picking an interpolation function in the second scheme is equivalent to picking the impulse response of the filter in the first scheme. Linear interpolation is equivalent to a triangular impulse response; windowed sinc approximates a brick-wall filter (it approaches the desirable "brick wall" filter as the number of points increase). The length of the impulse response of the filter in method 1 corresponds to the number of points used in interpolation in method 2.
In method 1, a slow pre-computation (such as the Remez algorithm) can be used to obtain an optimal (per application requirements) filter design. Method 2 will work in more general cases, e.g. where the ratio of sample-rates is not rational, or two real-time streams must be accommodated, or the sample-rates are time-varying.
Film and television
Movies (shot at 24 frames per second) are converted to television (roughly 50 or 60 fields[nb 2] per second). To convert a 24 frame/sec movie to 60 field/sec television, for example, alternate movie frames are shown 2 and 3 times, respectively. For 50 Hz systems such as PAL each frame is shown twice. Since 50 is not exactly 2×24, the movie will run 50/48 = 4% faster, and the audio pitch will be 4% higher, an effect known as PAL speed-up. This is often accepted for simplicity, but more complex methods are possible that preserve the running time and pitch. Every twelfth frame can be repeated 3 times rather than twice, or digital interpolation (see above) can be used in a video scaler.
Audio on Compact Disc has a sampling rate of 44.1 kHz; to transfer it to Digital Audio Tape, which uses 48 kHz, method 1 above can be used with L=160, M=147 (since 48000/44100 = 160/147). For the reverse conversion, the values of L and M are swapped. Per above, in both cases, the low-pass filter should be to 22.05 kHz.
Sample rate conversion in multiple dimensions:
Techniques and processing that may involve sample-rate conversion:
- For example, the irrational ratio 21/12, corresponding to one equal-temperament semitone, might be approximated by 196/185 (roughly 0.99994 semitones).
- A field is half of an interlaced frame – just the odd or even lines.
- Oppenheim, Alan V.; Schafer, Ronald W.; Buck., John R. (1989). Discrete-time signal processing, Volume 2. Englewood Cliffs: Prentice-hall.
- Lyons, Richard (2010). "10. Sample Rate Conversion". Understanding Digital Signal Processing. Prentice Hall. ISBN 978-0137027415.
In satellite and medical image processing, sample rate conversion is necessary for image enhancement, scale change, and image rotation
- Antoniou, Andreas (2006). Digital Signal Processing. McGraw-Hill. p. 830. ISBN 0-07-145424-1.
An alternative and more satisfactory approach...
- Partch, Harry (2009). Genesis of a Music (Second ed.). Da Capo Press. p. 101. ISBN 978-0306801068.
The ratios of the "equal semitone" are, with progressive accuracy: 18/17, 89/84, 196/185
- Rajamani, K.; Yhean-Sen Lai; Furrow, C. W. (2000). "An efficient algorithm for sample rate conversion from CD to DAT" (PDF). IEEE Signal Processing Letters 7 (10): 288. doi:10.1109/97.870683.