Comparison of analog and digital recording

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

Sound can be recorded and stored and played using either digital or analog techniques. Both techniques introduce errors and distortions in the sound, and these methods can be systematically compared. Musicians and listeners have argued over the superiority of digital versus analog sound recordings. Arguments for analog systems include the absence of fundamental error mechanisms which are present in digital audio systems, including aliasing and quantization noise.[1] Advocates of digital point to the high levels of performance possible with digital audio, including excellent linearity in the audible band and low levels of noise and distortion.[2]:7

Two prominent differences in performance between the two methods are the bandwidth and the signal-to-noise ratio (S/N). The bandwidth of the digital system is determined, according to the Nyquist frequency, by the sample rate used. The bandwidth of an analog system is dependent on the physical capabilities of the analog circuits. The S/N of a digital system may be limited by the bit depth of the digitization process, but the electronic implementation of conversion circuits introduces additional noise. In an analog system, other natural analog noise sources exist, such as flicker noise and imperfections in the recording medium. Other performance differences are specific to the systems under comparison, such as the ability for more transparent filtering algorithms in digital systems[3] and the harmonic saturation and speed variations of analog systems.

Dynamic range[edit]

The dynamic range of an audio system is a measure of the difference between the smallest and largest amplitude values that can be represented in a medium. Digital and analog differ in both the methods of transfer and storage, as well as the behavior exhibited by the systems due to these methods.

The dynamic range of digital audio systems can exceed that of analog audio systems. Consumer analog cassette tapes have a dynamic range of 60 to 70 dB. Analog FM broadcasts rarely have a dynamic range exceeding 50 dB.[4] The dynamic range of a direct-cut vinyl record may surpass 70 dB. Analog studio master tapes can have a dynamic range of up to 77 dB.[5] A theoretical LP made out of perfect diamond has an atomic feature size of about 0.5 nanometer, which, with a groove size of 8 micron, yields a dynamic range of 110 dB, while a theoretical vinyl LP is expected to have a dynamic range of 70 dB,[6] with measurements indicating performance in the 60 to 70 dB range.[7] Typically, a 16 bit analog-to-digital converter may have a dynamic range of between 90 and 95 dB,[8]:132 whereas the signal-to-noise ratio (roughly the equivalent of dynamic range, noting the absence of quantization noise but presence of tape hiss) of a professional reel-to-reel 1/4 inch tape recorder would be between 60 and 70 dB at the recorder's rated output.[8]:111

The benefits of using digital recorders with greater than 16 bit accuracy can be applied to the 16 bits of audio CD. Stuart stresses that with the correct dither, the resolution of a digital system is theoretically infinite, and that it is possible, for example, to resolve sounds at -110 dB (below digital full-scale) in a well-designed 16 bit channel.[9]:3

Compression[edit]

Despite the lower dynamic range and signal-to-noise ratios a vinyl or tape record can achieve in theory (60-80 dB versus 90-96 dB for CD recordings), vinyl records may still be preferred for their greater dynamic range in practice because of aggressive dynamic range compression used for CD audio material (see Loudness war), however unless the vinyl release specifically notes a vinyl mastering credit it is safe to assume it uses the same dynamically-challenged master as the digital versions.[citation needed]

Overload conditions[edit]

There are some differences in the behaviour of analog and digital systems when high level signals are present, where there is the possibility that such signals could push the system into overload. With high level signals, analog magnetic tape approaches saturation, and high frequency response drops in proportion to low frequency response. While undesirable, the audible effect of this can be reasonably unobjectionable.[10] In contrast, digital PCM recorders show non-benign behaviour in overload;[11]:65 samples that exceed the peak quantization level are simply truncated, clipping the waveform squarely, which introduces distortion in the form of large quantities of higher-frequency harmonics. In principle, PCM digital systems have the lowest level of nonlinear distortion at full signal amplitude. The opposite is usually true of analog systems, where distortion tends to increase at high signal levels. A study by Manson (1980) considered the requirements of a digital audio system for high quality broadcasting. It concluded that a 16 bit system would be sufficient, but noted the small reserve the system provided in ordinary operating conditions. For this reason, it was suggested that a fast-acting signal limiter or 'soft clipper' be used to prevent the system from becoming overloaded.[12]

With many recordings, high level distortions at signal peaks may be audibly masked by the original signal, thus large amounts of distortion may be acceptable at peak signal levels. The difference between analog and digital systems is the form of high-level signal error. Some early analog-to-digital converters displayed non-benign behaviour when in overload, where the overloading signals were 'wrapped' from positive to negative full-scale. Modern converter designs based on sigma-delta modulation may become unstable in overload conditions. It is usually a design goal of digital systems to limit high-level signals to prevent overload.[11]:65 To prevent overload, a modern digital system may compress input signals so that digital full-scale cannot be reached[13]:4

Physical degradation[edit]

Error correction allows digital formats to tolerate significant media deterioration though digital media is not immune to data loss. Consumer CD-R compact discs have a limited and variable lifespan dues to inherent and manufacturing quality issues.[14]

Unlike analog duplication, digital copies are exact replicas that can be duplicated indefinitely and without generation loss.[citation needed]

Noise[edit]

For electronic audio signals, sources of noise include mechanical, electrical and thermal noise in the recording and playback cycle. The amount of noise that a piece of audio equipment adds to the original signal can be quantified. Mathematically, this can be expressed by means of the signal to noise ratio (SNR or S/N). Sometimes the maximum possible dynamic range of the system is quoted instead.[citation needed]

With digital systems, the quality of reproduction depends on the analog-to-digital and digital-to-analog conversion steps, and does not depend on the quality of the recording medium, provided it is adequate to retain the digital values without error. Digital mediums capable of bit-perfect storage and retrieval have been commonplace for some time, since they were generally developed for software storage which has no tolerance for error. In the case of the Compact Disc, however, the error recovery system used traded some reliability for more space on the disk to allow more recording time. Still, it is extremely common for them to read with no errors unless they have been mishandled or have degraded due to poor manufacturing methods.[citation needed]

The process of analog-to-digital conversion will, according to theory, always introduce quantization distortion. This distortion can be rendered as uncorrelated quantization noise through the use of dither. The magnitude of this noise or distortion is determined by the number of quantization levels. In binary systems this is determined by and typically stated in terms of the number of bits. Each additional bit adds approximately 6 dB in possible SNR, e.g. 24 x 6 = 144 dB for 24 bit quantization, 126 dB for 21-bit, and 120 dB for 20-bit. The 16-bit digital system of Red Book audio CD has 216= 65,536 possible signal amplitudes, theoretically allowing for an SNR of 98 dB.[2]:49

Rumble[edit]

Rumble is a form of noise characteristic caused by imperfections in the bearings of turntables, the platter tends to have a slight amount of motion besides the desired rotation—the turntable surface also moves up-and-down and side-to-side slightly. This additional motion is added to the desired signal as noise, usually of very low frequencies, creating a rumbling sound during quiet passages. Very inexpensive turntables sometimes used ball bearings which are very likely to generate audible amounts of rumble. More expensive turntables tend to use massive sleeve bearings which are much less likely to generate offensive amounts of rumble. Increased turntable mass also tends to lead to reduced rumble. A good turntable should have rumble at least 60 dB below the specified output level from the pick-up.[15]:79–82 Because they have no moving parts in the signal path, digital systems are not subject to rumble.

Wow and flutter[edit]

Wow and flutter are a change in frequency of an analog device and are the result of mechanical imperfections, with wow being a slower rate form of flutter. Wow and flutter are most noticeable on signals which contain pure tones. For LP records, the quality of the turntable will have a large effect on the level of wow and flutter. A good turntable will have wow and flutter values of less than 0.05%, which is the speed variation from the mean value.[15] Wow and flutter can also be present in the recording, as a result of the imperfect operation of the recorder. Owing to their use of precision crystal oscillators for their timebase, digital systems are not subject to wow and flutter.

Frequency response[edit]

For digital systems, the upper limit of the frequency response is determined by the sampling frequency. The choice of sample rate used in a digital system is based on the Nyquist-Shannon sampling theorem. This states that a sampled signal can be reproduced exactly as long as it is sampled at a frequency greater than twice the bandwidth of the signal. Therefore, a sampling rate of 40 kHz would be theoretically enough to capture all the information contained in a signal having frequency bandwidth up to 20 kHz. The sampling theorem assumes ideal filters, however, which cannot exist in reality, so practical sampling uses "guard bands" (higher than necessary sample rates) to reduce aliasing.[citation needed]

The bandwidth of the standard for audio CDs is sufficiently wide to cover the entire human hearing range, which roughly extends from 20 Hz to 20 kHz. Commercial and industrial digital recorders may record higher frequencies, while consumer and telecommunications systems inferior to the CD record a more restricted frequency range.[citation needed]

High quality open-reel machines can extend from 10 Hz to above 20 kHz. The linearity of the response may be indicated by providing information on the level of the response relative to a reference frequency. For example, a system component may have a response given as 20 Hz to 20 kHz +/- 3 dB relative to 1 kHz. Some analog tape manufacturers specify frequency responses up to 20 kHz, but these measurements may have been made at lower signal levels.[15] Compact cassettes may have a response extending up to 15 kHz at full (0 dB) recording level.[16] At lower levels usually -10 dB, cassettes typically rolls-off at around 20 kHz for most machines, due to the nature of the tape media caused by self-erasure (which worsens the linearity of the response).

The frequency response for a conventional LP player might be 20 Hz - 20 kHz +/- 3 dB. Unlike the audio CD, vinyl records and cassettes do not require a cut-off in response above 20 kHz. The low frequency response of vinyl records is restricted by rumble noise,(described above), as well as the physical and electrical characteristics of the entire pickup arm and transducer assembly. The high frequency response of vinyl depends on the cartridge. CD4 records contained frequencies up to 50 kHz, while some high-end turntable cartridges have frequency responses of 120 kHz while having flat frequency response over the audible band (e.g. 20 Hz to 15 kHz +/-0.3 dB).[17] In addition, frequencies of up to 122 kHz have been experimentally cut on LP records.[18]

In comparison, the CD system offers a frequency response of 0 Hz–20 kHz ±0.5 dB, with a superior dynamic range over the entire audible frequency spectrum.[2]:108

With vinyl records, there will be some loss in fidelity on each playing of the disc. This is due to the wear of the stylus in contact with the record surface. A good quality stylus, matched with a correctly set up pick-up arm, should cause minimal surface wear. Magnetic tapes, both analog and digital, wear from friction between the tape and the heads, guides, and other parts of the tape transport as the tape slides over them. The brown residue deposited on swabs during cleaning of a tape machine's tape path is actually particles of magnetic coating shed from tapes. Tapes can also suffer creasing, stretching, and frilling of the edges of the plastic tape base, particularly from low-quality or out-of-alignment tape decks. When a CD is played, there is no physical contact involved, and the data is read optically using a laser beam. Therefore, no such media deterioration takes place, and the CD will, with proper care, sound exactly the same every time it is played (discounting aging of the player and CD itself); however, this is a benefit of the optical system, not of digital recording, and the Laserdisc format enjoys the same non-contact benefit with analog optical signals. Recordable CDs slowly degrade with time, called disc rot, even if they are not played, and are stored properly.[19] A new compact disc was released called M-DISC which is said to last 1000 years. These discs are recordable and have a layer developed from stone.[vague] They come in CD, DVD and Blu-ray formats and various storage sizes. They can be used for music, movies, and data for computers, etc.

Aliasing[edit]

Technical difficulty arises with digital sampling in that all high frequency signal content above the Nyquist frequency must be removed prior to sampling, which, if not done, will result in these ultrasonic frequencies "folding over" into frequencies which are in the audible range, producing a kind of distortion called aliasing. The difficulty is that designing a brick-wall anti-aliasing filter, a filter which would precisely remove all frequency content exactly above or below a certain cutoff frequency, is impractical.[20] Instead, a sample rate is usually chosen which is above the theoretical requirement. This solution is called oversampling, and allows a less aggressive and lower-cost anti-aliasing filter to be used.

Unlike digital audio systems, analog systems do not require filters for bandlimiting. These filters act to prevent aliasing distortions in digital equipment. Early digital systems may have suffered from a number of signal degradations related to the use of analog anti-aliasing filters, e.g., time dispersion, nonlinear distortion, temperature dependence of filters etc.[21]:8 Even with sophisticated anti-aliasing filters used in the recorder, it is still demanding for the player not to introduce more distortion.

Hawksford highlighted the advantages of digital converters that oversample.[21]:18 Using an oversampling design and a modulation scheme called sigma-delta modulation (SDM), analog anti-aliasing filters can effectively be replaced by a digital filter.[20] This approach has several advantages. The digital filter can be made to have a near-ideal transfer function, with low in-band ripple, and no aging or thermal drift.

Sampling rates[edit]

CD quality audio is sampled at 44.1 kHz (Nyquist frequency = 22.05 kHz) and at 16 bits. Sampling the waveform at higher frequencies and allowing for a greater number of bits per sample allows noise and distortion to be reduced further. DAT can sample audio at up to 48 kHz, while DVD-Audio can be 96 or 192 kHz and up to 24 bits resolution. With any of these sampling rates, signal information is captured above what is generally considered to be the human hearing range.

Work done in 1981 by Muraoka et al.[22] showed that music signals with frequency components above 20 kHz were only distinguished from those without by a few of the 176 test subjects.[23] Later papers, however, by a number of different authors, have led to a greater discussion of the value of recording frequencies above 20 kHz. Such research led some to the belief that capturing these ultrasonic sounds could have some audible benefit. Audible differences were reported between recordings with and without ultrasonic responses. Dunn (1998) examined the performance of digital converters to see if these differences in performance could be explained.[24] He did this by examining the band-limiting filters used in converters and looking for the artifacts they introduce.

A perceptual study by Nishiguchi et al. (2004) concluded that "no significant difference was found between sounds with and without very high frequency components among the sound stimuli and the subjects... however, [Nishiguchi et al] can still neither confirm nor deny the possibility that some subjects could discriminate between musical sounds with and without very high frequency components."[25]

Additionally, in blind tests conducted by Bob Katz, recounted in his book Mastering Audio: The Art and the Science, he found that listening subjects could not discern any audible difference between sample rates with optimum A/D conversion and filter performance. He posits that the primary reason for any aural variation between sample rates is due largely to poor performance of low-pass filtering prior to conversion, and not variance in ultrasonic bandwidth. These results suggest that the main benefit to using higher sample rates is that it pushes consequential phase distortion out of the audible range and that, under ideal conditions, higher sample rates may not be necessary.[26]

Digital errors[edit]

Quantization[edit]

An illustration of quantization of a sampled audio waveform using 4 bits.
An illustration of quantization of a sampled audio waveform using 4 bits.

A signal is recorded digitally by an analog-to-digital converter, which measures the amplitude of an analog signal at regular intervals, which are specified by the sample rate, and then stores these sampled numbers in computer hardware. The fundamental problem with numbers on computers is that the range of values that can be represented is finite, which means that during sampling, the amplitude of the audio signal must be rounded. This process is called quantization, and these small errors in the measurements are manifested aurally as a form of low level distortion.

Analog systems do not have discrete digital levels in which the signal is encoded. Consequently, the original signal can be preserved to an accuracy limited only by the intrinsic noise-floor and maximum signal level of the media and the playback equipment, i.e., the dynamic range of the system. This form of distortion, sometimes called granular or quantization distortion, has been pointed to as a fault of some digital systems and recordings.[27][9]:6 Knee & Hawksford drew attention to the deficiencies in some early digital recordings, where the digital release was said to be inferior to the analog version.

The range of possible values that can be represented numerically by a sample is defined by the number of binary digits used. This is called the resolution, and is usually referred to as the bit depth in the context of PCM audio. The quantization noise level is directly determined by this number, decreasing exponentially as the resolution increases (or linearly in dB units), and with an adequate number of true bits of quantization, random noise from other sources will dominate and completely mask the quantization noise. The Redbook CD standard uses 16 bits, which keep the quantization noise 96 dB below maximum amplitude, far below a discernible level with almost any source material.[28]

Quantization in analog media[edit]

Since analog media is composed of molecules, the smallest microscopic structure represents the smallest quantization unit of the recorded signal. Natural dithering processes, like random thermal movements of molecules, the nonzero size of the reading instrument, and other averaging effects, make the practical limit larger than that of the smallest molecular structural feature. A theoretical LP composed of perfect diamond, with a groove size of 8 micron and feature size of 0.5 nanometer, has a quantization that is similar to a 16-bit digital sample.[6]

Dither as a solution[edit]

An illustration of dither used in image processing.
An illustration of dither used in image processing. A random deviation has been inserted before reducing the palette to only 16 colors, which is analogous to the effect of dither on an audio signal.

It is possible to make quantization noise more audibly benign by applying dither. To do this, a noise-like signal is added to the original signal before quantization. Dither makes the digital system behave as if it has an analog noise-floor. Optimal use of dither (triangular probability density function dither in PCM systems) has the effect of making the rms quantization error independent of signal level,[11]:143 and allows signal information to be retained below the least significant bit of the digital system.[9]:3

Dither algorithms also commonly have an option to employ some kind of noise shaping, which pushes the frequency of much of the dither noise to areas that are less audible to human ears. This has no statistical benefit, but rather it raises the S/N of the audio that is apparent to the listener.

Proper application of dither combats quantization noise effectively, and is commonly applied during mastering before final bit depth reduction,[26] and also at various stages of DSP.

Jitter[edit]

One aspect that may degrade the performance of a digital system is jitter. This is the phenomenon of variations in time from what should be the correct spacing of discrete samples according to the sample rate. This can be due to timing inaccuracies of the digital clock. Ideally a digital clock should produce a timing pulse at exactly regular intervals. Other sources of jitter within digital electronic circuits are data-induced jitter, where one part of the digital stream affects a subsequent part as it flows through the system, and power supply induced jitter, where DC ripple on the power supply output rails causes irregularities in the timing of signals in circuits powered from those rails.

The accuracy of a digital system is dependent on the sampled amplitude values, but it is also dependent on the temporal regularity of these values. This temporal dependency is inherent to digital recording and playback and has no analog equivalent, though analog systems have their own temporal distortion effects (pitch error and wow-and-flutter).

Periodic jitter produces modulation noise and can be thought of as being the equivalent of analog flutter.[29] Random jitter alters the noise floor of the digital system. The sensitivity of the converter to jitter depends on the design of the converter. It has been shown that a random jitter of 5 ns (nanoseconds) may be significant for 16 bit digital systems.[29] For a more detailed description of jitter theory, refer to Dunn (2003).[11]

Jitter can degrade sound quality in digital audio systems. In 1998, Benjamin and Gannon researched the audibility of jitter using listening tests.[11]:34 They found that the lowest level of jitter to be audible was around 10 ns (rms). This was on a 17 kHz sine wave test signal. With music, no listeners found jitter audible at levels lower than 20 ns. A paper by Ashihara et al. (2005) attempted to determine the detection thresholds for random jitter in music signals. Their method involved ABX listening tests. When discussing their results, the authors of the paper commented that:

'So far, actual jitter in consumer products seems to be too small to be detected at least for reproduction of music signals. It is not clear, however, if detection thresholds obtained in the present study would really represent the limit of auditory resolution or it would be limited by resolution of equipment. Distortions due to very small jitter may be smaller than distortions due to non-linear characteristics of loudspeakers. Ashihara and Kiryu [8] evaluated linearity of loudspeaker and headphones. According to their observation, headphones seem to be more preferable to produce sufficient sound pressure at the ear drums with smaller distortions than loudspeakers.' [30]

Signal processing[edit]

After initial recording, it is common for the audio signal to be altered in some way, such as with the use of compression, equalization, delays and reverb. With analog, this comes in the form of outboard hardware components, and with digital, the same is accomplished with plug-ins that are utilized in the user's DAW.

A comparison of analog and digital filtering shows technical advantages to both methods, and there are several points that are relevant to the recording process.

Analog hardware[edit]

An illustration of phase shift.
Phase shift: the sinusoidal wave in red has been delayed in time equal to the angle , shown as the sinusoidal wave in blue.

Many analog units possess unique characteristics that are desirable.

When altering a signal with a filter, the outputted signal may differ in time from the signal at the input, which is called a change in phase. Many equalizers exhibit this behavior, with the amount of phase shift differing in some pattern, and centered around the band that is being adjusted. This phase distortion can create the perception of a "ringing" sound around the filter band, or other coloration. Although this effect alters the signal in a way other than a strict change in frequency response, this coloration can sometimes have a positive effect on the perception of the sound of the audio signal.

Digital filters[edit]

Digital filters can be made to objectively perform better than analog components,[3][31] because the variables involved can be precisely specified in the calculations.

One prime example is the invention of the linear phase equalizer, which has inherent phase shift that is homogeneous across the frequency spectrum. Digital delays can also be perfectly exact, provided the delay time is some multiple of the time between samples, and so can the summing of a multitrack recording, as the sample values are merely added together.

A practical advantage of digital processing is the more convenient recall of settings. Plug-in parameters can be stored on the computer hard disk, whereas parameter details on an analog unit must be written down or otherwise recorded if the unit needs to be reused. This can be cumbersome when entire mixes must be recalled manually using an analog console and outboard gear. When working digitally, all parameters can simply be stored in a DAW project file and recalled instantly. Most modern professional DAWs also process plug-ins in real time, which means that processing can be largely non-destructive until final mix-down.

Analog modeling[edit]

Many plug-ins exist now that incorporate some kind of analog modeling. There are some engineers that endorse them and feel that they compare equally in sound to the analog processes that they imitate. Digital models also carry some benefits over their analog counterparts, such as the ability to remove noise from the algorithms and add modifications to make the parameters more flexible. On the other hand, other engineers also feel that the modeling is still inferior to the genuine outboard components and still prefer to mix "outside the box".[32]

Sound quality[edit]

Subjective evaluation[edit]

Subjective evaluation attempts to measure how well an audio component performs according to the human ear. The most common form of subjective test is a listening test, where the audio component is simply used in the context for which it was designed. This test is popular with hi-fi reviewers, where the component is used for a length of time by the reviewer who then will describe the performance in subjective terms. Common descriptions include whether the component has a 'bright' or 'dull' sound, or how well the component manages to present a 'spatial image'.

Another type of subjective test is done under more controlled conditions and attempts to remove possible bias from listening tests. These sorts of tests are done with the component hidden from the listener, and are called blind tests. To prevent possible bias from the person running the test, the blind test may be done so that this person is also unaware of the component under test. This type of test is called a double-blind test. This sort of test is often used to evaluate the performance of digital audio codecs.

There are critics of double-blind tests who see them as not allowing the listener to feel fully relaxed when evaluating the system component, and can therefore not judge differences between different components as well as in sighted (non-blind) tests. Those who employ the double-blind testing method may try to reduce listener stress by allowing a certain amount of time for listener training.[33]

Early digital recordings[edit]

Early digital audio machines had disappointing results, with digital converters introducing errors that the ear could detect.[34] Record companies released their first LPs based on digital audio masters in the late 1970s. CDs became available in the early 1980s. At this time analog sound reproduction was a mature technology.

There was a mixed critical response to early digital recordings released on CD. Compared to vinyl record, it was noticed that CD was far more revealing of the acoustics and ambient background noise of the recording environment.[35] For this reason, recording techniques developed for analog disc, e.g., microphone placement, needed to be adapted to suit the new digital format.[35]

Some analog recordings were remastered for digital formats. Analog recordings made in natural concert hall acoustics tended to benefit from remastering.[36] The remastering process was occasionally criticised for being poorly handled. When the original analog recording was fairly bright, remastering sometimes resulted in an unnatural treble emphasis.[36]

Super Audio CD and DVD-Audio[edit]

The Super Audio CD (SACD) format was created by Sony and Philips, who were also the developers of the earlier standard audio CD format. SACD uses Direct Stream Digital (DSD), which works quite differently from the PCM format discussed in this article. Instead of using a greater number of bits and attempting to record a signal's precise amplitude for every sample cycle, a DSD recorder uses a technique called sigma-delta modulation. Using this technique, the audio data is stored as a sequence of fixed amplitude (i.e. 1- bit) values at a sample rate of 2.884 MHz, which is 64 times the 44.1 kHz sample rate used by CD. At any point in time, the amplitude of the original analog signal is represented by the relative preponderance of 1's over 0's in the data stream. This digital data stream can therefore be converted to analog by the simple expedient of passing it through a relatively benign analog low-pass filter. The competing DVD-Audio format uses standard, linear PCM at variable sampling rates and bit depths, which at the very least match and usually greatly surpass those of a standard CD Audio (16 bits, 44.1 kHz).

In the popular Hi-Fi press, it had been suggested that linear PCM "creates [a] stress reaction in people", and that DSD "is the only digital recording system that does not [...] have these effects".[37] This claim appears to originate from a 1980 article by Dr John Diamond entitled Human Stress Provoked by Digitalized Recordings.[38] The core of the claim that PCM (the only digital recording technique available at the time) recordings created a stress reaction rested on "tests" carried out using the pseudoscientific technique of applied kinesiology, for example by Dr Diamond at an AES 66th Convention (1980) presentation with the same title.[39] Diamond had previously used a similar technique to demonstrate that rock music (as opposed to classical) was bad for your health due to the presence of the "stopped anapestic beat".[40] Dr Diamond's claims regarding digital audio were taken up by Mark Levinson, who asserted that while PCM recordings resulted in a stress reaction, DSD recordings did not.[41][42][43] A double-blind subjective test between high resolution linear PCM (DVD-Audio) and DSD did not reveal a statistically significant difference.[44] Listeners involved in this test noted their great difficulty in hearing any difference between the two formats.

Analog warmth[edit]

Some audio enthusiasts prefer the sound of vinyl records over that of a CD. Founder and editor Harry Pearson of The Absolute Sound journal says that "LPs are decisively more musical. CDs drain the soul from music. The emotional involvement disappears". Dub producer Adrian Sherwood has similar feelings about the analog cassette tape, which he prefers because of its warm sound.[45]

Those who favour the digital format point to the results of blind tests, which demonstrate the high performance possible with digital recorders.[46] The assertion is that the 'analog sound' is more a product of analog format inaccuracies than anything else. One of the first and largest supporters of digital audio was the classical conductor Herbert von Karajan, who said that digital recording was "definitely superior to any other form of recording we know". He also pioneered the unsuccessful Digital Compact Cassette and conducted the first recording ever to be commercially released on CD: Richard Strauss's Eine Alpensinfonie.

Hybrid systems[edit]

While the words analog audio usually imply that the sound is described using a continuous time/continuous amplitudes approach in both the media and the reproduction/recording systems, and the words digital audio imply a discrete time/discrete amplitudes approach, there are methods of encoding audio that fall somewhere between the two, e.g. continuous time/discrete levels and discrete time/continuous levels.

While not as common as "pure analog" or "pure digital" methods, these situations do occur in practice. Indeed, all analog systems show discrete (quantized) behaviour at the microscopic scale,[47] and asynchronously operated class-D amplifiers even consciously incorporate continuous time, discrete amplitude designs. Continuous amplitude, discrete time systems have also been used in many early analog-to-digital converters, in the form of sample-and-hold circuits. The boundary is further blurred by digital systems which statistically aim at analog-like behavior, most often by utilizing stochastic dithering and noise shaping techniques. While vinyl records and common compact cassettes are analog media and use quasi-linear physical encoding methods (e.g. spiral groove depth, tape magnetic field strength) without noticeable quantization or aliasing, there are analog non-linear systems that exhibit effects similar to those encountered on digital ones, such as aliasing and "hard" dynamic floors (e.g. frequency modulated hi-fi audio on videotapes, PWM encoded signals).

Although those "hybrid" techniques are usually more common in telecommunications systems than in consumer audio, their existence alone blurs the distinctive line between certain digital and analog systems, at least for what regards some of their alleged advantages or disadvantages.

There are many benefits to using digital recording over analog recording because “numbers are more easily manipulated than are grooves on a record or magnetized particles on a tape”.[48] Because numerical coding represents the sound waves perfectly, the sound can be played back without background noise.

See also[edit]

References[edit]

  1. ^ Liversidge, Anthony (February 1995). "Analog versus digital: has vinyl been wrongly dethroned by the music industry?". Omni. Vol. 17 no. 5.
  2. ^ a b c Maes, Jan; Vercammen, Marc, eds. (2001). Digital Audio Technology: A guide to CD, MiniDisc, SACD, DVD(A), MP3 and DAT (4 ed.). Focal Press. ISBN 0240516540. A 16-bit system, therefore, gives a theoretical signal-to-noise ratio of 98 dB...
  3. ^ a b "Chapter 21: Filter Comparison". dspguide.com. Retrieved 2012-09-13.
  4. ^ Mark Garrison (23 Sep 2011). "Encyclopedia of Home Recording: Dynamic Range".
  5. ^ "State-of-the-Art Audio Transfer". The Audio Archive. Retrieved 2018-05-14. Signal-to-Noise NAB (1/4-inch two-track 2.0 mm track, RMS, A-weighted) 30 ips - 75 dB
  6. ^ a b Jim Lesurf (May 18, 2000). "The 'digital' defects of the long-playing record". University of St. Andrews. Retrieved September 22, 2017.
  7. ^ Michael Fremer (January 6, 1999). "Pass Aleph Ono phono preamplifier". Retrieved 2018-05-14.
  8. ^ a b Metzler, Bob (2005). The Audio Measurement Handbook (2 ed.). Audio Precision, USA. Retrieved 2008-03-09.
  9. ^ a b c Stuart, J. "Coding High Quality Digital Audio" (PDF). Meridian Audio Ltd. Retrieved 2008-03-09This article is substantially the same as Stuart's 2004 JAES article "Coding for High-Resolution Audio Systems", Journal of the Audio Engineering Society, Volume 52 Issue 3 pp. 117–144; March 2004.
  10. ^ Elsea, Peter (1996). "Analog Recording of Sound". Electronic Music Studios at the University of California, Santa Cruz. Retrieved 2008-03-09.
  11. ^ a b c d e Dunn, Julian (2003). "Measurement Techniques for Digital Audio: Audio Precision Application Note #5". Audio Precision, Inc. Archived from the original on 2007-03-20. Retrieved 2008-03-09.
  12. ^ Manson, W. (1980). "Digital Sound: studio signal coding resolution for broadcasting" (PDF). BBC Research Department, Engineering Division. p. 8.
  13. ^ Jones, Wayne; Wolfe, Michael; Tanner, Theodore C. Jr.; Dinu, Daniel (March 2003). Testing Challenges in Personal Computer Audio Devices. 114th AES Convention. Archived from the original on 2008-03-07. Retrieved 2008-03-09.
  14. ^ "CD-R Unreadable in Less Than Two Years". myce.com. Retrieved 2007-02-01.
  15. ^ a b c Driscoll, R. (1980). Practical Hi-Fi Sound, 'Analogue and digital', pages 61–64; 'The pick-up, arm and turntable', pages 79–82. Hamlyn. ISBN 0-600-34627-7.
  16. ^ Stark, C. (1989). "High-fidelity concepts and systems". Macropaedia article 'Sound'. 27 (15 ed.). Encyclopædia Britannica. p. 625.
  17. ^ Technics EPC-100CMK4
  18. ^ "mastering". Positive-feedback.com. Retrieved 2012-08-15.
  19. ^ Byers, Fred R (October 2003). "Care and Handling of CDs and DVDs" (PDF). Council on Library and Information Resources. Retrieved 27 July 2014.
  20. ^ a b Thompson, Dan. Understanding Audio. Berklee Press, 2005, ch. 14.
  21. ^ a b Hawksford, Malcolm (September 1991). Introduction to Digital Audio Images of Audio, (PDF). Proceedings of the 10th International AES Conference. London. Archived from the original (PDF) on 2007-09-29. Retrieved 2008-03-09.
  22. ^ Muraoka, Teruo; Iwahara, Makoto; Yamada, Yasuhiro (1981). "Examination of Audio-Bandwidth Requirements for Optimum Sound Signal Transmission". Journal of the Audio Engineering Society. 29 (1/2): 2–9.
  23. ^ Kaoru, A.; Shogo, K (2001). Detection threshold for tones above 22 kHz. 110th AEC Convention. Audio Engineering SocietyPaper 5401
  24. ^ Dunn, Julian (1998). "Anti-alias and anti-image filtering: The benefits of 96kHz sampling rate formats for those who cannot hear above 20kHz" (PDF). Nanophon Limited. Retrieved 27 July 2014.
  25. ^ Nishiguchi, Toshiyuki; Iwaki, Masakazu; Ando, Akio (2004). Perceptual Discrimination between Musical Sounds with and without Very High Frequency Components. NHK Laboratories Note No. 486 (Report). NHK. Archived from the original on October 16, 2015. Retrieved August 15, 2012.
  26. ^ a b Katz, Bob (October 3, 2007). Mastering Audio: The Art and the Science (2nd ed.). Focal Press. ISBN 978-0240808376.
  27. ^ Knee, Anthony B.; Hawksford, Malcolm J. (February 1995). Evaluation of Digital Systems and Digital Recording Using Real Time Audio Data. 98th AES Convention. p. 3.
  28. ^ Hass, Jeffrey (2013). "Chapter 5: Principles of Digital Audio". Center for Electronic and Computer Music. Indiana University.
  29. ^ a b Rumsey, F.; Watkinson, J (1995). "Sections 2.5 and 6". The Digital Interface Handbook (2 ed.). Focal Press. pp. 37, 154–160.
  30. ^ Ashihara, Kaoru; Kiryu, Shogo; Koizumi, Nobuo; Nishimura, Akira; Ohga, Juro; Sawaguchi, Masaki; Yoshikawa, Shokichiro (2005). "Detection threshold for distortions due to jitter on digital audio". Acoustical Science and Technology. 26 (1): 50–54. doi:10.1250/ast.26.50. Archived from the original on 2009-08-12. Retrieved 2014-01-31.
  31. ^ John Eargle, Chris Foreman. Audio Engineering for Sound Reinforcement, The Advantages of Digital Transmission and Signal Processing. Retrieved 2012-09-14.
  32. ^ "Secrets Of The Mix Engineers: Chris Lord-Alge". Retrieved 2012-09-13.
  33. ^ Toole, Floyd (1994). "Section 11.7: Experimental Procedure". In Borwick, John. The Loudspeaker and Headphone Handbook (2 ed.). Focal Press. pp. 481–488. ISBN 0-240-51371-1.
  34. ^ Watkinson, J. (1994). "Section 1.2: What is digital audio? What can we hear?". An Introduction to Digital Audio. Focal Press. pp. 3, 26. ISBN 0-240-51378-9.
  35. ^ a b Greenfield, E.; et al. (1986). March, Ivan, ed. The Penguin Guide to Compact Discs, Cassettes and LPs. Penguin Books, England.
  36. ^ a b Greenfield, E.; et al. (1990). "Preface". In March, Ivan. The Penguin Guide to Compact Discs. Penguin Books, England. pp. viii–ix. ISBN 0-14-046887-0.
  37. ^ Hawksford, M. (2001). SDM versus LPCM: The Debate Continues (PDF). 110th AES Convention. Archived from the original (PDF) on 2006-05-13paper 5397
  38. ^ "Digital stress". The Diamond Center. 2003 [1980]. Archived from the original on 2004-08-12. Retrieved 17 July 2013.
  39. ^ Diamond, John; Lagadec, Roger (December 1985). "More on -Human Stress Provoked by Digitalized Recordings- and Reply". JAES. AES. 33 (12): 968. Retrieved 2013-08-16.
  40. ^ Fuller, John Grant (1981). Are the Kids All Right?: The Rock Generation and Its Hidden Death Wish. pp. 130–135. ISBN 0812909704.
  41. ^ Levinson, Mark. "Re-vitalizing Audio Industry:Music and Health" (PDF). Archived from the original (PDF) on 2014-03-23.
  42. ^ Levinson, Mark. "Mark Levinson: CD vs. SACD and LP". Redrosemusic.com. Archived from the original on 2012-02-29. Retrieved 2013-08-16.
  43. ^ Phillips, Wes (2005-07-05). "Mark Levinson & the Bobcat". Stereophilia. Stereophile.com. Retrieved 2013-08-16.
  44. ^ Blech, Dominik; Yang, Min-Chi (8–11 May 2004). DVD-Audio versus SACD: Perceptual Discrimination of Digital Audio Coding Formats (PDF). AES Convention:116. Berlin: Audio Engineering Society. Archived from the original on 27 September 2007. Retrieved 27 July 2014.
  45. ^ James Paul (2003-09-26). "Last night a mix tape saved my life | Music | The Guardian". London: Arts.guardian.co.uk. Retrieved 2012-08-15.
  46. ^ "ABX Testing article". Boston Audio Society. 1984-02-23. Retrieved 2012-08-15.
  47. ^ Lesurf, Jim. "Analog or Digital?". The Scots Guide to Electronics. St-andrews.ac.uk. Retrieved 2012-08-15.
  48. ^ Rudolph, Thomas E.; Leonard, Vincent A. (2001). Recording in the Digital World. Berklee Press Publications. p. 3. ISBN 0634013246.

Bibliography[edit]

External links[edit]