Direct Stream Digital

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

Direct Stream Digital
DSD logo
Media typeAudio recording process
Disc format
Read mechanismDSD
Write mechanismSuper Audio CD
StandardISO/IEC 14496-3
UsageAudio recording
Extended from1999
Extended topresent

Direct Stream Digital (DSD) is a trademark used by Sony and Philips for their system of digitally recreating audible signals for the Super Audio CD (SACD).

DSD uses pulse-density modulation encoding - a technology to store audio signals on digital storage media which are used for the SACD. The signal is stored as delta-sigma modulated digital audio, which is a sequence of single-bit values at a sampling rate of 2.8224 MHz (64 times the CD audio sampling rate of 44.1 kHz, but only at 1⁄32768 of its 16-bit resolution). Noise shaping occurs by use of the 64-times oversampled signal to reduce noise and distortion caused by the inaccuracy of quantization of the audio signal to a single bit. Therefore, it is a topic of discussion whether it is possible to eliminate distortion in one-bit delta-sigma conversion.[1]


DSD is a method of storing a delta-sigma signal before applying a decimation process that converts the signal to a PCM signal. Delta-sigma conversion was first described by C.C. Cutler in 1954,[2] but was not named as such until a 1962 paper by Inose et al. Decimation did not initially exist and oversampled data was sent as-is. The proposal to decimate oversampled delta-sigma data before converting it into PCM audio was made by D. J. Goodman in 1969.[3]

DSD technology was later developed and commercialized by Sony and Philips, the designers of the audio CD. However, in 2005, Philips later sold its DSD tool division to Sonic Studio.[4][better source needed]

Major label support

DVD-Audio was endorsed by the Warner Music Group, while the SACD format was endorsed by Sony and Universal Music Group, with an especially high-profile by UMG imprint Virgin Records.[5][6] Despite this, in 2011, The Warner Premium Sound series of albums was released by Warner Music Group, marking the first time the label released titles in a SACD format, with recording in DSD.[7] The series grew to ten rock and pop albums, with Super Audio CD/CD hybrid discs containing both an SACD layer and a standard CD layer.[8]

Sony did not promote SACD actively in North America, with the result that DVD-Audio gained competitive traction in the market. Elsewhere, such as in Europe or Japan, SACD gained more of a foothold.[9] Examples include the German Stockfisch Records, which releases vinyl editions of albums and DSD-recordings, released as hybrid SACDs.[10]

Independent label use

Many music companies that specialize in Super Audio CD products therefore use DSD encoding. A number of independent record labels have also worked directly with Sony to focus on DSD products or the DSD recording process.

DMP Digital Music Products was an early user of the SACD digital audio format.[11] In 1997 their release Alto by Joe Beck & Ali Ryerson was the first commercial recording captured with Sony's Direct Stream Digital recording technology. The label's Just Jobim by Manfredo Fest in 1998 was the first project captured with the new Meitner DSD conversion technology. In 2000, DMP released the world's first multi-channel SACD—Sacred Feast by Gaudeamus.[12]

The majority of Telarc International Corporation's releases were on (generally hybrid) SACD, and are DSD recordings.[13][failed verification] Telarc often worked with early audiophile company Soundstream, and re-released many of its original Soundstream recordings in SACD format.[14] Soundstream, which made the first digital recording in the United States, recorded in 16 bit PCM at a sample rate of 50 kHz via its own proprietary digital recorder. This 50 kHz PCM format was converted to DSD for release on Telarc SACD.

The record label Mobile Fidelity had engineers who decided to adopt the Super Audio CD over the DVD-Audio disc as a high resolution digital format after listening tests and technical evaluations. On the label's Hybrid SACD releases, the SACD layer is a direct DSD recording of the analog master tape, while the CD layer is a digital down conversion of the DSD, with Super Bit Mapping applied. Post-2001, CD-only are sourced from DSD, but omit the SACD layer.[15]

In 2007, Blue Coast Records was founded in California for the purpose of recording and releasing music recorded with the DSD format, primarily focusing on jazz and acoustic artists.[16]

On August 28, 2013, the Acoustic Sounds label launched, which sells mainstream albums from major record labels that were produced with Direct Stream Digital or PCM audio formats.[17][18] On September 4, 2013, Acoustic Sounds announced an agreement with Sony Music Entertainment to provide the company's new digital download service with albums that have been produced or remastered in Direct Stream Digital format.[19]

The format is used on albums such as Pop, Songs & Death in 2009,[20] and the remastered The Rolling Stones album Their Satanic Majesties Request in 2002.[21][22]

DSD technique[edit]

Comparison with PCM

SACD audio is stored in DSD, which differs from the conventional PCM used by the compact disc or conventional computer audio systems.

A DSD recorder uses delta-sigma modulation. DSD is 1-bit with a 2.8224 MHz sampling rate. The output from a DSD recorder is a bitstream. The long-term average of this signal is proportional to the original signal. DSD uses noise shaping techniques to push quantization noise up to inaudible ultrasonic frequencies. In principle, the retention of the bitstream in DSD lets the SACD player use a basic (one-bit) DAC design with a low-order analog filter. The SACD format can deliver a dynamic range of 120 dB from 20 Hz to 20 kHz and an extended frequency response up to 100 kHz—though most current players list an upper limit of 80–90 kHz.

Most professional audiologists accept that the upper limit of human adult hearing is 20 kHz[23] and that high frequencies are the first to be affected by hearing loss.[24]

The process of creating a DSD signal is conceptually similar to taking a one-bit delta-sigma analog-to-digital (A/D) converter and removing the decimator, which converts the 1-bit bitstream into multi-bit PCM. Instead, the 1-bit signal is recorded directly and, in theory, only requires a lowpass filter to reconstruct the original analog waveform. In reality it is a little more complex, and the analogy is incomplete in that 1-bit sigma-delta converters are these days rather unusual, one reason being that a one-bit signal cannot be dithered properly: most modern sigma-delta converters are multi-bit.

Because of the nature of sigma-delta converters, one cannot make a direct comparison between DSD and PCM. An approximation is possible, though, and would place DSD in some aspects comparable to a PCM format that has a bit depth of 24 bits and a sampling frequency of 88200 Hz.

Because it has been extremely difficult to carry out DSP operations (for example performing EQ, balance, panning and other changes in the digital domain) in a one-bit environment, and because of the prevalence of solely PCM studio equipment such as Pro Tools, the vast majority of SACDs—especially rock and contemporary music, which rely on multitrack techniques—are in fact mixed in PCM (or mixed analog and recorded on PCM recorders) and then converted to DSD for SACD mastering.

To address some of these issues, a new studio format has been developed, usually referred to as DSD-wide, which retains the high sample rate of standard DSD, but uses an 8-bit, rather than single-bit digital word length, yet still relies heavily on the noise shaping principle. DSD-wide is PCM with noise shaping—and is sometimes disparagingly referred to as "PCM-narrow"—but has the added benefit of making DSP operations in the studio a great deal more practical. The main difference is that "DSD-wide" still retains 2.8224 MHz (64fs) sampling frequency while the highest frequency in which PCM is being edited is 384 kHz (8fs). The "DSD-wide" signal is down-converted to regular DSD for SACD mastering. As a result of this technique and other developments there are now a few digital audio workstations (DAWs) that operate, or can operate, in the DSD domain, notably Pyramix and some SADiE systems.

Another format for DSD editing is Digital eXtreme Definition (DXD), a PCM format with 24-bit resolution sampled at 352.8 kHz (or alternatively 384 kHz). DXD was initially developed for the Merging Pyramix workstation and introduced together with their Sphynx 2, AD/DA converter in 2004. This combination meant that it was possible to record and edit directly in DXD,[25] and that the sample only converts to DSD once before publishing to SACD. This offers an advantage to the user as the noise created by converting DSD rises dramatically above 20 kHz, and more noise is added each time a signal is converted back to DSD during editing.

Note that high-resolution PCM (DVD-Audio, HD DVD and Blu-ray Disc) and DSD (SACD) may still technically differ at high frequencies. A reconstruction filter is typically used in PCM decoding systems, much the same way that bandwidth-limiting filters are normally used in PCM encoding systems. Any error or unwanted artifact introduced by such filters typically affects the end-result. A claimed advantage of DSD is that product designers commonly choose to have no filtering, or modest filtering. Instead DSD leads to constant high levels of noise at these frequencies. The dynamic range of DSD decreases quickly at frequencies over 20 kHz due to the use of strong noise shaping techniques that push the noise out of the audio band, resulting in a rising noise floor just above 20 kHz. The dynamic range of PCM, on the other hand, is the same at all frequencies. However, almost all present-day DAC chips employ some kind of sigma-delta conversion of PCM files that results in the same noise spectrum as DSD signals. All SACD players employ an optional low-pass filter set at 50 kHz for compatibility and safety reasons, suitable for situations where amplifiers or loudspeakers cannot deliver an undistorted output if noise above 50 kHz is present in the signal.

Double-rate DSD (DSD128)[edit]

Referred to as DSD128 because the sample rate is 128 times that of CD. Since its establishment content creators have started to make 5.6 MHz DSD128 recordings available, such as the audiophile label Opus3.[26] Additionally a 44.1 kHz variant at 5.6448 MHz has been supported by multiple hardware devices such as the exaSound e20 Mk II DAC.[27] The Korg MR-1000 1-bit digital recorder samples at 5.6448 MHz, twice the SACD rate.

Quad-rate DSD (DSD256)[edit]

Referred to as DSD256 because the sample rate is 256 times that of CD. The Pyramix Virtual Studio Digital Audio Workstation allows for recording, editing and mastering all DSD formats, being DSD64 (SACD resolution), DSD128 (Double-DSD) and DSD256 (Quad-DSD). A 44.1 kHz variant of 12.288 MHz has been established. The exaSound e20 DAC was the first commercially available device capable of DSD256 playback at sampling rates of 11.2896/12.288 MHz.[28][29] The Merging Technologies Horus AD/DA Converter offers sample rates up to 11.2 MHz, or four times the SACD rate.

Octuple-rate DSD (DSD512)[edit]

Referred to as DSD512 because the sample rate is 512 times that of CD. It has a sample rate of 22.5792 MHz (512 times that of CD), or alternatively 24.576 MHz (512 times 48 kHz). Hardware such as the Amanero Combo384 DSD output adapter, and exaU2I USB to I²S interface, and software such as JRiver Media Center, foobar2000 with SACD plugin (up to DSD256 only), Roon,[30] HQPlayer and Neutron Music Player are all able to handle DSD files of this advanced sampling rate fully natively.

DSD playback options[edit]

Sony developed DSD for SACD, and many disk players support SACD. Since the format is digital, there are other ways to play back a DSD stream; the development of these alternatives has enabled companies to offer high-quality music downloads in DSD.[31]

DSD disc format[edit]

Some professional audio recorders (from Korg, Tascam, and others) can record in DSD format. Transferring this signal to a recordable DVD with the appropriate tools, such as the AudioGate software bundled with Korg MR-1/2/1000/2000 recorders, renders a DSD Disc. Such discs can be played back in native DSD only on certain Sony VAIO laptops and PlayStation 3 systems.[32] HQPlayer from February 16, 2011, version 2.6.0 beta includes support for direct/native playback from DSD Interchange File Format (DSDIFF) and DSD stream files (DSF) to ASIO devices with DSD support. Moreover, Sony produces two SACD players, the SCD-XA5400ES and the SCD-XE800, that fully support the DSD-disc format. Only the DSF format is supported. DSF is a stereo-only, simplified form of DFF, the format used for SACD mastering and 5.1-channel downloads. However, since most personal computers have only PCM audio hardware, DSD discs must be transcoded to PCM on the fly with the proper software plug-ins with questionable quality benefits compared to native high resolution PCM sources like DVD or Blu-ray Disc Audio.

In June 2012, Pioneer launched a series of SACD players compatible with DSD-disc. The PD-30 and PD-50.

In January 2013, TEAC announced a DSD-disc compatible player, the PD-501HR.[33]

DSD over USB[edit]

An alternative to burning DSD files onto disks for eventual playback is to transfer the (non-encrypted) files from a computer to audio hardware over a digital link such as USB.

The USB audio 2.0 specification defined several formats for the more common PCM approach to digital audio, but did not define a format for DSD.

In 2012, representatives from many companies and others developed a standard to represent and detect DSD audio within PCM frames; the standard, commonly known as "DSD over PCM", or "DoP", is suitable for other digital links that use PCM.[34] The 1.1 revision added protocol support for higher DSD sample rates without requiring an increase the underlying PCM sample rate. Many manufacturers now offer DACs that support DoP.

DSD vs. PCM[edit]

There has been much controversy between proponents of DSD and PCM over which encoding system is superior. In 2001, Lipshitz and Vanderkooy stated that one-bit converters, as employed by DSD, are unsuitable for high-end applications due to their high distortion.[35] In 2002, Philips published a paper arguing the contrary.[36] Lipshitz and Vanderkooy's paper was further criticized by Angus.[37] Lipshitz and Vanderkooy[38] responded to the criticisms.[39] Stuart also defined sigma delta modulation a "totally unsuitable choice" for high resolution digital audio.[40]

Conventional implementation of DSD has an intrinsic high distortion.[41] Distortion can be alleviated to some degree by using multibit digital to analogue converter. Historically, state-of-the-art ADCs were based around sigma-delta modulation designs. Oversampling converters are frequently used in linear PCM formats, where the ADC output is subject to bandlimiting and dithering.[42] Many modern converters use oversampling and a multi-bit design.[clarification needed]

Comparisons of DSD and PCM recordings with the same origin, number of channels and similar bandwidth and noise have yielded contradictory results. A study conducted at the Erich-Thienhaus Institute in Detmold, Germany found that in double-blind tests "hardly any of the subjects could make a reproducible distinction between the two encoding systems."[43] In contrast, a 2014 study conducted at the Tokyo University of the Arts found that listeners could distinguish PCM (192 kHz/24 bits) from either DSD (2.8 MHz) or DSD (5.6 MHz) (but not between the two DSD samplings), preferring the sound of DSD over PCM: "For example, Drums stimulus of DSD (5.6 MHz) has p = 0.001 when compared against PCM (192 kHz/24 bit) in overall preference. This suggests that DSD version was statistically significantly preferred over the PCM version."[44]

DSD met with relatively little success in the consumer market, even though the SACD was actually more successful than its direct competitor, the PCM-based DVD-Audio. Direct manipulation of recorded DSD data is difficult due to limited availability of appropriate software. The advent of new high-resolution PCM standards, such as DXD, further restricted its market niche. DSD, however, is still used as an archival format for studio applications, and it's seen as a possible replacement for analog tapes.

See also[edit]


  1. ^ Lipshitz, Stanley P.; Vanderkooy, John (May 12, 2001). "Why 1-Bit Sigma-Delta Conversion is Unsuitable for High-Quality Applications" (PDF). Audio Engineering Society. Retrieved July 6, 2011.
  2. ^ US patent 2967962, Cutler, Cassius C., "Transmission systems employing quantization", issued March 8, 1960 
  3. ^ "Data Converter Architectures: Chapter 3" (PDF). Retrieved October 27, 2018.
  4. ^ "Sonic Studio Takes Over Philips ProTech DSD i SACD Products [HFR]". HighFidelityReviews. July 9, 2005. Retrieved October 27, 2018.
  5. ^ Davis, Nick. "Genesis on SACD: Nick Davis interview" (Interview). Interviewed by Christian Gerhardts. Retrieved November 3, 2011.
  6. ^ "Genesis Reviews | SACD". Retrieved October 27, 2018.
  7. ^ Marchese, Joe (July 15, 2011). "Eagles, Deep Purple, Yes! Out-Of-Print DVD-Audios Coming To SACD". The Second Disc. Retrieved November 3, 2011.
  8. ^ "Ultradisc UHR SACD". Mobile Fidelity. Retrieved October 27, 2018.
  9. ^ "March 30". March 30, 2008. Retrieved October 27, 2018. In Europe and some other parts of the world this was done using the SACD format but not in North America, like with other reissues (such as the Depeche Mode albums).
  10. ^ "A new format: The Stockfisch DMM-CD/SACD". Stockfisch Records. February 5, 2016. Retrieved October 27, 2018.
  11. ^ Jung, Tom (June 27, 2004). "Tom Jung of DMP: Making Musical Sense". Stereophile Magazine (Interview). Interviewed by David Lander. Retrieved October 27, 2018.
  12. ^ "Gaudeamus – Sacred Feast". Discogs. Retrieved October 27, 2018.
  13. ^ "Official site". Archived from the original on January 14, 1998. Retrieved January 6, 2015.
  14. ^ Renner, Jack (Fall 1992), "The Roots of Telarc", Telarc newsletter
  15. ^ "Enjoy the Music". Retrieved November 12, 2010.
  16. ^ Rick Clark (May 1, 2004). "5.1 Recording Vibes With ESE". Mix. Archived from the original on December 3, 2013. Retrieved November 19, 2013.
  17. ^ Brockhouse, Gordon (August 28, 2013). "Acoustic Sounds Launches Hi-Res Download Service". MarketNews. Archived from the original on September 4, 2013. Retrieved October 27, 2018.
  18. ^ Snider, Mike (August 28, 2013). "Kansas firm raises bar on high-res music downloads". USA TODAY.
  19. ^ Lavorgna, Michael (September 4, 2013). "Acoustic Sounds Announces DSD Deal with Sony Music!". AudioStream. Retrieved October 27, 2018.
  20. ^ "Wheatus' new record downloadable in DSD". July 24, 2009. Archived from the original on September 29, 2018. Retrieved October 27, 2018.
  21. ^ Walsh, Christopher (August 24, 2002). "The Rolling Stones Remastered - Super Audio CDs". Billboard. p. 27.
  22. ^ "Rolling Stones Turn To 'Super Audio' For Reissues". Billboard. May 30, 2002. Retrieved October 27, 2018.
  23. ^ Rosen, Stuart (2011). Signals and Systems for Speech and Hearing (2nd ed.). BRILL. p. 163. For auditory signals and human listeners, the accepted range is 20Hz to 20kHz, the limits of human hearing
  24. ^ Bitner-Glindzicz, M (2002). "Hereditary deafness and phenotyping in humans". British Medical Bulletin. 63 (1): 73–94. doi:10.1093/bmb/63.1.73. PMID 12324385.
  25. ^ "Digital HD Audio Formats". Archived from the original on June 21, 2015. Retrieved October 27, 2014.
  26. ^ Hiro (April 7, 2013). "Audiophile label Opus3 starts offering 5.6448MHz DSD downloads". Pure Super Audio. Retrieved October 27, 2018.
  27. ^ "exaSound e20 Mk II".
  28. ^ "e20 Mk II - the first high-end DAC capable of DSD256".
  29. ^ "e20 Mk III - the first high-end DAC capable of 12.288 MHz (DSD256)".
  30. ^ "DSD512 ...upsampled free demo from [Resolved with build 218]". April 17, 2017. Thanks @wizardofoz, managed to download the 'Wechseldominante - Babel DSD512\Wechseldominante - Babel (DSD512).dff' and playback on my Holo Spring DAC. Direct DSD512 playback is far less computer intensive if it doesn't do any form of conversion. I managed to playback DSD512 without any issue.
  31. ^ Robinson, David W. (April 2012). "The Higher End: From an Editor's Notebook - Downloadable DSD". Positive Feedback Online. Retrieved May 14, 2012.
  32. ^ "PS3 SACD FAQ". Archived from the original on October 19, 2018. Retrieved August 8, 2010.
  33. ^ "CES 2013 Announcement".
  34. ^ Koch, Andreas; et al. (March 30, 2012). "Method for transferring DSD Audio over PCM Frames Version 1.1". Archived from the original on May 15, 2012. Retrieved May 14, 2012.
  35. ^ Audio Engineering Society Convention Paper 5395.
  36. ^ Convention Paper 5616. Archived March 13, 2019, at the Wayback Machine
  37. ^ Angus-Whiteoak, Jamie A S. "The Effect of Idle Tone Structure on Effective Dither in Delta-Sigma Modulation Systems", "Session X: SIGNAL PROCESSING FORUM - PART 1". Audio Engineering Society 112th Convention, Munich, April, 2002, #5619. (Professor of Audio Technology) Professor James Angus BSc, PhD (Kent), FIOA, MAES Publications Research Projects
  38. ^ "John Vanderkooy". Member Profile. Audio Engineering Society. Retrieved January 23, 2021.
  39. ^ Vanderkooy, John; Lipshitz, Stanley P. (April 1, 2002). "Towards a Better Understanding of 1-Bit Sigma-Delta Modulators - Part 3". Audio Engineering Society. Retrieved January 23, 2021. "Session X: SIGNAL PROCESSING FORUM - PART 1". Audio Engineering Society 112th Convention, Munich, April, 2002..
  40. ^ Stuart, J. Robert (March 2004). "Coding for High-Resolution Audio Systems" (PDF). J. Audio Eng. Soc. 52 (3): 139–142. S2CID 18072620.
  41. ^ Hawksford, M. (2001). "SDM versus LPCM: The Debate Continues" (PDF). AES Convention: 5397. Archived from the original (PDF) on May 13, 2006.
  42. ^ (Hawksford 1995).
  43. ^ Blech, Dominic; Yang, Min-Chi (2004). "DVD-Audio versus SACD: Perceptual Discrimination of Digital Audio Coding Formats" (PDF). Erich-Thienhaus-Institute (Tonmeisterinstitut). University of Music Detmold, Germany. Archived from the original (PDF) on September 27, 2007. Retrieved July 27, 2014.
  44. ^ Marui, A., Kamekawa, T., Endo, K. and Sato, E., 2014, April. Subjective evaluation of high resolution recordings in PCM and DSD audio formats. In Audio Engineering Society Convention 136. Audio Engineering Society.

External links[edit]