Jump to content

High-Efficiency Advanced Audio Coding

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by 77.96.157.117 (talk) at 11:43, 25 May 2020 (Added Rocbox, as this supports playback of AAC-HE, and may add support to devices that the native firmware previously did not permit this format ~~~~). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

High-Efficiency Advanced Audio Coding
Filename extensionsMPEG/3GPP Container

Apple Container

  • .m4a, .m4b, .m4p, .m4r, .m4v

ADTS Stream - NOT raw - Contains Headers

  • .aac
Internet media typeaudio/aac
audio/aacp
audio/3gpp
audio/3gpp2
audio/mp4
Developed byISO
Type of formatAudio compression format
Contained byMPEG-4 Part 14, 3GP and 3G2, ISO base media file format, Audio Data Interchange Format (ADIF), Audio Data Transport Stream (ADTS)
Extended fromAAC
StandardISO/IEC 14496-3
Hierarchical structure of AAC profile, AAC-HE profile and AAC-HE v2 profile, and compatibility between them. The AAC-HE profile decoder is fully capable of decoding any AAC profile stream. Similarly, The AAC-HE v2 decoder can handle all AAC-HE profile streams as well as all AAC profile streams. Based on the MPEG-4 Part 3 technical specification.[1]
Evolution from MPEG-2 AAC-LC (Low Complexity) Profile and MPEG-4 AAC-LC Object Type to AAC-HE v2 Profile.[2]

High-Efficiency Advanced Audio Coding (AAC-HE) is an audio coding format for lossy data compression of digital audio defined as an MPEG-4 Audio profile in ISO/IEC 14496-3. It is an extension of Low Complexity AAC (AAC-LC) optimized for low-bitrate applications such as streaming audio. The usage profile AAC-HE v1 uses spectral band replication (SBR) to enhance the modified discrete cosine transform (MDCT) compression efficiency in the frequency domain.[3] The usage profile AAC-HE v2 couples SBR with Parametric Stereo (PS) to further enhance the compression efficiency of stereo signals.

AAC-HE is used in digital radio standards like HD Radio,[4] DAB+ and Digital Radio Mondiale.

History

The progenitor of AAC-HE was developed by Coding Technologies by combining MPEG-2 AAC-LC with a proprietary mechanism for spectral band replication (SBR), to be used by XM Radio for their satellite radio service. Subsequently, Coding Technologies submitted their SBR mechanism to MPEG as a basis of what ultimately became AAC-HE.

AAC-HE v1 was standardized as a profile of MPEG-4 Audio in 2003 by MPEG and published as part of the ISO/IEC 14496-3:2001/Amd 1:2003[5] specification.

The AAC-HE v2 profile was standardized in 2006 as per ISO/IEC 14496-3:2005/Amd 2:2006.[1][6]

Parts of the AAC-HE specification had previously been standardized and published by various bodies in 3GPP TS 26.401 [7], ETSI TS 126 401 V6.1.0 [8], ISO/IEC 14496-3:2001/Amd.1:2003 and ISO/IEC 14496-3:2001/Amd 2:2004. [9]

At the time, Coding Technologies had already begun using the trade names AAC+ and aacPlus for what is now known as AAC-HE v1, and aacPlus v2 and eAAC+ for what is now known as AAC-HE v2.

Perceived quality

Testing indicates that material decoded from 64 kbit/s AAC-HE does not quite have similar audio quality to material decoded from MP3 at 128 kbit/s using high quality encoders.[10][11][12][13] The test, taking bitrate distribution and RMSD into account, is a tie between mp3PRO, AAC-HE and Ogg Vorbis.

Further controlled testing by 3GPP during their revision 6 specification process indicates that AAC-HE and AAC-HE v2 provide "Good" audio quality for music at low bit rates (e.g., 24 kbit/s).

In 2011, a public listening test[14] comparing the two best-rated AAC-HE encoders at the time to Opus and Ogg Vorbis indicated statistically significant superiority at 64 kbit/s for Opus over all other contenders, and second-ranked Apple's implementation of AAC-HE as statistically superior to both Ogg Vorbis and Nero AAC-HE, which were tied for third place.

MPEG-2 and MPEG-4 AAC-LC decoders without SBR support will decode the AAC-LC part of the audio, resulting in audio output with only half the sampling frequency, thereby reducing the audio bandwidth. This usually results in the high-end, or treble, portion of the audio signal missing from the audio product.

Support

Encoding

Orban Opticodec-PC Streaming and File Encoders were the first commercially available encoders supporting AAC-LC/AAC-HE back in 2003. They are now deprecated and replaced with StreamS Encoders from StreamS/Modulation Index with many more features, including support xAAC-HE/Unified Speech and Audio Coding. They are now in use at some of the largest content providers, and are considered to be the standard of the industry for live encoding.

Sony supports AAC-HE encoding since SonicStage version 4.

iTunes 9 supports AAC-HE encoding and playback.[15][16]

Nero has released a free-of-charge command line AAC-HE encoder, Nero AAC Codec,[17] and also supports AAC-HE inside the Nero software suite.

Sorenson Media’s Squeeze Compression Suite includes an AAC-HEv1 encoder and is available for Mac OS X as well as Windows.

The 3GPP consortium released source code of a reference AAC-HEv2 encoder that appears to offer competitive quality.[18]

Die Plattenkiste and Winamp Pro also supports ripping music to AAC-HE. Using a transcoding plugin for Winamp's media library, any file can be transcoded to AAC-HE.[19]

XLD, an OS X audio encoding program, offers encoding from any of its supported formats to AAC-HE.

Nokia PC Suite may encode audiofiles to eAAC+ format before transmitting them to mobile phone.

AAC-HE v1 and v2 encoders are provided by the Fraunhofer FDK AAC library in Android 4.1 and later versions.[20]

Decoding

AAC-HE is supported in the open source FAAD/FAAD2 decoding library and all players incorporating it, such as VLC media player, Winamp, foobar2000, Audacious Media Player, SonicStage and Die Plattenkiste.

The Nero AAC Codec supports decoding HE and HEv2 AAC.

AAC-HE is also used by AOL Radio and Pandora Radio clients to deliver high-fidelity music at low bitrates.

iTunes 9.2 and iOS 4 include full decoding of AAC-HE v2 parametric stereo streams.

  • iTunes 9 thru 9.1, iPhone OS 3.1 and Fall 2009 iPods have support for AAC-HE playback for version 1 with no parametric stereo.
  • Older versions of Apple iTunes, iPod Touch, and iPhone will play AAC-HE files at reduced fidelity because they ignore the spectral-band replication and parametric stereo information, instead playing them as though they were standard AAC-LC files without the high-frequency, or "treble," information that is only present in the SBR part of the signal.[21] These will report the track length as twice its actual length.[citation needed]

Dolby released Dolby Pulse decoders and encoders in September 2008. AAC-HE v2 is the core of Dolby Pulse so files and streams encoded in Dolby Pulse will playback on AAC, AAC-HE v1 and v2 decoders. Conversely files and streams encoded in AAC, AAC-HE v1 or v2 will playback on Dolby Pulse decoders.

Dolby Pulse provides the following additional capabilities beyond AAC-HE v2:

  • Ability to intelligently generate and insert reversible loudness normalization and dynamic range metadata into the encoded file/stream; this metadata can then be used to optimize the playback experience based on application and/or device.
  • Ability to insert custom metadata into the encoded file, and extract this metadata on playback

Dolby has additionally released a PC decoder as an SDK suitable for integration into PC applications requiring Dolby Pulse, AAC-HE or AAC playback capabilities.

AAC-HE v2 decoders are provided in all versions of Android.[20] Decoding is handled by Fraunhofer FDK AAC since Android version 4.1.

Clients

Application Platform Description
AIMP Windows A Winamp-like alternative music player.[22]
Adobe Flash Player Windows, OS X, Chrome OS, Linux Browser plug-in.[23][24] Supports AAC+ from any RTMP source.
Live streams wrapped in an ADTS container are not natively supported and have to be re-wrapped. (e.g. Icecast KH can serve streams in a .flv container, which is compatible with Flash.)[a]
Amarok (software) Windows, Linux Open-source music player.
Audacious Media Player Windows, Linux Open-source music player.
Deadbeef Linux, Android Open-source music player.
Die Plattenkiste Windows Freeware internet radio application (in German).
foobar2000 Windows Freeware music player.
FStream OS X, iOS Internet radio application.
GuguRadio iOS Internet radio application.
Internet Radio Player Android Internet radio player.
Internet Radio Box iOS Internet radio application.
iTunes Windows, OS X Freeware music player. Pre-installed on Mac computers.
JetAudio Windows, Android Shareware media player.
MediaHuman Audio Converter Windows, OS X Freeware audio converter.
(Supports conversion of MP3, AAC, AIFF, WAV etc.)
MPlayer Windows, OS X and Linux Open-source media player.
Mpv (media player) Windows, OS X and Linux Open-source media player.
Rockbox Various portable media devices Alternate firmware for various portable media-players, such as Apple iPod and Creative Zen.
QuickTime X OS X Media player pre-installed on OS X Snow Leopard or later.
RealPlayer Windows, OS X, Linux, Android Freemium media player.
(AAC-HE v2 will only play in mono)[26]
Rhythmbox Linux Open-source music player.
Snowtape OS X Shareware internet radio application.
streamWriter Windows Open-source internet radio application.
StreamS HiFi Radio iOS Paidware internet radio player.
Tunein radio iOS, Android, Windows Phone, Blackberry Internet radio player.
VLC media player Windows, OS X, Linux, iOS, Android Open-source media player.
Winamp Windows, OS X, Android Freeware media player.
XiiaLive Android, iOS Internet radio player.
Kodi Windows, Linux, OS X, Android Open-source media player.
Media Player Classic Windows Open-source media player

Promotion aspects

Commercial trademarks and labeling

AAC-HE is marketed under the trademark aacPlus by Coding Technologies and under the trademark Nero Digital by Nero AG. Sony Ericsson, Nokia and Samsung use AAC+ to label support for AAC-HE v1 and eAAC+ to label support for AAC-HE v2 on their phones. Motorola uses AAC+ to indicate AAC-HE v1 and "AAC+ Enhanced" to indicate AAC-HE v2.[citation needed]

Licensing and patents

Companies holding patents for AAC-HE have formed a patent pool administered by Via Licensing Corporation[27] to provide a single point of license for product makers.

Patent licenses are required for end-product companies that make hardware or software products that include AAC-HE encoders and/or decoders.[28] Unlike the MP3 format before April 23, 2017,[29] content owners are not required to pay license fees to distribute content in AAC-HE.

Standards

AAC-HE profile was first standardized in ISO/IEC 14496-3:2001/Amd 1:2003.[5] AAC-HE v2 profile (AAC-HE with Parametric Stereo) was first specified in ISO/IEC 14496-3:2005/Amd 2:2006.[1][6][30] The Parametric Stereo coding tool used by AAC-HE v2 was standardized in 2004 and published as ISO/IEC 14496-3:2001/Amd 2:2004.[9][7]

The current version of the MPEG-4 Audio (including AAC-HE standards) is published in ISO/IEC 14496-3:2009.

Enhanced aacPlus is required audio compression format in 3GPP technical specifications for 3G UMTS multimedia services and should be supported in IP Multimedia Subsystem (IMS), Multimedia Messaging Service (MMS), Multimedia Broadcast/Multicast Service (MBMS) and Transparent end-to-end Packet-switched Streaming Service (PSS).[31][32][33][34] AAC-HE version 2 was standardized under the name Enhanced aacPlus by 3GPP for 3G UMTS multimedia services in September 2004 (3GPP TS 26.401).[35]

AAC-HE and AAC-HE v2 audio coding for DVB applications is standardized by TS 101 154.[36][37] AacPlus v2 by Coding Technologies[38] is also standardized by the ETSI as TS 102 005 for Satellite services to Handheld devices (DVB-SH) below 3 GHz.

In December 2007, Brazil started broadcasting terrestrial DTV standard called International ISDB-Tb that implements video coding H.264 with audio AAC-LC on main program (single or multi) and video H.264 with audio AAC-HEv2 in the 1Seg mobile sub-program.

Versions

The following is the summary of the different versions of AAC-HE:

Version Common trade names Codec feature Standards
AAC-HE v1 aacPlus v1, eAAC, AAC+, CT-aacPlus AAC-LC + SBR ISO/IEC 14496-3:2001/Amd 1:2003
AAC-HE v2 aacPlus v2, eAAC+, AAC++, Enhanced AAC+ AAC-LC + SBR + PS ISO/IEC 14496-3:2005/Amd 2:2006
xAAC-HE aacPlus v2, eAAC+, AAC++, Enhanced AAC+ AAC-LC + SBR + PS + USAC ISO/IEC 23003-3:2012/Amd 2:2012
[39]

See also

Notes

  1. ^ To deliver streaming audio, AAC data is most likely carried in either the Audio Data Interchange Format (ADIF) or via Audio Data Transport Stream (ADTS). You can parse these containers and create FLV audio tags in order to use the audio file with Data Generation Mode.[25]

References

  1. ^ a b c ISO/IEC JTC1/SC29/WG11/N7016 (2005-01-11), Text of ISO/IEC 14496-3:2001/FPDAM 4, Audio Lossless Coding (ALS), new audio profiles and BSAC extensions, archived from the original (DOC) on 2014-05-12, retrieved 2009-10-09{{citation}}: CS1 maint: numeric names: authors list (link)
  2. ^ Fraunhofer IIS, MPEG-4 Audio and Video Technology (PDF), retrieved 2009-10-15[dead link]
  3. ^ Herre, J.; Dietz, M. (2008). "MPEG-4 high-efficiency AAC coding [Standards in a Nutshell]". IEEE Signal Processing Magazine. 25 (3): 137–142. doi:10.1109/MSP.2008.918684.
  4. ^ "Receiving NRSC-5". theori.io. Archived from the original on 20 August 2017. Retrieved 14 April 2018.
  5. ^ a b ISO (2003). "Bandwidth extension, ISO/IEC 14496-3:2001/Amd 1:2003". ISO. Archived from the original on 2012-01-04. Retrieved 2009-10-13.
  6. ^ a b ISO (2006). "Audio Lossless Coding (ALS), new audio profiles and BSAC extensions, ISO/IEC 14496-3:2005/Amd 2:2006". ISO. Archived from the original on 2012-01-04. Retrieved 2009-10-13.
  7. ^ a b 3GPP (2004-09-30). "3GPP TS 26.401 V6.0.0 (2004-09), General Audio Codec audio processing functions; Enhanced aacPlus General Audio Codec; General Description (Release 6)" (DOC). 3GPP. Archived from the original on 2006-08-19. Retrieved 2009-10-13.{{cite web}}: CS1 maint: numeric names: authors list (link)
  8. ^ 3GPP (2005-01-04). "ETSI TS 126 401 V6.1.0 (2004-12) - Universal Mobile Telecommunications System (UMTS); General audio codec audio processing functions; Enhanced aacPlus general audio codec; General description (3GPP TS 26.401 version 6.1.0 Release 6)". 3GPP. Retrieved 2009-10-13.{{cite web}}: CS1 maint: numeric names: authors list (link)
  9. ^ a b ISO (2004). "Parametric coding for high-quality audio, ISO/IEC 14496-3:2001/Amd 2:2004". ISO. Archived from the original on 2012-01-04. Retrieved 2009-10-13.
  10. ^ "Results of 64kbit/s Listening Test". archive.org. 23 June 2007. Archived from the original on 23 June 2007. Retrieved 3 May 2018.{{cite web}}: CS1 maint: bot: original URL status unknown (link)
  11. ^ "Multiformat Listening Test @ 48 kbps - FINISHED". www.hydrogenaud.io. Archived from the original on 8 July 2014. Retrieved 3 May 2018.
  12. ^ "80 kbps personal listening test (summer 2005)". www.hydrogenaud.io. Archived from the original on 8 July 2014. Retrieved 3 May 2018.
  13. ^ "MP3 – WMA – AAC – OGG – qualité à 96 kbps (évaluation) - Traitement Audio - Video & Son - FORUM HardWare.fr". forum.hardware.fr. Archived from the original on 15 July 2012. Retrieved 3 May 2018.
  14. ^ "Hydrogen audio 2011 multiformat listening test unofficial results page". people.xiph.org. Archived from the original on 25 July 2012. Retrieved 3 May 2018.
  15. ^ "Archived copy". Archived from the original on 2011-03-29. Retrieved 2011-03-29.{{cite web}}: CS1 maint: archived copy as title (link)
  16. ^ "iTunes". Apple. Archived from the original on 29 March 2011. Retrieved 3 May 2018.
  17. ^ "Nero AAC Codec". Archived from the original on 2009-12-11. Retrieved 2009-11-23.
  18. ^ Bouvigne, Gabriel (2006-03-20). "48kbps AAC public test results". MP3'Tech. Archived from the original on 2008-07-24. Retrieved 2008-09-05.
  19. ^ "Free Download Winamp Transcoder 2.0". www.free-codecs.com. Archived from the original on 20 August 2008. Retrieved 3 May 2018.
  20. ^ a b "Supported Media Formats". Google. Archived from the original on 2012-03-11. Retrieved 2013-10-10.
  21. ^ "iPod touch: Supported file formats". Apple Support. Retrieved 2019-04-07.
  22. ^ "AIMP". www.aimp.ru. Archived from the original on 8 November 2014. Retrieved 3 May 2018.
  23. ^ "Adobe Flash Player". www.adobe.com. Archived from the original on 23 July 2008. Retrieved 3 May 2018.
  24. ^ "Adobe bringing HD video, high quality audio to Flash using H.264, AAC (iPhone Flash support?) – MacDailyNews - Welcome Home". macdailynews.com. Archived from the original on 21 June 2015. Retrieved 3 May 2018.
  25. ^ "Playing Icecast streaming audio in Flash Player - Adobe Developer Connection". www.adobe.com. Archived from the original on 16 March 2015. Retrieved 3 May 2018.
  26. ^ "Archived copy". Archived from the original on 2015-03-18. Retrieved 2014-10-19.{{cite web}}: CS1 maint: archived copy as title (link)
  27. ^ Via Licensing. "Licensing Programs". Archived from the original on 2017-05-13. Retrieved 2017-05-11.
  28. ^ Via Licensing. "AAC Licensing FAQ". Archived from the original on 2017-05-22. Retrieved 2017-05-11.
  29. ^ Thomson. "Thomson/FhG MP3 Licensing". Archived from the original on 2017-01-17.
  30. ^ Mihir Mody (2005-06-06). "Audio compression gets better and more complex". Embedded.com. Retrieved 2009-10-13.[permanent dead link]
  31. ^ ETSI (2009-04) ETSI TS 126 234 V8.2.0 (2009-04); 3GPP TS 26.234; Transparent end-to-end Packet-switched Streaming Service (PSS); Protocols and codecs Archived 2008-12-01 at the Wayback Machine Page 58. Retrieved on 2009-06-02.
  32. ^ ETSI (2009-01) ETSI TS 126 140 V8.0.0 (2009-01); 3GPP TS 26.140; Multimedia Messaging Service (MMS); Media formats and codes Archived 2008-12-06 at the Wayback Machine Page 11. Retrieved on 2009-06-02.
  33. ^ ETSI (2009-01) ETSI TS 126 141 V8.0.0 (2009-01); 3GPP TS 26.141; IP Multimedia System (IMS) Messaging and Presence; Media formats and codecs Archived 2008-10-07 at the Wayback Machine Page 10. Retrieved on 2009-06-02.
  34. ^ 3GPP (2009). "ETSI TS 126 346 V8.3.0 (2009-06); 3GPP TS 26.346; Multimedia Broadcast/Multicast Service (MBMS); Protocols and codecs". ETSI. p. 85. Archived from the original on 2008-10-04. Retrieved 2009-10-13.{{cite web}}: CS1 maint: numeric names: authors list (link)
  35. ^ 3GPP (2004). "3GPP TS 26.401 - General audio codec audio processing functions; Enhanced aacPlus general audio codec; General description". 3GPP. Archived from the original on 2008-10-04. Retrieved 2009-10-13.{{cite web}}: CS1 maint: numeric names: authors list (link)
  36. ^ ETSI TS 101 154 v1.5.1: Specification for the use of Video and Audio Coding in Broadcasting Applications based on the MPEG-2 Transport Stream
  37. ^ ETSI (2009-03-31). "TS 101 154 version 1.9.1 - Digital Video Broadcasting (DVB); Specification for the use of Video and Audio Coding in Broadcasting Applications based on the MPEG-2 Transport Stream". ETSI. Archived from the original on 2013-04-14. Retrieved 2009-10-13.
  38. ^ "Archived copy" (PDF). Archived from the original (PDF) on 2006-10-26. Retrieved 2007-01-29.{{cite web}}: CS1 maint: archived copy as title (link)
  39. ^ "xHE-AAC". Fraunhofer Institute for Integrated Circuits IIS. Archived from the original on 30 December 2017. Retrieved 3 May 2018.