AES47 describes a standardised method for packing AES3 professional digital audio streams over Asynchronous Transfer Mode networks. The AES47 method is published by the International Electrotechnical Commission as IEC 62365.
Many professional audio systems are now combined with telecommunication and IT technologies to provide new functionality, flexibility and connectivity over both local and wide area networks. AES47 was developed to provide a standardised method of transporting the existing standard for digital audio (AES3) over current telecommunication interconnection standards that provide a quality of service required by many professional low latency, uncompressed live audio uses. It may be used directly between specialist audio devices or in combination with telecommunication and computer equipment with suitable network interfaces and utilises the same physical structured cable used as standard by those networks.
Common network protocols like Ethernet use large packet sizes, which produce an unacceptably large minimum latency. Asynchronous transfer mode divides data into 48-byte "cells" which provide acceptably low latency.
The original work was carried out at the British Broadcasting Corporation’s R&D department and published as "White Paper 074", which established that this approach provides the necessary performance for professional media production. AES47 was originally published in 2002 and was republished with minor revisions in February 2006. Amendment 1 to AES47 was published in February 2009, adding code points in the ATM Adaptation Layer Parameters Information Element to signal that the time to which each audio sample relates can be identified as specified in AES53.
The change in thinking from traditional ATM network design is not to necessarily use ATM to pass IP traffic (apart from management traffic) but to use AES47 in parallel with standard Ethernet structures to deal with extremely high performance secure media streams.
AES47 has been developed to allow the simultaneous transport and switched distribution of a large number of AES3 linear audio streams at different sample frequencies. AES47 can support any of the standard AES3 sample rates and word size. AES11 Annex D (the November 2005 printing or version of AES11-2003) shows an example method to provide isochronous timing relationships for distributed AES3 structures over asynchronous networks such as AES47 where reference signals may be locked to common timing sources such as GPS. AES53 specifies how timing markers within AES47 can be used to associate an absolute time stamp with individual audio samples as described in AES47 Amendment 1.
An additional standard has been published by the Audio Engineering Society to extend AES3 digital audio carried as AES47 streams to enable this to be transported over standard physical Ethernet hardware. This additional standard is known as AES51-2006.
For minimum latency, AES47 uses "raw" ATM cells, ATM adaptation layer 0. Each ATM virtual circuit negotiates the parameters of a stream at connection time. In addition to the same rate and number of channels (which may be more than the 2 supported by AES3), the negotiation covers the number of bits per sample and the presence of an optional data byte. The total must be 1, 2, 3, 4 or 6 bytes per sample, so it evenly divides the ATM cell size. AES3 uses 4 bytes per sample (24 bits of sample plus the optional data byte), but AES47 supports additional formats.
The optional data byte contains four "ancillary" bits corresponding to the AES3 VUCP bits. However, the P (parity) bit is replaced by a B bit which is set on the first sample of each audio block, and clear at all other times. This serves the same function as the B (or Z) synchronization preamble.
The other half of the data byte contains three "data protection" bits for error control and a sequencing bit. The concatenation of the sequencing bits from all samples in a cell (combined little-endian) form a sequencing word of 8, 12, 16, or 24 bits. Only the first 12 bits are defined.
The first four bits of the sequencing word are a sequencing number, used to detect dropped cells. This increments by 1 for each cell transmitted.
The second four bits are for error detection, with bit 7 being an even parity bit for the first byte.
The third four bits, if present, are a second sequencing number which can be used to align multiple virtual circuits.
- IEC 62365 preview
- Chambers, C.J. (September 2003). The development of ATM network technology for live production infrastructure (Technical report). BBC Research & Development. WHP 074.
- Audio Engineering Society standards web site
- Rumsey, Francis; Watkinson, John (September 2003). "5.4 AES47: Audio Over ATM". Digital Interface Handbook (3rd ed.). Focal press. ISBN 0-240-51909-4.