AES3
AES3
(Redirected from AES-EBU embedded timecode)
Jump to navigation
Jump to search
AES3 (also known as AES/EBU) is a standard for the exchange of digital audio signals between professional audio devices. An AES3 signal can carry two channels of PCM audio over several transmission media including balanced lines, unbalanced lines, and optical fiber.[1]
AES3 was jointly developed by the Audio Engineering Society (AES) and the European Broadcasting Union (EBU). The standard was first published in 1985 and was revised in 1992 and 2003. AES3 has been incorporated into the International Electrotechnical Commission's standard IEC 60958, and is available in a consumer-grade variant known as S/PDIF.
AES3 was jointly developed by the Audio Engineering Society (AES) and the European Broadcasting Union (EBU). The standard was first published in 1985 and was revised in 1992 and 2003. AES3 has been incorporated into the International Electrotechnical Commission's standard IEC 60958, and is available in a consumer-grade variant known as S/PDIF.
Contents
History and development
Variants using different physical connections are specified in IEC 60958. These are essentially consumer versions of AES3 for use within the domestic high fidelity environment using connectors more commonly found in the consumer market. These variants are commonly known as S/PDIF.
Hardware connections
The AES3 standard parallels part 4 of the international standard IEC 60958. Of the physical interconnection types defined by IEC 60958, two are in common use.IEC 60958 type I
XLR connectors, used for IEC 60958 type I connections.
| Cable end | Device end | |
|---|---|---|
| Input | XLR male plug | XLR female jack |
| Output | XLR female plug | XLR male jack |
IEC 60958 type II
IEC 60958 Type II defines an unbalanced electrical or optical interface for consumer electronics applications. The precursor of the IEC 60958 Type II specification was the Sony/Philips Digital Interface, or S/PDIF. Both were based on the original AES/EBU work. S/PDIF and AES3 are interchangeable at the protocol level, but at the physical level they specify different electrical signaling levels and impedances, which may be significant in some applications.BNC Connector
BNC connector, used for AES-3id connections.
Other formats
AES3 digital audio format can also be carried over an Asynchronous Transfer Mode network. The standard for packing AES3 frames into ATM cells is AES47. AES67 described digital audio transport over an IP network but does not include any of the ancillary data associated with AES3. A mechanism for carrying this additional data is described in SMPTE 2110-31.For information on the synchronization of digital audio structures, see the AES11 standard. The ability to insert unique identifiers into an AES3 bit stream is covered by the AES52 standard.
Protocol
Simple representation of the protocol for both AES3 and S/PDIF
- The low-level protocol for data transmission in AES3 and S/PDIF is largely identical, and the following discussion applies for S/PDIF, except as noted.
Each bit occupies one time slot. Each audio sample (of up to 24 bits) is combined with four flag bits and a synchronisation preamble which is four time slots long to make a subframe of 32 time slots. The 32 time slots of each subframe are assigned as follows:
| Time slot | Name | Description |
|---|---|---|
| 0–3 | Preamble | A synchronisation preamble (biphase mark code violation) for audio blocks, frames, and subframes. |
| 4–7 | Auxiliary sample (optional) | A low-quality auxiliary channel used as specified in the channel status word, notably for producer talkback or recording studio-to-studio communication. |
| 8–27, or 4–27 | Audio sample | One sample stored with most significant bit (MSB) last. If the auxiliary sample is used, bits 4–7 are not included. Data with smaller sample bit depths always have MSB at bit 27 and are zero-extended towards the least significant bit (LSB). |
| 28 | Validity (V) | Unset if the audio data are correct and suitable for D/A conversion. During the presence of defective samples, the receiving equipment may be instructed to mute its output. It is used by most CD players to indicate that concealment rather than error correction is taking place. |
| 29 | User data (U) | Forms a serial data stream for each channel (with 1 bit per frame), with a format specified in the channel status word. |
| 30 | Channel status (C) | Bits from each frame of an audio block are collated giving a 192-bit channel status word. Its structure depends on whether AES3 or S/PDIF is used. |
| 31 | Parity (P) | Even parity bit for detection of errors in data transmission. Excludes preamble; Bits 4–31 have an even number of ones. |
Synchronisation preamble
This is a specially coded preamble that identify the subframe and its position within the audio block. They are not normal BMC-encoded data bits, although they do still have zero DC bias.Three preambles are possible :
- X (or M) : 111000102 if previous time slot was 0, 000111012 if it was 1. (Equivalently, 100100112 NRZI encoded.) Marks a word for channel A (left), other than at the start of an audio block.
- Y (or W) : 111001002 if previous time slot was 0, 000110112 if it was 1. (Equivalently, 100101102 NRZI encoded.) Marks a word for channel B (right).
- Z (or B) : 111010002 if previous time slot was 0, 000101112 if it was 1. (Equivalently, 100111002 NRZI encoded.) Marks a word for channel A (left) at the start of an audio block.
The 8-bit preambles are transmitted in time allocated to the first four time slots of each subframe (time slots 0 to 3). Any of the three marks the beginning of a subframe. X or Z marks the beginning of a frame, and Z marks the beginning of an audio block.
| 0 | 1 | 2 | 3 | | 0 | 1 | 2 | 3 | Time slots _____ _ _____ _ / \_____/ \_/ \_____/ \_/ \ Preamble X _____ _ ___ ___ / \___/ \___/ \_____/ \_/ \ Preamble Y _____ _ _ _____ / \_/ \_____/ \_____/ \_/ \ Preamble Z ___ ___ ___ ___ / \___/ \___/ \___/ \___/ \ All 0 bits BMC encoded _ _ _ _ _ _ _ _ / \_/ \_/ \_/ \_/ \_/ \_/ \_/ \_/ \ All 1 bits BMC encoded | 0 | 1 | 2 | 3 | | 0 | 1 | 2 | 3 | Time slotsIn two-channel AES3, the preambles form a pattern of ZYXYXYXY…, but it is straightforward to extend this structure to additional channels (more subframes per frame), each with a Y preamble, as is done in the MADI protocol.
Channel status word
As stated before there is one channel status bit in each subframe, making one 192 bit word for each channel in each block. This 192 bit word is usually presented as 192/8 = 24 bytes. The contents of the channel status word are completely different between the AES3 and S/PDIF standards, although they agree that the first channel status bit (byte 0 bit 0) distinguishes between the two. In the case of AES3, the standard describes in detail how the bits have to be used. Here is a summary of the channel status word:[1]- byte 0: basic control data: sample rate, compression, emphasis
- bit 0: A value of 1 indicates this is AES3 channel status data. 0 indicates this is S/PDIF data.
- bit 1: A value of 0 indicates this is linear audio PCM data. A value of 1 indicates other (usually non-audio) data.
- bits 2–4: Indicates the type of signal preemphasis applied to the data. Generally set to 1002 (none).
- bit 5: A value of 0 indicates that the source is locked to some (unspecified) external time sync. A value of 1 indicates an unlocked source.
- Bits 6–7: Sample rate. These bits are redundant when real-time audio is transmitted (the receiver can observe the sample rate directly), but are useful if AES3 data is recorded or otherwise stored. Options are unspecified, 48 kHz (the default), 44.1 kHz, and 32 kHz.
- byte 1: indicates if the audio stream is stereo, mono or some other combination.
- bits 0–3: Indicates the relationship of the two channels; they might be unrelated audio data, a stereo pair, duplicated mono data, music and voice commentary, a stereo sum/difference code.
- bits 4–7: Used to indicate the format of the user channel word.
- byte 2: audio word length
- bits 0–2: Aux bits usage. This indicates how the aux bits (time slots 4–7) are used. Generally set to 0002 (unused) or 0012 (used for 24-bit audio data).
- bits 3–5: Word length. Specifies the sample size, relative to the 20- or 24-bit maximum. Can specify 0, 1, 2 or 4 missing bits. Unused bits are filled with 0, but audio processing functions such as mixing will generally fill them in with valid data without changing the effective word length.
- bits 6–7: Unused
- byte 3: used only for multichannel applications
- byte 4: Additional sample rate information.
- bits 0–1: indicate the grade of the sample rate reference, per AES11.
- bit 2: reserved
- bits 3–6: Extended sample rate. This indicates other sample rates, not representable in byte 0 bits 6–7. Values are assigned for 24, 96, and 192 kHz, as well as 22.05, 88.2, and 176.4 kHz.
- bit 7: This "sampling frequency scaling flag", if set, indicates that the sample rate is multiplied by 1/1.001 to match NTSC video frame rates.
- byte 5: reserved
- bytes 6–9: Four ASCII characters for indicating channel origin. Widely used in large studios.
- bytes 10–13: Four ASCII characters indicating channel destination, to control automatic switchers. Less often used.
- bytes 14–17: 32-bit sample address, incrementing block-to-block by 192 (because there are 192 frames per block). At 48 kHz, this wraps every 24h51m18.485333s.
- bytes 18–21: as above, but offset to indicate samples since midnight.[4]
- byte 22: contains information about the reliability of the channel status word.
- bits 0–3: reserved
- bit 4: if set, bytes 0–5 (signal format) are unreliable.
- bit 5: if set, bytes 6–13 (channel labels) are unreliable.
- bit 6: if set, bytes 14–17 (sample address) are unreliable.
- bit 7: if set, bytes 18–21 (timestamp) are unreliable.
- byte 23: CRC. This byte is used to detect corruption of the channel status word, as might be caused by switching mid-block. (Generator polynomial is x8+x4+x3+x2+1, preset to 1.)
Embedded timecode
SMPTE timecode timestamp data can be embedded within AES3 digital audio signals. It can be used for synchronization and for logging and identifying audio content. According to John Ratcliff's Timecode: A user's guide, it is embedded as a 32-bit binary word in bytes 18 to 21 of the channel status data.[5]See also
- ADAT Lightpipe
- AES-2id
- MADI, an extension with many more channels.
References
In 1977, stimulated by the growing need for standards in digital audio, the AES Digital Audio Standards Committee was formed.
Bytes 18 to 21, Bits 0 to 7: Time of day sample address code. Value (each Byte): 32-bit binary value representing the first sample of current block. LSBs are transmitted first. Default value shall be logic "0". Note: This is the time-of-day laid down during the source encoding of the signal and shall remain unchanged during subsequent operations. A value of all zeros for the binary sample address code shall, for the purposes of transcoding to real time, or to time codes in particular, be taken as midnight (i.e., 00 h, 00 mm, 00 s, 00 frame). Transcoding of the binary number to any conventional time code requires accurate sampling frequency information to provide the sample accurate time.
- Ratcliff, John (1999). Timecode: A user's guide. Focal Press. pp. 226, 228. ISBN 0-240-51539-0.
Further reading
- European Broadcasting Union, Specification of the Digital Audio Interface (The AES/EBU interface) Tech 3250-E third edition (2004)
- Watkinson, John (2001). The Art of Digital Audio Third Edition. Focal Press. ISBN 0-240-51587-0.
- Watkinson, John (August 1989). "The AES/EBU Digital Audio Interface". UK Conference: AES/EBU Interface. EBU-02.
- Emmett, John (1995). "Engineering Guidelines: The EBU/AES Digital Audio Interface" (PDF). EBU.
- AES-2id-2006: AES information document for digital audio engineering — Guidelines for the use of the AES3 interface. (Downloaded from the AES standards web site; see external links)
- Mark Yonge (June–July 2005). "AES3 Channel Status Revisited" (PDF). Line Up (101): 20–22. Archived from the original (PDF) on 2015-05-01. Retrieved 2013-09-01.
- "AES3 / AES-EBU channel status byte settings".
Коментарі
Дописати коментар