Jump to content

Container format

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Reim (talk | contribs) at 00:37, 17 May 2010. The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

A container or wrapper format is a meta-file format whose specification describes how data and metadata are stored (not coded). A program able to identify and open a container file might not be able to decode the contained data. This can be caused by the opening program lacking the required decoding algorithm, or the meta-data not providing enough information.

By definition, a container format could wrap any kind of data. Though there are a few examples of such file formats (e.g. Microsoft Windows's DLL files), most container formats are specialized for the specific requirements of the data. For example, a popular family of containers is found among multimedia file formats. Since audio and video streams can be coded and decoded with many different algorithms, a container format can be used to provide a single file to the user.

Among the earliest cross-platform container formats are Distinguished Encoding Rules (DER), and the 1985 Interchange File Format (IFF).

Multimedia container formats

The container file is used to identify and interleave the different data types. Simpler container formats can contain different types of audio codecs, while more advanced container formats can support multiple audio and video streams, subtitles, chapter-information, and meta-data (tags) — along with the synchronization information needed to play back the various streams together. In most cases, the file header, most of the metadata and the synchro chunks are specified by the container format (for example container formats exist for optimized, low-quality, internet video streaming which, for example, differs from high-quality DVD streaming requirements).

The constituent parts of a container format have various names; they are often called "chunks", as in RIFF and PNG, while they are called "packets" in MPEG-TS (from the communications term), and they are called "segments" in JPEG. The main content of a chunk is called the "data" or the "payload". Most container formats have chunks in sequence, each with a header, while TIFF unusually instead stores offsets. Modular chunks make it easy to recover other chunks in case of file corruption or dropped frames or bit slip, while offsets result in framing errors in cases of bit slip.

Some containers are exclusive to audio:

  • AIFF (IFF file format, widely used on Mac OS platform)
  • WAV (RIFF file format, widely used on Windows platform)
  • XMF (Extensible Music Format)

Other containers are exclusive to still images:

  • FITS (Flexible Image Transport System) is a wrapper file format for still images, raw data, and associated metadata.
  • TIFF (Tagged Image File Format) is a wrapper file format for still images and associated metadata.

Other flexible containers can hold many types of audio and video, as well as other media. The most popular multi-media containers are:

There are many other container formats, such as NUT, MXF, GXF, ratDVD, SVI, VOB and DivX Media Format

Single coding formats

In addition to pure container formats, which specify only the wrapper but not the coding, a number of file formats specify both a storage layer and the coding, as part of modular design and forward compatibility.

Examples include JPEG File Interchange Format (JFIF) for containing JPEG data, and Portable Network Graphics (PNG).

In principle, coding can be changed while the storage layer is retained; for example, Multiple-image Network Graphics (MNG) uses the PNG container format but provides animation, while JPEG Network Graphics (JNG) puts JPEG encoded data in a PNG container; in both cases however, the different formats have different magic numbers – the format specifies the coding, though a MNG can contain both PNG-encoded images and JPEG-encoded images.

Issues

The differences between various container formats arise from five main issues:

  1. Popularity; how widely supported a container is.
  2. Overhead. This is the difference in file-size between two files with the same content in a different container.
  3. Support for advanced codec functionality. Older formats such as AVI do not support new codec features like B-frames, VBR audio, VFR natively. The format may be "hacked" to add support, but this creates compatibility problems.
  4. Support for advanced content, such as chapters, subtitles, meta-tags, user-data.
  5. Support of streaming media

See also

Similar packaging structures

References