Closed captioning (CC) and subtitling are both processes of displaying text on a television, video screen, or other visual display to provide additional or interpretive information. Both are essentially the same and typically used as a transcription of the audio portion of a program as it occurs (either verbatim or in edited form), sometimes including descriptions of non-speech elements. Other uses have been to provide a textual alternative language translation of a presentation's primary audio language that is usually burned-in (or "open") to the video and not selectable (or "closed"). HTML5 defines subtitles as a "transcription or translation of the dialogue ... when sound is available but not understood" by the viewer (for example, dialogue in a foreign language) and captions as a "transcription or translation of the dialogue, sound effects, relevant musical cues, and other relevant audio information ... when sound is unavailable or not clearly audible" (for example, when audio is muted or the viewer is deaf or hard of hearing").
- 1 Terminology
- 2 History
- 3 Application
- 4 Television and video
- 5 HDTV interoperability issues
- 6 Uses of captioning in other mediums
- 7 Logo
- 8 See also
- 9 References
- 10 References
- 11 External links
The term "closed" (versus "open") indicates that the captions are not visible until activated by the viewer, usually via the remote control or menu option. "Open", "burned-in", "baked on", or "hard-coded" captions are visible to all viewers.
Most of the world does not distinguish captions from subtitles. In the United States and Canada, however, these terms do have different meanings. "Subtitles" assume the viewer can hear but cannot understand the language or accent, or the speech is not entirely clear, so they transcribe only dialogue and some on-screen text. "Captions" aim to describe to the deaf and hard of hearing all significant audio content — spoken dialogue and non-speech information such as the identity of speakers and, occasionally, their manner of speaking – along with any significant music or sound effects using words or symbols. Also the term closed caption has come to be used to also refer to the North American EIA-608 encoding that is used with NTSC-compatible video.
The United Kingdom, Ireland, and most other countries do not distinguish between subtitles and closed captions, and use "subtitles" as the general term—the equivalent of "captioning" is usually referred to as "subtitles for the hard of hearing". Their presence is referenced on screen by notation which says "Subtitles", or previously "Subtitles 888" or just "888" (the latter two are in reference to the conventional teletext channel for captions), which is why the term subtitle is also used to refer to the Ceefax-based Teletext encoding that is used with PAL-compatible video. The term subtitle has been replaced with caption in a number of PAL markets that still use Teletext such as Australia and New Zealand that purchase large amounts of imported US material with much of that video having had the US CC logo already superimposed over the start of it. In New Zealand, broadcasters superimpose an ear logo with a line through it that represents "Subtitles for the hard of hearing" even though they are currently referred to as captions. In the UK, modern digital television services have subtitles for the majority of programs, so it is no longer necessary to highlight which have captioning and which do not.
Closed captioning was first demonstrated at the First National Conference on Television for the Hearing Impaired in Nashville, Tennessee in 1971. A second demonstration of closed captioning was held at Gallaudet College (now Gallaudet University) on February 15, 1972 where ABC and the National Bureau of Standards demonstrated closed captions embedded within a normal broadcast of The Mod Squad.
The closed captioning system was successfully encoded and broadcast in 1973 with the cooperation of PBS station WETA. As a result of these tests, the FCC in 1976 set aside line 21 for the transmission of closed captions. PBS engineers then developed the caption editing consoles that would be used to caption prerecorded programs.
Real-time captioning, a process for captioning live broadcasts, was developed by the National Captioning Institute in 1982. In real-time captioning, court reporters trained to write at speeds of over 225 words per minute give viewers instantaneous access to live news, sports and entertainment. As a result, the viewer sees the captions within two to three seconds of the words being spoken.
Major US producers of captions are WGBH-TV, VITAC, CaptionMax and the National Captioning Institute. In the UK and Australasia, Red Bee Media, itfc and Independent Media Support are the major vendors.
The National Captioning Institute was created in 1979 in order to get the cooperation of the commercial television networks.
The first use of regularly scheduled uses of closed captioning on American television occurred on March 16, 1980. Sears had developed and sold the Telecaption adapter, a decoding unit that could be connected to a standard television set. The first programs seen with captioning were a Disney's Wonderful World presentation of the film Son of Flubber on NBC, an ABC Sunday Night Movie airing of Semi-Tough, and Masterpiece Theatre on PBS.
Legislative development in the U.S.
Until the passage of the Television Decoder Circuitry Act of 1990, television captioning was performed by a set-top box manufactured by Sanyo Electric and marketed by The National Captioning Institute (NCI). (At that time a set-top decoder cost about as much as a TV set itself, approximately $200.) Through discussions with the manufacturer it was established that the appropriate circuitry integrated into the television set would be less expensive than the stand-alone box, and Ronald May, then a Sanyo employee, provided the expert witness testimony on behalf of Sanyo and Gallaudet University in support of the passage of the bill. On January 23, 1991, the Television Decoder Circuitry Act of 1990 was passed by US Congress. This Act gave the Federal Communications Commission (FCC) power to enact rules on the implementation of Closed Captioning. This Act required all analog television receivers with screens of at least 13 inches or greater, either sold or manufactured, to have the ability to display closed captioning by July 1, 1993.
Also in 1990, the Americans with Disabilities Act (ADA) was passed to ensure equal opportunity for persons with disabilities. The ADA prohibits discrimination against persons with disabilities in public accommodations or commercial facilities. Title III of the ADA requires that public facilities, such as hospitals, bars, shopping centers and museums (but not movie theaters), provide access to verbal information on televisions, films or slide shows.
The Telecommunications Act of 1996 expanded on the Decoder Circuity Act to place the same requirements on digital television receivers by July 1, 2002. All TV programming distributors in the U.S. are required to provide closed captions for Spanish language video programming as of January 1, 2010.
A bill, H.R. 3101, the Twenty-First Century Communications and Video Accessibility Act of 2010, was passed by the United States House of Representatives in July 2010. A similar bill, S. 3304, with the same name was passed by the United States Senate on August 5, 2010, by the House of Representatives on September 28, 2010, and was signed by President Barack Obama on October 8, 2010. The Act requires, in part, for ATSC-decoding set-top box remotes to have a button to turn on or off the closed captioning in the output signal. It also requires broadcasters to provide captioning for television programs redistributed on the Internet.
On February 20, 2014, the FCC unanimously approved the implementation of quality standards for closed captioning, addressing accuracy, timing, completeness, and placement. This is the first time the FCC has addressed quality issues in captions.
Legislative development in Australia
The government of Australia provided seed funding in 1981 for the establishment of the Australian Caption Centre (ACC) and the purchase of equipment. Captioning by the ACC commenced in 1982 and a further grant from the Australian government enabled the ACC to achieve and maintain financial self-sufficiency. The ACC, now known as Media Access Australia, sold its commercial captioning division to Red Bee Media in December 2005. Red Bee Media continues to provide captioning services in Australia today.
Funding development in New Zealand
In 1981, TVNZ held a telethon to raise funds for Teletext encoding equipment used for the creation and editing of text based broadcast services for the deaf. The service came into use in 1984 with caption creation and importing paid for as part of the public broadcasting fee until the creation of the NZ On Air tax payer fund which is used to provide captioning for NZ On Air content, TVNZ news shows and conversion of EIA-608 US captions to the preferred EBU STL format for only TV one, TV 2 and TV 3 with archived captions available to FOUR and select Sky programming. During the second half of 2012, TV3 and FOUR began providing non-Teletext DVB image based captions on their HD service and used the same format on the satellite service which has since caused major timing issues in relation to server load and the loss of captions from most SD DVB-S receivers such as the ones Sky Television provides their customers. As of April 2, 2013 only the Teletext page 801 caption service will remain in use with the informational Teletext non-caption content being discontinued.
Closed captions were created for deaf or hard of hearing individuals to assist in comprehension. They can also be used as a tool by those learning to read, learning to speak a non-native language, or in an environment where the audio is difficult to hear or is intentionally muted. Captions can also be used by viewers who simply wish to read a transcript along with the program audio.
In the United States, the National Captioning Institute noted that English as a foreign or second language (ESL) learners were the largest group buying decoders in the late 1980s and early 1990s before built-in decoders became a standard feature of US television sets. This suggested that the largest audience of closed captioning was people whose native language was not English. In the United Kingdom, of 7.5 million people using TV subtitles (closed captioning), 6 million have no hearing impairment.
Closed captions are also used in public environments, such as bars and restaurants, where patrons may not be able to hear over the background noise, or where multiple televisions are displaying different programs. In addition, online videos may be treated through digital processing of their audio content by various robotic algorithms (robots). Multiple chains of errors are the result. When a video is truly and accurately transcribed, then the closed-captioning publication serves a useful purpose, and the content is available for search engines to index and make available to users on the internet.
Some television sets can be set to automatically turn captioning on when the volume is muted.
Television and video
For live programs, spoken words comprising the television program's soundtrack are transcribed by a human operator (a speech-to-text reporter) using stenotype or stenomask type of machines, whose phonetic output is instantly translated into text by a computer and displayed on the screen. This technique was developed in the 1970s as an initiative of the BBC's Ceefax teletext service. In collaboration with the BBC, a university student took on the research project of writing the first phonetics-to-text conversion program for this purpose. Sometimes, the captions of live broadcasts, like news bulletins, sports events, live entertainment shows, and other live shows fall behind by a few seconds. This delay is because the machine does not know what the person is going to say next, so after the person on the show says the sentence, the captions appear. Automatic computer speech recognition now works well when trained to recognize a single voice, and so since 2003, the BBC does live subtitling by having someone re-speak what is being broadcast. Live captioning is also a form of real-time text. Meanwhile, sport events on channels like ESPN are using court reporters, using a special (steno) keyboard and individually constructed "dictionaries."
In some cases, the transcript is available beforehand and captions are simply displayed during the program after being edited. For programs that have a mix of pre-prepared and live content, such as news bulletins, a combination of the above techniques is used.
For prerecorded programs, commercials, and home videos, audio is transcribed and captions are prepared, positioned, and timed in advance.
For all types of NTSC programming, captions are "encoded" into line 21 of the vertical blanking interval – a part of the TV picture that sits just above the visible portion and is usually unseen. For ATSC (digital television) programming, three streams are encoded in the video: two are backward compatible "line 21" captions, and the third is a set of up to 63 additional caption streams encoded in EIA-708 format.
Captioning is modulated and stored differently in PAL and SECAM 625 line 25 frame countries, where teletext is used rather than in EIA-608, but the methods of preparation and the line 21 field used are similar. For home Betamax and VHS videotapes, a shift down of this line 21 field must be done due to the greater number of VBI lines used in 625 line PAL countries, though only a small minority of European PAL VHS machines support this (or any) format for closed caption recording. Like all teletext fields, teletext captions can't be stored by a standard 625 line VHS recorder (due to the lack of field shifting support), they are available on all professional S-VHS recordings due to all fields being recorded. Recorded Teletext caption fields also suffer from a higher number of caption errors due to increased number of bits and a low SNR especially on low bandwidth VHS. This is why Teletext captions used to be stored separately on floppy disk to the analogue master tape. DVDs have their own system for subtitles and/or captions that is digitally inserted in the data stream and encoded on playback in video field lines.
For older televisions, a set-top box or other decoder is usually required. In the US, since the passage of the Television Decoder Circuitry Act, manufacturers of most television receivers sold have been required to include closed captioning display capability. High-definition TV sets, receivers, and tuner cards are also covered, though the technical specifications are different (high-definition display screens, as opposed to high-definition TVs, may lack captioning). Canada has no similar law, but receives the same sets as the US in most cases.
During transmission, single byte errors can be replaced by a white space which can appear at the beginning of the program. More byte errors during EIA-608 transmission can affect the screen momentarily, by defaulting to a real-time mode such as the "roll up" style, type random letters on screen, and then revert to normal. Uncorrectable byte errors within the teletext page header will cause whole captions to be dropped. EIA-608 due to using only two characters per video frame sends these captions ahead of time storing them in a second buffer awaiting a command to display them, Teletext sends these in real-time.
The use of capitalization varies between caption provider, most providers caption use capitalize all words, while providers such as WGBH and non-US providers prefer to use mixed case letters.
There are two main styles of line 21 closed captioning:
- Roll-up or scroll-up or paint-on or scrolling: Real-time words sent in paint-on or scrolling mode appear from left to right, up to one line at a time; when a line is filled in roll-up mode, the whole line scrolls up to make way for a new line, and the line on top is erased. The lines usually appear at the bottom of the screen, but can actually be placed on any of the 14 screen rows to avoid covering graphics or action. This method is used when captioning video in real-time such as for live events, where a sequential word-by-word captioning process is needed or a pre-made intermediary file isn't available. This method is signaled on EIA-608 by a two byte caption command or in Teletext by replacing rows for a roll-up effect and duplicating rows for a paint-on effect. This allows for real-time caption line editing.
- Pop-on or pop-up or block: A caption appears on any of the 14 screen rows as complete sentences, which can be followed additional captions. This method is used when captions come from an intermediary file (such as the Scenarist or EBU STL file formats) for pre-taped television and film programming, commonly produced at captioning facilities and can be aided by digital scripts or voice recognition software. This method if used for live events would require a video delay to avoid a large delay in the captions appearance on screen, which occurs with Teletext encoded live subtitles.
TVNZ Access Services and Red Bee Media for BBC and Australia example:
I got the machine ready.
ENGINE STARTING (speeding away)
UK IMS for ITV and Sky example:
(man) I got the machine ready. (engine starting)
US WGBH Access Services example:
MAN: I got the machine ready. (engine starting)
US National Captioning Institute example:
I GOT THE MACHINE READY.
US other provider example:
I GOT THE MACHINE READY. [engine starting]
US in-house real time roll-up example:
>> Man: I GOT THE MACHINE READY. [engine starting]
non-US in-house real time roll-up example:
I got the machine ready. ENGINE STARTING
For real time captioning done outside of captioning facilities, the following syntax is used.
- '>>' (two prefixed greater-than signs) indicates a change in single speaker
- (sometimes appended with the speaker's name in alternate case followed by a colon)
- '>>>' (three prefixed greater-than signs) indicates a change in news story or multiple speakers
Styles of syntax that are used by various captioning producers:
- Capitals indicate main on-screen dialogue and the name of the speaker
- (legacy EIA-608 home caption decoder fonts had no descenders for the lowercase Latin alphabet)
- (outside North America capitals with background coloration indicate a song title or sound effect description)
- (outside North America capitals with black or no background coloration indicates when a word is stressed or emphasized)
- Descenders indicate background sound description and off-screen dialogue
- (most modern caption producers such as WGBH-TV now use mixed case for both on-screen and off-screen dialogue)
- '-' (a prefixed dash) indicates a change in single speaker (used by CaptionMax)
- Words in italics indicate when a word is stressed or emphasized and when real world names are quoted
- Text coloration indicates captioning credits and sponsorship
- (occasionally used for a karaoke effect for music videos on MTV or VH-1)
- (in Ceefax/Teletext countries indicates a change in single speaker in place of '>>')
- (some Teletext countries use coloration to indicate when a word is stressed or emphasized)
- (coloration is limited to white, green, blue, cyan, red, yellow and magenta)
- (UK order of use for text is white, green, cyan, yellow and backgrounds is black, red, blue, magenta, white )
- (US order of use for text is white, yellow, cyan, green and backgrounds is black, blue, red, magenta, white )
- Square brackets or parentheses indicate a song title or sound effect description
- Parentheses indicate speaker's vocal pitch e.g., (man), (woman), (boy) or (girl)
- (outside North America parentheses indicate a silent on-screen action)
- A pair of Eighth notes are used to bracket a line of lyrics to indicate singing
- (A pair of Eighth notes on a line of no text are used during a section of instrumental music)
- (outside North America a single number sign is used on a line of lyrics to indicate singing)
- (an additional musical notation character is appended to the end of the last line of lyrics to indicate the song's end)
- (the Eighth note is unsupported by Ceefax/Teletext and a number sign which similar to a musical sharp is substituted)
There were many shortcomings in the original Line 21 specification from a typographic standpoint, since, for example, it lacked many of the characters required for captioning in languages other than English. Since that time, the core Line 21 character set has been expanded to include quite a few more characters, handling most requirements for languages common in North and South America such as French, Spanish, and Portuguese, though those extended characters are not required in all decoders and are thus unreliable in everyday use. The problem has been almost eliminated with a market specific full set of Western European characters and a private adopted Norpak extension for South Korean and Japanese markets. The full EIA-708 standard for digital television has worldwide character set support, but there has been little use of it due to EBU Teletext dominating DVB countries, which has its own extended character sets.
Captions are often edited to make them easier to read and to reduce the amount of text displayed onscreen. This editing can be very minor, with only a few occasional unimportant missed lines, to severe, where virtually every line spoken by the actors is condensed. The measure used to guide this editing is words per minute, commonly varying from 180 to 300, depending on the type of program. Offensive words are also captioned, but if the program is censored for TV broadcast, the broadcaster might not have arranged for the captioning to be edited or censored also. The "TV Guardian", a television set top box, is available to parents who wish to censor offensive language of programs–the video signal is fed into the box and if it detects an offensive word in the captioning, the audio signal is bleeped or muted for that period of time.
The Line 21 data stream can consist of data from several data channels multiplexed together. Odd field 1 can have four data channels: two separate synchronized captions (CC1, CC2) with caption related text, such as web site URLs (T1, T2). Even field 2 can have five additional data channels: two separate synchronized captions (CC3, CC4) with caption related text (T3, T4), and Extended Data Services (XDS) for Now/Next EPG details. XDS data structure is defined in CEA–608.
As CC1 and CC2 share bandwidth, if there is a lot of data in CC1, there will be little room for CC2 data and is generally only used for the primary audio captions. Similarly CC3 and CC4 share the second even field of line 21. Since some early caption decoders supported only single field decoding of CC1 and CC2, captions for SAP in a second language were often placed in CC2. This led to bandwidth problems, however, and the current U.S. Federal Communications Commission (FCC) recommendation is that bilingual programming should have the second caption language in CC3. Many Spanish television networks such as Univision and Telemundo, for example, provides English subtitles for many of its Spanish programs in CC3. Canadian broadcasters use CC3 for French translated SAPs, which is also a similar practice in South Korea and Japan.
Ceefax and Teletext can have a larger number of captions for other languages due to the use of multiple VBI lines. However, only European countries used a second subtitle page for second language audio tracks where either the NICAM dual mono or Zweikanalton were used.
HDTV interoperability issues
The US ATSC digital television system originally specified two different kinds of closed captioning datastream standards—the original analog compatible (available by Line 21) and the more modern digital only CEA-708 formats are delivered within the video stream. The US FCC mandates that broadcasters deliver (and generate, if necessary) both datastream formats with the CEA-708 format merely a conversion of the Line 21 format. The Canadian CRTC has not mandated that broadcasters either broadcast both datastream formats or exclusively in one format. Most broadcasters and networks to avoid large conversion cost outlays just provide EIA-608 captions along with a transcoded CEA-708 version encapsulated within CEA-708 packets.
Incompatibility issues with digital TV
Many viewers find that when they acquire a digital television or set-top box they are unable to view closed caption (CC) information, even though the broadcaster is sending it and the TV is able to display it.
Originally, CC information was included in the picture ("line 21") via a composite video input, but there is no equivalent capability in the HDTV 720p/1080i interconnects (such as DVI, HDMI or component video) between the display and a "source". A "source", in this case, can be a DVD player or a terrestrial or cable digital television receiver. When CC information is encoded in the MPEG-2 data stream, only the device that decodes the MPEG-2 data (a source) has access to the closed caption information; there is no standard for transmitting the CC information to a display monitor separately. Thus, if there is CC information, the source device needs to overlay the CC information on the picture prior to transmitting to the display over the interconnect's video output.
Many source devices do not have the ability to overlay CC information, for controlling the CC overlay can be complicated. For example, the Motorola DCT-5xxx and -6xxx cable set-top receivers have the ability to decode CC information located on the MPEG-2 stream and overlay it on the picture, but turning CC on and off requires turning off the unit and going into a special setup menu (it is not on the standard configuration menu and it cannot be controlled using the remote). Historically, DVD players, VCRs and set-top tuners did not need to do this overlaying since they simply passed this information on to the TV, and they are not mandated to perform this overlaying.
Many modern digital television receivers can be directly connected to cables, but often cannot receive scrambled channels that the user is paying for. Thus, the lack of a standard way of sending CC information between components, along with the lack of a mandate to add this information to a picture, results in CC being unavailable to many hard-of-hearing and deaf users.
The EBU Ceefax based teletext systems are the source for closed captioning signals, thus when teletext is embedded into DVB-T or DVB-S the closed captioning signal is included. However, for DVB-T and DVB-S, it is not necessary for a teletext page signal to also be present (ITV1, for example, does not carry analogue teletext signals on Sky Digital, but does carry the embedded version, accessible from the "Services" menu of the receiver, or more recently by turning them off/on from a mini menu accessible from the "help" button).
In New Zealand, captions use a EBU Ceefax based teletext system on DVB broadcasts via satellite and cable television with the exception of MediaWorks New Zealand channels who completely switched to DVB RLE subtitles in 2012 on both Freeview satellite and UHF broadcasts, this decision was made based on the TVNZ practice of using this format on only DVB UHF broadcasts (aka Freeview HD). This made composite video connected TVs incapable of decoding the captions on their own. Also these pre-rendered subtitles use classic caption style opaque backgrounds with an overly large font size and obscure the picture more than the more modern partially transparent backgrounds.
The CEA-708 specification provides for dramatically improved captioning
- An enhanced character set with more accented letters and non-Latin letters, and more special symbols
- Viewer-adjustable text size (called the "caption volume control" in the specification), allowing individuals to adjust their TVs to display small, normal, or large captions
- More text and background colors, including both transparent and translucent backgrounds to optionally replace the big black block
- More text styles, including edged or drop shadowed text rather than the letters on a solid background
- More text fonts, including monospaced and proportional spaced, serif and sans-serif, and some playful cursive fonts
- Higher bandwidth, to allow more data per minute of video
- More language channels, to allow the encoding of more independent caption streams
As of 2009, however, most closed captioning for DTV environments is done using tools designed for analog captioning (working to the CEA-608 NTSC spec rather than the CEA-708 DTV spec). The captions are then run through transcoders made by companies like EEG Enterprises or Evertz, which convert the analog Line 21 caption format to the digital format. This means that none of the CEA-708 features are used unless they were also contained in CEA-608.
DVDs, BDs, & HD DVDs
NTSC DVDs may carry closed captions in data packets of the MPEG-2 video streams inside of the Video-TS folder. Once played out of the analog outputs of a set top DVD player, the caption data is converted to the Line 21 format. They are output by the player to the composite video (or an available RF connector) for a connected TV's built-in decoder or a set-top decoder as usual. They can not be output on S-Video or component video outputs due to the lack of a colorburst signal on line 21. (Actually, regardless of this, if the DVD player is in interlaced rather than progressive mode, closed captioning will be displayed on the TV over component video input if the TV captioning is turned on and set to CC1.) When viewed on a personal computer, caption data can be viewed by software that can read and decode the caption data packets in the MPEG-2 streams of the DVD-Video disc. Windows Media Player (before Windows 7) in Vista supported only closed caption channels 1 and 2 (not 3 or 4). And Apple's DVD Player does not have the ability to read and decode Line 21 caption data which is recorded on a DVD made from an over-the-air broadcast. Apple's DVD Player can display some movie DVD captions.
In addition to Line 21 closed captions, video DVDs may also carry subtitles, which generally rendered from the EIA-608 captions as a bitmap overlay that can be turned on and off via a set top DVD player or DVD player software, just like the textual captions. This type of captioning is usually carried in a subtitle track labeled either "English for the hearing impaired" or, more recently, "SDH" (Subtitled for the Deaf and Hard of hearing). Many popular Hollywood DVD-Videos can carry both subtitles and closed captions (see Stepmom DVD by Columbia Pictures). On some DVDs, the Line 21 captions may contain the same text as the subtitles; on others, only the Line 21 captions include the additional non-speech information (even sometimes song lyrics) needed for deaf and hard of hearing viewers. European Region 2 DVDs do not carry Line 21 captions, and instead list the subtitle languages available—English is often listed twice, one as the representation of the dialogue alone, and a second subtitle set which carries additional information for the deaf and hard of hearing audience. (Many deaf/HOH subtitle files on DVDs are reworkings of original teletext subtitle files.)
HD DVD and Blu-ray disc media cannot carry any VBI data such as Line 21 closed captioning due to the design of DVI based High-Definition Multimedia Interface (HDMI) specifications that was only extended for synchronized digital audio replacing older analog standards, such as VGA, S-Video, component video and SCART. Both Blu-ray disc and HD DVD can use either PNG bitmap subtitles or 'advanced subtitles' to carry SDH type subtitling, the latter being an XML based textual format which includes font, styling and positioning information as well as a unicode representation of the text. Advanced subtitling can also include additional media accessibility features such as "descriptive audio".
There are several competing technologies used to provide captioning for movies in theaters. Cinema captioning falls into the categories of 'open' and 'closed.' The definition of "closed" captioning in this context is different from television, as it refers to any technology that allows as few as one member of the audience to view the captions.
Open captioning in a film theater can be accomplished through burned-in captions, projected text or bitmaps, or (rarely) a display located above or below the movie screen. Typically, this display is a large LED sign. In a digital theater, open caption display capability is built into the digital projector. Closed caption capability is also available, with the ability for 3rd party closed caption devices to plug into the digital cinema server.
Probably the best-known closed captioning option for film theaters is the Rear Window Captioning System from the National Center for Accessible Media. Upon entering the theater, viewers requiring captions are given a panel of flat translucent glass or plastic on a gooseneck stalk, which can be mounted in front of the viewer's seat. In the back of the theater is an LED display that shows the captions in mirror image. The panel reflects captions for the viewer, but is nearly invisible to surrounding patrons. The panel can be positioned so that the viewer watches the movie through the panel and captions appear either on or near the movie image. A company called Cinematic Captioning Systems has a similar reflective system called Bounce Back. A major problem for distributors has been that these systems are each proprietary, and require separate distributions to the theater to enable them to work. Proprietary systems also incur license fees.
For film projection systems, Digital Theater Systems, the company behind the DTS surround sound standard, has created a digital captioning device called the DTS-CSS or Cinema Subtitling System. It is a combination of a laser projector which places the captioning (words, sounds) anywhere on the screen and a thin playback device with a CD that holds many languages. If the Rear Window Captioning System is used, the DTS-CSS player is also required for sending caption text to the Rear Window sign located in the rear of the theater.
Special effort has been made to build accessibility features into digital projection systems (see digital cinema). Through SMPTE, standards now exist that dictate how open and closed captions, as well as hearing-impaired and visually impaired narrative audio, are packaged with the rest of the digital movie. This eliminates the proprietary caption distributions required for film, and the associated royalties. SMPTE has also standardized the communication of closed caption content between the digital cinema server and 3rd party closed caption systems (the CSP/RPL protocol). As a result, new, competitive closed caption systems for digital cinema are now emerging that will work with any standards-compliant digital cinema server. These newer closed caption devices include cup-holder-mounted electronic displays and wireless glasses which display caption text in front of the wearer's eyes. Bridge devices are also available to enable the use of Rear Window systems. As of mid-2010, the remaining challenge to the wide introduction of accessibility in digital cinema is the industry-wide transition to SMPTE DCP, the standardized packaging method for very high quality, secure distribution of digital movies.
Captioning systems have also been adopted by some stadiums, typically through dedicated portions of their main scoreboards. These screens display captions of the public address announcer and other spoken content, such as those contained within in-game segments, public service announcements, and lyrics of songs played in-stadium. In some facilities, these systems were added as a result of discrimination lawsuits; following a lawsuit under the Americans with Disabilities Act, FedEx Field added caption screens in 2006, and after a similar lawsuit that declared special "deaf seating" areas with screen-mounted captioning, and later the use of smartphones to be insufficient due to the small size of text, University of Phoenix Stadium added dedicated caption displays in 2013.
Some stadiums utilize on-site captioners, while others outsource them to external providers who caption remotely. A prominent provider of in-arena captioning systems is Good Sport Captioning, founded by Patti White of St. Louis. White had worked as a stenographer at a courthouse near where Busch Stadium was being constructed, and reached a deal with the team to provide in-stadium captioning upon the stadium's 2006 opening—conducting her activity from her home. Patti later formed Good Sport Captioning to provide remote captioning for other teams and venues.
Closed captioning of video games is becoming more common. One of the first video game companies to feature closed captioning was Bethesda Softworks in their 1990 release of Hockey League Simulator and The Terminator 2029. Infocom also offered Zork Grand Inquisitor in 1997. Many games since then have at least offered subtitles for spoken dialog during cut scenes, and many include significant in-game dialog and sound effects in the captions as well; for example, with subtitles turned on in the Metal Gear Solid series of stealth games, not only are subtitles available during cut scenes, but any dialog spoken during real-time gameplay will be captioned as well, allowing players who can't hear the dialog to know what enemy guards are saying and when the main character has been detected. Also, in many of developer Valve's video games (such as Half-Life 2 or Left 4 Dead), when closed captions are activated, dialog and nearly all sound effects either made by the player or from other sources (e.g. gunfire, explosions) will be captioned.
Video games don't offer Line 21 captioning, decoded and displayed by the television itself but rather a built-in subtitle display, more akin to that of a DVD. The game systems themselves have no role in the captioning either: each game must have its subtitle display programmed individually.
Reid Kimball, a game designer who is hearing impaired, is attempting to educate game developers about closed captioning for games. Reid started the Games[CC] group to closed caption games and serve as a research and development team to aid the industry. Kimball designed the Dynamic Closed Captioning system, writes articles, and speaks at developer conferences. Games[CC]'s first closed captioning project called Doom3[CC] was nominated for an award as Best Doom3 Mod of the Year for IGDA's Choice Awards 2006 show.
Online video streaming
Internet video streaming service YouTube offers captioning services in videos. The author of the video can upload a SubViewer (*.SUB), SubRip (*.SRT) or *.SBV file. As a beta feature, the site also added the ability to automatically transcribe and generate captioning on videos, with varying degrees of success based upon the content of the video. However, the automatic captioning is often inaccurate on videos with background music and exaggerated emotion in speaking. On June 30, 2010, YouTube announced a new "YouTube Ready" designation for professional caption vendors in the United States. The initial list included twelve companies who passed a caption quality evaluation administered by the Described and Captioned Media Project, have a website and a YouTube channel where customers can learn more about their services, and have agreed to post rates for the range of services that they offer for YouTube content.
Flash video also supports captions via the Distribution Exchange profile (DFXP) of W3C Timed Text format. The latest Flash authoring software adds free player skins and caption components that enable viewers to turn captions on/off during playback from a webpage. Previous versions of Flash relied on the Captionate 3rd party component and skin to caption Flash video. Custom Flash players designed in Flex can be tailored to support the Timed Text exchange profile, Captionate .XML, or SAMI file (see Hulu captioning). This is the preferred method for most US broadcast and cable networks that are mandated by the U.S. Federal Communications Commission to provide captioned on-demand content. The media encoding firms generally use software such as MacCaption to convert EIA-608 captions to this format.
Windows Media Video can support closed captions for both video on demand streaming or live streaming scenarios. Typically Windows Media captions support the SAMI file format but can also carry embedded closed caption data.
QuickTime video supports raw 608 caption data via proprietary Closed Caption Track, which are just EIA-608 byte pairs wrapped in a QuickTime packet container with different IDs for both line 21 fields. These captions can be turned on and off and appear in the same style as TV closed captions with all the standard formatting (pop-on, roll-up, paint-on) and can be positioned and split anywhere on the video screen. QuickTime Closed Caption tracks can be viewed in Mac or Windows versions of QuickTime Player, iTunes (via QuickTime), iPod Nano, iPod Classic, iPod Touch, iPhone, and iPad.
A captioned telephone is a telephone that displays real-time captions of the current conversation. The captions are typically displayed on a screen embedded into the telephone base.
Media monitoring services
In the United States especially, most media monitoring services capture and index closed captioning text from news and public affairs programs, allowing them to search the text for client references. The use of closed captioning for television news monitoring was pioneered by Universal Press Clipping Bureau (Universal Information Services) in 1992, and later in 1993 by Tulsa-based NewsTrak of Oklahoma (later known as Broadcast News of Mid-America, acquired by video news release pioneer Medialink Worldwide Incorporated in 1997). US patent 7,009,657 describes a "method and system for the automatic collection and conditioning of closed caption text originating from multiple geographic locations" as used by news monitoring services.
Software programs are now available that automatically generate a closed-captioning of conversations. Examples of such conversations include discussions in conference rooms, classroom lectures, and/or religious services.
In April 2010, Sony Creative Software released the Vegas Pro 9.0d update to the professional non-linear editor, Vegas Pro which implemented basic support for importing, editing, and delivering CEA608 Closed Captions. Vegas Pro 10, released on October 11, 2010, added several enhancements to the closed captioning support. TV-like CEA608 Closed Captioning can now be displayed as an overlay when played back in the Preview and Trimmer windows making it easy to check placement, edits, and timing of CC information. CEA708 style Closed Captioning is automatically created when the CEA608 data is created. Line 21 Closed Captioning is now supported as well as HD-SDI closed captioning capture and print from AJA and Blackmagic Design cards. Line 21 support provides a workflow for existing legacy media. Other improvements include increased support for multiple closed captioning file types, as well as the ability to export closed caption data for DVD Architect, YouTube, RealPlayer, QuickTime, and Windows Media Player.
In mid-2009, Apple released Final Cut Pro version 7 and began support for inserting closed caption data into SD and HD tape masters via firewire and compatible video capture cards. Up until this time it was not possible for video editors to insert caption data with both CEA-608 and CEA-708 to their tape masters. The typical workflow included first printing the SD or HD video to a tape and sending it to a professional closed caption service company that had a stand-alone closed caption hardware encoder.
This new closed captioning workflow known as e-Captioning involves making a proxy video from the non-linear system to import into a third-party non-linear closed captioning software. Once the closed captioning software project is completed, it must export a closed caption file compatible with the non-linear editing system. In the case of Final Cut Pro 7, three different file formats can be accepted: a .SCC file (Scenarist Closed Caption file) for Standard Definition video, a QuickTime 608 Closed Caption track (a special 608 coded track in the .mov file wrapper) for Standard Definition video, and finally a QuickTime 708 Closed Caption track (a special 708 coded track in the .mov file wrapper) for High Definition video output.
Alternatively, Matrox video systems devised another mechanism for inserting closed caption data by allowing the video editor to include CEA-608 and CEA-708 in a discrete audio channel on the video editing timeline. This allows real-time preview of the captions while editing and is compatible with Final Cut Pro 6 and 7.
Other non-linear editing systems indirectly support closed captioning only in Standard Definition line-21. Video files on the editing timeline must be composited with a line-21 VBI graphic layer known in the industry as a "blackmovie" with closed caption data. Alternately, video editors working with the DV25 and DV50 firewire workflows must encode their DV .avi or .mov file with VAUX data which includes CEA-608 closed caption data.
The current and most familiar logo for closed captioning consists of two Cs (for "closed captioned") inside a television screen. It was created by WGBH. The other logo, trademarked by the National Captioning Institute, is that of a simple geometric rendering of a television set merged with the tail of a speech balloon; two such versions exist: one with a tail on the left, the other with a tail on the right.
- Same Language Subtitling
- Synchronized Accessible Media Interchange (SAMI) file format
- Sign language on television
- Subtitle (captioning)
- Synchronized Multimedia Integration Language (SMIL) file format
- http://www.w3.org/TR/html5/embedded-content-0.html#the-track-element 4.7.9
- "A Brief History of Captioned Television".
- National Captioning Institute
- Gannon, Jack. 1981. Deaf Heritage–A Narrative History of Deaf America, Silver Spring, MD: National Association of the Deaf, pp. 384-387
- "Today on TV", Chicago Daily Herald, March 11, 1980, Section 2-5
- "Television Decoder Circuitry Act of 1990".
- "FCC Consumer Facts on Closed Captioning".
- "Part 79 – Closed Captioning of Video Programming".
- "Twenty-First Century Communications and Video Accessibility Act of 2010". 2010. Retrieved 2013-03-28.
- "Twenty-First Century Communications and Video Accessibility Act of 2010". 2010. Retrieved 2013-03-28.
- "FCC Moves to Upgrade TV Closed Captioning Quality". 2014.
- Alex Varley, Chief Executive, Media Access Australia (June 2008). "Submission to DBCDE's investigation into Access to Electronic Media for the Hearing and Vision Impaired" (PDF). Australia: Media Access Australia. pp. 12, 18, 43. Retrieved 2009-02-07.
- "About Media Access Australia". Australia: Media Access Australia. Retrieved 2009-02-07.
- "About Red Bee Media Australia". Australia: Red Bee Media Australia Pty Limited. Retrieved 2009-02-07.[dead link]
-  Ofcom, UK: Television access services Archived June 24, 2013 at the Wayback Machine
- Alex Varley, Chief Executive, Media Access Australia (June 2008). "Submission to DBCDE's investigation into Access to Electronic Media for the Hearing and Vision Impaired" (PDF). Australia: Media Access Australia. p. 16. Retrieved 2009-01-29.
The use of captions and audio description is not limited to deaf and blind people. Captions can be used in situations of "temporary" deafness, such as watching televisions in public areas where the sound has been turned down (commonplace in America and starting to appear more in Australia).
- Mayor's Disability Council (May 16, 2008). "Resolution in Support of Board of Supervisors' Ordinance Requiring Activation of Closed Captioning on Televisions in Public Areas". City and County of San Francisco. Retrieved 2009-01-29.
that television receivers located in any part of a facility open to the general public have closed captioning activated at all times when the facility is open and the television receiver is in use.
- Alex Varley, Chief Executive, Media Access Australia (April 18, 2005). "Settlement Agreement Between The United States And Norwegian American Hospital Under The Americans With Disabilities Act". U.S. Department of Justice. Retrieved 2009-01-29.
...will have closed captioning operating in all public areas where there are televisions with closed captioning; televisions in public areas without built-in closed captioning capability will be replaced with televisions that have such capability
- mb21 – ether.net – Teletext Then and Now – Timeline
- WHP 065
-  – ATSC Closed Captioning FAQ (cached copy) Archived September 1, 2008 at the Wayback Machine
- "ETSI EN 300 743: Digital Video Broadcasting (DVB); Subtitling systems"
- Dvd Faq
- Dvd Faq
- Enabling the Disabled in Digital Cinema
- List of Bethesda Softworks video games
- List of Terminator video games#Non-film based games
- Robson, Gary (1998). "Captioning Computer Games".
- Adding and Editing captions / subtitles – YouTube Help
- YouTube Blog: The Future Will Be Captioned: Improving Accessibility on YouTube
- YouTube Blog: Professional caption services get “YouTube Ready”
- Microsoft Media Platform: Player Framework – Home
- For example, Auditory Sciences' Interact-AS.
- Apple – Final Cut Studio – Whats New
- CPC Closed Captioning & Subtitling Software for Matrox MXO2
- CPC Closed Captioning & Subtitling Software for Non-linear Editors (NLEs)
- National Captioning Institute Logos
|This article needs additional citations for verification. (July 2007)|
- Realtime Captioning... The VITAC Way by Amy Bowlen and Kathy DiLorenzo (no ISBN)
- Closed Captioning: Subtitling, Stenography, and the Digital Convergence of Text with Television by Gregory J. Downey (ISBN 978-0-8018-8710-9)
- The Closed Captioning Handbook by Gary D. Robson (ISBN 0-240-80561-5)
- Alternative Realtime Careers: A Guide to Closed Captioning and CART for Court Reporters by Gary D. Robson (ISBN 1-881859-51-7)
- A New Civil Right: Telecommunications Equality for Deaf and Hard of Hearing Americans by Karen Peltz Strauss (ISBN 978-1-56368-291-9)
- Enabling The Disabled by Michael Karagosian (no ISBN)
|Wikimedia Commons has media related to Closed captioning.|
- Closed Captioning of Video Programming – 47 C.F.R. 79.1—From the Federal Communications Commission Consumer & Governmental Affairs Bureau
- FCC Consumer Facts on Closed Captioning
- Closed Captioning at DMOZ
- Closed Captioned TV: A Resource for ESL Literacy Education—From the Education Resources Information Center Clearinghouse for ESL Literacy Education, Washington D.C.
- Bill Kastner: The Man Behind Closed Captioning