Automatic content recognition

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Bgwhite (talk | contribs) at 07:12, 17 May 2016 (WP:CHECKWIKI error fix #94. Stray ref tag. Do general fixes and cleanup if needed. -, typo(s) fixed: in it's → in its using AWB). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Automatic content recognition (ACR) is an identification technology to recognize content played on a media device or present in a media file. Devices containing ACR support enable users quickly obtain additional information about the content they have just experienced without any user based input or search efforts. For example, developers of the application can then provide personalized complementary content to viewers.[1]

How it works

To start the recognition, a short audio clip is recorded by the device and sent to the identification service. The identification service uses a reference database that stores fingerprints of the works to be identified. The database also contains information about the content and associated information, including complementary media. If the fingerprint of the recorded audio sample is matched, the identification service returns the corresponding metadata to the client.[2]

Fingerprint & Watermarking

Audio based ACR is commonly used in the market. The two leading methodology are acoustic fingerprinting and watermarking. There are alternative approaches that involve a focus on Video fingerprinting but augment the accuracy and scalability with other content recognition solutions running in parallel and in series.

Acoustic fingerprinting generates unique fingerprints from the content itself. Fingerprinting techniques work regardless of content format, codec, bitrate and compression techniques.[citation needed] This makes it possible to use across networks and channels. Therefore, it is widely used for interactive TV, second screen application and content monitoring sectors.[3][non-primary source needed] Popular apps like Shazam, YouTube, Facebook,[4] Thetake, Wechat and Weibo are using audio fingerprinting methodology to recognize the contented played from a TV and trigger additional features like votes, lottery, topic or purchase.

In contrast to fingerprinting, digital watermarking requires inserting digital tags containing information about the content into the content itself, prior to distribution. For example, a broadcast encoder might insert a watermark every few seconds that could be used to identify to broadcast channel, program id, and time stamp. The watermark is normally inaudible or invisible to the users. Terminal devices like phones or tablets read the watermarks instead of actually recognizing the played content.[5] Watermarking technology is utilized in media protection field to trace where the illegal copies originate.

It is expected by Next/Market Insights that 2.5 billion devices will be integrated with ACR technology to provide synchronized live and on-demand video watching experience.[6]

The disadvantages of audio fingerprinting and digital watermarking are both in cost, reporting inaccuracy or fading of the audio and tags. As well as the inherent intrusive nature of digital watermarking where each asset is tagged in the production stage. There are companies and technologies today that do not require any manual intensive process, nor do they have any issues with longevity, scalability or accuracy. The hybrid combination of ACR technologies can allow for metadata reporting on the scale of millions of unique streams and billions of hours of video via both cloud and client-side based solutions, allowing analytics to be drawn directly from whatever is on the screen at the time not detected or limited to what is watermarked. Both options that will fail in the new TV Everywhere approach the likes of Comcast and others are adopting.[7]

History

ACR technology was applied in TV content by Shazam in 2011 which captured the attention from TV industries. Shazam was previously a music recognition service which recognizes music from a sound recording. By utilizing its own fingerprint technology to identify live channels and videos, Shazam extended their business for TV. In 2012 DIRECTV partnered with Viggle which is a TV loyalty vendor to provide interactive viewing experience on the second screen. In 2015 ACR technology is spread widely to even more applications and smart TVs. Now, social applications and TV manufacturers like Facebook, Twitter, Google, Wechat, Weibo, LG, Samsung TV have already used ACR technology either developed by themselves or integrated from third party ACR providers.[citation needed] In 2016 there are more applications and mobile OS embedded with automatic content recognition services on the market like Peach, Omusic and Mi OS to enhance the music discover experiences.[8][9][10]

Application

Content identification

ACR technology helps audiences easily retrieve information about the content they watched. For smart TVs and applications with ACR technology embedded the audience can check the name of the song which is played or descriptions of the movie they watched. In addition to that, the identified video and music content can be linked to internet content providers for on-demand viewing, third parties for additional background information, or complementary media.

Content enhancement

Because devices can be "aware" of content being watched or listened to, second screen devices can feed users complementary content beyond what is presented on the primary viewing screen. ACR technology can not only identify the content, but also it can identify the precise location within the content. Thus, additional information can be presented to the user. ACR can enable a variety of interactive features such as polls, coupons, lottery or purchase of goods based on timestamp.[11]

Audience measurement

Real-time audience measurement metrics are now achievable by applying ACR technology into smart TVs, set top boxes and mobile devices such as smart phones and tables. This measurement data is highly essential to quantify audience consumption to set advertising pricing policies.

Broadcast monitoring

For advertisers and content owners, it is vital to know when and where their content has been played. Traditionally agencies or advertisers have to manually audit the presentation. At scale it only can be checked through a statistical sampling method. ACR technology enables automatic monitoring of the content played in TV. Information like the time of play, duration, frequency can be achieved without any manual effort.[12][non-primary source needed]

The alternative approaches are video based automated content recognition technologies. These are a suite of technologies that revolve around the convergence of video and TV Everywhere[13] which will render the audio and digital watermarking methods incapable of handling the millions of unique streams going out and billions of hours of footage to be reviewed with metadata extracted or enriched in relation to the content in real-time. Where Acoustic fingerprint fails in its sensitivity and reliance on very limited to no distortion of the content[14] and where Digital watermarking relies on intrusive frame by frame production stage imprinting on every piece of content[15] it cannot logically or cost effectively scale to the amount of video being generated - both of which fade in ability to accurately detect presence of reference data.[16] The utilization of Video fingerprinting and specifically hybrid approaches that utilize other ACR technologies such as Optical character recognition, Automated speech recognition, and more with highly scalable Big data architecture will allow for the massive amount of analytics to be harnessed by Hybrid Automated Content Recognition which in real-time delivers actionable analytics that will change how users interact with content forever.[17] An example of this approach can be found in companies using these hybrid approaches such as IDenTV and the "Evolution of Video Content Recognition" via real-time Video fingerprinting, OCR, ASR, facial recognition, logo learning and more. IDenTV

ACR technology providers

ACR service providers include ACRCloud, Civolution, Digimarc, Doreso, Gracenote, Rovi, Shazam, IDenTV

See also

References

  1. ^ "Automatic Content Recognition (ACR)". Gartner. Retrieved 24 June 2015.
  2. ^ "Automated content recognition creating content aware ecosystems" (PDF). Civolution. Civolution. Retrieved 24 June 2015.
  3. ^ Brink, Kyle. "A Primer on Automated Content". Viggle. Viggle. Retrieved 22 June 2015.
  4. ^ "Facebook Automatic Content Recognition". Starcom MediaVest Group. SMG. Retrieved 6 July 2015.
  5. ^ Brink, Kyle. "SVP of Product Development". A Primer on Automated Content Recognition. Viggle. Retrieved 22 June 2015.
  6. ^ "ACR Technology To Drive Social TV As It Reaches 2.5 Billion Devices by 2017". NEXT / MARKET INSIGHTS. Retrieved 24 June 2015.
  7. ^ "The Evolution of Video Content Recognition". Evolving Video Content Recognition. Retrieved 16 May 2016. The aforementioned legacy technologies are not capable of handling the Big Data that will follow so new methods of Automated Content Recognition Technologies must be put into place for the analytics to be actionable and in real-time.
  8. ^ "ACRCloud Powers Song Recognition For Hottest New Social Network, Peach". Music Industry News Network. Music Industry News Network. Retrieved 3 March 2016.
  9. ^ Victoria, Ho. "Xiaomi will help you name that song you can't stop humming". Mashable. Mashable. Retrieved 3 March 2016.
  10. ^ "ACRCloud Powers The Launch Of Taiwan's First Music/Humming Recognition Service For Omusic". Music Industry News Network. Retrieved 3 March 2016.
  11. ^ Wolf, Michael. "Three Ways Automatic Content Recognition Will Change TV". Forbes. Retrieved 20 June 2015.
  12. ^ "Automated Content Recognition creating content aware ecosystems" (PDF). csimagazine. Civolution. Retrieved 24 June 2015.
  13. ^ Ramachandran, Shalini; Vranica, Suzanne (2015-10-20). "Comcast Seeks to Harness Trove of TV Data". Wall Street Journal. ISSN 0099-9660. Retrieved 2016-05-16.
  14. ^ "Facebook Announces Its ContentID Attempt... Using Audible Magic | Techdirt". Techdirt. Retrieved 2016-05-16.
  15. ^ "The Disadvantages of a Watermark". smallbusiness.chron.com. Retrieved 2016-05-16.
  16. ^ "Facebook Announces Its ContentID Attempt... Using Audible Magic | Techdirt". Techdirt. Retrieved 2016-05-16.
  17. ^ Wolf, Michael. "Three Ways Automatic Content Recognition Will Change TV". Forbes. Retrieved 2016-05-16.

Evolution of Video Content Recognition