Automatic content recognition

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

Automatic content recognition (ACR) is an identification technology to recognize content played on a media device or present in a media file. Devices containing ACR support enable users to quickly obtain additional information about the content they have just experienced without any user-based input or search efforts. For example, developers of the application can then provide personalized complementary content to viewers.[1]

How it works[edit]

To start the recognition, a short media clip (audio, video, or both) is selected. This clip could be selected from within a media file or recorded by a device. Through algorithms such as fingerprinting, information from the actual perceptual content is taken and compared to a database of reference fingerprints, each reference fingerprint corresponding to a known recorded work.[2] The database may contain metadata about the work and associated information, including complementary media. If the fingerprint of the media clip is matched, the identification software returns the corresponding metadata to the client application.[3]

Fingerprints and watermarking[edit]

Audio based ACR is commonly used in the market. The two leading methodologies are acoustic fingerprinting and watermarking. Another common approach uses video fingerprinting.

Acoustic fingerprinting generates unique fingerprints from the content itself. Fingerprinting techniques work regardless of content format, codec, bitrate and compression techniques.[4] This makes it possible to use across networks and channels. Therefore, it is widely used for interactive TV, second screen application and content monitoring sectors.[5][6][dead link] Popular apps like Shazam, YouTube, Facebook,[7] Thetake, Wechat and Weibo are using audio fingerprinting methodology to recognize the content played from a TV and trigger additional features like votes, lotteries, topics or purchases.

In contrast to fingerprinting, digital watermarking requires inserting digital tags containing information about the content into the content itself prior to distribution. For example, a broadcast encoder might insert a watermark every few seconds that could be used to identify to broadcast channel, program id, and time stamp. The watermark is normally inaudible or invisible to the users. Terminal devices like phones or tablets read the watermarks instead of actually recognizing the played content.[8] Watermarking technology is utilized in media protection field to trace where illegal copies originate.[9]

It is expected by Next/Market Insights that 2.5 billion devices will be integrated with ACR technology to provide synchronized live and on-demand video watching experience.[10]

History[edit]

ACR technology was applied in TV content by Shazam in 2011 which captured the attention from TV industries. Shazam was previously a music recognition service which recognizes music from a sound recording. By utilizing its own fingerprint technology to identify live channels and videos, Shazam extended their business for TV. In 2012 DIRECTV partnered with Viggle which is a TV loyalty vendor to provide interactive viewing experience on the second screen. In 2013 LG partnered with Cognitive Networks (later purchased by Vizio and renamed Inscape), an ACR vendor, to provide ACR driven interaction.[11] In 2015 ACR technology is spread widely to even more applications and smart TVs. Now, social applications and TV manufacturers like Facebook, Twitter, Google, Wechat, Weibo, LG, Samsung, Vizio TV have already used ACR technology either developed by themselves or integrated from third party ACR providers.[citation needed] In 2016 there are more applications and mobile OS embedded with automatic content recognition services on the market like Peach, Omusic and Mi OS to enhance the music discover experiences.[12][13][14]

Applications[edit]

Content identification[edit]

ACR technology helps audiences easily retrieve information about the content they watched. For smart TVs and applications with ACR technology embedded the audience can check the name of the song which is played or descriptions of the movie they watched. In addition to that, the identified video and music content can be linked to internet content providers for on-demand viewing, third parties for additional background information, or complementary media.

Content enhancement[edit]

Because devices can be "aware" of content being watched or listened to, second screen devices can feed users complementary content beyond what is presented on the primary viewing screen. ACR technology can not only identify the content, but also it can identify the precise location within the content. Thus, additional information can be presented to the user. ACR can enable a variety of interactive features such as polls, coupons, lottery or purchase of goods based on timestamp.[15]

Audience measurement[edit]

Real-time audience measurement metrics are now achievable by applying ACR technology into smart TVs, set top boxes and mobile devices such as smart phones and tables. This measurement data is highly essential to quantify audience consumption to set advertising pricing policies.

Broadcast monitoring[edit]

For advertisers and content owners, it is vital to know when and where their content has been played. Traditionally agencies or advertisers have to manually audit the presentation. At scale it only can be checked through a statistical sampling method. ACR technology enables automatic monitoring of the content played in TV. Information like the time of play, duration, frequency can be achieved without any manual effort.[16][17] Many people have expressed some concern[weasel words] however on the information that these smart TVs are sending out to the companies collecting this data. However there is an option in almost every set to disable this feature.[18]

The alternative approaches are video based automated content recognition technologies. These are a suite of technologies that revolve around the convergence of video and TV Everywhere[19] which will render the audio and digital watermarking methods incapable of handling the millions of unique streams going out and billions of hours of footage to be reviewed with metadata extracted or enriched in relation to the content in real-time. Where acoustic fingerprint fails in its reliance on a database of reference fingerprints. Digital watermarking relies on intrusive frame by frame production stage imprinting on every piece of content.[20] The effectiveness of these techniques have been challenged based on their presumed inability to effectively scale to the amount of video being generated.[21] In practice for monetization and other user based ACR applications the reference database or presence of watermarks only has to cover those videos that are targets of monetization. For example, a video that is hosted on YouTube and viewed only once does not need to be present in a world wide ACR database or be impressed with a watermark.

ACR technology providers[edit]

ACR service providers include ACRCloud, Audible Magic, Axwave, Digimarc, Gracenote, Kantar Media, and Shazam.

See also[edit]

References[edit]

  1. ^ "Automatic Content Recognition (ACR)". Gartner. Retrieved 24 June 2015. 
  2. ^ "ACR(Automatic Content Recognition)". Retrieved 27 February 2017. 
  3. ^ "Automated content recognition creating content aware ecosystems" (PDF). Civolution. Civolution. Retrieved 24 June 2015. 
  4. ^ "Panako: a scalable acoustic fingerprinting system handling time-scale and pitch modification". Universiteit Gent. Retrieved 27 February 2017. 
  5. ^ Main, Sami. "Nielsen Is Bringing Real-Time Interactive Ads to Smart TVs to Keep Streaming Audiences Engaged". Adweek. Retrieved 2018-01-11. 
  6. ^ Brink, Kyle. "A Primer on Automated Content". Viggle. Viggle. Retrieved 22 June 2015. 
  7. ^ "Facebook Automatic Content Recognition". Starcom MediaVest Group. SMG. Retrieved 6 July 2015. 
  8. ^ Brink, Kyle. "SVP of Product Development". A Primer on Automated Content Recognition. Viggle. Retrieved 22 June 2015. 
  9. ^ Solana, Anna. "How these hidden video watermarks can help spot piracy, doctored images | ZDNet". ZDNet. Retrieved 2018-01-11. 
  10. ^ "ACR Technology To Drive Social TV As It Reaches 2.5 Billion Devices by 2017". NEXT / MARKET INSIGHTS. Retrieved 24 June 2015. 
  11. ^ "LG partners with Cognitive Networks to make Smart TVs smarter and more interactive". engadget. Retrieved 23 August 2016. 
  12. ^ "ACRCloud Powers Song Recognition For Hottest New Social Network, Peach". Music Industry News Network. Music Industry News Network. Retrieved 3 March 2016. 
  13. ^ Victoria, Ho. "Xiaomi will help you name that song you can't stop humming". Mashable. Mashable. Retrieved 3 March 2016. 
  14. ^ "ACRCloud Powers The Launch Of Taiwan's First Music/Humming Recognition Service For Omusic". Music Industry News Network. Retrieved 3 March 2016. 
  15. ^ Wolf, Michael. "Three Ways Automatic Content Recognition Will Change TV". Forbes. Retrieved 20 June 2015. 
  16. ^ "Automated Content Recognition creating content aware ecosystems" (PDF). csimagazine. Civolution. Retrieved 24 June 2015. 
  17. ^ "Roku Privacy Policy (Section I-B-4 and IV-E)". roku.com. Roku. Retrieved 30 October 2017. 
  18. ^ "Samsung, LG, and Vizio smart TVs are recording—and sharing data about—everything you watch Consumer Reports investigates the information brokers who want to turn your viewing habits into cash". Consumer Reports. Retrieved 27 February 2017. 
  19. ^ Ramachandran, Shalini; Vranica, Suzanne (2015-10-20). "Comcast Seeks to Harness Trove of TV Data". Wall Street Journal. ISSN 0099-9660. Retrieved 2016-05-16. 
  20. ^ "The Disadvantages of a Watermark". smallbusiness.chron.com. Retrieved 2016-05-16. 
  21. ^ "Facebook Announces Its ContentID Attempt... Using Audible Magic | Techdirt". Techdirt. Retrieved 2016-05-16.