Adobe Voco

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

Adobe Voco will be an audio editing and generating prototype software by Adobe that enables novel editing and generation of audio. Dubbed "Photoshop-for-voice",[1] it was first previewed at the Adobe MAX event in November 2016. The technology shown at Adobe MAX was a preview that could potentially be incorporated into Adobe Creative Cloud. It was later revealed that Voco was never meant to be released and was meant to be a research prototype.[2][3], mainly due to the concerns of voice manipulation and legal issues surrounding Adobe.

Technical details[edit]

As the demo showed, the software takes approximately 20 minutes of the desired target's speech and then generated sound-alike voice with even phonemes that were not present in the target example material. Adobe has stated Voco will lower the cost of audio production.[1] With the introduction of Adobe Voco and the similarly capable WaveNet, produced by DeepMind.[3]

Concerns[edit]

Ethical and security concerns have been raised over the ability to alter an audio recording to include words and phrases the original speaker never spoke, and the potential risk to voiceprint biometrics.[1]

There are also concerns that it may be used in conjunction with:

Alternatives[edit]

Adobe's lack of publicized progress has opened opportunities for other companies to build alternative products to VOCO, such as LyreBird.[6]

WaveNet is a similar but open-source research project at London-based artificial intelligence firm DeepMind, developed independently around the same time as Adobe Voco.

References[edit]

  1. ^ a b c "sapic". BBC.com. BBC. 2016-11-07. Retrieved 2016-07-05.
  2. ^ https://community.adobe.com/t5/audition/beta-testing-voco/td-p/8635263?page=1
  3. ^ a b "Is Adobe VoCo dead ?". Adobe Blog. 2018-01-27. Retrieved 2020-06-17.
  4. ^ Rodgers, Julian. "Adobe Voco - Should We Be Afraid?". Production Expert. Pro Tools. Retrieved 14 December 2018.
  5. ^ Thies, Justus (2016). "Face2Face: Real-time Face Capture and Reenactment of RGB Videos". Proc. Computer Vision and Pattern Recognition (CVPR), IEEE. Retrieved 2016-06-18.
  6. ^ "Lyrebird - Create a digital copy of voice". lyrebird.ai. Archived from the original on 2018-04-24. Retrieved 2018-03-27.