User:20 STS grp1/sandbox
An intelligent virtual assistant (IVA) or intelligent personal assistant (IPA) is a software agent that can perform tasks or services for an individual based on commands or questions. Sometimes the term "chatbot" is used to refer to virtual assistants generally or specifically accessed by online chat. In some cases, online chat programs are exclusively for entertainment purposes. Some virtual assistants are able to interpret human speech and respond via synthesized voices. Users can ask their assistants questions, control home automation devices and media playback via voice, and manage other basic tasks such as email, to-do lists, and calendars with verbal (spoken?) commands.[1] A similar concept, however with differences, lays under the dialogue systems.[2]
As of 2017, the capabilities and usage of virtual assistants are expanding rapidly, with new products entering the market and a strong emphasis on both email and voice user interfaces. Apple and Google have large installed bases of users on smartphones. Microsoft has a large installed base of Windows-based personal computers, smartphones and smart speakers. Amazon has a large install base for smart speakers.[3] Conversica has over 100 million engagements via its email and sms interface Intelligent Virtual Assistants for business.
History
[edit]Radio Rex was the first voice activated toy released in 1911.[4]It was a dog made that would come out of its house when you called its name.
In 1952 Bell Labs presented “Audrey”, the Automatic Digit Recognition machine. It occupied a six- foot-high relay rack, consumed substantial power, had streams of cables and exhibited the myriad maintenance problems associated with complex vacuum-tube circuitry. It could recognize the fundamental units of speech, phonemes. It was limited to accurate recognition of digits spoken by designated talkers. It could therefore be used for voice dialing, but in most cases push-button dialing was cheaper and faster, rather than speaking the consecutive digits.[5]
Another early tool which was enabled to perform digital speech recognition was the IBM Shoebox voice-activated calculator, presented to the general public during the 1962 Seattle World's Fair after its initial market launch in 1961. This early computer, developed almost 20 years before the introduction of the first IBM Personal Computer in 1981, was able to recognize 16 spoken words and the digits 0 to 9.
The first natural language processing computer program or the chatbot ELIZA was developed by MIT professor Joseph Weizenbaum in the 1960s. Created to "demonstrate that the communication between man and machine was superficial"[6]. ELIZA used pattern matching and substitution methodology into scripted responses to simulate conversation, which gave an illusion of understanding on the part of the program.
Weizenbaum's own secretary reportedly asked Weizenbaum to leave the room so that she and ELIZA could have a real conversation. Weizenbaum was surprised by this, later writing: "I had not realized ... that extremely short exposures to a relatively simple computer program could induce powerful delusional thinking in quite normal people.[7]
This gave name to the ELIZA effect, the tendency to unconsciously assume computer behaviors are analogous to human behaviors; that is, anthropomorphisation a phenomen well present in Virtual Assistants.
The next milestone in the development of voice recognition technology was achieved in the 1970s at the Carnegie Mellon University in Pittsburgh, Pennsylvania with substantial support of the United States Department of Defense and its DARPA agency, funded five years of a Speech Understanding Research program, aiming to reach a minimum vocabulary of 1,000 words. Companies and academia including IBM, Carnegie Mellon University (CMU) and Stanford Research Institute took part in the program.
The result was "Harpy", it mastered about 1000 words, the vocabulary of a three-year-old and it could understand sentences. It could process speech that followed pre-programmed vocabulary, pronunciation, and grammar structures to determine which sequences of words made sense together, and thus reducing speech recognition errors.
In 1986 Tangora was an upgrade of the Shoebox, it was a voice recognizing typewriter. Named after the world’s fastest typist at the time, it had a vocabulary of 20,000 words and used prediction to decide the most likely result based on what was said in the past. IBM’s approach was based on a hidden Markov model, which adds statistics to digital signal processing techniques. The method makes it possible to predict the most likely phonemes to follow a given phoneme. Still each speaker had to individually train the typewriter to recognize his or her voice, and pause between each word.
In 1997 Dragon’s Naturally Speaking software could recognize and transcribe natural human speech without pauses between each word into a document at a rate of 100 words per minute. A version of Naturally Speaking is still available for download and it is still used today, for instance, by many doctors in the US and the UK to document their medical records.
The 1990s digital speech recognition technology became a feature of the personal computer with IBM, Philips and Lemout & Hauspie fighting for customers. Much later the market launch of the first smartphone IBM Simon in 1994 laid the foundation for smart virtual assistants as we know them today.
In 2001 Colloquis publicly launched SmarterChild, on platforms like AIM and MSN Messenger. While entirely text-based SmarterChild was able to play games, check the weather, look up facts, and converse with users to an extent.[8]
The first modern digital virtual assistant installed on a smartphone was Siri, which was introduced as a feature of the iPhone 4S on October 4, 2011.[9] Apple Inc. developed Siri following the 2010 acquisition of Siri Inc., a spin-off of SRI International, which is a research institute financed by DARPA and the United States Department of Defense.[10] Its aim was to aid in tasks such as sending a text message, making phone calls, checking the weather or setting up an alarm. Over time, it has developed to provide you recommendations to restaurants, search the internet and provide directions.
In November 2014, Amazon announced Alexa alongside the Echo.
In April 2017 Amazon released a service for building conversational interfaces for any type of virtual assistant or interface.
Method of interaction
[edit]Virtual assistants make work via:
- Text, including: online chat (especially in an instant messaging app or other app), SMS Text, e-mail or other text-based communication channel, for example Conversica's Intelligent Virtual Assistants for business.[11]
- Voice, for example with Amazon Alexa[12] on the Amazon Echo device, Siri on an iPhone, or Google Assistant on Google-enabled/Android mobile devices
- By taking and/or uploading images, as in the case of Samsung Bixby on the Samsung Galaxy S8
Some virtual assistants are accessible via multiple methods, such as Google Assistant via chat on the Google Allo and Google Messages app and via voice on Google Home smart speakers.
Virtual assistants use natural language processing (NLP) to match user text or voice input to executable commands. Many continually learn using artificial intelligence techniques including machine learning.
To activate a virtual assistant using the voice, a wake word might be used. This is a word or groups of words such as "Hey Siri", "OK Google" or "Hey Google", "Alexa", and "Hey Microsoft".[13]
Devices and objects where found
[edit]Virtual assistants may be integrated into many types of platforms or, like Amazon Alexa, across several of them:
- Into devices like smart speakers such as Amazon Echo, Google Home and Apple HomePod
- In instant messaging apps on both smartphones and via the Web, e.g. Facebook's M (virtual assistant) on both Facebook and Facebook Messenger apps or via the Web
- Built into a mobile operating system (OS), as are Apple's Siri on iOS devices and BlackBerry Assistant on BlackBerry 10 devices, or into a desktop OS such as Cortana on Microsoft Windows OS
- Built into a smartphone independent of the OS, as is Bixby on the Samsung Galaxy S8 and Note 8.[14]
- Within instant messaging platforms, assistants from specific organizations, such as Aeromexico's Aerobot on Facebook Messenger or Wechat Secretary on WeChat
- Within mobile apps from specific companies and other organizations, such as Dom from Domino's Pizza[15]
- In appliances,[16] cars,[17] and wearable technology.[18]
- Previous generations of virtual assistants often worked on websites, such as Alaska Airlines' Ask Jenn,[19] or on interactive voice response (IVR) systems such as American Airlines' IVR by Nuance.[20]
Services
[edit]Virtual assistants can provide a wide variety of services. These include:[21]
- Provide information such as weather, facts from e.g. Wikipedia or IMDb, set an alarm, make to-do lists and shopping lists
- Play music from streaming services such as Spotify and Pandora; play radio stations; read audiobooks
- Play videos, TV shows or movies on televisions, streaming from e.g. Netflix
- Conversational commerce (see below)
- Assist public interactions with government (see Artificial intelligence in government)
- Complement and/or replace customer service by humans.[22] One report estimated that an automated online assistant produced a 30% decrease in the work-load for a human-provided call centre.[23]
Conversational commerce
[edit]Conversational commerce is e-commerce via various means of messaging, including via voice assistants[24] but also live chat on e-commerce Web sites, live chat on messaging apps such as WeChat, Facebook Messenger and WhatsApp[25] and chatbots on messaging apps or Web sites.
Virtual Assistant can work with customer support team of a business to provide 24x7 support to customers. It provides quick responses, which enhances a customer's experience.
Third-party services
[edit]Amazon enables Alexa "Skills" and Google "Actions", essentially apps that run on the assistant platforms.
Virtual assistant privacy
[edit]Virtual assistants have a variety of privacy concerns associated with them. Features such as activation by voice pose a threat, as such features requires the device to always be listening.[26] Modes of privacy such as the virtual security button have been proposed to create a multilayer authentication for virtual assistants.[27]
Privacy policy of prominent Virtual Assistants
[edit]Google Assistant
[edit]Google Assistant does not store your data without your permission. To store the audio, you can go to Voice & Audio Activity (VAA) and turn on this feature. Your audio files are sent to cloud and used by Google to improve the performance of Google Assistant, but only if you have turned on the VAA feature.[28]
Amazon's Alexa
[edit]Amazon’s Virtual Assistant Alexa only listens to your conversation when you use its wake word (like Alexa, Amazon, Echo). It starts recording the conversation after the call of a wake word. It stops listening after 8 seconds of silence. It sends the recorded conversation to the cloud. You can delete your recording from the cloud by visiting ‘Alexa Privacy’ in ‘Alexa’. You can stop Alexa from listening to your conversations using ‘mute’ feature of Alexa, after muting the device, it cannot listen to you even if you use the wake words (like Alexa).[29]
Apple's Siri
[edit]Apple does not record your audios to improve Siri, it uses transcripts instead. It only sends data which is important for analysis, for instance, if you ask Siri to read your message it won’t send the message to the cloud, the machine will directly read the message without server’s interference. Users can opt out anytime if they don’t want Siri to send the transcripts on cloud.[30]
Presumed and observed interest for the consumer
[edit]It can be interesting to understand the presumed added value of Virtual Assistants as a new possible interaction between man and computers. And to compare it with its perceived interest by individuals.
Presumed added value as allowing a new way of interactions
[edit]Added value of the Virtual Assistants can come among others from the following :
- Voice communication can sometimes represent the optimal man-machine communication :
- It is convenient: there are some sectors where voice is the only way of possible communication, and more generally, it allows to free-up both hands and vision potentially for doing another activity in parallel, or helps also disabled people.
- It is faster: Voice is more efficient than writing on a keyboard: we can speak up to 200 words per minute opposed to 60 in case of writing on a keyboard. It is also more natural thus requiring less effort (reading a text however can reach 700 words per minute).[31]
- Virtual Assistants save a lot of time by automation: they can take appointments, or read the news while the consumer does something else. It is also possible to ask the Virtual Assistant to schedule meetings, hence helping to organize time. The designers of new digital schedulers explained the ambition they had that these calendars schedule lives to make the consumer use his time more efficiently, through machine learning processes, and complete organization of work time and free time. As an example when the consumer expresses the desire of scheduling a break, the VA will schedule it at an optimal moment for this purpose (for example at a time of the week where he is less productive), with the additional long term objective of being able to schedule and organize the free time of the consumer, to assure him optimal work efficiency.[32]
Perceived interest
[edit]- According to a recent study (2019), the two reasons for using Virtual Assistants for consumers are perceived usefulness and perceived enjoyment. The first result of this study is that both perceived usefulness and perceived enjoyment have an equivalent very strong influence for the consumer willingness to use a Virtual Assistant.
- The second result of this study is that :
- Provided content quality has a very strong influence on perceived usefulness and a strong influence on perceived enjoyment.
- Visual attractiveness has a very strong influence on perceived enjoyment.
- Automation has a strong influence on perceived usefulness.
This study helps to show on a more general scale the key factors of the integration of artificial intelligence services by individuals.[33]
Controversies
[edit]Artificial Intelligence controversies
[edit]- Virtual Assistants spur the filter bubble: As for social medias, Virtual Assistants’s algorithms are trained to show pertinent data and discard others based on previous activities of the consumer: The pertinent data is the one which will interest or please the consumer. As a result, he becomes isolated from data that disagree with his viewpoints, effectively isolating him into his own intellectual bubble, and reinforcing his opinions. This phenomena was known to reinforce fake news and echo chambers.[34]
- Virtual Assistants are also sometimes criticized for being overrated. In particular, A. Casilli points out that the AI of Virtual Assistants are neither intelligent nor artificial for two reasons :
- Not intelligent because all they do is being the assistant of the human, and only by doing tasks that a human could do easily, and in a very limited specter of actions: find, class, and present information, offers or documents. Also, Virtual Assistants are neither able to make decisions on their own nor to anticipate things.
- And not artificial because they would be impossible without human labelization through micro working. [35]
Ethic implications
[edit]Antonio A. Casilli, a french sociologist, criticized in 2019 the artificial intelligence and in particular the virtual assistants in the following way:
At a first level the fact that the consumer provides free data for the training and improvement of the virtual assistant, often without knowing it, is ethically disturbing.
But at a second level, it might be even more ethically disturbing to know how these AIs are trained with those data.
These artificial intelligences are trained through Neuronal Networks, which require a huge amount of labelled data. But the data need to be labelled through a human process. This explains the rise of microwork in the last decade. That is, using remotely some unqualified people worldwide doing some repetitive and very simple tasks for a few cents such as listening to Virtual Assistant heard data, and writing down what was said. Microwork has been criticized about for the job insecurity it causes, and for the total lack of regulation: average salary was 1,38 dollar/hours in 2010 [36], and it provides neither healthcare nor retirement benefits, sick pay, minimum wage. Hence, Virtual Assistants and their designers are controversial for spuring job insecurity, and the AIs they propose are still human in the way that they would be impossible without the microwork of millions of human workers.[37]
Developer platforms
[edit]Notable developer platforms for virtual assistants include:
- Amazon Lex was opened to developers in April 2017. It involves natural language understanding technology combined with automatic speech recognition and had been introduced in November 2016.[38]
- Google provides the Actions on Google and Dialogflow platforms for developers to create "Actions" for Google Assistant[39]
- Apple provides SiriKit for developers to create extensions for Siri
- IBM's Watson, while sometimes spoken of as a virtual assistant is in fact an entire artificial intelligence platform and community powering some virtual assistants, chatbots. and many other types of solutions.[40][41]
Previous generations
[edit]In previous generations of text chat-based virtual assistants, the assistant was often represented by an avatar (a.k.a. interactive online character or automated character) — this was known as an embodied agent.
Comparison of notable assistants
[edit]Intelligent personal assistant | Developer | Free software | Free and open-source hardware | HDMI out | External I/O | IOT | Chromecast integration | Smart phone app | Always on | Unit to unit voice channel | Skill language |
---|---|---|---|---|---|---|---|---|---|---|---|
Alexa (a.k.a. Echo) | Amazon.com | No | No | No | No class="table-no" |style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes | No class="table-no" |style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes | ? | ? | |||
Alice | Yandex | No class="table-no" |data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes | No class="table-no" |style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | — | ? | |||||||
AliGenie | Alibaba Group | No | No class="table-no" |data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes | No class="table-no" |style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | — | ? | ||||||
Assistant | Speaktoit | No class="table-no" |data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | — | No | No class="table-no" |style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes | No class="table-no" |data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | — | ? | |||||
Bixby | Samsung Electronics | No class="table-no" |data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | — | No | No class="table-no" |style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | — | ? | ||||||
BlackBerry Assistant | BlackBerry Limited | No class="table-no" |data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | — | No | No class="table-no" |style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes | No class="table-no" |data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | — | ? | |||||
Braina | Brainasoft | No class="table-no" |data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | — | No | No class="table-no" |style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes | No class="table-no" |data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | — | ? | |||||
Clova | Naver Corporation | No class="table-no" |data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes | No class="table-no" |style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | — | ? | |||||||
Cortana | Microsoft | No class="table-no" |data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes | No class="table-no" |style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | — | ? | |||||||
Duer | Baidu[42] | ||||||||||
Evi | Amazon.com True Knowledge | No class="table-no" |data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | — | No | No class="table-no" |style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes | No class="table-no" |data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | — | ? | |||||
Google Assistant | No class="table-no" |data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | — | ? | |||||||||
Google Now | No class="table-no" |data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | — | ? | |||||||||
M (discontinued)[43] | |||||||||||
Mycroft[44] | Yes| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes | Python | |||||||||
SILVIA | Cognitive Code | No class="table-no" |data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | — | No | No class="table-no" |style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes | No class="table-no" |data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | — | ? | |||||
Siri | Apple Inc. | No | No class="table-no" |data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes | No class="table-no" |style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | — | ? | ||||||
Viv | Samsung Electronics | No class="table-no" |data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | —| style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes | No class="table-no" |style="background:#9EFF9E;color:black;vertical-align:middle;text-align:center;" class="table-yes"|Yes | No class="table-no" |data-sort-value="" style="background: var(--background-color-interactive, #ececec); color: var(--color-base, #2C2C2C); vertical-align: middle; text-align: center; " class="table-na" | — | ? | ||||||
Xiaowei | Tencent | ? |
Economic relevance
[edit]For individuals
[edit]Digital experiences enabled by virtual assistants are considered to be among the major recent technological advances and most promising consumer trends. Experts claim that digital experiences will achieve a status-weight comparable to ‘real’ experiences, if not become more sought-after and prized.[45] The trend is verified by a high number of frequent users and the substantial growth of worldwide user numbers of virtual digital assistants. In mid-2017, the number of frequent users of digital virtual assistants is estimated to be around 1 bn worldwide.[46] In addition, it can be observed that virtual digital assistant technology is no longer restricted to smartphone applications, but present across many industry sectors (incl. automotive, telecommunications, retail, healthcare and education).[47] In response to the significant R&D expenses of firms across all sectors and an increasing implementation of mobile devices, the market for speech recognition technology is predicted to grow at a CAGR of 34.9% globally over the period of 2016 to 2024 and thereby surpass a global market size of US$7.5 billion by 2024.[47] According to an Ovum study, the "native digital assistant installed base" is projected to exceed the world's population by 2021, with 7.5 billion active voice AI–capable devices.[48] According to Ovum, by that time "Google Assistant will dominate the voice AI–capable device market with 23.3% market share, followed by Samsung's Bixby (14.5%), Apple's Siri (13.1%), Amazon's Alexa (3.9%), and Microsoft's Cortana (2.3%)."[48]
Taking into consideration the regional distribution of market leaders, North American companies (e.g. Nuance Communications, IBM, eGain) are expected to dominate the industry over the next years, due to the significant impact of BYOD (Bring Your Own Device) and enterprise mobility business models. Furthermore, the increasing demand for smartphone-assisted platforms are expected to further boost the North American Intelligent Virtual Assistant (IVA) industry growth. Despite its smaller size in comparison to the North American market, the intelligent virtual assistant industry from the Asia-Pacific region, with its main players located in India and China is predicted to grow at an annual growth rate of 40% (above global average) over the 2016-2024 period.[47]
Economic opportunity for enterprises
[edit]Virtual assistants should not be only seen as a gadget for individuals, as they could have a real economic utility for enterprises. As an example, a virtual assistant can take the role of an always available assistant with an encyclopedic knowledge. And which can organize meetings, check inventories, verify informations. Virtual Assistants are all the more important that their integration in small and middle-sized enterprises often consists in an easy first step through the more global adaptation and use of Internet of Things (IoT). Indeed IoT technlologies are first perceived by small and medium-sized enterprises as technologies of critical importance, but too complicated, risky or costly to be used.[49]
Security
[edit]In May 2018, researchers from the University of California, Berkeley, published a paper that showed audio commands undetectable for the human ear could be directly embedded into music or spoken text, thereby manipulating virtual assistants into performing certain actions without the user taking note of it.[50] The researchers made small changes to audio files, which cancelled out the sound patterns that speech recognition systems are meant to detect. These were replaced with sounds that would be interpreted differently by the system and command it to dial phone numbers, open websites or even transfer money.[50] The possibility of this has been known since 2016,[50] and affects devices from Apple, Amazon and Google.[51]
In addition to unintentional actions and voice recording, another security and privacy risk associated with intelligent virtual assistants is malicious voice commands: An attacker who impersonates a user and issues malicious voice commands to, for example, unlock a smart door to gain unauthorized entry to a home or garage or order items online without the user's knowledge. Although some IVAs provide a voice-training feature to prevent such impersonation, it can be difficult for the system to distinguish between similar voices. Thus, a malicious person who is able to access an IVA-enabled device might be able to fool the system into thinking that he or she is the real owner and carry out criminal or mischievous acts.[52]
See also
[edit]- Applications of artificial intelligence
- Chatbot
- Conversational user interface
- Computer facial animation
- Expert system
- Home network
- Intelligent agent
- Knowledge Navigator
- Microsoft Office Assistant
- Natural language processing
- Simulated reality
- Software agent
- Wizard (software)
References
[edit]- ^ Hoy, Matthew B. (2018). "Alexa, Siri, Cortana, and More: An Introduction to Voice Assistants". Medical Reference Services Quarterly. 37 (1): 81–88. doi:10.1080/02763869.2018.1404391. PMID 29327988. S2CID 30809087.
- ^ Klüwer, Tina. "From chatbots to dialog systems." Conversational agents and natural language interaction: Techniques and Effective Practices. IGI Global, 2011. 1-22.
- ^ Daniel B. Kline (2017-01-30). "Alexa, How Big Is Amazon's Echo?". The Motley Fool.
- ^ https://www.cnet.com/news/google-finding-its-voice/
- ^ Moskvitch, Katia. "The machines that learned to listen". www.bbc.com. Retrieved 2020-05-05.
- ^ Epstein, J; Klinkenberg, W. D (2001-05-01). "From Eliza to Internet: a brief history of computerized assessment". Computers in Human Behavior. 17 (3): 295–314. doi:10.1016/S0747-5632(01)00004-8. ISSN 0747-5632.
- ^ Weizenbaum, Joseph (1976). Computer power and human reason : from judgment to calculation. Oliver Wendell Holmes Library Phillips Academy. San Francisco : W. H. Freeman.
- ^ "Smartphone: your new personal assistant - Orange Pop". 2017-07-10. Archived from the original on 2017-07-10. Retrieved 2020-05-05.
- ^ Darren Murph (2011-10-04). "iPhone 4S hands-on!". Engadget.com. Retrieved 2017-12-10.
- ^ "Feature: Von IBM Shoebox bis Siri: 50 Jahre Spracherkennung - WELT" [From IBM Shoebox to Siri: 50 years of speech recognition]. Die Welt (in German). Welt.de. 2012-04-20. Retrieved 2017-12-10.
- ^ https://www.bloomberg.com/press-releases/2018-10-30/conversica-raises-31-million-in-series-c-funding-to-fuel-expansion-of-conversational-ai-for-business
- ^ Herrera, Sebastian. "Amazon Extends Alexa's Reach Into Wearables". WSJ. Retrieved 2019-09-26.
- ^ "S7617 - Developing Your Own Wake Word Engine Just Like 'Alexa' and 'OK Google'". GPU Technology Conference. Retrieved July 17, 2017.
- ^ Lynn La (2017-02-27). "Everything Google Assistant can do on the Pixel". CNET. Retrieved 2017-12-10.
- ^ Morrison, Maureen (2014-10-05). "Domino's Pitches Voice-Ordering App in Fast-Food First | CMO Strategy". AdAge. Retrieved 2017-12-10.
- ^ Dan O'Shea (2017-01-04). "LG introduces smart refrigerator with Amazon Alexa-enabled grocery ordering". Retail Dive. Retrieved 2017-12-10.
{{cite web}}
: CS1 maint: multiple names: authors list (link) CS1 maint: numeric names: authors list (link) - ^ Samuel Gibbs (2017-02-07). "Amazon's Alexa escapes the Echo and gets into cars | Technology". The Guardian. Retrieved 2017-12-10.
- ^ "What is Google Assistant, how does it work, and which devices offer it?". Pocket-lint. 2017-10-06. Retrieved 2017-12-10.
- ^ ""Ask Jenn", Alaska Airlines website". Alaskaair.com. 2017-01-02. Retrieved 2017-12-10.
- ^ AT&T Tech Channel (2013-06-26). "American Airlines (US Airways) - First US Airline to Deploy Natural Language Speech" (video), Nuance Enterprise on YouTube. YouTube.com. Retrieved 2017-12-10.
YouTube title: Airline Information System, 1989 - AT&T Archives - speech recognition
- ^ Taylor Martin; David Priest (2017-09-10). "The complete list of Alexa commands so far". CNET. Retrieved 2017-12-10.
- ^ Kongthon, Alisa; Sangkeettrakarn, Chatchawal; Kongyoung, Sarawoot; Haruechaiyasak, Choochart (2009-01-01). Implementing an Online Help Desk System Based on Conversational Agent. MEDES '09. New York, NY, USA: ACM. pp. 69:450–69:451. doi:10.1145/1643823.1643908. ISBN 9781605588292. S2CID 1046438.
{{cite book}}
:|journal=
ignored (help) - ^ Anthony O'Donnell (2010-06-03). "Aetna's new "virtual online assistant"". Insurance & Technology. Archived from the original on 2010-06-07.
- ^ "How to prepare your products and brand for conversational commerce". 6 March 2018.
- ^ Taylor, Glenn. "Retail's Big Opportunity: 87% Of U.S. Consumers Grasp The Power Of Conversational Commerce - Retail TouchPoints".
- ^ Zhang, Guoming; Yan, Chen; Ji, Xiaoyu; Zhang, Tianchen; Zhang, Taimin; Xu, Wenyuan (2017). "DolphinAttack". Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security - CCS '17. pp. 103–117. arXiv:1708.09537. doi:10.1145/3133956.3134052. ISBN 9781450349468. S2CID 2419970.
- ^ Lei, Xinyu; Tu, Guan-Hua; Liu, Alex X.; Li, Chi-Yu; Xie, Tian (2017). "The Insecurity of Home Digital Voice Assistants - Amazon Alexa as a Case Study". arXiv:1712.03327 [cs.CR].
- ^ "Doing more to protect your privacy with the Assistant". Google. 2019-09-23. Retrieved 2020-02-27.
- ^ www.amazon.com https://www.amazon.com/gp/help/customer/display.html?nodeId=GVP69FUJ48X9DK8V. Retrieved 2020-02-27.
{{cite web}}
: Missing or empty|title=
(help) - ^ "Improving Siri's privacy protections". Apple Newsroom. Retrieved 2020-02-27.
- ^ Minker, W.; Néel, F. (2002). "Développement des technologies vocales". Le Travail Humain. 65 (3): 261. doi:10.3917/th.653.0261. ISSN 0041-1868.
- ^ Wajcman, Judy (2018). "The Digital Architecture of time Management". Science, Technology, & Human Values. 44 (2): 315–337. doi:10.1177/0162243918795041. S2CID 149648777.
{{cite journal}}
: CS1 maint: url-status (link) - ^ Yang, Heetae; Lee, Hwansoo (2018-06-26). "Understanding user behavior of virtual personal assistant devices". Information Systems and E-Business Management. 17 (1): 65–87. doi:10.1007/s10257-018-0375-1. ISSN 1617-9846. S2CID 56838915.
- ^ Tisseron, Serge (2019). "La famille sous écoute". L'École des parents. n° 632 (3): 16. doi:10.3917/epar.632.0016. ISSN 0424-2238. S2CID 199344092.
{{cite journal}}
:|volume=
has extra text (help) - ^ Casilli, Antonio A. (2019). En attendant les robots. Enquête sur le travail du clic. Editions Seuil. ISBN 978-2-02-140188-2. OCLC 1083583353.
- ^ Horton, John Joseph; Chilton, Lydia B. (2010). "The labor economics of paid crowdsourcing". Proceedings of the 11th ACM Conference on Electronic Commerce - EC '10. New York, New York, USA: ACM Press: 209. doi:10.1145/1807342.1807376. ISBN 978-1-60558-822-3. S2CID 18237602.
- ^ Casilli, Antonio A. (2019). En attendant les robots. Enquête sur le travail du clic. Editions Seuil. ISBN 978-2-02-140188-2. OCLC 1083583353.
- ^ "Amazon Lex, the technology behind Alexa, opens up to developers". TechCrunch. 2017-04-20. Retrieved 2017-12-10.
- ^ "Actions on Google | Google Developers". Retrieved 2017-12-10.
- ^ "Watson - Stories of how AI and Watson are transforming business and our world". Ibm.com. Retrieved 2017-12-10.
- ^ Memeti, Suejb; Pllana, Sabri (January 2018). "PAPA: A parallel programming assistant powered by IBM Watson cognitive computing technology". Journal of Computational Science. 26: 275–284. doi:10.1016/j.jocs.2018.01.001.
- ^ "Baidu unveils 3 smart speakers with its Duer digital assistant". 8 January 2018.
- ^ Newton, Casey (14 January 2018). "Facebook is shutting down M, its personal assistant service that combined humans and AI". The Verge. Vox Media. Retrieved 8 January 2018.
- ^ Janakiram MSV (20 August 2015). "Meet Mycroft, The Open Source Alternative To Amazon Echo". Forbes. Retrieved 27 October 2016.
- ^ "5 Consumer Trends for 2017". TrendWatching. 2016-10-31. Retrieved 2017-12-10.
- ^ Felix Richter (2016-08-26). "Chart: Digital Assistants - Always at Your Service". Statista. Retrieved 2017-12-10.
- ^ a b c "Virtual Assistant Industry Statistics « Global Market Insights, Inc". Gminsights.wordpress.com. 2017-01-30. Retrieved 2017-12-10.
- ^ a b "Virtual digital assistants to overtake world population by 2021". ovum.informa.com. Retrieved 2018-05-11.
- ^ Jones, Nory B.; Graham, C. Matt (February 2018). "Can the IoT Help Small Businesses?". Bulletin of Science, Technology & Society. 38 (1–2): 3–12. doi:10.1177/0270467620902365. ISSN 0270-4676. S2CID 214031256.
- ^ a b c "Alexa and Siri Can Hear This Hidden Command. You Can't". The New York Times. 2018-05-10. ISSN 0362-4331. Retrieved 2018-05-11.
- ^ "As voice assistants go mainstream, researchers warn of vulnerabilities". CNET. 2018-05-10. Retrieved 2018-05-11.
- ^ Chung, H.; Iorga, M.; Voas, J.; Lee, S. (2017). "Alexa, Can I Trust You?". Computer. 50 (9): 100–104. doi:10.1109/MC.2017.3571053. ISSN 0018-9162. PMC 5714311. PMID 29213147.