Google Translate

From Wikipedia, the free encyclopedia
  (Redirected from Google Translator)
Jump to: navigation, search
Google Translate
Google Translate Icon.png
Type of site
Machine translation
Available in 103 languages, see below
Owner Google
Website translate.google.com
Commercial Yes
Registration Optional
Users Over 200+ million people daily
Launched April 28, 2006; 11 years ago (2006-04-28) (as statistical machine translation)[1]
November 15, 2016; 6 months ago (2016-11-15) (as neural machine translation)[2]
Current status Active

Google Translate is a free multilingual machine translation service developed by Google, to translate text, speech, images, sites, or real-time video from one language into another. It offers a web interface, mobile apps for Android and iOS, and an API that helps developers build browser extensions and software applications. Google Translate supports over 100 languages at various levels[3] and as of May 2013, serves over 200 million people daily.[4]

In November 2016, Google announced that Google Translate would switch to a neural machine translation engine - Google Neural Machine Translation (GNMT) - which translates "whole sentences at a time, rather than just piece by piece. It uses this broader context to help it figure out the most relevant translation, which it then rearranges and adjusts to be more like a human speaking with proper grammar". Originally only enabled for a few languages in 2016, GNMT is gradually being used for more languages.

Features[edit]

Google Translate can translate multiple forms of text and media, including text, speech, images, sites, or real-time video, from one language to another.[5][6]

It supports over 100 languages at various levels[3] and as of May 2013, serves over 200 million people daily.[4]

For some languages, Google Translate can pronounce translated text, highlight corresponding words and phrases in the source and target text, and act as a simple dictionary for single-word input. If "Detect language" is selected, text in an unknown language can be automatically identified.[7]

If a user enters a URL in the source text, Google Translate will produce a hyperlink to a machine translation of the website.[8]

For some languages, text can be entered via an on-screen keyboard, handwriting recognition, or speech recognition.[9][10]

Browser integration[edit]

Google Translate is available in some browsers as an extension which can run the translation engine.

A number of Firefox extensions exist for Google services, and likewise for Google Translate, which allow right-click command access to the translation service.[11]

An extension for Google's Chrome browser also exists;[12] in February 2010, Google Translate was integrated into the Chrome browser by default, for optional automatic webpage translation.[13]

Mobile apps[edit]

Google Translate
Google Translate Icon.png
Developer(s) Google Inc.
Stable release(s) [±]
Android 5.9.0 / May 12, 2017; 10 days ago (2017-05-12)[14]
iOS 5.9.0 / April 24, 2017; 28 days ago (2017-04-24)[15]
Platform
Size 13.69 MB (Android)
64.5 MB (iOS)
Type Machine translation
Website translate.google.com

The Google Translate app[3] for Android and iOS supports more than 90 languages and can translate 37 languages via photo, 32 via voice in "conversation mode", and 27 via real-time video in "augmented reality mode".[16]

The Android app was released in January 2010, while an HTML5 web application was released for iOS users in August 2008,[17] followed by a native app on February 8, 2011.[18]

An early 2011 version supported Conversation Mode when translating between English and Spanish (in alpha testing). This interface within Google Translate allows users to communicate fluidly with a nearby person in another language. In October 2011 it was expanded to 14 languages.[19]

The 'Camera input' functionality allows users to take a photograph of a document, signboard, etc. Google Translate recognises the text from the image using optical character recognition (OCR) technology and gives the translation. Camera input is not available for all languages.

In January 2015, the application gained the ability to translate text in real time using the device's camera, as a result of Google's acquisition of the Word Lens app.[20] The speed and quality of real-time video translation (augmented reality) feature were further enhanced in July 2015 with the release of a new implementation that utilizes convolutional neural networks.[21][22]

API[edit]

In May 2011, Google announced that the Google Translate API for software developers had been deprecated and would cease functioning.[23] The Translate API page stated the reason as "substantial economic burden caused by extensive abuse" with an end date set for December 1, 2011.[24] In response to public pressure, Google announced in June 2011 that the API would continue to be available as a paid service.[25]

Because the API was used in numerous third-party websites and apps, the original decision to deprecate it led some developers to criticize Google and question the viability of using Google APIs in their products.[26][27]

Supported languages[edit]

The following languages are supported in Google Translate.[28]

Languages in development[edit]

These languages are not yet supported by Google Translate, but are available in the Translate Community.[29]

Method of translation[edit]

In April 2006, Google Translate launched with a statistical machine translation engine.[1]

Google Translate does not apply grammatical rules, since its algorithms are based on statistical analysis rather than traditional rule-based analysis. The system's original creator, Franz Josef Och, has criticized the effectiveness of rule-based algorithms in favor of statistical approaches.[30] It is based on a method called statistical machine translation, and more specifically, on research by Och who won the DARPA contest for speed machine translation in 2003. Och was the head of Google's machine translation group until leaving to join Human Longevity, Inc. in July 2014.[31]

According to Och, a solid base for developing a usable statistical machine translation system for a new pair of languages from scratch would consist of a bilingual text corpus (or parallel collection) of more than 150-200 million words, and two monolingual corpora each of more than a billion words.[30] Statistical models from these data are then used to translate between those languages.

To acquire this huge amount of linguistic data, Google used United Nations documents.[32] The UN typically publishes documents in all six official UN languages, which has produced a very large 6-language corpus.

Google Translate does not translate from one language to another (L1 → L2). Instead, it often translates first to English and then to the target language (L1 → EN → L2).[33]

When Google Translate generates a translation, it looks for patterns in hundreds of millions of documents to help decide on the best translation. By detecting patterns in documents that have already been translated by human translators, Google Translate makes intelligent guesses as to what an appropriate translation should be.[34]

Before October 2007, for languages other than Arabic, Chinese and Russian, Google Translate was based on SYSTRAN, a software engine which is still used by several other online translation services such as Babel Fish (now defunct). Since October 2007, Google Translate has used proprietary, in-house technology based on statistical machine translation instead.[35][36]

Google Neural Machine Translation[edit]

In September 2016, a research team at Google announced the development of the Google Neural Machine Translation system (GNMT) to increase fluency and accuracy in Google Translate[2][37] and in November announced that Google Translate would switch to GNMT.

Google Translate's new neural machine translation system uses a large end-to-end artificial neural network capable of deep learning,[2][38] in particular, long short-term memory networks.[39][40][41][42] GNMT improves the quality of translation because it uses an example based (EBMT) machine translation method in which the system "learns from millions of examples."[38] It translates "whole sentences at a time, rather than just piece by piece. It uses this broader context to help it figure out the most relevant translation, which it then rearranges and adjusts to be more like a human speaking with proper grammar".[2] GNMT's "proposed architecture" of "system learning" was first tested on over a hundred languages supported by Google Translate.[38] With the end-to-end framework, "the system learns over time to create better, more natural translations."[2] The GNMT network is capable of interlingual machine translation, which encodes the "semantics of the sentence rather than simply memorizing phrase-to-phrase translations",[38][43] and the system did not invent its own universal language, but uses "the commonality found inbetween many languages".[44] GNMT was first enabled for eight languages: to and from English and Chinese, French, German, Japanese, Korean, Portuguese, Spanish and Turkish.[2][37] In March 2017, it was enabled for Hindi, Russian and Vietnamese languages,[45] followed by Indonesian, Bengali, Gujarati, Kannada, Malayalam, Marathi, Punjabi, Tamil and Telugu languages in April.[46]

GNMT is an improvement on Google Translate in that it is capable of translating directly from one language to another (L1 → L2) instead often first translating to English, for example, and then to the target language (L1 → EN → L2).[43] The GNMT system is "capable of Zero-Shot Translation - translating between a language pair (for example, Japanese to Korean) which the "system has never explicitly seen before."[38] Previously, Google Translate translated to English and then to the target language (L1 → EN → L2) not directly from one language to another (L1 → L2).[43]

Limitations[edit]

Some languages produce better results than others. Google Translate performs well especially when English is the target language and the source language is from the European Union due to the prominence of translated EU parliament notes. A 2010 analysis indicated that French to English translation is relatively accurate.[47] However, if the source text is shorter, rule-based machine translations often perform better; this effect is particularly evident in Chinese to English translations. While edits of translations may be submitted, in Chinese specifically one is not able to edit sentences as a whole. Instead, one must edit sometimes arbitrary sets of characters, leading to incorrect edits.[47]

Texts written in the Greek, Devanagari, Cyrillic and Arabic scripts can be transliterated automatically from phonetic equivalents written in the Latin alphabet. The browser version of Google Translate provides the read phonetically option for Japanese to English conversion. The same option is not available on the paid API version.

Accent of English that the "text-to-speech" audio of Google Translate of each country uses
  British English (female)
  American English (female)
  Oceania accent (female)
  No Google translate service

Many of the more popular languages have a "text-to-speech" audio function that is able to read back a text in that language, up to a few dozen words or so. In the case of pluricentric languages, the accent depends on the region: for English, in the Americas, most of the Asia-Pacific and West Asia the audio uses a female General American accent, whereas in Europe, Hong Kong, Malaysia, Singapore, Guyana and all other parts of the world a female British English accent is used, except for a special Oceania accent used in Australia, New Zealand and Norfolk Island; for Spanish, in the Americas a Latin American Spanish accent is used, while in the other parts of the world a Castilian Spanish accent is used; Portuguese uses a São Paulo accent in the world, except for Portugal, where their native accent is used. Some less widely spoken languages use the open-source eSpeak synthesizer for their speech.[citation needed]

Open-source licenses and components[edit]

Language WordNet[48] License
Albanian Albanet CC-BY 3.0/GPL 3
Arabic Arabic Wordnet CC-BY-SA 3
Catalan Multilingual Central Repository CC-BY-3.0
Chinese Chinese Wordnet Wordnet
Danish Dannet Wordnet
English Princeton Wordnet Wordnet
Finnish FinnWordnet Wordnet
French WOLF (WOrdnet Libre du Français) CeCILL-C
Galician Multilingual Central Repository CC-BY-3.0
Hebrew Hebrew Wordnet Wordnet
Hindi IIT Bombay Wordnet Indo Wordnet
Indonesian Wordnet Bahasa MIT
Italian MultiWordnet CC-BY-3.0
Japanese Japanese Wordnet Wordnet
Javanese Javanese Wordnet Wordnet
Malay Wordnet Bahasa MIT
Norwegian Norwegian Wordnet Wordnet
Persian Persian Wordnet Free to Use
Polish plWordnet Wordnet
Portuguese OpenWN-PT CC-BY-SA-3.0
Spanish Multilingual Central Repository CC-BY-3.0
Thai Thai Wordnet Wordnet
Nepali Nepali Wordnet Wordnet

Reviews[edit]

Shortly after launching the translation service, Google won an international competition for English–Arabic and English–Chinese machine translation.[49]

Translation mistakes and oddities[edit]

Since Google Translate uses statistical matching to translate, translated text can often include apparently nonsensical and obvious errors,[50] sometimes swapping common terms for similar but nonequivalent common terms in the other language,[51] or inverting sentence meaning. Novelty websites like Bad Translator and Translation Party have utilized the service to produce humorous text by translating back and forth between multiple languages, similar to the children's game Chinese whispers.[52]

If the app tries to translate Monty Python's "The Funniest Joke in the World" into English, the service returns the message "[FATAL ERROR]".[53]

Translate Community[edit]

Translate Community is a platform intended to improve the Google Translate service. Volunteers can select up to five languages to help improve translation; users can verify translated phrases and translate phrases in their languages to and from English, helping to improve the accuracy of translating more rare and complex phrases.[54] In August 2016 the Google Crowdsource app was released, which also offered translation tasks.[55]

See also[edit]

References[edit]

  1. ^ a b Orch, Franz (April 28, 2006). "Statistical machine translation live". Google Research Blog. Google. Retrieved December 1, 2016. 
  2. ^ a b c d e f Turovsky, Barak (November 15, 2016). "Found in translation: More accurate, fluent sentences in Google Translate". The Keyword Google Blog. Google. Retrieved March 23, 2017. 
  3. ^ a b c "Languages - Google Translate". Google. Retrieved October 15, 2016. 
  4. ^ a b Shankland, Stephen (May 18, 2013). "Google Translate now serves 200 million people daily". CNET. CBS Interactive. Retrieved December 1, 2016. 
  5. ^ "About - Google Translate". Google. Retrieved December 1, 2016. 
  6. ^ "Google Translate Help". Google Translate Help. Google. Retrieved December 1, 2016. 
  7. ^ "Translate written words". Google Translate Help. Google. Retrieved December 1, 2016. 
  8. ^ "Translate text messages, webpages, or documents". Google Translate Help. Google. Retrieved December 1, 2016. 
  9. ^ "Translate with handwriting or virtual keyboard". Google Translate Help. Google. Retrieved December 1, 2016. 
  10. ^ "Translate by speech". Google Translate Help. Google. Retrieved December 1, 2016. 
  11. ^ "Search Add-ons :: Add-ons for Firefox". Mozilla. Retrieved August 7, 2009. 
  12. ^ "Google Translate". Chrome Web Store. Google. Retrieved December 1, 2016. 
  13. ^ Brinkmann, Martin (February 7, 2010). "Google Translate Integrated In Google Chrome 5". Ghacks.net. Retrieved December 1, 2016. 
  14. ^ "Google Translate APKs - APKMirror". APKMirror. Android Police. Retrieved May 20, 2017. 
  15. ^ "Google Translate on the App Store". App Store. Apple Inc. Retrieved May 20, 2017. 
  16. ^ Setalvad, Ariha (July 29, 2015). "Google Translate adds 20 new languages to video text translation". The Verge. Vox Media. Retrieved March 23, 2017. 
  17. ^ Hutchison, Allen (August 7, 2008). "Google Translate now for iPhone". Google Mobile Blog. Google. Retrieved March 23, 2017. 
  18. ^ Zhu, Wenzhang (February 8, 2011). "Introducing the Google Translate app for iPhone". Official Google Blog. Google. Retrieved March 23, 2017. 
  19. ^ Ryan Kim (October 13, 2011). "Google Translate conversation mode expands to 14 languages". GigaOM. 
  20. ^ Turovsky, Barak (January 14, 2015). "Hallo, hola, olá to the new, more powerful Google Translate app". Official Google Blog. Google. Retrieved March 23, 2017. 
  21. ^ Turovsky, Barak (July 29, 2015). "See the world in your language with Google Translate". Official Google Blog. Google. Retrieved March 23, 2017. 
  22. ^ Good, Otavio (July 29, 2015). "How Google Translate squeezes deep learning onto a phone". Google Research Blog. Google. Retrieved March 23, 2017. 
  23. ^ Feldman, Adam (May 26, 2011). "Spring cleaning for some of our APIs". Official Google Code Blog. Google. Archived from the original on May 28, 2011. Retrieved December 11, 2016. 
  24. ^ "Google Translate API (Deprecated)". Google Code. Google. Archived from the original on August 22, 2011. Retrieved December 11, 2016. 
  25. ^ Feldman, Adam (June 3, 2011). "Spring cleaning for some of our APIs". Official Google Code Blog. Google. Retrieved December 11, 2016. 
  26. ^ Burnette, Ed (May 27, 2011). "Google pulls the rug out from under web service API developers, nixes Google Translate and 17 others". ZDNet. CBS Interactive. Retrieved December 11, 2016. 
  27. ^ Wong, George (May 27, 2011). "Google gets rid of APIs for Translate and other services". UberGizmo. Retrieved December 11, 2016. 
  28. ^ "See which features work with each language". Google. Retrieved January 11, 2017. 
  29. ^ "Translate Community: Help us improve Google Translate!". 
  30. ^ a b Och, Franz (September 12, 2005). "Statistical Machine Translation: Foundations and Recent Advances" (PDF). Google. Retrieved December 1, 2016. 
  31. ^ "Franz Och, Ph.D., Expert in Machine Learning and Machine Translation, Joins Human Longevity, Inc. as Chief Data Scientist" (Press release). La Jolla, CA: Human Longevity, Inc. July 29, 2014. Retrieved January 15, 2015. 
  32. ^ "Google seeks world of instant translations". ABC News. Retrieved November 8, 2015. 
  33. ^ Boitet, Christian; Blanchon, Hervé; Seligman, Mark; Bellynck, Valérie. "MT on and for the Web" (PDF). Retrieved December 1, 2016. 
  34. ^ "Inside Google Translate". Google. Archived from the original on August 22, 2010. Retrieved December 11, 2016. 
  35. ^ Chitu, Alex (October 22, 2007). "Google Switches to Its Own Translation System". Unofficial Google Blog. Retrieved March 23, 2017. 
  36. ^ Schwartz, Barry (October 23, 2007). "Google Translate Drops Systran For Home Brewed Translation". Search Engine Land. Retrieved March 23, 2017. 
  37. ^ a b Le, Quoc; Schuster, Mike (September 27, 2016). "A Neural Network for Machine Translation, at Production Scale". Google Research Blog. Google. Retrieved December 1, 2016. 
  38. ^ a b c d e Schuster, Mike; Johnson, Melvin; Thorat, Nikhil (November 22, 2016). "Zero-Shot Translation with Google's Multilingual Neural Machine Translation System". Google Research Blog. Google. Retrieved March 23, 2017. 
  39. ^ Sepp Hochreiter; Jürgen Schmidhuber (1997). "Long short-term memory". Neural Computation. 9 (8): 1735–1780. doi:10.1162/neco.1997.9.8.1735. PMID 9377276. 
  40. ^ Felix A. Gers; Jürgen Schmidhuber; Fred Cummins (2000). "Learning to Forget: Continual Prediction with LSTM". Neural Computation. 12 (10): 2451–2471. doi:10.1162/089976600300015015. 
  41. ^ Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation (26 Sep 2016): Yonghui Wu, Mike Schuster, Zhifeng Chen, Quoc V. Le, Mohammad Norouzi, Wolfgang Macherey, Maxim Krikun, Yuan Cao, Qin Gao, Klaus Macherey, Jeff Klingner, Apurva Shah, Melvin Johnson, Xiaobing Liu, Łukasz Kaiser, Stephan Gouws, Yoshikiyo Kato, Taku Kudo, Hideto Kazawa, Keith Stevens, George Kurian, Nishant Patil, Wei Wang, Cliff Young, Jason Smith, Jason Riesa, Alex Rudnick, Oriol Vinyals, Greg Corrado, Macduff Hughes, Jeffrey Dean. https://arxiv.org/abs/1609.08144
  42. ^ "An Infusion of AI Makes Google Translate More Powerful Than Ever." Cade Metz, WIRED, Date of Publication: 09.27.16. https://www.wired.com/2016/09/google-claims-ai-breakthrough-machine-translation/
  43. ^ a b c Boitet, Christian; Blanchon, Hervé; Seligman, Mark; Bellynck, Valérie (2010). "MT on and for the Web" (PDF). Retrieved December 1, 2016. 
  44. ^ McDonald, Chris (January 7, 2017). "Ok slow down". Medium. Retrieved March 23, 2017. 
  45. ^ Davenport, Corbin (March 6, 2017). "Google Translate now uses neural machine translation for some languages". Android Police. Retrieved April 26, 2017. 
  46. ^ Hager, Ryne (April 25, 2017). "Google adds Indonesian and eight new Indian languages to its neural machine translation". Android Police. Retrieved April 26, 2017. 
  47. ^ a b Ethan Shen, Comparison of online machine translation tools, archived from the original on February 10, 2011, retrieved December 15, 2010 
  48. ^ "Inside Google Translate". Google. Archived from the original on January 1, 2014. Retrieved December 11, 2016. 
  49. ^ Nielsen, Michael. Reinventing discovery: the new era of networked science. Princeton, NJ: Princeton University Press. p. 125. ISBN 978-0-691-14890-8. 
  50. ^ Gomes, Lee (July 22, 2010). "Google Translate Tangles With Computer Learning". Forbes. Retrieved March 23, 2017. 
  51. ^ Weinberg, Nathan (September 10, 2007). "Google Translates Ivan the Terrible as "Abraham Lincoln"". Blog News Channel. Archived from the original on September 12, 2007. Retrieved March 23, 2017. 
  52. ^ Kincaid, Jason (August 7, 2009). "Translation Party: Tapping Into Google Translate's Untold Creative Genius". TechCrunch. AOL. Retrieved March 23, 2017. 
  53. ^ Freeze, Christopher. "JokeWarfare". Instagram. Retrieved 12 April 2017. 
  54. ^ "Translate Community FAQ". Google. 
  55. ^ Whitwam, Ryan (August 29, 2016). "New Google Crowdsource app asks you to help with translation and text transcription a few seconds at a time". Android Police. Retrieved March 23, 2017. 

External links[edit]