Jump to content

Apertium

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Wei2912 (talk | contribs) at 09:26, 12 May 2015 (→‎History: Replace broken link with citations.). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Apertium
Stable release
3.2[1] / September 21, 2010; 13 years ago (2010-09-21)
Repository
Written inC++
Operating systemPOSIX compatible
Available inMulti-lingual
TypeMachine translation
LicenseGNU General Public License
Websitewww.apertium.org

Apertium is a rule-based machine translation platform. It is free software and released under the terms of the GNU General Public License.

History

Apertium originated as one of the machine translation engines in the project OpenTrad, which was funded by the Spanish government. It was originally designed to translate between closely related languages, although it has recently been expanded to treat more divergent language pairs. To create a new machine translation system, one just has to develop linguistic data (dictionaries, rules) in well-specified XML formats.

Language data developed for it (in collaboration with the Universidade de Vigo, the Universitat Politècnica de Catalunya and the Universitat Pompeu Fabra) currently support (in stable version) the Aragonese, Asturian, Basque, Breton, Bulgarian, Catalan, Danish, English, Esperanto, French, Galician, Icelandic, Italian, Macedonian, Norwegian (Bokmål and Nynorsk), Occitan, Portuguese, Romanian, Spanish, Swedish and Welsh languages. A full list is available below. Several companies are also involved in the development of Apertium, including Prompsit Language Engineering, Imaxin Software and Eleka Ingeniaritza Linguistikoa.

Apertium is a shallow-transfer machine translation system, which uses finite state transducers for all of its lexical transformations, and hidden Markov models for part-of-speech tagging or word category disambiguation. Constraint Grammar taggers are also used for some language pairs (e.g. Breton-French).

The project has taken part in the 2009[2], 2010[3], 2011[4], 2012[5], 2013[6] and 2014[7] editions of Google Summer of Code and the 2010[8], 2011[9], 2012[10], 2013[11] and 2014[12] editions of Google Code-In.

Language pairs

List of currently stable language pairs, hover over the language codes to see the languages that they represent.

Arabic Aragonese Asturian Basque Breton Bulgarian Catalan Danish English Esperanto French Galician Icelandic Indonesian Italian Macedonian Malaysian Maltese North Sami Norwegian (Bokmål) Norwegian (Nynorsk) Occitan Portuguese Romanian Spanish Swedish Welsh
Arabic No No No No No No No No No No No No No No No No Yes (←) No No No No No No No No No
Aragonese No No No No No No No No No No No No No No No No No No No No No No No Yes (⇄) No No
Asturian No No No No No No No No No No No No No No No No No No No No No No No Yes (⇄) No No
Basque No No No No No No No No No No No No No No No No No No No No No No No Yes (→) No No
Bretonă No No No No No No No No No Yes (→) No No No No No No No No No No No No No No No No
Bulgarian No No No No No No No No No No No No No No Yes (⇄) No No No No No No No No No No No
Catalan No No No No No No No Yes (⇄) Yes (→) Yes (⇄) No No No Yes (←) No No No No No No Yes (⇄) Yes (⇄) No Yes (⇄) No No
Danish No No No No No No No No No No No No No No No No No No No No No No No No Yes (←) No
English No No No No No No Yes (⇄) No Yes (⇄) No Yes (⇄) Yes (←) No No Yes (←) No No No No No No No No Yes (⇄) No Yes (←)
Esperanto No No No No No No Yes (←) No Yes (⇄) Yes (←) No No No No No No No No No No No No No Yes (←) No No
French No No No No Yes (←) No Yes (⇄) No No Yes (→) No No No No No No No No No No No No No Yes (⇄) No No
Galician No No No No No No No No Yes (⇄) No No No No No No No No No No No No Yes (⇄) No Yes (⇄) No No
Icelandic No No No No No No No No Yes (→) No No No No No No No No No No No No No No No No No
Indonesian No No No No No No No No No No No No No No No Yes (→) No No No No No No No No No No
Italian No No No No No No Yes (→) No No No No No No No No No No No No No No No No No No No
Macedonian No No No No No Yes (⇄) No No Yes (→) No No No No No No No No No No No No No No No No No
Malaysian No No No No No No No No No No No No No Yes (←) No No No No No No No No No No No No
Maltese Yes (→) No No No No No No No No No No No No No No No No No No No No No No No No No
North Sami No No No No No No No No No No No No No No No No No No Yes (→) No No No No No No No
Norwegian (Bokmål) No No No No No No No No No No No No No No No No No No Yes (←) Yes (⇄) No No No No No No
Norwegian (Nynorsk) No No No No No No No No No No No No No No No No No No No Yes (⇄) No No No No No No
Occitan No No No No No No Yes (⇄) No No No No No No No No No No No No No No No No Yes (⇄) No No
Portuguese No No No No No No Yes (⇄) No No No No Yes (⇄) No No No No No No No No No No No Yes (⇄) No No
Romanian No No No No No No No No No No No No No No No No No No No No No No No Yes (←) No No
Spanish No Yes (⇄) Yes (⇄) Yes (←) No No Yes (⇄) No Yes (⇄) Yes (→) Yes (⇄) No No No No No No No No No No Yes (⇄) Yes (⇄) Yes (←) No No
Swedish No No No No No No No Yes (→) No No No No No No No No No No No No No No No No No No
Welsh No No No No No No No No Yes (→) No No No No No No No No No No No No No No No No No

See also

References

  • Corbí-Bellot, M. et al. (2005) "An open-source shallow-transfer machine translation engine for the romance languages of Spain" in Proceedings of the European Association for Machine Translation, 10th Annual Conference, Budapest 2005, pp. 79–86
  • Armentano-Oller, C. et al. (2006) "Open-source Portuguese-Spanish machine translation" in Lecture Notes in Computer Science 3960 [Computational Processing of the Portuguese Language, Proceedings of the 7th International Workshop on Computational Processing of Written and Spoken Portuguese, PROPOR 2006], p 50-59.
  1. ^ http://sourceforge.net/projects/apertium/files/apertium/3.2/
  2. ^ "Accepted organizations for Google Summer of Code 2009".
  3. ^ "Accepted organizations for Google Summer of Code 2010".
  4. ^ "Accepted organizations for Google Summer of Code 2011".
  5. ^ "Accepted organizations for Google Summer of Code 2012".
  6. ^ "Accepted organizations for Google Summer of Code 2013".
  7. ^ "Accepted organizations for Google Summer of Code 2014".
  8. ^ "Accepted organizations for Google Code-in 2010".
  9. ^ "Accepted organizations for Google Code-in 2011".
  10. ^ "Accepted organizations for Google Code In 2012".
  11. ^ "Accepted organizations for Google Code-in 2013".
  12. ^ "Accepted organizations for Google Code-in 2014".