Jump to content

JMdict

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by 126.25.73.40 (talk) at 00:47, 7 October 2016. The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

JMdict is a large machine-readable multilingual Japanese dictionary. As of August 2016, it contained JapaneseEnglish translations for more than 180,000 entries, representing around 200,000 unique headword–reading combinations.[1] Because the dictionary files are free to use (with attribution), they have been widely adopted on the Internet and are used in many computer and smartphone applications. This project is considered a standard Japanese–English reference on the Internet and is used by the Unihan Database and several other Japanese–English projects.

History

The JMdict/EDICT project was started by Jim Breen in 1991 with the creation of "EDICT" (a flat-text file in EUC-JP encoding), which was later expanded to a UTF-8-encoded XML file in 1999 as "JMdict". The XML format allows for multiple surface forms of lexemes and multiple readings, as well as cross-references and annotations. It permits glosses in other languages and contains French, German, Russian, etc. translations for many entries. The original EDICT format is still being generated for systems that rely on that format.

An expanded version (EDICT2), which reflects the structure of the XML entries, is produced and is used by several systems including the server for WWWJDIC, Jim Breen's own online dictionary search tool. Versions of JMdict are also produced in the XML format used by Apple's "Dict" application and in the EPWING/JIS X 4081 format used by many Japanese electronic dictionary systems.

Since 1991, JMdict/EDICT has been updated and expanded by many contributors. Since 2000, the EDICT project has been managed by the Electronic Dictionary Research and Development Group (EDRDG).[2] In 2010 maintenance of the dictionary was moved to an online database system. The dictionary is managed by an editorial board including Jim Breen and six other editors.[3]

EDICT has inspired other projects, including the CEDICT Chinese dictionary project started by Paul Denisowski in 1997.[4]

Word class abbreviations

EDICT has a set of word class abbreviations[5] that are used to disambiguate classes with similar word endings.

References

  1. ^ Breen, Jim. "The EDICT Dictionary File". Electronic Dictionary Research and Development Group. Retrieved 8 October 2014.
  2. ^ "Electronic Dictionary Research and Development Group File". Retrieved 20 June 2011.
  3. ^ Breen, Jim. "JMdict Editorial Board". Monash University. Retrieved 8 October 2014.
  4. ^ [1]
  5. ^ EDICT abbreviation list: http://www.csse.monash.edu.au/~jwb/jmdict_dtd_h.html