JMdict
JMdict is a large machine-readable multilingual Japanese dictionary. As of August 2016, it contained Japanese–English translations for more than 180,000 entries, representing around 200,000 unique headword–reading combinations.[1] Because the dictionary files are free to use (with attribution), they have been widely adopted on the Internet and are used in many computer and smartphone applications. This project is considered a standard Japanese–English reference on the Internet and is used by the Unihan Database and several other Japanese–English projects.
History
The JMdict/EDICT project was started by Jim Breen in 1991 with the creation of "EDICT" (a flat-text file in EUC-JP encoding), which was later expanded to a UTF-8-encoded XML file in 1999 as "JMdict". The XML format allows for multiple surface forms of lexemes and multiple readings, as well as cross-references and annotations. It permits glosses in other languages and contains French, German, Russian, etc. translations for many entries. The original EDICT format is still being generated for systems that rely on that format.
An expanded version (EDICT2), which reflects the structure of the XML entries, is produced and is used by several systems including the server for WWWJDIC, Jim Breen's own online dictionary search tool. Versions of JMdict are also produced in the XML format used by Apple's "Dict" application and in the EPWING/JIS X 4081 format used by many Japanese electronic dictionary systems.
Since 1991, JMdict/EDICT has been updated and expanded by many contributors. Since 2000, the EDICT project has been managed by the Electronic Dictionary Research and Development Group (EDRDG).[2] In 2010 maintenance of the dictionary was moved to an online database system. The dictionary is managed by an editorial board including Jim Breen and six other editors.[3]
EDICT has inspired other projects, including the CEDICT Chinese dictionary project started by Paul Denisowski in 1997.[4]
Word class abbreviations
EDICT has a set of word class abbreviations[5] that are used to disambiguate classes with similar word endings.
Abbreviation | Explanation |
---|---|
MA | martial arts term |
X | rude or X-rated term (not displayed in educational software) |
abbr | abbreviation |
adj-i | adjective (keiyoushi) |
adj-na | adjectival nouns or quasi-adjectives (keiyodoshi) |
adj-no | nouns which may take the genitive case particle `no' |
adj-pn | pre-noun adjectival (rentaishi) |
adj-t | `taru' adjective |
adj-f | noun or verb acting prenominally |
adj | former adjective classification (being removed) |
adv | adverb (fukushi) |
adv-to | adverb taking the `to' particle |
arch | archaism |
astron | astronomy terminology |
ateji | ateji (phonetic) reading |
aux | auxiliary |
aux-v | auxiliary verb |
aux-adj | auxiliary adjective |
Buddh | Buddhist term |
chem | chemistry term |
chn | children's language |
col | colloquialism |
comp | computer terminology |
conj | conjunction |
ctr | counter |
derog | derogatory |
eK | exclusively kanji |
ek | exclusively kana |
exp | Expressions (phrases, clauses, etc.) |
fam | familiar language |
fem | female term or language |
food | food term |
geom | geometry term |
gikun | gikun (meaning as reading) or jukujikun (special kanji reading) |
hon | honorific or respectful (sonkeigo) language |
hum | humble (kenjougo) language |
iK | word containing irregular kanji usage |
id | idiomatic expression |
ik | word containing irregular kana usage |
int | interjection (kandoushi) |
io | irregular okurigana usage |
iv | irregular verb |
ling | linguistics terminology |
m-sl | manga slang |
male | male term or language |
male-sl | male slang |
math | mathematics |
mil | military |
n | noun (common) (futsuumeishi) |
n-adv | adverbial noun (fukushitekimeishi) |
n-suf | noun, used as a suffix |
n-pref | noun, used as a prefix |
n-t | noun (temporal) (jisoumeishi) |
num | numeric |
oK | word containing out-dated kanji |
obs | obsolete term |
obsc | obscure term |
ok | out-dated or obsolete kana usage |
on-mim | onomatopoeic or mimetic word |
pn | pronoun |
poet | poetical term |
pol | polite (teineigo) language |
pref | prefix |
proverb | proverb |
prt | particle |
physics | physics terminology |
rare | rare |
sens | sensitive |
sl | slang |
suf | suffix |
uK | word usually written using kanji alone |
uk | word usually written using kana alone |
v1 | Ichidan verb |
v2a-s | Nidan verb with 'u' ending (archaic) |
v4h | Yondan verb with `hu/fu' ending (archaic) |
v4r | Yondan verb with `ru' ending (archaic) |
v5 | Godan verb (not completely classified) |
v5aru | Godan verb - -aru special class |
v5b | Godan verb with `bu' ending |
v5g | Godan verb with `gu' ending |
v5k | Godan verb with `ku' ending |
v5k-s | Godan verb - Iku/Yuku special class |
v5m | Godan verb with `mu' ending |
v5n | Godan verb with `nu' ending |
v5r | Godan verb with `ru' ending |
v5r-i | Godan verb with `ru' ending (irregular verb) |
v5s | Godan verb with `su' ending |
v5t | Godan verb with `tsu' ending |
v5u | Godan verb with `u' ending |
v5u-s | Godan verb with `u' ending (special class) |
v5uru | Godan verb - Uru old class verb (old form of Eru) |
v5z | Godan verb with `zu' ending |
vz | Ichidan verb - zuru verb (alternative form of -jiru verbs) |
vi | intransitive verb |
vk | Kuru verb - special class |
vn | irregular nu verb |
vr | irregular ru verb, plain form ends with -ri |
vs | noun or participle which takes the aux. verb suru |
vs-c | su verb - precursor to the modern suru |
vs-s | suru verb - special class |
vs-i | suru verb - irregular |
kyb | Kyoto-ben |
osb | Osaka-ben |
ksb | Kansai-ben |
ktb | Kantou-ben |
tsb | Tosa-ben |
thb | Touhoku-ben |
tsug | Tsugaru-ben |
kyu | Kyuushuu-ben |
rkb | Ryuukyuu-ben |
nab | Nagano-ben |
vt | transitive verb |
vulg | vulgar expression or word |
References
- ^ Breen, Jim. "The EDICT Dictionary File". Electronic Dictionary Research and Development Group. Retrieved 8 October 2014.
- ^ "Electronic Dictionary Research and Development Group File". Retrieved 20 June 2011.
- ^ Breen, Jim. "JMdict Editorial Board". Monash University. Retrieved 8 October 2014.
- ^ [1]
- ^ EDICT abbreviation list: http://www.csse.monash.edu.au/~jwb/jmdict_dtd_h.html
External links
- JMdict/EDICT: Japanese–English Dictionary Project
- Official web interface for EDICT
- RomajiDesu: JMDict-based Japanese–English dictionary with a friendly interface
- JEDict: JMDict-based Japanese–English dictionary for OS X
- imiwa?: JMDict-based Japanese–English dictionary for iOS
- Edict for Kindle: can be used as the default dictionary on Kindle Paperwhites, iPads and Android devices.