List of dictionaries by number of words

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

This is a list of dictionaries considered authoritative or complete by approximate number of total words, or headwords, included. These figures do not take account of entries with senses for different word classes (such as noun and adjective) and homographs. Although it is possible to count the number of entries in a dictionary, it is not possible to count the number of words in a language.[1][2] In compiling a dictionary, a lexicographer decides whether the evidence of use is sufficient to justify an entry in the dictionary. This decision is not the same as determining whether the word exists.[citation needed]

The green background means a given dictionary is the largest in a given language.

Language Approx. no. of words Dictionary Notes
Korean 1,100,373 1100373
 
우리말샘 (Woori Mal Saem, 2017) Online open dictionary including dialects of South and North Korea.[3]
Finnish 800,000 800000
 
RedFox Pro Online dictionary. The free version has over 300,000 Finnish words and the Pro version has over 800,000 Finnish words. The dictionary has agglomerated other dictionaries, such as technical ones[4], and the largest set comes from Wordnet[5]. Note that even this dictionary essentially doesn't include inflections.
Turkish 616,767 616767
 
Büyük Türkçe Sözlük Big Online dictionary of the Turkish Language Association[6]
Swedish 600,000 600000
 
Svenska Akademiens ordbok, Swedish Academy After having completed letters A through T SAOB included 470,000 words, but 600,000 words when the alphabet was completed in 2017. Svenska Akademiens ordlista, which includes only commonly used words, currently includes ~126,000 words after having added 13,500 and removed 9,000 in its latest edition, SAOL 14, plus an additional 200,000 still encountered words in earlier editions.[7][8]
Icelandic 560,000 560000
 
Orðabók Háskólans 43,000 basic words and 519,000 compound words of which more than half are attested only once or don't get into print (“instant combinations”)[9]
English 520,000 520000
 
English Wiktionary Contains 520,000 gloss entries and 928,989 total entries.[10]
Italian 500,000 500000
 
Grande Dizionario Hoepli Italiano[11][12] The number of "sayable and writable" word-forms is estimated at over 2 million[13]
Japanese 500,000 500000
 
Nihon Kokugo Daijiten [14]
Lithuanian 500,000 500000
 
Lietuvių kalbos žodynas (Academic Dictionary of Lithuanian) 22,000 pages in 20 volumes with quotations from all kinds of writing and dialect records between 1547 and 2001. Accessible online at www.lkz.lt.[15]
English 470,000 470000
 
Webster's Third New International Dictionary and Addenda Section Contains 470,000 entries[16]
Dutch 400,000 400000
 
Woordenboek der Nederlandsche Taal The 43 volumes of the WNT (including three supplements) consist of 49,255 pages, describing Dutch words from 1500 to 1976.[17]
Tamil 380,000 380000
 
Sorkuvai An online open dictionary run by Tamil Nadu government.[18]
Chinese 370,000 370000
 
Hanyu Da Cidian [19]
English 350,000 350000
 
The American Heritage Dictionary of the English Language, Third Edition In the introduction to the 4th and 5th editions, it is mentioned that more than 10,000 words have been added, thus the total for the 5th edition will be more than 370,000 words.[20]‹See TfM›[failed verification]
Finnish 350,000 350000
 
Suomen murteiden sanakirja (in progress) Suomen murteiden sanakirja (SMS) will include 350,000 words from different dialects, with well-documented definitions, based on the archives (Suomen murteiden sana-arkisto) of 400,000 words, with over 8 million definitions.[21] [22]
Persian 343,466 343466
 
Dehkhoda Dictionary, 1998, ISBN 9789640396025 The original series initially consisted of 3 million records (فیش fish or برگه bargeh in Persian) (up to 100 records for each word or proper noun) until Dehkhoda's death in November 1955, and currently contains 343,466 entries that according to the latest digital release of the dictionary by Tehran University Press (version 3.0) are based on an ever-growing library of over 2300 volumes in lexicology and various other scientific fields.[23] [24]
Norwegian 330,000 330000
 
Norsk Ordbok The finished dictionary has about 330,000 headwords, whereas the corpus it's built upon contains about 500,000 words in total.[25]
German 330,000 330000
 
Deutsches Wörterbuch 330,000 words in use since the mid-fifteenth century. — Duden's Großes Wörterbuch der deutschen Sprache contains over 200,000 contemporary words.[26][27]
Norwegian 300,000 300000
 
Tanums store rettskrivningsordbok (10. utgave) [28]
Gujarati 281,377 281377
 
Bhagavadgomandal 2.81 lakh words and their meanings in 9 volumes. Also serves as an encyclopedia with almost 8.22 lakh words.[29]
Urdu 264,000 264000
 
Urdu Lughat [30]
Ukrainian 253,000 253000
 
Великий орфографічний словник сучасної української лексики Contains 253,000 entries.[31][32]
Czech 250,000 250000
 
cs:Příruční slovník jazyka českého Nine volumes of this dictionary were printed in years 1935-1957. They contain about 250,000 words, their meanings and example usage from literature. The dictionary is available online.[33]


Portuguese 250,000 250000
 
Houaiss Dictionary of the Portuguese Language A total of 250,000 words in the language, with the largest dictionary having 171,000 words.[34]


Serbo-Croatian 241,000 241000
 
Dictionary of Serbo-Croatian Literary and Vernacular Language This dictionary is incomplete. So far, 20 volumes of the planned 40 have been published. These 20 volumes contain 241.000 headwords. When complete, this Dictionary will have around 500.000 headwords.[35]
Belarusian 223,000 223000
 
Большой словарь белорусского языка [36]
English 207,016 207016
 
WordNet, 3.1 As of November 2012 WordNet's latest Online-version is 3.1. The database contains 155,327 words organized in 175,979 synsets for a total of 207,016 word-sense pairs.[37]
Finnish 201,000 201000
 
Nykysuomen sanakirja, 1961 Nykysuomen sanakirja can be translated to The Dictionary of Modern Finnish or The Dictionary of Contemporary Finnish, but the language can be quite dated; the dictionary only reflects the language as it was no later than 1961. Even though it has been published again, it has not been updated. The dictionary contains over 201,000 headwords in six volumes.[38] For modern language, The New Dictionary of Modern Finnish is more relevant.
Danish 200,000 200000
 
Ordbog over det danske Sprog Dictionary maintained by the Society for Danish Language and Literature. Covers Dansh language use 1700-1950.[39] The society also maintains a sister dictionary, Den Danske Ordbog covering language use since 1950.
Russian 200,000 200000
 
ТОЛКОВЫЙ СЛОВАРЬ ЖИВОГО ВЕЛИКОРУССКОГО ЯЗЫКА Tolkovyi slovar' zhivogo velikorusskogo iazyka[40].
Slovak 200,000 200000
 
Slovník slovenského jazyka z r. 1959 – 1968, Slovník súčasného slovenského jazyka A – G, H – L, M – N z r. 2006, 2011, 2015 Here is the information about the number of words in Slovak language written by Jazykovedný ústav Ľ. Štúra SAV.
Hindi 183,175 183175
 
Wiktionary, Hindi language version एक मुक्त शब्दकोश, जो सभी को सम्पादन का अधिकार देता है।

जिसमें अनेक भाषाओं के १,८३,१७५ शब्द उपलब्ध हैं। [41]

Romanian 180,000 180000
 
dexonline Online dictionary. Project of digitisation of 67 general, specialty and archaic dictionaries. Launched in 2001. As of 2013, it contained over 180,000 unique words and 576,000 definitions.
English 171,476 171476
 
Oxford English Dictionary, Second Edition Oxford Dictionary has 273,000 headwords; 171,476 of them being in current use, 47,156 being obsolete words and around 9,500 derivative words included as subentries. The dictionary contains 157,000 combinations and derivatives in bold type, and 169,000 phrases and combinations in bold italic type, making a total of over 600,000 word-forms.[42][43]
There is one count that puts the English vocabulary at about 1 million words — but that count presumably includes words such as Latin species names, prefixed and suffixed words, scientific terminology, jargon, foreign words of extremely limited English use and technical acronyms.[44][45][46]
Kazakh 166,000 166000
 
15 томдық "Қазақ тілінің түсіндірме сөздігі" Explanatory dictionary of the Kazakh language[47]
Russian 150,000 150000
 
Большой академический словарь русского языка Great Academy Dictionary of Russian language[48]
Belarusian 150,000 150000
 
Слоўнік беларускай мовы [49]
Polish 140,000 140000
 
Wielki słownik ortograficzny PWN Big orthography dictionary PWN contains new words, proper nouns and latest spelling changes.
French 135,000 135000
 
Trésor de la Langue Française informatisé ATILF[50] (Analyse et Traitement Informatique de la Langue Française – Computer Processing and Analysis of the French Language)
135,000 (Larousse Dictionnaire de français, published by Editions Larousse)[51][52]
Ukrainian 134,058 134058
 
Словник української мови (The Dictionary of the Ukrainian language) The dictionary was finished in late 1970s - early 1980s[53][54]
Russian 130,000 130000
 
Большой толковый словарь русского языка Great Dictionary of Russian language[55]
Indonesian 127,036 127036
 
Kamus Besar Bahasa Indonesia, 5th edition, 2016
Eastern Armenian 125,000 125000
 
Ժամանակակից հայոց լեզվի բացատրական բառարան Žamanakakic’ hayoc’ lezvi bac’atrakan baṙaran[56]
Tamil 124,405 124405
 
University of Madras Tamil Lexicon The dictionary includes 124,405 separate entries.[57]
Arabic 120,000 120000
 
Tāj al-ʿArūs Min Jawāhir al-Qāmūs The dictionary includes 120,000 entries filling 40 volumes, whereby one entry comprises dozens of words.
Bulgarian 119,200 119200
 
Dictionary of the Bulgarian Language (monolingual academic explanatory dictionary), (Многотомен) Речник на българския език in Bulgarian, in 15+ volumes This dictionary covers vocabulary from the last 150 years of the Bulgarian language and is compiled and edited by linguistics (primarily native lexicographers and lexicologists) from The Institute for the Bulgarian Language (part of the Bulgarian Academy of Sciences). It includes basic, commonly used, literary, colloquial, dialectical, archaic and obsolete Bulgarian words, as well as some specialized terminology. The latest volume (15th) published in 2015 ends with headwords beginning with the (Bulgarian Cyrillic) letter Р.[58]
Turkish 114,767 114767
 
Güncel Türkçe Sözlük Online dictionary of the Turkish Language Association[59]
Belarusian 112,462 112462
 
Skarnik As of August 2019. Belarusian-Russian online dictionary contains 112,462 words.[60]
Slovene 110,180 110180
 
Slovar slovenskega knjižnega jezika, Second edition, 2014 The official dictionary of modern Slovene is Slovar slovenskega knjižnega jezika (SSKJ; Standard Slovene Dictionary). It was published in five volumes by Državna Založba Slovenije between 1970 and 1991 and contains more than 100,000 entries and subentries with accentuation, part-of-speech labels, common collocations, and various qualifiers. In the 1990s, an electronic version of the dictionary was published and it is available online.[61]
Finnish 102,174 102174
 
Kielitoimiston sanakirja, 2018 Online dictionary. Institute for the Languages of Finland (governmental institute) has selected the core vocabulary, and many headwords are not included.[62]
Afrikaans 100,000 100000
 
Handwoordeboek van die Afrikaanse Taal (HAT), 2015 New 6th edition contains 3228 new keywords and 5365 meaning[63].
Polish 100,000 100000
 
Słownik języka polskiego PWN Polish dictionary of PWN contains about 100,000 articles and 145,000 definitions.[64]
French 100,000 100000
 
Dictionnaire Le Grand Robert de la langue française, 2019 Contains 100,000 words and 350,000 definitions.[65]
German 100,000 100000
 
Österreichisches Wörterbuch, 2018 Official dictionary of the German language in the Republic of Austria[66].
Spanish 93,000 93000
 
Diccionario de la lengua española de la Real Academia Española, 23rd edition, 2014 [67]
Spanish 90,000 90000
 
DICCIONARIO DE USO DEL ESPAŃOL, 2007 Contains 90,000 keywords and 190,000 meaning.
Dutch 90,000 90000
 
Van Dale, 14th edition, 2005 [68]
Catalan 88,500 88500
 
Gran Diccionari de la llengua catalana (Great Dictionary of the Catalan language, includes the definitions in the Diccionari de la llengua catalana) Contains 88,500 headwords and 172,000 definitions.[69]
Chinese 85,568 85568
 
Zhonghua Zihai Number of different characters in use over three millennia of written history. The Hanyu Da Cidian defines some 370,000 words.[70][71][72]


Malaysian 82,900 82900
 
Kamus Dewan, 4th Edition, 2005
Chechen 70,000 70000
 
Словарь Чеченского языка
Romanian 67,000 67000
 
Dicționarul explicativ al limbii române (Published by the Romanian Academy)
Tamazight 60,000 60000
 
Grand dictionnaire Français-Tamazight (Written by Abdelhafed Idres)
Galician 59,999 59999
 
Dicionario da Real Academia Galega (Dictionary of the Royal Galician Academy) [73]
Western Armenian 56,000 56000
 
Հայոց լեզուի նոր բառարան Hayoc’ lezowi nor baṙaran[74]
Tatar 56,000 56000
 
Татарско-русский словарь Ш.Н. Асылгараева, Ф.А. Ганиева, М.З. Закиева, К.М. Миннуллина, Д.Б. Рамазанова Tatar-Russian dictionary of Sh.N. Asylgaraev, F.A. Ganiev, M.Z. Zakiyev, K.M. Minnullin, D.B. Ramazanova[75]
Turkmen 50,000 50000
 
Türkmen diliniň düşündirişli sözlügi Turkmen Explanatory Dictionary[76]
Azerbaijani 44,750 44750
 
Azərbaycan dilinin izahlı lüğəti Azerbaijani Explanatory Dictionary[77]
Bashkir 40,000 40000
 
Башкирско-русский словарь Ураксин З.Г. Bashkir-Russian dictionary Uraksin Z. G.[78]
Chuvash 40,000 40000
 
Чувашско-русский словарь Скворцова М. И. Chuvash-Russian dictionary Skvortsova M. I.[79]
Dargwa 40,000 40000
 
Даргинско-русский словарь Юсупова Х. А. Dargwa-Russian dictionary of Yusupov H. A[80]
Classical Latin 39,589 39589
 
Oxford Latin Dictionary Includes 39,589 Classical Latin entries, including borrowings from Greek, Gaulish, other Italic dialects, Sanskrit, and others. There are about: 10,000 adjectives, 2,123 adverbs, 46 conjunctions, 77 interjections, 17,450 nouns, 26 particles, 39 prepositions, 17 pronouns, and 5,986 verbs. The remaining entries are references to other entries (such as alternate spellings or archaic versions), prefixes, suffixes, and terms left untranslated by the original editors.[81]
Avar 36,000 36000
 
Аварско-русский словарь Гимбатова. М. М. Avar-Russian dictionary of Gimbatov M. M
Arabic 32,300 32300
 
Lexicon of the Modern Arabic Language
Quechua 20,000 20000
 
Diccionario Quechua-Español Lira Jorge Quechua-Spanish dictionary Lira Jorge [82]
Esperanto 16,780 16780
 
Plena Ilustrita Vortaro de Esperanto (Complete Illustrated Dictionary of Esperanto) 46,890 lexical units[83]

References[edit]

  1. ^ "How many words are there in the English language?". Oxford English Dictionary. Retrieved 14 August 2018.
  2. ^ "The Biggest Vocabulary?". The Economist. 23 June 2010. Retrieved 14 August 2018.
  3. ^ "우리말샘 - 사전 통계". 우리말샘. Retrieved 2017-11-09.[dead link]
  4. ^ "RedFox Sanakirja: Sisältöpäivitykset" (in Finnish). Redfox languages Oy. Retrieved 2019-08-08.
  5. ^ "Suomen suurin sanakirja julkaistiin ilmaisena netissä" (in Finnish). Yle. Retrieved 2019-08-08.
  6. ^ "Büyük Türkçe Sözlük". Turkish Language Association. Retrieved 27 November 2017.
  7. ^ "Engelsk har næppe flere ord end dansk". videnskab.dk. Retrieved 2016-08-14.
  8. ^ SAOL. Svenska Akademien. Retrieved 14 August 2016.
  9. ^ "Hvað eru til mörg orð í íslensku?". University of Iceland. Retrieved 7 August 2016.
  10. ^ "Statistics". en.wiktionary.org. Retrieved 2020-05-29.
  11. ^ "Grande dizionario Hoepli italiano". Retrieved 24 March 2020.
  12. ^ "Dizionario di italiano". Retrieved 24 March 2020.
  13. ^ "Quante parole ci sono nel dizionario italiano?". Retrieved 23 February 2017.
  14. ^ "NIHON KOKUGO DAIJITEN". Indiana University. Retrieved 7 August 2016.
  15. ^ "Dictionary of the Lithuanian Language". 2002. Archived from the original on 2017-08-11. Retrieved April 19, 2018.
  16. ^ "How many words are there in English?". www.merriam-webster.com. Retrieved 2018-05-15.
  17. ^ "Woordenboek der Nederlandsche Taal". Retrieved 9 August 2017.
  18. ^ "Government to launch Tamil word bank". The Hindu. Special Correspondent. 2019-02-05. ISSN 0971-751X. Retrieved 2020-05-10.CS1 maint: others (link)
  19. ^ 汉语大词典_百度百科. Retrieved 17 June 2019.
  20. ^ Soukhanov, Anne H. (1992). The American Heritage Dictionary of the English Language. ISBN 0395448956.
  21. ^ "Suomen murteiden sanakirja" (in Finnish). Institute for the Languages of Finland. Retrieved 2019-08-08.
  22. ^ "Suomen murteiden sana-arkisto: Pääkokoelma" (in Finnish). Institute for the Languages of Finland. Retrieved 2019-08-08.
  23. ^ https://icps.ut.ac.ir/talif-f.html
  24. ^ https://en.wikipedia.org/wiki/Dehkhoda_Dictionary
  25. ^ "Antall ord i norsk". Språkrådet (in Norwegian). Retrieved 2017-09-29.
  26. ^ "Großes Wörterbuch der deutschen Sprache". Berlin-Brandenburg Academy of Sciences and Humanities. Retrieved 16 August 2016.
  27. ^ "Deutsches Wörterbuch von Jacob Grimm und Wilhelm Grimm (The German Dictionary of the Brothers Grimm)". Berlin-Brandenburg Academy of Sciences and Humanities & Heidelberg Academy of Sciences and Humanities. Retrieved 16 August 2016. Also available online.
  28. ^ "Antall ord i norsk". Språkrådet (in Norwegian). Retrieved 2017-09-29.
  29. ^ "What is Bhagavadgomanal?". bhagavadgomandalonline.com. Pravin Prakashan Pvt. Ltd. Retrieved 2018-03-04.
  30. ^ "New Urdu dictionary heavily influenced by TV news". tribune. Retrieved 11 February 2020.
  31. ^ "Великий зведений орфографічний словник сучасної української лексики". 2003.
  32. ^ "Великий зведений орфографічний словник сучасної української лексики".
  33. ^ "Příruční slovník jazyka českého". www.ujc.cas.cz.
  34. ^ "Portuguese words you need to know to build your Portuguese vocabulary". www.mondly.com.
  35. ^ Bogutović, Dragan (10 August 2018). "Rečnik SANU: Još pola veka do slova "Š"". Večernje novosti. Retrieved 30 December 2019.
  36. ^ [1]
  37. ^ https://wordnet.princeton.edu
  38. ^ Kolehmainen, Taru: "Viisikymmentä vuotta Nykysuomen sanakirjan alullepanosta" (in Finnish). Institute for the Languages of Finland. Retrieved 2019-08-08. Kielikello 10/1977.
  39. ^ Ordbog over det danske Sprog, Society for Danish Language and Literature
  40. ^ Толковый словарь живого великорусского языка. Современное написание. В 4-х томах.
  41. ^ hi.wiktionary.org https://hi.wiktionary.org/wiki/%E0%A4%B5%E0%A4%BF%E0%A4%B6%E0%A5%87%E0%A4%B7:Statistics. Retrieved 2019-05-11. Missing or empty |title= (help)
  42. ^ Robert McCrum, William Cran, & Robert MacNeil. The Story of English. New York: Penguin, 1992: 1
  43. ^ Oxford English Dictionary, Second Edition, Volume 1. Oxford University Press, 1989.
  44. ^ Algeo 1999.
  45. ^ "How many words are there in the English language?". Oxford English Dictionary. Retrieved 6 August 2016.
  46. ^ Bas Aarts; Sylvia Chalker; Edmund Weiner (16 January 2014). The Oxford Dictionary of English Grammar. OUP Oxford. pp. 436–. ISBN 978-0-19-107900-9.
  47. ^ https://www.inform.kz/kz/15-tomdyk-kazak-adebi-tili-sozdiginin-tusaukeseri-otedi_a2428253
  48. ^ [2]
  49. ^ [3]
  50. ^ "Presentation - Site du laboratoire ATILF". www.atilf.fr. Retrieved 2016-10-28.
  51. ^ "French Dictionary". Larousse. Retrieved 28 October 2016.
  52. ^ "TLFi : Trésor de la Langue Française informatisé - Site du laboratoire ATILF". www.atilf.fr (in French). Retrieved 2016-10-28.
  53. ^ [4]
  54. ^ [5]
  55. ^ [6]
  56. ^ [7]
  57. ^ [8]. TheHindu.com. Retrieved 11 July 2017.
  58. ^ "Online edition of the 15 volumes of The (Explanatory) Dictionary of the Bulgarian Language (in Bulgarian)". many publishers, primarily The Academic Publishing House of The Institute for the Bulgarian Language. Retrieved October 29, 2016.
  59. ^ "Güncel Türkçe Sözlük". Turkish Language Association. Retrieved 20 July 2020.
  60. ^ https://www.skarnik.by/belrus
  61. ^ http://www.mladinska.com/sskj/o_drugi_izdaji_sskj
  62. ^ "Kielitoimiston sanakirja" (in Finnish). Institute for the Languages of Finland. Retrieved 2019-08-08.
  63. ^ "takealot.com". m.takealot.com. Retrieved 2020-07-04.
  64. ^ [9]
  65. ^ "Dictionnaire Le Grand Robert de la langue française" (in French). Edition Le Robert. Retrieved 5 August 2019.
  66. ^ "Österreichisches Wörterbuch, 43., aktualisierte Auflage, Buchhandelsausgabe mit Nutzerschlüssel | öbv Österreichischer Bundesverlag Schulbuch GmbH & Co. KG, Wien". www.oebv.at. Retrieved 2020-07-04.
  67. ^ "Presentación" (in Spanish). Real Academia Española. Retrieved 6 August 2016.
  68. ^ "Structure and history of the Dutch language". Free University of Berlin, Department for Dutch Linguistics. September 14, 2009. Retrieved June 26, 2011.
  69. ^ "Gran Diccionari de la llengua catalana | enciclopèdia.cat". www.enciclopedia.cat. Retrieved 2020-06-29.
  70. ^ Shouhui Zhao, Dongbo Zhang, The Totality of Chinese Characters – A Digital Perspective Archived July 16, 2011, at the Wayback Machine
  71. ^ Daniel G. Peebles, SCML: A Structural Representation for Chinese Characters, May 29, 2007
  72. ^ Victor H. Mair, Who Has the Biggest Dictionary?, October 9, 2008. Retrieved 25 February 2017.
  73. ^ [10]
  74. ^ [11]
  75. ^ [12]
  76. ^ [13]
  77. ^ [14]
  78. ^ [15]
  79. ^ [16]
  80. ^ [17]
  81. ^ Oxford Latin Dictionary, Oxford University Press, 1968.
  82. ^ https://futatraw.ourproject.org/descargas/DicQuechuaBolivia.pdf
  83. ^ Blahuš, Marek. A Spell Checker for Esperanto. Brno : Masaryk University, Faculty of Informatics, 2008. 40 pp. Bachelor thesis. Text in English. Supervisor RNDr. Petr Sojka, Ph.D. Available online: [18] p. 17