Wikipedia:Typo Team/moss
![]() | This page has a backlog that requires the attention of willing editors. Please remove this notice if and when the backlog is cleared. |
The moss project seeks to find and remove the furry green typos that have been growing on Wikipedia articles. It uses software written by User:Beland to automatically find misspellings, mistakes in English grammar, violations of the Wikipedia:Manual of Style, and confusing or broken wiki markup.
QUICK LINK TO THE BEST PAGE FOR NEW PARTICIPANTS
About misspellings[edit]
How the lists are made[edit]
The moss spell checker is run against a recent set of database dumps, which are generated on the 1st and 20th of every month (but take a few days to process). All the articles in the English Wikipedia are examined. The following are ignored:
- Text inside references, templates, tables, quotation marks, sections like "External links" and "Works", and some other weird places.
- Capitalized words (which are presumed to be correctly-spelled proper nouns)
- Words that appear in titles in the English Wiktionary (which has definitions of all words in all languages, excluding proper nouns and systematic words like chemical names and large numbers)
- Words that appear in titles in the English Wikipedia (which explains some things that don't appear in the dictionary)
- Words that appear in titles in the Wikispecies (which has many technical words that don't appear in the dictionary or encyclopedia)
Many mistakes are not (yet) caught:
- Improper addition of 's (possessives are not added to Wiktionary, so these are excluded systematically)
- Incorrect capitalization
- Incorrect multi-word phrases
- Wrong word used in context
- Non-English language words not tagged with {{lang}} or where an English misspelling happens to be the same as a word in another language. (These are counted as correct spellings if they are in the English Wiktionary, which lists words in all languages – only the definitions are restricted to English.)
- Other situations listed in #False negatives below
2020 statistics[edit]
- See also: Older statistics
In the year from March 2019 to March 2020, moss volunteers fixed over 94,000 typos! The most impressive progress is in the T1 category (single-letter misspellings), where we eliminated about half from the English Wikipedia. During this period we also started fixing missing spaces (focusing on those around punctuation) and those have dropped by about one-fifth. As we make progress, clear misspellings are increasingly mixed in with unclear cases; I'll be doing some more work on separation algorithms to keep the typo reports useful, so you'll probably see some more changes to typo classifications. Thanks to everyone who has been helping out! -- Beland (talk) 16:54, 28 April 2020 (UTC)
Reporting symbol | Explanation | Change from 2019-03-01 to 2020-02-20 | Instances, 2020-04-01 dump (9f6d726) | Instances, 2020-04-20 dump (5ff589d) | Instances, 2020-05-01 dump (1a96ded) | Instances, 2020-05-20 dump (e511f74) | Instances, 2020-06-01 dump (509f79a) | Instances, 2020-06-20 dump (825ceb4) | Instances, 2020-07-01 dump (db9db23) | Instances, 2020-07-20 dump (caa619f) | Instances, 2020-08-01 dump (cf76e8c) | Instances, 2020-08-20 dump (f104e58) | Instances, 2020-09-01 dump (4654d88) | Instances, 2020-09-20 dump (a26ccca) | Instances, 2020-10-01 dump (686f5db) | Instances, 2020-10-20 dump (4f90810) | Instances, 2020-11-01 dump (ac54580) | Instances, 2020-11-20 dump (6dbd61d) | Instances, 2020-12-01 dump (917bcc8) | Instances, 2020-12-20 dump (0b3409d) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
TS | Missing or extra whitespace or dash (or new compound) | -39368 (-21%) | 145297 | 144673 | 331658** | 330624 | 328249 | 325399 | 324179 | 322282 | 321801 | 318621 | 317183 | 315825 | 314747 | 312110 | 310537 | 309386 | 308280 | 308977 |
T1 | Edit distance 1 from common English word | -36192 (-48%) | 41090 | 41081 | 39967 | 39452 | 38783 | 38379 | 38436 | 38271 | 37803 | 36783 | 35976 | 34036 | 33539 | 33764 | 32347 | 33097 | 33559 | 33427 |
T2 | Edit distance 2 from common English word | -7560 (-10%) | 64526 | 63263 | 60690 | 60321 | 59589 | 58603 | 58649 | 58521 | 58200 | 58085 | 57845 | 57329 | 57152 | 57487 | 57387 | 57511 | 57386 | 57348 |
T3 | Edit distance 3 from common English word | -5276 (-7%) | 74396 | 73255 | 70516 | 70039 | 68887 | 68192 | 68149 | 68020 | 67769 | 67788 | 67482 | 67226 | 67025 | 67101 | 67002 | 67213 | 67298 | 67399 |
R | Regular word (A-Z only) not near a common English word | -3525 (-3%) | 97726 | 96916 | 94793 | 93855 | 93252 | 91537 | 91489 | 91746 | 91521 | 91729 | 91513 | 91613 | 91339 | 91813 | 92329 | 93246 | 93377 | 93493 |
I | Definitely not English (International) due to accents or mixed with punctuation (other than hyphen) | -22196 (-24%) | 72151 | 69118 | 65842 | 64827 | 63630 | 61844 | 61888 | 61782 | 61899 | 62113 | 61916 | 62003 | 62049 | 62274 | 62287 | 62390 | 62234 | 62471 |
W | Not in English Wiktionary, in non-English Wiktionary | -6764 (-8%) | 75913 | 74351 | 86935 | 85604 | 83173 | 81894 | 81946 | 82173 | 81943 | 82170 | 81912 | 81968 | 81792 | 81256 | 81052 | 81224 | 81131 | 81192 |
L | Probable Romanization (transLiteration) | +81 (+2%) | 4435 | 4486 | 4266 | 4199 | 4120 | 4122 | 4104 | 4113 | 4137 | 4140 | 4151 | 4164 | 4165 | 4207 | 4203 | 4234 | 4240 | 4260 |
ME | Probable coMpound, English (with and without dash) | +976 (+2%) | 52269 | 48761 | 47187 | 47153 | 46830 | 46856 | 46967 | 47163 | 47052 | 47170 | 47009 | 47070 | 47066 | 47045 | 47023 | 47193 | 47142 | 47302 |
MI | Probable coMpound, non-English (International) in English Wiktionary (both A-Z and non-ASCII characters, with and without dash) | -18475 (-9%) | 177646 | 176929 | 171484 | 169592 | 166216 | 164828 | 165140 | 165351 | 165605 | 166016 | 166208 | 166499 | 166572 | 167349 | 167961 | 169044 | 168953 | 169409 |
MW | Probable coMpound, found in non-English Wiktionary | -5544 (-11%) | 46113 | 45103 | 43501 | 42931 | 40436 | 41383 | 41325 | 41440 | 41173 | 41234 | 40990 | 40956 | 40795 | 40353 | 40272 | 40454 | 40411 | 40338 |
ML | Probable coMpound, transLiteration | -124 (-3%) | 3909 | 3874 | 3707 | 3663 | 3672 | 3575 | 3589 | 3593 | 3628 | 3639 | 3658 | 3717 | 3724 | 3779 | 3769 | 3825 | 3830 | 3822 |
C | Chemistry words | -176 (-9%) | 1782 | 7564 | 7530 | 7644 | 7640 | 7655 | 7658 | 7659 | 7660 | 7662 | 7654 | 7644 | 7659 | 7661 | 7665 | 7659 | 7674 | 7700 |
N | A-Z plus numbers and hyphens | -1391 (-5%) | 25209 | 23813 | 22650 | 22511 | 22290 | 22020 | 22052 | 22053 | 21971 | 22009 | 21960 | 21923 | 21879 | 21856 | 21885 | 21898 | 21893 | 21943 |
Z | Decimal fraction missing leading Zero | - | 47* | 0* | 11405** | 11418 | 11414 | 11398 | 11402 | 11421 | 11455 | 11530 | 11546 | 11578 | 11598 | 11669 | 11683 | 11703 | 11728 | 11762 |
P | Patterns (e.g. rhyme schemes) | -20 (-43%) | 27 | 28 | 7 | 9 | 7 | 7 | 3 | 2 | 2 | 4 | 5 | 4 | 5 | 5 | 4 | 5 | 5 | 5 |
H | HTML/XML/SGML tag | -539 (-15%) | 3010 | 2886 | 2938 | 2903 | 2904 | 2848 | 2693 | 2697 | 2680 | 2747 | 2757 | 2729 | 2565 | 2569 | 2542 | 2538 | 2540 | 2572 |
HB | Known bad HTML tag, like <font> | -1080 (-7%) | 14465 | 14121 | 12903 | 13928 | 12919 | 14733 | 14022 | 11428 | 11670 | 11198 | 10191 | 8860 | 8756 | 8842 | 9725 | 11088 | 10164 | 10556 |
HL | Bad HTML-like linking, like <http://...> | -98 (-19%) | 414 | 418 | 377 | 394 | 394 | 421 | 408 | 425 | 420 | 413 | 373 | 359 | 356 | 329 | 324 | 315 | 318 | 328 |
U | URL | -94 (-7%, from 2019-03-20) | 1179 | 1152 | 1118 | 1134 | 1117 | 1122 | 1129 | 1124 | 1120 | 1124 | 1124 | 1103 | 1101 | 1099 | 1091 | 1096 | 1050 | 1055 |
BC | Bad characters | -12678 (-6%, from 2019-09-01) | 192230 | 190482 | 186651 | 186517 | 185572 | 178698 | 175325 | 166116 | 159095 | 124158 | 112959 | 112755 | 112695 | 112633 | 112479 | 110608 | 110025 | 109808 |
BW | Bad words | -6542 (-5%, from 2019-09-20) | 113682 | 106327 | 381288** | 380259 | 378710 | 374982 | 375107 | 375206 | 375431 | 375306 | 374622 | 374740 | 374560 | 375010 | 375008 | 375557 | 374989 | 375663 |
Total | -39115 (-3%, from 2019-09-20) | 1207516 instances | 1188601 instances | 1647413** instances | 1638977 instances | 1619804 instances | 1600496 instances | 1595660 instances | 1582586 instances | 1574035 instances | 1535639 instances | 1519034 instances | 1514101 instances | 1511139 instances | 1510211 instances | 1508575 instances | 1511284 instances | 1508227 instances | 1510830 instances | |
Parse failure | Mismatched punctuation | -5145 (-3%) | 154084 articles + 40705 MOS:STRAIGHT violations | 153033 articles + 40838 MOS:STRAIGHT violations | 214365 articles + 37697 MOS:STRAIGHT violations | 214463 articles + 37667 MOS:STRAIGHT violations | 214101 articles + 37607 MOS:STRAIGHT violations | 214465 articles + 37767 MOS:STRAIGHT violations | 214732 articles + 37849 MOS:STRAIGHT violations | 215081 articles + 37993 MOS:STRAIGHT violations | 215447 articles + 38067 MOS:STRAIGHT violations | 215915 articles + 38169 MOS:STRAIGHT violations | 216227 articles + 38210 MOS:STRAIGHT violations | 216472 articles + 38205 MOS:STRAIGHT violations | 216738 articles + 38213 MOS:STRAIGHT violations | 216991 articles + 38246 MOS:STRAIGHT violations | 217192 articles + 38338 MOS:STRAIGHT violations | 217660 articles + 38498 MOS:STRAIGHT violations | 217861 articles + 38625 MOS:STRAIGHT violations | 218207 articles + 38789 MOS:STRAIGHT violations |
- red = Probably need to fix
- yellow = Unsorted
- blue = Probably OK (but may need to verify)
- bold = actively working on fixing
* Identification of Z was broken
** Affected by major bug fix for counting inter-word typos (e.g. involving punctuation)
2021 statistics[edit]
Dump (moss version) | Parse failures (articles + articles with MOS:STRAIGHT violations) | TOTAL (instances) | BC | BW | C | H | HB | HL | I | L | ME | MI | ML | MW | N | P | R | T1 | T2 | T3 | TS | U | W | Z |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2021-01-01 (b4af24a) | 218317 + 38841 | 1505808 | 108661 | 375875 | 7705 | 2550 | 10726 | 311 | 62583 | 4262 | 47274 | 169504 | 3841 | 40131 | 21954 | 4 | 93373 | 32968 | 56903 | 66819 | 306445 | 1054 | 81112 | 11753 |
2021-01-20 (a249b2d) | 218455 + 38930 | 1506940 | 108030 | 376079 | 7679 | 2616 | 11036 | 298 | 62746 | 4298 | 47044 | 170234 | 3885 | 39960 | 21959 | 4 | 93467 | 33598 | 56688 | 66688 | 306776 | 1042 | 81049 | 11764 |
2021-02-01 (8279235) | 218833 + 38960 | 1506004 | 107000 | 375979 | 7677 | 2595 | 11729 | 298 | 62829 | 4305 | 47053 | 171005 | 3888 | 39771 | 21971 | 2 | 93726 | 33237 | 56822 | 66707 | 305573 | 1035 | 81079 | 11723 |
2021-02-20 (2f00c51) | 218991 + 39035 | 1504064 | 106534 | 375909 | 7682 | 2602 | 11697 | 275 | 62942 | 4342 | 47036 | 171313 | 3897 | 39732 | 22009 | 3 | 93959 | 32705 | 56529 | 66617 | 304463 | 1020 | 81041 | 11757 |
2021-03-01 (248159a) | 219198 + 39155 | 1494162 | 106421 | 376305 | 7669 | 2624 | 9291 | 281 | 62978 | 4328 | 46830 | 169666 | 3876 | 39189 | 21936 | 4 | 92221 | 32762 | 56197 | 66069 | 302377 | 1020 | 80338 | 11780 |
2021-03-20 (57aaae7) | 219556 + 39371 | 1492923 | 106284 | 375853 | 7695 | 2610 | 9965 | 278 | 63055 | 4331 | 47064 | 170453 | 3880 | 39172 | 21998 | 2 | 92721 | 32523 | 56052 | 66087 | 299751 | 1002 | 80305 | 11842 |
2021-04-01 (d47c725) | 219692 + 39478 | 1484879 | 105670 | 375757 | 7697 | 2620 | 8857 | 205 | 62842 | 4309 | 46966 | 170369 | 3884 | 38886 | 21964 | 0 | 92575 | 32160 | 55810 | 65706 | 296009 | 995 | 79736 | 11862 |
Instructions for editors[edit]
Just like a regular spell checker, sometimes a word that's highlighted is really a misspelling and should be changed, but sometimes it is a correct spelling that needs to be added to the spell checker's dictionary (which in this case is the English Wiktionary and Wikispecies). For the below lists, here's how you can help:
- For spelling mistakes: Click on the links to the individual Wikipedia articles, and edit them to correct the misspelling. Make sure this is actually a misspelling, and not a technical term that needs to be better explained, or an alternate spelling (possibly from a different regional variety of English).
- For non-English words (including words from Old English and Middle English, since they are pronounced differently): Edit the article and use the {{lang}} or {{transl}} templates to mark all non-English passages. Template contents are ignored, so they will not show up in the next report. If you can define the word, it would still be helpful to add the non-English word to the English Wiktionary or the same-language Wiktionary if you speak that language. As of the March 20, 2019 dump, only words not found in any Wiktionary are reported by moss as misspellings. (The "home" Wiktionary for Old and Middle English words is the modern English one.)
- If you don't know which language is being used, you can tag it with {{which lang}}. If you add a "reason=" parameter, that will change the pop-up tooltip text readers will see when they hover over "what language is this?". If you have a guess as to which language it might be, or any other question or comment, you can leave that here to help future editors. If you use this tag, you can delete the article from the moss listing; the article will be added to Category:Articles with unidentified words instead, and ignored by future runs of moss until the mystery is solved.
- For languages that don't have a code (often happens with historical languages), use "mis" and add an HTML comment indicating the language. For example: {{lang|mis|sharbe do kin ratz}}<!-- Old Runish -->
- For incorrect spellings in direct quotes:
- These shouldn't be picked up by the spell checker, as text in double quotes "" is ignored. The article probably has incorrect punctuation.
- Regardless of punctuation problems, you can add {{sic}} around the word or phrase. See Wikipedia:Manual of Style#Quotations for guidance.
- For correct spellings that belong in the dictionary: Click on the word to add it to the English Wiktionary. Remember the word might not be English (though the definition must be) and be sure to check capitalization!
- For correct spellings already in the dictionary: Delete from the list or strike through; these have been added in the meantime since the database dump by other editors. They do not automatically turn red as internal Wikipedia links do.
- For correct spellings not appropriate for Wiktionary:
- For DNA sequences, add {{DNA sequence}} around it.
- For species, add the whole name to Wikispecies:Wikispecies:Requested articles#From_Wikipedia and it will be suppressed from future runs.
- For proper nouns and (including non-English titles) that aren't capitalized, put inside a {{proper name}} tag.
- Use <code></code> or similar tags for computer programs; see Wikipedia:WikiProject_Computer_science/Manual_of_style#Code_samples.
- For terms that are only relevant to one Wikipedia article (and for which the article makes clear the definition) consider creating a redirect to the article. As long as the "typo" word is in the title (as a whole word), it won't show up as a mistake in future spell checks.
- {{IPA}} or {{respell}} can be used for word pronunciations. See Wikipedia:Manual of Style/Pronunciation for details.
- For bird calls: Treat these as foreign-language words or words-as-words and put them in italics, following MOS:ITALICS. Put the call inside {{not a typo}} so it won't show up on moss spell check reports. (It doesn't matter if the double apostrophes that make the italics go inside or outside the template.)
- Anything else, add {{not a typo}} around it (for example, nonsense series of letters used as examples in puzzles).
- Correct or incorrect, when finished delete or
strike outthe entry for the word from the lists on this page (or subpages), so work won't be duplicated. It is preferred to delete the entry for sections that rotate through specific letters, and strikethrough for sections where the whole thing gets updated (to prevent duplicating work done while the dumps were being processed, which can take more than a week). - If an article or section has generally bad grammar, and you don't have time to fix the whole thing, just add {{copyedit}} at the top of the article or {{copyedit|section}} at the top of the affected section. If it's just a sentence or two, {{copy edit inline}} or {{incomprehensible inline}} can go at the end of the problem passage.
- If you see errors being reported from footnotes or bibliographies, check to make sure the section is titled with a standard name following MOS:APPENDIX conventions. Standard end-matter sections like "References" and "Further reading" and "Works" are ignored.
- If it helps to leave a message on the article's talk page asking if the word is correct or incorrect, you can use Template:Typo help like this when editing the bottom of the talk page (leave the section header blank; it will automatically be added):
- {{subst:typo help|PUT WORD HERE}} -- ~~~~
- NEW: If you are uncertain whether a word is spelt correctly or not, you can add {{typo help inline}} immediately after it. If you add a "reason=" parameter, that will change the pop-up tooltip text readers will see when they hover over "check spelling". You can add a specific question or comment that may help identification. If you use this tag, you can delete the article from the moss listing; the article will be added to Category:Articles with unidentified words instead, and ignored by future runs of moss until the mystery is solved.
Don't worry if you miss something; it will reappear in a future report if there are still mistakes.
Suggested edit summaries[edit]
If you want to help publicize this project, you can copy-and-paste these into your edit summary, if appropriate.
For Wikipedia edits:
- Fix misspelling found by [[Wikipedia:Typo Team/moss]] – you can help!
- Tag non-English text found by [[Wikipedia:Typo Team/moss]] – you can help!
- Tag correct text as {{not a typo}} for automated spell checkers (including [[Wikipedia:Typo Team/moss]])
- Fix mismatched quote marks found by [[Wikipedia:Typo Team/moss]] – you can help!
For Wiktionary edits:
- Add word identified by [[w:Wikipedia:Typo Team/moss]] – you can help!
Wiktionary cheat sheet[edit]
Need to add a word to Wiktionary? The Wiktionary cheat sheet has copy-and-paste templates that make it easy for the types of words commonly encountered here, even if you've never done it before.
Misspellings - lists of things to fix[edit]
Likely misspellings by article (main listing)[edit]
The most efficient list to work on if all you want to do is fix misspellings. These listings try to list all the typos from a given article, so they can be fixed all at once. It also tries to only show typos that legitimately need fixing. It's not perfect, so a few words found need to be added to Wiktionary or tagged as not English, not a typo, etc. Only a few letters are updated on each run, to avoid stale listings as the whole list takes far longer than two weeks to work through. (This also avoids duplicating recent work when listings are refreshed.)
See subpages due to length:
- Wikipedia:Typo Team/moss/before A - Completed 2020-04-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/A - Completed 2020-05-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/B - Completed 2020-06-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/C - Completed 2020-07-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/D - Completed 2020-08-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/E - Completed 2020-08-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/F - Completed 2020-09-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/G - Completed 2020-09-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/H - Completed 2020-10-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/I - Completed 2020-10-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/J - Completed 2021-01-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/K - Completed 2021-02-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/L - Typos ready for fixing from 2021-03-01 dump
- Wikipedia:Typo Team/moss/M - Typos ready for fixing from 2021-04-01 dump
- Wikipedia:Typo Team/moss/N - Completed 2019-05-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/O - Completed 2019-05-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/P - Completed 2019-06-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/Q - Completed 2020-05-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/R - Completed 2019-07-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/S - Completed 2019-07-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/T - Completed 2019-08-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/U - Completed 2019-08-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/V - Completed 2019-08-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/W - Completed 2019-08-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/X - Completed 2019-03-20 dump, currently empty
- Wikipedia:Typo Team/moss/Y - Completed 2019-05-01 dump, currently empty
- Wikipedia:Typo Team/moss/Z - Completed 2019-03-20 dump, currently empty
- Wikipedia:Typo Team/moss/after Z - Completed 2019-03-20 dump, currently empty
Notes:
- For more cases that require investigation, see Category:Articles with unidentified words.
- Due to length and an increased number of false positives, typo reports for dumps 2020-05-20 and later don't include T2, T3, and TS+BRACKET.
Likely misspellings by frequency (a-m)[edit]
The best list to work on if you want to eliminate all instances of a specific typo. Only typos that are very close to known words are shown. The algorithm is not perfect, so some of these may still be words that need to be added to Wiktionary. For each run, only words from half of the alphabet are shown, to avoid duplicate work from when new dumps are being processed.
Legitimate misspellings are candidates for Wikipedia:Lists of common misspellings. If there is an obvious correction, adding that to Wikipedia:Lists of common misspellings/For machines will help editors who use automated tools to fix cases faster.
- 30 -
wikt:decomissioned - 2EC, Badstuestræde 7, British Rail Classes 101 and 102, Californium neutron flux multiplier, Costa Victoria ... find all - 22 -
wikt:iqtas - Assaf dynasty, Buhturids, Iltutmish, Razia Sultana, Toghrul III ... find all - 20 -
wikt:kurals - Aram (Kural book), Impact of the Tirukkural, Inbam (Kural book), M. Gopala Krishna Iyer, Parimelalhagar ... find all - 19 -
wikt:aforomentioned - List of Afghanistan One Day International cricket records, List of Afghanistan Twenty20 International cricket records, List of Australia One Day International cricket records, List of Australia Twenty20 International cricket records, List of Bangladesh Twenty20 International cricket records ... find all - 14 -
wikt:localitites - Aistala, Anulia, Cooper's Camp, Gopalpur, Nadia, Habibpur, Nadia ... find all - 14 - wikt:deuls - Bardhaman, Bengal temple architecture, Chapari, Gumut, Bankura, Kanki, Purulia ... find all
- 14 -
wikt:assiastant - Revoir un printemps, The Imperial (Flipmode Squad album) ... find all 13 - wikt:firts - 2000–01 Azadegan League, List of Afghanistan Twenty20 International cricket records, List of Australia Twenty20 International cricket records, List of Bangladesh Twenty20 International cricket records, List of England Twenty20 International cricket records ... find all- 12 - wikt:efuses - IBM eFUSE, Programmable ROM, Semiconductor device fabrication, Trusted execution environment ... find all
- 12 -
wikt:dircted - Information theory, Maurizio Micheli ... find all - 12 -
wikt:againt - 1985–86 FC Basel season, Daniel Guijo-Velasco, Dmitry Valent, Finnish Anarchist Association, Marcelo Salas ... find all 11 - wikt:decleared - 2021 Croix-des-Bouquets jailbreak, Air Busan, Dhalkebar, Floriana F.C., Frédéric Prinz von Anhalt ... find all- 11 -
wikt:cakwe - Betawi cuisine, Breakfast, Bubur ayam, Indonesian cuisine, Kue ... find all - 11 -
wikt:annouced - Andras Kemeny, Bronagh Waugh, Derwent Power Station, Dragon Ball Z: Ultimate Battle 22, Loui Batley ... find all - 10 -
wikt:inclduing - Art Gallery of Ballarat, Bob Dylan and the Band 1974 Tour, European multilateral defence procurement, External relations of Jersey, Healthcare in Jersey ... find all - 10 -
wikt:inagural - Ariel Martínez, Aylestone Road, Georgia Championship Wrestling, Lucha Underground, MLL–PLL merger ... find all - 10 - wikt:daees - List of Dai of the Dawoodi Bohra, Sulaymani ... find all
- 10 -
wikt:compariot - Anna Blinkova, Irina-Camelia Begu, Karolína Muchová career statistics, Paula Badosa, Shelby Rogers ... find all - 9 -
wikt:currenty - Ahmad Syaikhu, BNSF Railway (Metra), Bobby Adhityo Rizaldi, Frogmore, Hampshire, MSC Adams ... find all - 9 -
wikt:belived - 2016 in hip hop music, Charles Henderson (Nevada politician), Dildarnagar, Ghazipur, Fugitive Doctor, Jaleshwar Mahadev Temple ... find all - 9 -
wikt:alonside - Arlete Bombe, Frank Philip Bowden, Fury UK, Madi Diaz, Milanka Brooks ... find all - 8 -
wikt:fradulent - 1992 Ghanaian parliamentary election, Drumlougher, Dutch childcare benefits scandal, Government of Cristina Cifuentes, Government of Ángel Garrido ... find all - 8 -
wikt:euxenic - Banded iron formation, Glossary of environmental science, Great Oxidation Event, Onate Formation ... find all - 8 -
wikt:eigthteen - Soviet frigate Bodryy, Soviet frigate Deyatelnyy, Soviet frigate Doblestnyy, Soviet frigate Dostoynyy, Soviet frigate Druzhnyy ... find all - 8 -
wikt:diamter - Acacia aulacophylla, Acacia imitans, Acacia kenneallyi, Acacia pachyphylla, Acacia pravissima ... find all - 8 -
wikt:constrast - Brønderslev, Iamnot, Lorentz–Heaviside units, Madkhalism, Mariano Araneta ... find all - 8 - wikt:cabaya - Kebaya, Malaysian cultural outfits ... find all
- 7 -
wikt:madake - Baren (printing tool), Phyllostachys bambusoides, Shakuhachi, Ōita Prefecture ... find all - 7 -
wikt:idirect - Ip.access, O3b mPOWER, Robert T. Dail, Sevsat, Very-small-aperture terminal ... find all - 7 - wikt:garhs - Garhwal division, Lunahar, Panchagarh District, Sangram Shah, Tehri Garhwal district ... find all
- 7 - wikt:eview - ANU Press, International HL7 Implementations, Java Caps, Transactive energy ... find all
- 7 -
wikt:entirerly - Szwederowo district, Bydgoszcz, Transformers Autobots, Transformers Decepticons, Transformers Revenge of the Fallen: Decepticons, Transformers: War for Cybertron (Nintendo DS)... find all - 7 - wikt:econtact - Antti Sakari Saario, Arne Eigenfeldt, Biosignal, Eldad Tsabary, Gordon Fitzell ... find all
- 7 -
wikt:earthern - Coronation of Mindon Min, Isle of Dogs, Nagapattinam, Panshet Dam, Sadar (festival)... find all - 7 -
wikt:disseration - Annie Cuyt, Jason Josephson Storm, Selenne Bañuelos, Stephen Fuchs, The Myth of Disenchantment ... find all - 7 -
wikt:desedents - Chaunsa, Dewaitha, Mania, Dildarnagar, Qasimabad Estate ... find all - 7 -
wikt:conclussive - 1916 Copa de Honor Cousenier, 1933 Copa Beccar Varela Final, 1936 Copa Aldao, 1937 Copa Aldao, 1939 Copa Aldao ... find all - 7 -
wikt:avaiable - Henry Horton State Park, Isa Pa with Feelings, Level crossing, The Bassmachine, Thorsborne Trail... find all - 7 -
wikt:announed - Audi Q3, Google Workspace, MS Cruise Sardegna, Paya, Inc., SAE J2954 ... find all - 6 -
wikt:momument - Memorial to Heroes of the Marine Engine Room, Military Intelligence Service (United States), Military history of Canada, Organ in the Aa-kerk in Groningen, Wazirabad... find all - 6 -
wikt:mistakely - Class of the Titans, Nojor, Tracy Strauss, Une passion dans le désert, Ó Siadhail ... find all - 6 -
wikt:memeber - AFC member nations major tournament records, AssadUllah Shah, Growing Up Hip Hop, List of My Hero Academia characters, Margot Mayo ... find all - 6 - wikt:jalsas - Rose Garden Palace, Tikatuli ... find all
- 6 -
wikt:industralist - Govindram Seksaria, List of Old Harrovians, Pandora (TV series), Sonja Ziemann, Telford ... find all - 6 -
wikt:indepepdent - 1985 Memphis State Tigers football team, 1987 Memphis State Tigers football team, 1989 Memphis State Tigers football team, 1991 Memphis State Tigers football team, 1993 Memphis State Tigers football team ... find all - 6 - wikt:immersely - Adjoint representation, Lie group ... find all
- 6 -
wikt:hactares - Calvim, Canca, Goa, Marna, Goa, Marra, Goa, Muara Angke Wildlife Reserve ... find all - 6 - wikt:frutis - Economy of Ferizaj, List of non-marine molluscs of Georgia ... find all
- 6 - wikt:filal - Aloor, Kerala, St. Joseph's Church, Aloor ... find all
- 6 - wikt:fgly - Aldehyde tag, Formylglycine-generating enzyme ... find all
- 6 -
wikt:excuding - Benares State, Blackburn Bus Company, Chainpur Estate, Durgavati Canal, Narayan dynasty ... find all - 6 - wikt:evtc - CVVTCS, Nissan VQ engine ... find all
- 6 - wikt:eufs - Solid-state drive, System on a chip, Universal Flash Storage ... find all
- 6 -
wikt:esablished - Go Topless Day, Kingaroy, Mojave Road, Mojave Road (Los Angeles), Santa Fe Trail Historical Park ... find all - 6 - wikt:erosed - Melica glabrescens, Melica hyalina, Melica lilloi, Melica transsilvanica, Melica violacea ... find all
- 6 - wikt:eport - George R. Jensen Jr., Television Bureau of Advertising ... find all
- 6 -
wikt:employess - EDO MBM Technology Ltd v Campaign to Smash EDO, Employee experience design, GCR Class 9D, Rajaram Jaipuria, Toxzon ... find all - 6 - wikt:efund - Fundrise, Hands-On Mobile ... find all
- 6 -
wikt:discoverd - BabyRiki, Erigone atra, Ferrimagnetism, History of Sofia, Iyothee Thass ... find all - 6 -
wikt:diamater - Carabao (mango), Euphorbia flanaganii, PIN diode, Pico (mango), Rubus pectinellus ... find all - 6 -
wikt:developped - Abdul Qayum Sher, Brocade, Catheter lock solution, Chynna Rogers, Institut de pastorale des Dominicains ... find all - 6 -
wikt:desecents - Chainpur, Kaimur, Sherpur, Ghazipur ... find all - 6 -
wikt:constrution - 2020 Central Vietnam floods, Hubli Airport, Pickup truck, Resident Evil: Infinite Darkness, Szwederowo district, Bydgoszcz ... find all - 6 -
wikt:chaffeur - ANZ Bank New Zealand, Jussi Muilu, Namdev Bhau: In Search of Silence, Oft in the Silly Night, Rubí (2004 TV series) ... find all - 6 - wikt:betoel - Oey Tamba Sia, Tan Eng Goan, Thio Tjin Boen, Tjerita Oeij Se ... find all
- 6 - wikt:baands - Sixareen, Yoal ... find all
- 6 - wikt:ayai - Skor daey, Tro (instrument) ... find all
- 6 -
wikt:archieved - Elias Seppänen, Gillian Henrion, Isack Hadjar, Paul Aron, Rafael Villagómez ... find all - 6 -
wikt:apponted - Ely Bannister Soane, Hermann Abbestée, Jeff Essmann, Markus Feldhoff, Queen Jeongseong... find all - 6 -
wikt:agaist - 2021 in Brazil, Hughie Gallacher, Knight, Rio Carnival, Suddala Ashok Teja... find all - 6 -
wikt:accloades - Jyothika, Lech Polcyn, Manasanamaha, Padmapriya Janakiraman, Trisha (actress)... find all
Likely new English compounds by frequency (a-m)[edit]
The best list to work on if you want to add variations of known words to Wiktionary, mostly compound words. The algorithm is not perfect, so some of these might be common mistakes that need to corrected. For each run, only words from half of the alphabet are shown, to avoid duplicate work from when new dumps are being processed.
- 90 - wikt:bellcast - Acadia National Park carriage paths, bridges and gatehouses, Administration Building, Missouri State Fruit Experiment Station, Asbury United Methodist Church (Knoxville, Tennessee), Avondale, Parramatta, Benjamin Franklin Prescott House ... find all
- 83 - wikt:buyrate - Backlash (2003), Badd Blood: In Your House, Beach Brawl, Campbell McLaren, Chuck Liddell vs. Tito Ortiz ... find all
- 63 - wikt:headstroke - Ba (Indic), Bengali language, Bha (Indic), Ca (Indic), Cha (Indic) ... find all
- 55 - wikt:favehotel - Archipelago International ... find all
- 50 - wikt:cornerboards - Abraham Hall, Babson-Alling House, Bangor Elevator, Benjamin Franklin Prescott House, Benoit Apartments ... find all
- 48 - wikt:cellspot - Apamea anceps, Apamea furva, Apamea oblonga, Apamea ophiogramma, Apamea remissa ... find all
- 44 - wikt:lagums - BIP Brewery, Building of the Patriarchate, Belgrade, Gardoš, Gardoš Tower, House at 10 Cara Dušana Street ... find all
- 39 - wikt:flushboarding - Binks Hess House and Barn, Burt Henry Covered Bridge, Call-Bartlett House, Capt. William McGilvery House, Casey House (Mountain Home, Arkansas) ... find all
- 36 - wikt:akarere - Bugesera District, Cyarubare District, Districts of Rwanda, Gasabo District, Gatsibo District ... find all
- 35 - wikt:derpwhales - Harbour porpoise ... find all
- 34 - wikt:digibook - 'N Crugu Bradului, A Bit o' This & That, A Strange Thing to Say, Anoraknophobia, At the Arena ov Aion – Live Apostasy ... find all
- 33 - wikt:katuns - Battle of Kolašin, Bjelasica, Bua (tribe), Kelmendi (tribe), Komovi ... find all
- 33 - wikt:aeroscreen - 2016 Russian Grand Prix, 2020 Genesys 300, 2020 Indianapolis 500, 2020 IndyCar Series, 2020 Iowa IndyCar 250s ... find all
- 32 - wikt:fanmeeting - After School (group), Apeace, B.O.Y, Beyond Live, CLC (group) ... find all
- 31 - wikt:lostbelt - List of Fate/Grand Order characters ... find all
- 31 - wikt:csexp - Canonical S-expressions ... find all
- 30 - wikt:funfactor - All-Star Baseball '97 featuring Frank Thomas, Battle Arena Toshinden 2, Brain Dead 13, Brandish (video game), Chrono Trigger ... find all
- 29 - wikt:dropsondes - Atmospheric sounding, Dropsonde, Economy of Columbus, Ohio, Eyewall replacement cycle, Global Positioning System ... find all
- 27 - wikt:genophage - Ashley Williams (Mass Effect), Biological warfare in popular culture, Dragon Age: Inquisition – Trespasser, Krogan, Mass Effect 3 ... find all
- 27 - wikt:bodykits - "Mad" Mike Whiddett, 2018 WeatherTech SportsCar Championship, Autodelta (UK), Brabus, Bōsōzoku ... find all
- 27 - wikt:alcippus - Danaus (butterfly), Danaus chrysippus, List of butterflies of Benin, List of butterflies of Burkina Faso, List of butterflies of Cameroon ... find all
- 26 - wikt:diplexed - In-band on-channel, KEAR (AM), KEST, KFWB, KQFN ... find all
- 26 - wikt:danceband - After the Ball (album), Andy Nye, Ansco Bruinier, Don Lusher, Donnez ... find all
- 26 - wikt:concepted - 2011 Southeast Asian Games, AI & Society, Akiyuki Shinbo, Android 21, Bastila Shan ... find all
- 26 - wikt:biradaris - Bisati, Churihar, Dharhi, Doodwala, Ethnic groups in Pakistan ... find all
- 25 - wikt:mprs - Allopregnanolone, Membrane progesterone receptor, Membrane steroid receptor, Pharmacodynamics of progesterone, Progesterone ... find all
- 25 - wikt:maavg - List of Mullard–Philips vacuum tubes ... find all
- 25 - wikt:headstrokes - Bha (Indic), Ca (Indic), Cha (Indic), Da (Indic), Dha (Indic) ... find all
- 25 - wikt:gametype - Battlefield 4, Defense of the Ancients, Devastation (video game), Flood (Halo), GoldenEye: Rogue Agent ... find all
- 25 - wikt:bandform - Analogue filter, Composite image filter, Distributed-element filter, Electronic filter topology, Filter (signal processing) ... find all
- 24 - wikt:istudy - Bored of Studies, McMaster Integrated Science, System Technology-i Co, Ltd ... find all
- 24 - wikt:auroglaucin - Aspergillus aerius, Aspergillus appendiculatus, Aspergillus biplanus, Aspergillus brunneus, Aspergillus caperatus ... find all
- 23 - wikt:lipfire - Ethan Allen (armsmaker) ... find all
- 23 - wikt:inequilaterally - Acacia acutata, Acacia adenogonia, Acacia anaticeps, Acacia andrewsii, Acacia auricoma ... find all
- 23 - wikt:groundsnake - Arrhyton, Jamaican dry forests, List of reptiles of Turkey, Stegonotus (snake) ... find all
- 23 - wikt:everset - Kamen Rider Drive, Kamen Rider Drive: Surprise Future, Kamen Rider Fourze, Kamen Rider Gaim, Kamen Rider OOO ... find all
- 23 - wikt:byali - ELEAGUE Major 2017, ESL One Cologne 2016, FACEIT Major: London 2018, MLG Major Championship: Columbus, PGL Major: Kraków 2017 ... find all
- 23 - wikt:bestfriends - Alessandra Usman, Ally Carter, Anna Liza, FlordeLiza, Fred Payawan ... find all
- 23 - wikt:afwc - 25th Bangladesh Infantry Regiment, Abdul Waheed Kakar, Army Golf Club, Bangladesh Coast Guard, Engineer-in-Chief (Bangladesh army) ... find all
- 22 - wikt:monofacial - Antonio Luque, Bifacial solar cells, Hammerstone, Isofoton, Solar cell ... find all
- 22 - wikt:mapeak - List of Mullard–Philips vacuum tubes ... find all
- 22 - wikt:isoleukotoxin - C18H32O3, CYP2C18, CYP2C19, CYP2C8, CYP2C9 ... find all
- 22 - wikt:godspoken - Gloriously Bright, List of Ender's Game characters, Xenocide ... find all
- 22 - wikt:courtcase - Amsterdamsestraatweg Water Tower, Attack on Kennedy Road, Caroline Farner, Carrickfergus (barony), Ciutat Morta ... find all
- 22 - wikt:akaval - Ciṟupāṇāṟṟuppaṭai, Commentaries in Tamil literary tradition, Five Great Epics, Indian epic poetry, Kalittokai ... find all
- 21 - wikt:geoviewer - Alder Brook (West Branch French Creek tributary), Bailey Brook (West Branch French Creek tributary), Baskin Run, Beaver Run (South Branch French Creek tributary), Beaverdam Creek (Crabtree Creek tributary) ... find all
- 21 - wikt:fieldbuses - CODESYS, ControlNet, Fieldbus, HMS Networks, M-Module ... find all
- 21 - wikt:divebombers - 105th Light Anti-Aircraft Regiment, Royal Artillery, 1st Flintshire Rifle Volunteers, 32nd Light Anti-Aircraft Regiment, Royal Artillery, 61st Light Anti-Aircraft Regiment, Royal Artillery, Battle of Fort Eben-Emael ... find all
- 21 - wikt:dispensery - Ballymacilcurr, Ballynahone Beg, Maghera civil parish, Bracaghreilly, Brackaghlislea, Cloughfin ... find all
- 21 - wikt:derpwhale - Harbour porpoise, Spectacled porpoise ... find all
- 21 - wikt:containg - 2015 in paleontology, Acacia acinacea, Acacia gregorii, Atromentin, Babymetal World Tour 2015 ... find all
- 21 - wikt:constested - 1904 Calgary municipal election, 1983 Jamaican general election, 1985 FINA Men's Water Polo World Cup, 1997 Irish general election, 2002 Irish general election ... find all
- 21 - wikt:ciliolated - Melica amethystina, Melica argentata, Melica bonariensis, Melica commersonii, Melica decipiens ... find all
- 21 - wikt:bandname - 59 Times the Pain, Cute (Japanese idol group), Dynamite Boy, Ill Niño, Illusion (Renaissance album) ... find all
- 21 - wikt:advisorship - Albert Muchnik, Allen Knutson, André D'Allemagne, Craig A. Carlson, Donald B. McCormick ... find all
- 20 - wikt:metalogs - Metalog distribution, Pearson distribution ... find all
- 20 - wikt:metalbending - Avatar: The Last Airbender – North and South, Avatar: The Last Airbender – The Promise, Avatar: The Last Airbender – The Rift, Bolin (The Legend of Korra), Human echolocation ... find all
- 20 - wikt:longiconic - Ascoceratidae, Ascocerida, Basslerocerida, Campendoceras, Ellesmeroceratidae ... find all
- 20 - wikt:keytype - Postage stamps and postal history of Malta, Postage stamps and postal history of Nigeria, Revenue stamps of Aden, Revenue stamps of Bermuda, Revenue stamps of Eritrea ... find all
- 20 - wikt:jetpipe - Airbreathing jet engine, Bristol Proteus, CFE CFE738, Components of jet engines, De Havilland Sea Venom ... find all
- 20 - wikt:freeskate - Adult figure skating, Ashley Wagner, Blades of Courage, Caroline Zhang, Chloé Seyrès ... find all
- 20 - wikt:endwall - Capt. Richard Strong House, Cheng Xu, Edward R. Wilson House, Expansion tube, Hersey-Duncan House ... find all
- 20 - wikt:droneless - Bosnia and Herzegovina, Bousine, Great Highland bagpipe, Greek bagpipes, Gudastviri ... find all
- 20 - wikt:depillaring - Kajora Area, Mugma Area, Pootkee Balihari Area, Satgram Area, Sodepur Area ... find all
- 20 - wikt:datsans - Buddhism in Buryatia, Buddhism in Russia, Damba Ayusheev, Datsan, Tsetserleg, Khövsgöl ... find all
- 20 - wikt:commlock - All That Glisters (Space: 1999), Dragon's Domain, Earthbound (Space: 1999), Guardian of Piri, Seed of Destruction (Space: 1999) ... find all
- 20 - wikt:bushdrive - Toyota Land Cruiser, Toyota Land Cruiser (J40) ... find all
- 20 - wikt:antepreparatory - Anna Maria Taigi, August Czartoryski, Domenico Lentini, Elena Guerra, Franz-Josef Rudigier ... find all
- 20 - wikt:aeroengines - Avio, Bristol Filton Airport, CRAIC CR929, Fedden Mission, Gustav Otto ... find all
- 19 - wikt:lockerboxes - Bahrenfeld station, Billwerder-Moorfleet station, Diebsteich station, Eidelstedt station, Elbgaustraße station ... find all
- 19 - wikt:headscarfs - 2008 Uyghur unrest, 2017–2019 Iranian protests against compulsory hijab, Basketball at the 2014 Asian Games – Women, British debate over veils, Christine Delphy ... find all
- 19 - wikt:growthlines - Acamptodaphne biconica, Atrypa, Calliostoma coppingeri, Eulimella angeli, Eulimella boydae ... find all
- 19 - wikt:goldmarks - Aufhausen–Kröhstorf railway, Caroline Islands, Countess Louise von Bose, Economic history of World War I, Emil Jellinek ... find all
- 19 - wikt:flamebacks - Changeable hawk-eagle, Flameback, Great slaty woodpecker, Greater flameback, Himalayan flameback ... find all
- 19 - wikt:farmans - Aina Mahal, Chanto, Fatawa 'Alamgiri, Hathwa Raj, Ibrahim Khan II ... find all
- 19 - wikt:expp - Exponential map (Riemannian geometry), Hadamard manifold, Hopf–Rinow theorem, Normal coordinates, P-adic exponential function ... find all
- 19 - wikt:electrogyration - Electro-gyration ... find all
- 19 - wikt:ejournals - Aryabhatta College, Asia Pacific Institute of Information Technology, Auraria Library, British Library, Cambridge University Library ... find all
- 19 - wikt:eaudiobooks - Alexandria Library (Virginia), Arlington Public Library, Chicopee Public Library, Dublin City Public Libraries and Archive, Eastern Regional Libraries ... find all
- 19 - wikt:drillhead - History of ice drilling, Ice core, Ice drilling ... find all
- 19 - wikt:drawspan - Amtrak Old Saybrook – Old Lyme Bridge, Evergreen Point Floating Bridge, First Avenue South Bridge, History of the Staten Island Railway, Hood Canal Bridge ... find all
- 19 - wikt:compèred - Athol Guy, Bob Monkhouse, Can't Stop Myself from Loving You, Colin Murray, David Vine ... find all
- 19 - wikt:adnervular - Atrophaneura varuna, Byasa latreillei, Pachliopta aristolochiae, Papilio alcmenor, Papilio bootes ... find all
- 18 - wikt:meetingplace - Alexanderplatz, Amager Boulevard, Baron Boltens Gård, Confidentcrowd, Dr. William J. Mayo House ... find all
- 18 - wikt:lilwa - Mbole people ... find all
- 18 - wikt:icinema - Jamil Yamani, Jeffrey Shaw, Kate Moore (composer), Sarah Kenderdine, Scenario (artwork) ... find all
- 18 - wikt:hotelware - Middleport, Staffordshire, Restaurant ware, Royal Doulton, Steelite, Stoke-on-Trent ... find all
- 18 - wikt:headgrowth - Aquarium fish feed, Egg-fish goldfish, Goldfish, Lionchu, Lionhead (goldfish) ... find all
- 18 - wikt:headcrest - 2019 in archosaur paleontology, Cockatoo, List of birds of Asia, List of birds of Australia, List of birds of East Timor ... find all
- 18 - wikt:headcoach - Alemannia Aachen, Benito Montalvo, John Allen (footballer, born 1964), Kenya national rugby league team, List of NorthEast United FC managers ... find all
- 18 - wikt:hammerbeams - All Saints Anglican Church, Brisbane, Bristol Temple Meads railway station, Edmund Blacket, Emmaus United Methodist Church, Emneth ... find all
- 18 - wikt:goreum - Dangui, Durumagi, Hanbok, Jang-ot, Jeogori ... find all
- 18 - wikt:episodics - Alefia Kapadia, Annie Gill, Bentonville Film Festival, Bill Cakmis, Himmanshoo A. Malhotra ... find all
- 18 - wikt:disappeareds - Stolpersteine in Hradec Králové Region, Stolpersteine in Karlovy Vary Region, Stolpersteine in Loštice, Stolpersteine in Milovice nad Labem, Stolpersteine in Mladá Boleslav ... find all
- 18 - wikt:cyclepaths - A2 road (England), Benelux, Bike lane, Biotren, De Marne ... find all
- 18 - wikt:cleansheets - 2020–21 Barrow A.F.C. season, 2020–21 Harrogate Town A.F.C. season, Ashraful Islam Rana, Bilal Khan (footballer), Craig Dootson ... find all
- 18 - wikt:centercab - Arcade and Attica Railroad, Buda Engine Co., Chesapeake Beach Railway, DSB (railway company), Delaware Coast Line Railroad ... find all
- 18 - wikt:buttpad - AK-12, Accuracy International Arctic Warfare, Barrett M82, FN Special Police Rifle, Heckler & Koch MP5 ... find all
- 18 - wikt:boyaress - 1842 Wallachian princely election, Alexandrina Cantacuzino, Alexandru B. Știrbei, Alexandru Bogdan-Pitești, An Unforgettable Summer ... find all
- 18 - wikt:antipalindromic - Cyclotomic polynomial, Line spectral pairs, Reciprocal polynomial ... find all
- 25 (down from 53) - wikt:οτι - Lectionary 12, Lectionary 239, Lectionary 240, Matthew 28:5–6, Minuscule 2427 ... find all
- These all appear to be the Greek word 'οτι', which does not appear in wikt without breath marks. That is, see wikt:ότι, which then mentions forms wikt:τι, wikt:ὅτι, wikt:ό,τι.
- It would appear then that the proper action is to mark all these quoted Greek texts with {{lang}}? ::Also, I think I'll ask over at wikt if it would be reasonable for them to have an entry for wikt:οτι. They do have an entry for wikt:oti, which mentions at least wikt:ότι and wikt:ό,τι, but not wikt:ὅτι. (sigh) Oh what a tangled web we wind, when first we endeavor these defined. Shenme (talk) 04:30, 13 October 2019 (UTC)
- Additionally, many (all?) of these appear to be 'biblical' == classical == ancient Greek, which has ISO 639-2 code 'grc'. Modern Greek is ISO 639-1 code 'el', ISO 639-2 code 'gre'. Shenme (talk) 04:49, 18 October 2019 (UTC)
- Ah, but not all. Some found with search are modern Greek, so lang|el, and some 'oti' found having breath marks. Currently searching using "οτι" -insource:"lang|grc" -insource:"lang|el" -insource:"lang|gre" and working on labelling any form of Greek. Shenme (talk) 02:27, 20 October 2019 (UTC)
- 121 (down from 230) - wikt:æftiʀ - Ardre image stones, Aringsås Runestones, Arkils tingstad, Asferg Runestone, Ballstorp Runestone ... find all
- I don't think Old Norse entries with ʀ are allowed (they are either presented in Runic or normalized to r) on Wiktionary; the solution is to language-tag instances on here (generally as Old Norse although glancing at a few, it seems the articles/infoboxes helpfully specify which language it is in each case). -sche (talk) 20:13, 18 November 2018 (UTC)
Likely new words by frequency, all languages (a-m)[edit]
Good candidates for words to add to the English Wiktionary (which provides English definitions for words in all languages), as it seems English Wikipedia readers will frequently encounter them. For each run, only words from half of the alphabet are shown, to avoid duplicate work from when new dumps are being processed.
Most of the words are not from English. To get them off this list, you can either add an entry to the English Wiktionary (which provides English definitions for words in all languages) or tag all instances of the word on the English Wikipedia with {{lang}}. Wiktionary does not accept Romanizations for some languages, so those cases must be tagged as {{transl}} or {{lang}}.
- 181 - wikt:alangaram - Abirameswarar temple, Adhirangam Ranganathaswamy temple, Adi Jagannatha Perumal Temple, Adi Kumbeswarar Temple, Kumbakonam, Adikesava Perumal temple, Mylapore ... find all
- 178 - wikt:aradanai - Abirameswarar temple, Adhirangam Ranganathaswamy temple, Adi Jagannatha Perumal Temple, Adi Kumbeswarar Temple, Kumbakonam, Adikesava Perumal temple, Mylapore ... find all
- 137 - wikt:gandharam - Abhogi, Ahiri, Amritavarshini, Anandabhairavi, Andolika ... find all
- 129 - wikt:стрелковая - 109th Rifle Division (Soviet Union), 10th Guards Motor Rifle Division, 114th Rifle Division (Soviet Union), 118th Rifle Division, 121st Guards Rifle Division ... find all
- 119 - wikt:farābād - Bagh-e Jafarabad, Jafarabad, Ahar, Jafarabad, Alborz, Jafarabad, Amol, Jafarabad, Andika ... find all
- 89 - wikt:dhaivatham - Abheri, Abhogi, Anandabhairavi, Asampurna Melakarta, Bahudari ... find all
- 88 - wikt:jangha - Aisanyesvara Siva Temple, Akhadachandi Temple, Arjunesvara Siva Temple, Astasambhu Siva Temples, Belesvara Siva Temple ... find all
- 84 - wikt:kiruthigai - Abirameswarar temple, Adi Kumbeswarar Temple, Kumbakonam, Agastheeswar Temple, Agnipureeswarar Temple, Thirupugalur, Aiyarappar temple ... find all
- 75 - wikt:lēah - Abberley, Abbotsley, Acklam, Ryedale, Adel, Leeds, Alderley, Gloucestershire ... find all
- 72 - wikt:chathusruthi - Abhogi, Anandabhairavi, Bahudari, Bhairavi (Carnatic), Chakravakam (raga) ... find all
- 66 - wikt:ispánate - Andrew I Hont-Pázmány, Arnold II Hahót, Atyusz (genus), Atyusz Hahót, Atyusz III Atyusz ... find all
- 61 - wikt:lemosaurus - Ulemosaurus ... find all
- 60 - wikt:гвардейская - 100th Guards Rifle Division, 10th Guards Motor Rifle Division, 10th Guards Uralsko-Lvovskaya Tank Division, 121st Guards Rifle Division, 126th Guards Rifle Division ... find all
- 60 - wikt:īlābād - Boneh-ye Esmail, Khuzestan, Eshqabad, West Azerbaijan, Esmailabad (north), Dorudzan, Esmailabad (north), Gowhar Kuh, Esmailabad (south), Dorudzan ... find all
- 58 - wikt:bafen - Prince Cheng of the Second Rank, Prince Chun (醇), Prince Ding, Prince Dun, Prince Gong (peerage) ... find all
- 56 - wikt:dhaivatam - Ahiri, Amritavarshini, Andolika, Asaveri, Bowli ... find all
- 53 - wikt:kagawads - 2013 Philippine local elections, Achila, Bohol, Ang Probinsyano (season 7), Ayala Alabang, Barangay ... find all
- 53 - wikt:busstop - Brachttal, Chōfu Airport, Edappadi, Hargs bro runic inscriptions, Higashi-Azuma Station ... find all
- 53 - wikt:adhyayas - Abel Bergaigne, Adi Parva, Aitareya Brahmana, Anushasana Parva, Ashramavasika Parva ... find all
- 51 - wikt:književnosti - Anonymous Ravanićanin, August Kovačec, Božidar Petranović, Bratoljub Klaić, Croatian Language Corpus ... find all
- 49 - wikt:σαυρος - Abelisauridae, Abelisaurus, Abrictosaurus, Abrosaurus, Acteosaurus ... find all
- 48 - wikt:molodezhnaja - Foster Daddy, Tora!, Hearts and Flowers for Tora-san, Maid-Droid, Marriage Counselor Tora-san, Stage-Struck Tora-san ... find all
- 47 - wikt:īdābād - Aqeh Kheyl, Gorgabad, Ardabil, Kalateh-ye Seyyed Ali, South Khorasan, Mohammadabad-e Saidabad, Nematabad-e Ghar ... find all
- 45 - wikt:moughataa - Adrar Region, Assaba Region, Brakna Region, Dakhlet Nouadhibou Region, Gorgol Region ... find all
- 45 - wikt:kaisiki - Ahiri, Andolika, Asaveri, Bahudari, Bhavapriya ... find all
- 45 - wikt:efilmcritic - A Christmas Horror Story, Adrift in Tokyo, All American Orgy, All the Boys Love Mandy Lane, Amy (1997 film) ... find all
- 43 - wikt:myzomelas - Aguiguan, Alamagan, Ambae Island, Ambrym, Angophora floribunda ... find all
- 43 - wikt:fänikor - 1st Life Grenadier Regiment (Sweden), 2nd Life Grenadier Regiment (Sweden), Dalarna Regiment, Fähnlein, Halland Regiment ... find all
- 42 - wikt:προσευχη - Codex Augiensis, Codex Claromontanus, Codex Porphyrianus, Minuscule 1739, Minuscule 181 ... find all
- 42 - wikt:maçkolik - 1963–64 Mersin İdmanyurdu season, 1964–65 Mersin İdmanyurdu season, 1965–66 Mersin İdmanyurdu season, 1966–67 Mersin İdmanyurdu season, 1967–68 Mersin İdmanyurdu season ... find all
- 41 - wikt:практическое - List of ASTM International standards (E) ... find all
- 40 - wikt:ŭnbyŏng - Goryeo coinage, Korean currency, Korean mun ... find all
- 39 - wikt:haplolepideous - Bruchia (plant), Bruchia elegans, Bruchiaceae, Calymperaceae, Campylopus ... find all
- 39 - wikt:actinosiphonate - Acleistoceratidae, Actinomorpha, Adelphoceras, Augustoceras, Balashovia ... find all
- 37 - wikt:cheilos - Acheilognathus, Adenochilus, Ancistrochilus, Anoectochilus, Arthrochilus ... find all
- 36 - wikt:ghilmān - Abu'l-Najm Badr, Ahmad ibn Tulun, Al-Aziz Billah, Al-Mu'tadid, Al-Mu'tasim ... find all
- 36 - wikt:elaenias - Booby Pond Nature Reserve, Botanic Park and Salina Reserve Important Bird Area, Cauls Pond, Centre Hills, Dos Pos, Bonaire, Important Bird Area ... find all
- 36 - wikt:audava - Abhogi, Amritavarshini, Bhupalam, Devagandhari, Gambhiranata ... find all
- 36 - wikt:akarere - Bugesera District, Cyarubare District, Districts of Rwanda, Gasabo District, Gatsibo District ... find all
- 35 - wikt:derpwhales - Harbour porpoise ... find all
- 34 - wikt:digibook - 'N Crugu Bradului, A Bit o' This & That, A Strange Thing to Say, Anoraknophobia, At the Arena ov Aion – Live Apostasy ... find all
- 33 - wikt:katuns - Battle of Kolašin, Bjelasica, Bua (tribe), Kelmendi (tribe), Komovi ... find all
- 33 - wikt:guachimonton - Guachimontones, Tequila, Jalisco, Teuchitlán culture ... find all
- 33 - wikt:bimaristans - Bimaristan, Maristan of Granada, Nur ad-Din (died 1174), Psychiatric hospital, Timeline of psychology ... find all
- 33 - wikt:aeroscreen - 2016 Russian Grand Prix, 2020 Genesys 300, 2020 Indianapolis 500, 2020 IndyCar Series, 2020 Iowa IndyCar 250s ... find all
- 32 - wikt:āgamas - Anekantavada, Antakrddaasah, Anuttaraupapātikadaśāh, Aupapatika, Bhairava ... find all
- 32 - wikt:lyrium - Characters of Dragon Age II, Characters of Dragon Age: Inquisition, Dragon Age, Dragon Age 4, Dragon Age II ... find all
- 32 - wikt:apetura - 1981 Primera División de Fútbol Profesional, 1982 Primera División de Fútbol Profesional, 1983 Primera División de Fútbol Profesional, 1985 Primera División de Fútbol Profesional, 1987–88 Primera División de Fútbol Profesional ... find all
- 32 - wikt:aarss - Aminoacyl tRNA synthetase, Cyclodipeptide synthases, Dino Moras, Expanded genetic code, Phage-assisted continuous evolution ... find all
- 31 - wikt:изд - Albena Stambolova, Church of St Demetrius, Boboshevo, Church of St Elijah, Boboshevo, Daniel Kluger, Igor Birman ... find all
- 31 - wikt:fstnt - TNNI2, TNNT1, TNNT2, TNNT3 ... find all
- 31 - wikt:etatsråd - Amaliegade 12, Anker Heegaard, Bolle Luxdorph, Carl Adolph Castenschiold, Carsten Anker ... find all
- 31 - wikt:dābād - Aliabad-e Jowhari, Asgarabad, Fars, Narmeh, Radabad, Sadabad, Anbarabad ... find all
- 31 - wikt:dcdn - Content delivery network interconnection ... find all
- 30 - wikt:đồngs - Bình Phước Province, Bình Định Province, Bắc Giang Province, Bắc Kạn Province, Cao Bằng Province ... find all
- 30 - wikt:mukhamandapa - Architecture of Karnataka, Arjuna Ratha, Baroli Temples, Bugga Ramalingeswara temple, Chakravageswarar Temple, Chakkarappalli ... find all
- 30 -
wikt:alongwith - AMX-13, Al Jazeera Media Network, Badi Dooooor Se Aaye Hai, Bahu Begum, Bodoland University ... find all - 29 - wikt:νηστεια - Codex Augiensis, Codex Claromontanus, Codex Porphyrianus, Minuscule 1739, Minuscule 181 ... find all
- 29 - wikt:θεου - Codex Basilensis A. N. IV. 4, Codex Glazier, Codex Vaticanus 2061, Lectionary 60, Matthew 27:54 ... find all
- 29 - wikt:δij - Archimedes' principle, Borel–de Siebenthal theory, Brownian motion, Buoyancy, Cartesian tensor ... find all
- 29 - wikt:ādatābād - Dashtabad, Narmashir, Saadatabad, Abadeh, Saadatabad, Arsanjan, Saadatabad, Bardsir, Saadatabad, Darab ... find all
- 29 - wikt:mihnah - Ahmad ibn Abi Du'ad, Harthamah ibn al-Nadr al-Jabali, Ishaq ibn Ibrahim al-Mus'abi, Ishaq ibn Yahya ibn Mu'adh, Kaydar Nasr ibn Abdallah ... find all
- 29 - wikt:maechis - Maechi, Siladhara Order ... find all
- 29 - wikt:imirenge - Bugesera District, Burera District, Busengo, Rwanda, Districts of Rwanda, Gatunda ... find all
- 29 - wikt:hangaround - 1996 Copenhagen Airport shooting, Bandidos MC criminal allegations and incidents, Brödraskapet, Hells Angels MC criminal allegations and incidents, Nordic Biker War ... find all
- 29 - wikt:dropsondes - Atmospheric sounding, Dropsonde, Economy of Columbus, Ohio, Eyewall replacement cycle, Global Positioning System ... find all
- 28 - wikt:musumeyaku - Asuka Tono, Ayane Sakurano, Mari Hanafusa, Natsuki Mizu, Risa Junna ... find all
- 28 -
wikt:dissertion - Anders Hultgård, C. Scott Littleton, Craig Benjamin, Dean A. Miller, Dieter Timpe ... find all - 27 - wikt:дивизија - 11th Air Defense Division, 13th Air Defense Division, 15th Air Defense Division, 21st Aviation Division, 29th Aviation Division (Socialist Yugoslavia) ... find all
- 27 - wikt:mōhua - Breaksea Island (New Zealand), Brook Waimārama Sanctuary, Declana egregia, Fiordland, Fiordland National Park ... find all
- 27 - wikt:mössmärke - Admiral (Sweden), Flottiljamiral, Kapten, Kommendör, Kommendör av 1. graden ... find all
- 27 - wikt:kurakas - Aymara kingdoms, Cacique, Calchaquí, Diaguita, Efraín Trelles ... find all
- 27 - wikt:kombonis - Komboni, Shanty town, Squatting ... find all
- 27 - wikt:genophage - Ashley Williams (Mass Effect), Biological warfare in popular culture, Dragon Age: Inquisition – Trespasser, Krogan, Mass Effect 3 ... find all
- 27 - wikt:degredados - 2nd Portuguese India Armada (Cabral, 1500), Barra (neighborhood), Cacheu, Captaincy of Pernambuco, Colonial Brazil ... find all
- 27 - wikt:dacoz - Epimeria ... find all
- 27 - wikt:alcippus - Danaus (butterfly), Danaus chrysippus, List of butterflies of Benin, List of butterflies of Burkina Faso, List of butterflies of Cameroon ... find all
- 26 - wikt:κυριος - Codex Laudianus, Codex Vaticanus 2061, Cotton Genesis, Cyril, Family Kr ... find all
- 26 - wikt:lăutărească - Ciocârlia (Romanian folk tune), Costi Ioniță, Damian Drăghici, George Nicolescu, Lăutari ... find all
- 26 - wikt:komēs - Anna (wife of Artabasdos), Artabasdos, Aëtius of Amida, Byzantine army, Chartoularios ... find all
- 26 - wikt:khoshuu - Aimag, Bayandelger, Töv, Chingünjav, Dashdorjiin Natsagdorj, Dulduityn Danzanravjaa ... find all
- 26 - wikt:honā - Continuous and progressive aspects, Future tense, Grammatical mood, Grammatical tense, Habitual aspect ... find all
- 26 - wikt:fylkesordfører - Administrative divisions of Norway, Arnfinn Nergård, Audun Tron, County council (Norway), County municipality (Norway) ... find all
- 26 - wikt:faʻafafine - Fa'afafine ... find all
- 26 - wikt:diplexed - In-band on-channel, KEAR (AM), KEST, KFWB, KQFN ... find all
- 26 - wikt:cypsellae - Felicia aethiopica, Felicia amelloides, Felicia amoena, Felicia annectens, Felicia bellidioides ... find all
- 26 - wikt:chatushruti - Amritavarshini, Andolika, Atana, Devagandhari, Dheerashankarabharanam ... find all
- 26 - wikt:biradaris - Bisati, Churihar, Dharhi, Doodwala, Ethnic groups in Pakistan ... find all
- 26 - wikt:bhukti - Asharaja, Ashoknagar district, Bamangola (community development block), Budhagupta, Chanchal II ... find all
- 26 - wikt:anuratha - Aisanyesvara Siva Temple, Arjunesvara Siva Temple, Astasambhu Siva Temples, Bata Mahadeva, Bhringesvara Siva Temple ... find all
- 26 - wikt:aluminyl - Aluminium(I) nucleophiles ... find all
- 25 - wikt:ваздухопловна - 1st Air Command, 21st Aviation Division, 29th Aviation Division (Socialist Yugoslavia), 32nd Aviation Division, 37th Aviation Division (Socialist Yugoslavia) ... find all
- 25 - wikt:mukhamantapa - Bankapura, Bhimeshvara Temple, Nilagunda, Chennakeshava Temple, Hullekere, Chennakeshava Temple, Turuvekere, Dah Parvatiya ... find all
- 25 - wikt:külliyye - Külliye ... find all
- 25 - wikt:kruptos - Barred tinamou, Berlepsch's tinamou, Black-capped tinamou, Brazilian tinamou, Brown tinamou ... find all
- 25 - wikt:hrvatskoga - Croatian Vukovians, Croatian language, Eduard Hercigonja, Etymological dictionary, Franjo Marković ... find all
- 25 - wikt:haltijas - Finnish paganism, Haltija, Haltya ... find all
- 25 - wikt:fújì - Debbie Klein, Fuji music ... find all
- 25 - wikt:flyaround - Flight controller, Jerry M. Linenger, Progress M-08M, Progress M-13M, Progress M-17M ... find all
- 25 - wikt:delområde - Bergjeland, Buøy, Eiganes, Gausel, Godeset ... find all
- 25 - wikt:bhikkhunīs - Bhikkhunī, Buddhist Cultural Centre, Buddhist flag, Dhammadharini Vihara, Ānanda ... find all
- 25 - wikt:apkc - Carla V. Rothlin, Cell polarity, Epithelial polarity, Ganglion mother cell, Inner cell mass ... find all
- 25 - wikt:abhangas - DY Patil Stadium, Damaji, Dattatreya, Devaki Pandit, Haripath ... find all
- 24 - wikt:field -
Björn Ambrosiani, Brett Maher (American football), Compact Linear Collider, Deimos and Phobos Interior Explorer, Eleanore Ramsey... find all - 24 - wikt:αυτω - Codex Boernerianus, Codex Ephesinus, Lectionary 239, Matthew 1:24, Matthew 27:55-56 ... find all
- 24 - wikt:kumichō - Saizo Kishimoto, Yamaguchi-gumi, Yoshiaki Fujiwara ... find all
- 24 - wikt:kubbs - Kubb, The Amazing Race 6 ... find all
- 24 - wikt:hvbat - Swedish Army ... find all
- 24 - wikt:heihaku - Fushimi Inari-taisha, Glossary of Shinto, Hirano Shrine, Hirose Shrine, Hirota Shrine ... find all
- 24 - wikt:harmoniai - Aeolian mode, Damon of Athens, Dorian mode, Katolophyromai, Mixolydian mode ... find all
- 24 - wikt:főszolgabíró - Dénes Farkas, Elek Csány, Farkas de Boldogfa, Ferenc Deák, Ferenc Farkas (Jesuit priest) ... find all
- 24 - wikt:fabrikmester - Andreas Gerner, Andreas Schifter, Diderich de Thurah, Frantz Hohlenberg, Frederik Michael Krabbe ... find all
- 24 - wikt:eventhough - Ahmad Sidi Ismail, Amboy Airfield, Cieszyn Silesian dialect, Coccoloba gigantifolia, Coenagrion ornatum ... find all
- 24 - wikt:ekakuta - Akkana Basadi, Amrutesvara Temple, Amruthapura, Architecture of Karnataka, Brahmeshvara Temple, Kikkeri, Chennakeshava Temple, Aralaguppe ... find all
- 24 - wikt:echinulins - Aspergillus aerius, Aspergillus appendiculatus, Aspergillus brunneus, Aspergillus caperatus, Aspergillus cibarius ... find all
- 24 - wikt:devotionalism - Akshay Kumar Datta, Allama Prabhu, Bharatendu Harishchandra, Buddhist influences on Advaita Vedanta, Faith in Buddhism ... find all
- 24 - wikt:bychowskyi - Boris Bychowsky, Cymothoa, Lepeophtheirus, Mexicana (genus), Murraytrema ... find all
- 24 - wikt:auroglaucin - Aspergillus aerius, Aspergillus appendiculatus, Aspergillus biplanus, Aspergillus brunneus, Aspergillus caperatus ... find all
- 24 - wikt:anatomicae - 1552, 1552 in science, 1561 in literature, 1561 in science, 1714 in science ... find all
- 24 - wikt:adelophthalmids - Adelophthalmidae, Bassipterus, Eurypterid, Eysyslopterus, Nanahughmilleria ... find all
Likely new English compounds by frequency (n-z)[edit]
(Waiting for new dump; only cases with manual notes are shown below.)
- 43 - wikt:subscalar - Aforia goniodes, Aforia trilix, Anatoma alta, Benthomangelia macra, Bolma aureola ... find all → need a malacological definition
- 21 - wikt:peaktime - 1977 in British television, 1982 in British television, 1983 in British radio, Blockbusters (British game show), Bus Éireann Route 101 ... find all -it does get used in sources eg, but dictionaries use "peak time". I went through and changed all uses within WP prose to the dictionary spelling, but it maybe should get added to wikt.
- 19 - wikt:pstg - Language center, Language processing in the brain ... find all - okay so this is an acronym (pSTG) for an area in the brain. I would suggest a redirect of both the term (posterior superior temporal gyrus) and the acronym to either Brodmann area 22 or Superior temporal gyrus. --Xurizuri (talk) 04:43, 2 February 2021 (UTC)
- 18 - wikt:soundsystems - Acetate disc, Boiler Room (music broadcaster), Caribbean music in the United Kingdom, David Rodigan, Hopelessly in Love ... find all - apparently this is a reggae term, I found a few instances of it w/o a space like this. The wp article is Sound system (Jamaican). --Xurizuri (talk) 05:35, 4 February 2021 (UTC)
Likely new words by frequency, all languages (n-z)[edit]
(Waiting for new dump; only cases with manual notes are shown below.)
- 26 - wikt:vrijburgers -
Afrikaners, Asafo, Boers, Cape Dutch, Dutch people... find all -> I've started going through these and lang tagging everything in the articles (although this specific word is probably important to have in wikt bc of historical significance), I have mediocre knowledge of Afrikaans (and therefore Dutch) and vrijburgers is the original Dutch word for Afrikaans colonists so I guess I'll be the sacrifice. --Xurizuri (talk) 10:34, 7 February 2021 (UTC) // listing here so I can keep track of which ones I've done: the other pages areMeermin slave mutiny, Cornelis Nagtglas, Johannes Oosthout, South Africa, Great Trek, White Africans of European ancestry, South African Wars (1879–1915) (started but got annoyed at its tone),Cape Colony--Xurizuri (talk) 08:32, 20 February 2021 (UTC) - 24 -
wikt:premiére - Atandwa Kani, Cigano (film), De Lafontaine, Figaro-Polka, Franz Christian Gau ... find all→ Use in English text fixed, but there are a few unconfirmable references with this spelling. Note this does also appear to be a Slovak spelling - not using the è as French does.
For Wiktionary[edit]
This is a special section; putting a Wiktionary link here will cause a word to be ignored by the spell checker everywhere it appears (on the assumption it will soon be added to Wiktionary.)
Rejected[edit]
(These will need {{not a typo}} and maybe an HTML comment.)
- wikt:tragediously: I moved this one out of the list above because it seems to have only ever been used once, by Aston Cockayne, whereas Wiktionary only includes English words that have been used by three different people. -sche (talk) 21:00, 13 August 2020 (UTC)
- moved out from the list for English words first attested in Chaucer, this is apparently a misspelling of prentishood as Chaucer spelled it (wikt:prenticehood) --Xurizuri (talk) 12:55, 7 January 2021 (UTC)
- wikt:scorkle (from English words first attested in Chaucer list) apparently used to be on wikt then got RfD'd
Vocab pages[edit]
- 61 - Boontling - wikt:bahlness, wikt:beelch, wikt:beemsch, wikt:beeljeck, wikt:belhoon, wikt:blooch, wikt:bloocher, wikt:breggo, wikt:borp, wikt:bowgley, wikt:burlapping, wikt:chigrel, wikt:cloddies, wikt:comoshe, wikt:condeal, wikt:crazeek, wikt:deeger, wikt:deejy, wikt:dehigged,wikt:dissies, wikt:donicker, wikt:donagher, wikt:dreek, wikt:dreeked, wikt:dreeking, wikt:dulcey, wikt:eeld, wikt:eesole, wikt:haireem, wikt:heelch, wikt:pockety, wikt:higged, wikt:higgied, wikt:hobneelch, wikt:keishbook, wikt:kimoshe, wikt:kingster, wikt:madging, wikt:modocker, wikt:moldune, wikt:moldunes, wikt:nettied, wikt:nonch, wikt:oshtook, wikt:peeril, wikt:pusseek, wikt:rawncher, wikt:seertail, wikt:sirtle, wikt:sharkin, wikt:shoveltooth, wikt:somersetting, wikt:steedos, wikt:teebow, wikt:tuddies, wikt:tuddish
- 43 - English words first attested in Chaucer - wikt:attourne, wikt:feminie, wikt:gigge, wikt:louke, wikt:emprent, wikt:enbaissing, wikt:ensampler, wikt:entach, wikt:entech, wikt:entalent, wikt:eschaufe, wikt:festivally, wikt:foleye, wikt:forline, wikt:formly, wikt:fortunel, wikt:fortunous, wikt:habitacule, wikt:hustlement, wikt:necess, wikt:overwhelve, wikt:plungy, wikt:portionable, wikt:presentary, wikt:previdence, wikt:purveyable, wikt:rhetorian, wikt:slead, wikt:troublabla, wikt:unbetide, wikt:undoubtous, wikt:unleeful, wikt:unmovablety, wikt:unparegal, wikt:unplite, wikt:unweened, wikt:vengeress, wikt:weeply, wikt:witnessfully
- 9 - Longest word in English - wikt:broughammed, wikt:subdermatoglyphic, wikt:gravedinously, wikt:shakalshas, wikt:galahads, wikt:leucocytozoans, wikt:quiaquia
0-9[edit]
- 1 - 2008 Dublin Senior Football Championship - wikt:sline: Gaelic Football notation ('sideline') \\ this is on wikt but not for this meaning --Xurizuri (talk) 13:06, 7 January 2021 (UTC)
- 1 - 1830–1831 papal conclave - wikt:unvetoed - not a typo
- 3 - 1842 Wallachian princely election - wikt:sortitioned - past tense verb form of sortition
- 1 - 2000 and Whatever - wikt:auspOp: the name of a web site. Ira Leviton (talk) 16:21, 24 September 2019 (UTC)
- 1 - 2018 in Germany - wikt:indiologist - seems to be a real word
- 1 - 42 (dominoes) - wikt:renegger: a term used in the game and defined in the article. Ira Leviton (talk) 20:59, 26 September 2019 (UTC)
- I think this belongs in the dictionary, as a derivation of wikt:reneg -- Beland (talk) 01:48, 24 March 2020 (UTC)
- 1 - 2015 African Youth Athletics Championships - wikt:octathlete - competitor in an octathlon*
- 1 - 1607 - wikt:pallisadoed - a real word
- 1 - 17th Armored Engineer Battalion - wikt:chespaling- "chespaling mat" is a real term for a type of field matting
- 2 - 1980 Quebec referendum - (probably OK: wikt:regroupments) - conscious adaptation of a French word specifically in the context of the politics of Quebec
- 3 - 1854 Broad Street cholera outbreak - wikt:vibriones, wikt:vibriones, wikt:vibriones - a real word
- 1 - 1894 United States House of Representatives elections - wikt:silverist - if this is a real word, it means an American political faction
- 1 - 1938 NSWRFL season - wikt:trygetters - conceivably a real word (Australian)
- 2 - 1957 in jazz - wikt:sazabo - a Turkish musical instrument
- 2 - 1968–69 Mersin İdmanyurdu season - (probably OK: wikt:maçkolik) = maçkolik.com (Turkish sports website)
- 2 - 1st Aeromedical Evacuation Squadron - wikt:aeromedically, wikt:aeromedically - if "aeromedical" is an adjective, no reason why "aeromedically" can't be an adverb
- 1 - 2001 Taiwan legislative election - wikt:reunificationist - must surely be a real word = "supporter of reunification"
- 1 - 2003 Somaliland presidential election - wikt:mistabulation - a real word
- 1 - 2NU - wikt:synclaver - this is a common spelling, so I've left it, but it may nevertheless be a typo for "synclavier"
- 1 - 2014 in Costa Rica - wikt:unjournalistic: this word seems OK. Ira Leviton (talk) 02:05, 30 September 2019 (UTC)
- 2 - 2016 PSOE crisis - wikt:officialists, wikt:officialists: a name given to a faction in this crisis (opposed to the "critics". Ira Leviton (talk) 21:17, 2 October 2019 (UTC)
- 6 - 2008 Murshidabad beheading - wikt:shalishi, wikt:shalishi, wikt:shalishi, wikt:shalishi, wikt:shalishi, wikt:shalishi = "shalishi court", which is a kangaroo court in India
- 1 - 2008 TC3 - wikt:polymict - a real word
- 2 - 2010–11 Reading F.C. season - wikt:backheeler - never seen it as a noun but no reason why not
- 1 - 2012 Ingleside, San Francisco homicide - wikt:undeportable - a real word
- 1 - 2006 Iranian Assembly of Experts election - wikt:provisionist: seems to be a term in Iranian politics. It comes up on Internet searches, but the citation in the article is in Persian. Ira Leviton (talk) 23:53, 29 September 2019 (UTC)
- 1 - 2006 Oregon Ballot Measures 46 and 47 - wikt:unobligated: OK, in an arcane financial way. Ira Leviton (talk) 23:53, 29 September 2019 (UTC)
- 1 - 20th Lancers (British Indian Army) - wikt:risallahs - word for an Indian cavalry unit - please add to Wikt
- 1 - 24/7 service - wikt:rehumanisation - apparently a word in the service sector
- 1 - 251st Cyberspace Engineering Installation Group - wikt:remissioning - cyberspace jargon
- 1 - 2017–18 Taça da Liga - wikt:repechaged - "repechage" is fine as a noun; "repechaged" is occasionally used, but is not necessarily correct, so leaving this here for a second opinion
- 4 - 3D cell culturing by magnetic levitation - wikt:adipospheres - real scientific term
- 1 - 1973 Soviet economic reform - wikt:derationalisation: seems OK in context and with British spelling. Ira Leviton (talk) 15:17, 29 September 2019 (UTC)
- 4 - 009-1 - wikt:cybernetized - intended for "adapted into a cybernetic form" but probably not a real word
- 1 - 2003 in Afghanistan - wikt:telekiosks: a legit term (plural). Ira Leviton (talk) 14:41, 26 May 2020 (UTC)
- 1 - 2018 Selangor state election - wikt:agropolitan: apparently a legitimate word. Ira Leviton (talk) 20:03, 25 May 2020 (UTC)
- 1 - 1962 Burmese coup d'état - wikt:intramilitary: seems OK. Ira Leviton (talk) 15:16, 27 May 2020 (UTC)
- the correct spelling is wikt:intra-military, but this isn't in wikt either so I'm leaving this entry here as a reminder. I have fixed it in the article.
- 1 - 1923 Madras Presidency Legislative Council election - wikt:diarchical - might be okay Strickesel (talk)
- 1 - 1 Corinthians 10 - wikt:eulogesas - a theological term, might be correct Strickesel (talk)
- 1 - 116th Operational Maneuvers Regiment - wikt:brelage: a type of belt or ropework. Ira Leviton (talk) 01:26, 25 May 2020 (UTC)
- 2 - 1946 Romanian general election - wikt:guardists: a term applied to members of the Iron Guard in Romania. Ira Leviton (talk) 00:08, 25 May 2020 (UTC)
- 2 - 35 mm movie film - wikt:anamorphically, wikt:anamorphosing: photography terms. Ira Leviton (talk) 14:41, 26 May 2020 (UTC)
- 2 - 86th Scripps National Spelling Bee - wikt:kneydel, wikt:knadel - different spellings of 'knaidel' Strickesel (talk)
- 1 - 2400-series (CTA) - wikt:unrehabbed - unsure if okay Strickesel (talk)
A[edit]
- 1 - A. carbonaria - wikt:varay: seems like a real word.
- This is part of the common name for the species, so I think that belongs in Wiktionary? If not, we would normally create a redirect from cotton varay to Albizia carbonaria and that would take care of it, but the latter hasn't been created yet. -- Beland (talk) 17:27, 13 October 2018 (UTC)
- 1 - Adana Center for Arts and Culture - wikt:ampire: possible Ottoman architectural style?
- 1 - Adeline's Dream - wikt:soddle - Possibly means 'soddy' a nickname for houses made of sod (earth and grass), potentially also Germanic slang for the same.
- 1 - African-American Vernacular English - wikt:fixina: dialect-specific
- Yes, but possibly too rare to meet Wiktionary Criteria For Inclusion in this specific spelling; we do have wikt:fixing to, wikt:finna and others. -sche (talk) 03:06, 28 November 2018 (UTC)
- 1 - Aina Wifalk - wikt:manuped: name for an invention.
- 1 - Alash Ensemble - wikt:limpi: old musical instrument?
- 1 -
Allelopathy - wikt:allelo-:used as a prefix to explain a word root. (Wiktionary does have prefixes and suffixes.) → lang marked - 1 - Alice Holt Forest - wikt:hangra: an Old English word.
- 1 - Antas de Ulla - wikt:liscos: a Spanish word, possibly regional, for a type of dish of bacon bits.
- 2 - Amsterdamseweg - wikt:banpole, wikt:banpole: a type of monument or marker to indicate how far criminals were allowed to approach city. Western Europe, Netherlands. Not sure if it should be one or two words.
- 1 -
Andrew Glover (composer) - wikt:aleotory: evidently a real word, although I can't define it. See https://aleacounterpoint.wordpress.com/2010/06/08/orpheus/ → was corrected to "aleatory" - 1 - ArmSCII - wikt:yiwn - normal Armenian ech (yech) and yiwn (vyun) small letters pair
- 1 - Arto Tunçboyacıyan - wikt:blul: an Armenian musical instrument, described as the same as or similar to a sring
- 1 - Arts and Science Center for Southeast Arkansas - wikt:seriographs: seems like a legitimate word. -> There's a Wikipedia article; Wiktionary needs the word and its plural.
These were checked for misspellings and determined to be OK. They need to be added to Wiktionary or an exclusion list:
- Apo (drink) - wikt:wiyu - checked, OK
- Apocalypse (Star Wars novel) - wikt:drochs - checked, OK
- Apodemia mormo langei - wikt:psychicola - checked, OK
- Apolinar's wren - wikt:twii (probably OK: wikt:tchorr) - checked, OK \\ w/o having checked, I'll bet all my savings that these are bird sounds --Xurizuri (talk) 13:24, 7 January 2021 (UTC)
- Aporia hippia - wikt:taupingi - checked, OK
Aposthia - wikt:aposthic, wikt:aposthic, wikt:aposthic, wikt:aposthic- checked, OK- Apotomops rhampha - wikt:rhamphos - checked, OK
- Apotropaic mark - wikt:trepein (probably OK: wikt:apotrepein) - checked, OK
- Appendix Probi - wikt:denasalised, wikt:numqua - checked, OK
- Appendix Vergiliana - wikt:keirein - checked, OK
- Appias ada - wikt:thasia (probably OK: wikt:tindalti) - checked, OK
- Apple Blossom Handicap - wikt:distaffers - checked, OK
- Apple of my eye - wikt:iyshown, wikt:iyshown, wikt:iyshown - checked, OK
- Application Enhancer - wikt:haxies, wikt:haxies, wikt:haxies - checked, OK
- April 2009 Moldovan parliamentary election protests - wikt:episodul - checked, OK
- April Daniels - wikt:andrn - checked, OK
- Aprosphylosoma - wikt:julidan - checked, OK
- Aptamer - wikt:aptabodies (probably OK: wikt:postranslational, wikt:trxA) - checked, OK
- Aptenia - wikt:ptenos - checked, OK
- Apulet - wikt:apulettes - checked, OK
- Aqraba, Nablus - wikt:khirbets - checked, OK
- Aqua Virgo - wikt:vinustas - checked, OK
- Aquatic garter snake - wikt:zaxanthus - checked, OK
- Aquilarhinus - wikt:palimentus - checked, OK
- Aquilino Ribeiro - wikt:encoiradas - checked, OK
- Arab Street - wikt:pukadai, wikt:sadkku - checked, OK
- Arabana people - wikt:wadlu, wikt:wagka (probably OK: wikt:woqka) - checked, OK
- Araeosoma - wikt:dactylous (probably OK: wikt:brunnichi) - checked, OK
- Aralez (mythology) - wikt:aralezes, wikt:aralezes, wikt:aralezes, wikt:aralezes - checked, OK
- Arancini - wikt:bburru - checked, OK
- Arapian - wikt:kavourma, wikt:loutza, wikt:caseri, wikt:chasapaki - checked, OK
- Araripedactylus - wikt:daktylos - checked, OK
- Araucaria biramulata - wikt:biramule - checked, OK
- Arbore people - wikt:kyrnat, wikt:qawots (probably OK: wikt:chirnan, wikt:morqo, wikt:qawot) - checked, OK
- Arboretum La Alfaguara - wikt:manleb (probably OK: wikt:euromericana) - checked, OK
- Arbostola - wikt:heuritica - checked, OK
- Arbutus unedo - wikt:kocimare (probably OK: wikt:komaròs) - checked, OK
- Arbuzov - wikt:arbooz - checked, OK
- Arca (bivalve) - wikt:kauaia (probably OK: wikt:koumaci) - checked, OK
- Arcadian League - wikt:myrioi - checked, OK
- Archaefructus - wikt:eoflora - checked, OK
- Archaeocyon - wikt:leptodus (probably OK: wikt:falkenbachi)- checked, OK
- Archaeocyte - (probably OK: wikt:collencytes) - checked, OK
- Archaeognatha - (probably OK: wikt:koryphē) - checked, OK
- Archaeoindris - (probably OK: wikt:collodiaphyseal) - checked, OK
- Archaeology of Qatar - wikt:rawdas - checked, OK
- wikt:archaios (from Archaeornithomimus, Archaeornithoides, Archaeoistiodactylus, Archaeoindris, Archaeognatha, Archaeocyte) used to be in wikt but was deleted for being a frivolous entry.
B[edit]
- 1 -
Balloon light - wikt:tuboid:legitimate word, meaning resembling or like a tube. - 1 - Batog - wikt:batogs: legitimate word.
- 1 - Battered (band) - wikt:burkies: probable legitimate slang use of a word.
- 1 -
Bathford - wikt:drayning: old English spelling of draining.→ it is in a quote so should not reappear. - 1 - Baths of Agrippa - wikt:quadran: a Roman bronze coin worth one quarter of an as.
- 1 - Bathtub boat - wikt:tubbers: somebody who races in bathtubs; a bathtub racer.
- 2 - Baju Kurung - wikt:sampin, wikt:sampin: probably a Malay word.
- 1 - Bruce Kiskaddon - wikt:creakin - many in' word endings: can they be special-cased?
- 1 - Buhler Group - wikt:gristing - needs to be in wikt (it's what grist mills do)
- 1 - Butts Up - wikt:savies - defined in artile. Slang/sports term. Elfabet (talk)
- 1 - Béton brut - wikt:shetting - defined in article with source, should be added, probably. Elfabet (talk)
- 1 - Brisket - wikt:brusket: Middle English.
- 1 - Bourgueticrinida - wikt:cirrals: part of a sea lily (plural).
- 1 - Bible and Orient Museum - wikt:ethnologica: old-fashioned, but OK.
- 2 - Big-bang firing order - wikt:twingles, wikt:twingles: plural for twingle, a type of engine re-engineered to have cylinders fire simultaneously instead of alternately.
- 1 - Birstall, West Yorkshire - wikt:byrh: Old English.
- 1 - Black Shuck - wikt:skuh: Old English.
- 1 - Blagdon - wikt:bloec: Old English, meaning 'black' or 'bleak'.
- 1 - Blasius Merrem - wikt:carinates: an term for flying birds, with a keeled sternum.
- 1 - Bellwether - wikt:bellewether: Middle English spelling, used as an example in the article.
- 1 - Berberis canadensis - wikt:glaucose: used correctly in the article - it's a word.
- 1 - Bergstedt - wikt:stedt: used as a suffix, as in -stedt.
- 1 - Bertram de Criol - wikt:constabularie: Old English spelling.
- 2 - Band government - wikt:treatied, wikt:untreatied: I'm not sure if this is proper use of the word treaty.
- 1 - Barbiturate - wikt:tooties: slang for barbiturate, as mentioned in the article.
- 1 - Bahaba - wikt:chaptis: a common name taken from a species name.
- 1 - Businessman (film) - wikt:flexies - unsure
- 1 - Breakscore - wikt:zouching: zouch is a term for a scoreless roll in the game, maybe in other games too.
- 3 - Book signing - wikt:ereading, wikt:ereading, wikt:ereading: short for "electronic reading".
- 1 - Barney Berlinger - wikt:septathlon: used in the newspaper article used as a reference.
- 2 - Bathtub racing - wikt:tubbers, wikt:tubbers: a bathtub racer.
- 1 - Battle of Byczyna - wikt:elears: a pretty obscure word. A type of cavalry fighter (and plural).
- 2 - Bay of Sielmönken - wikt:warfts, wikt:warfts: a type of Northern European artificial dwelling mound - see Terp
- 2 - Blood and Thunder (comics) - wikt:squigs, wikt:squigs: a type of character in a game or comic. Short for "squiggly beast".
- 1 - Belmond Las Casitas - wikt:colcas: Spanish or a local Indian word for the mud and stone granaries built into the cliffs or caves, and for which the Colca Valley is named. Plural.
- 1 - Battle-Pieces and Aspects of the War - wikt:outly: this appears to be a typo in a source copied online. I can't locate the original source, nor figure out what the word should be. I don't want to mark it with [sic] because I don't know if there's an error in the original source.
- 27 - wikt:adobong - Cabalen, Ipomoea aquatica, Kapamilya, Deal or No Deal, Philippine adobo, Squid as food ... find all
- A Filipino word derived from adobo meaning cooked in a marinade. I redirected, but if someone knows how to add these to Wiktionary, pleaes do it.
C[edit]
- 2 - Capriccio (art) - wikt:quadratture: may be mispelled (one 't'). The same word applied to math and other fields has one 't', but I'm not sure about this. I'll leave it to the art experts.
- 1 - Catherine de' Medici's building projects - wikt:priants: correct, people kneeling in prayer.
- 1 - Chattanooga Choo Choo (film) - wikt:chuggin: in a movie tagline, as "chuggin' "
- 1 - Cheese on toast - wikt:choast: slang for cheese and toast, explained in the article.
- 1 - Cheltenham - wikt:cilta: word root explaining the etymology of the town name.
- 1 - Catopta saldaitisi - wikt:stroky: quoted correctly from a citation.
- 1 - Cusec - wikt:cufm - unit of flow rate. stands for 'cubit feet per minute'. Includable, though uncommon.
- 1 - Cyclist fatality rate in U.S. by year - wikt:trikkes - A company's name for their 3-wheeled, body-powered transport device that is not copyright, so isn't a proper noun, but almost should be.
- 1 - Cremlingen - wikt:deestablishment - "Until its deestablishment in 1974"
- 1 - Cybermind - wikt:asence - unsure. A made-up word by one author about the presence or lack there-of of people online. Not in common usage. Probably just {notatypo} it and call it day? Elfabet (talk)
- 1 - Current (mathematics) - wikt:comass - unsure. Mathematical term? Defined in article? -- seems to be 'co-mass' Jkgree (talk) 15:46, 20 February 2019 (UTC)
- 1 - Crime in Iran - wikt:toumans - "amounts to 10 trillion toumans a year (1 touman equals 10 rials)"
- 1 - Cernach mac Fergusa - wikt:subsept: a subdivision of a tribe. Legitimate use.
- 1 - Chaos (genus) - wikt:uroid: a subcellular portion of an amoeba.
- 1 - Conulariida - wikt:conulate: seems like a legitimate word, although restricted to science.
- 1 -
Coregonus maraena - wikt:whitfish:appears to be spelled correctlybobdog54 (talk) 19:44, 14 December 2018 (UTC) - 4 - Coat of arms of Barcelona - wikt:paletts, wikt:paletts, wikt:fomer, wikt:paletts: diminutive of Pale (heraldry)?
- 1 - Coat of arms of the London Borough of Hammersmith and Fulham - wikt:pomels: a heraldic term.
- used to exist but got deleted with the reasoning "ME not ModE" which I do not understand --Xurizuri (talk) 03:14, 8 January 2021 (UTC)
- The edit summary means "[this word only exists as] Middle English not Modern English", though the entry shouldn't have been deleted, just the language header/codes should have been changed from presenting it as modern English to Middle English. It may, in fact, also exist in modern English in heraldic descriptions, in which case even the modern English entry could be restored, but for now I've at least restored it as Middle English. -sche (talk) 21:53, 12 April 2021 (UTC)
- used to exist but got deleted with the reasoning "ME not ModE" which I do not understand --Xurizuri (talk) 03:14, 8 January 2021 (UTC)
- 1 - Coat of arms of the Prince of Asturias - wikt:bordured: a heraldic term - bordure
- 2 - Coat of arms of the Valencian Community - wikt:paletts, wikt:paletts: diminutive of Pale (heraldry)? - i.e., "pallets" (usual spelling)
- 1 - Ciconiae Nixae - wikt:regionaries: seems legit (plural).
- 1 - Cox Green, Tyne and Wear - wikt:coccs: Old English cocc, crest of a hill.
- 1 - Coagulin - wikt:proxins: a type of protein.
- 1 - Craver Farmstead - wikt:skipples - "In the 1790 lease The Millers agreed to a yearly rent of 24 1/2 skipples of winter wheat"
- Merriam-Webster says this an alternate spelling of wikt:schepel, a Dutch unit -- Beland (talk) 20:56, 14 September 2019 (UTC)
- 1 - Champion (apple) - wikt:shampion: alternative spelling of the Champion type of apple.
- 1 - Chest (furniture) - wikt:wakis short for "wagon-kist".
- 2 - Chester Zoo - wikt:mantellas, wikt:mantellas: plural of mantella, a type of frog.
- 2 - Cheuksin - wikt:jesas, wikt:jesas: a type of Korean ritual.
- 1 - Chhurpi - wikt:durkha: a Nepali type of cheese.
- 1 - Chimantaea - wikt:paramoid: correctly used, according to the page, meaning "like paramo."
- 1 - Chorus line - wikt:twirlies: a term used for girls in a chorus line.
- 2 - Christopher O'Hare - wikt:cremain: short for cremated remains.
- 1 - Chub (disambiguation) - wikt:chubbing: a legislative discussion among several members to waste time and/or block action. Similar to filibustering.
- 2 - Chung Do Kwan - wikt:guep: a level in Tang Soo Do, a Korean martial art.
- 1 - Church of St. James (Brno) - wikt:flanning: architectural term meaning "the internal splay or bevel of a window-jamb."
- 2 - Coat of arms of Nuuk - wikt:siminar, wikt:siminar: the name of a type of building in Nuuk.
- 3 - Cobbler (food) - wikt:cobeler, wikt:sonker, wikt:sonker. Cobeler explains the etymology of cobbler. Sonker is a local North Carolina variation, a cross between a cobbler and a pie.
- 1 - Cipriani Potter - wikt:valzers: in the title of a piece written for piano.
- 1 - Cornish currency - wikt:dynar: an old Cornish currencey.
- 1 - Cornish jack - wikt:labeos: a type of fish (plural).
- 1 - Creedmoor Branch - wikt:demapped - "finally being torn up and demapped in the early 1970s."
- 1 - Chain conveyor - wikt:multiflexing
- 1 - Charles Dallas - wikt:mcrt: it's an abbreviation but I don't know for what. (It even has a period.)
- 1 - Cinema of the United States - wikt:photogenia: nots sure if it's a legit word. It's used to mean "the desire to make everything photogenic for social media impact".
- 2 - Cigu Niru - wikt:nirus: transliteration of a Chinese word (plural) of an army unit.
- 1 - Cleobury Mortimer - wikt:clifu: an Old English word, meaning a steep place.
F[edit]
J[edit]
- 1 -
Joadja, New South Wales - wikt:oilworks: seems legit. - 1 - JotForm - wikt:esigning: short for "electronic signing" like email for "electronic mail".
- 1 -
John Austin Victoreen - wikt:otometry:an old but legitimate word. Not optometry.
P[edit]
Y[edit]
- 1 - Yacambú National Park - wikt:caramerudo - this is the common name used in Venezuela for Odocoileus virginianus deer (see here) - I'm not sure what to do with it. DferDaisy (talk) 15:55, 4 August 2018 (UTC)
By word[edit]
- wikt:degradated - Southern American English
- 9 - wikt:lavwa - Bélé, Chanté mas, Chouval bwa, Music of Dominica, Music of Martinique ... find all →means "voice" in several; Caribbean creoles.
- wikt:vlně - Czeck (wool)
- wikt:vlne - Slovak
- According to User:Palmyrah on Template talk:Which lang#Patna, used in Horton Plains National Park:
- wikt:patna, (Sri Lankan) English, noun: a plain or, more usually, a hillside covered with patna grass
- wikt:patna grass - a particular kind of grass
- Possible etymology: a similar kind of grass grows in Patna, India, and brooms made from it are used all over India
- wikt:patana, Sinhalese, noun: patna
- Possible etymology: English patna
- patna and patana are both in wikt, but not with these meanings
- wikt:subdeletion - Comparative#Comparative subdeletion, [1]
- 28 (down from 45) - wikt:groundcolour - Anaxyrina cyanopa, Asura euprepioides, Carposina maritima, Dioryctria caesirufella, Euchromius aris ... find all
- 32 (down from 39) - wikt:āgamas - Anekantavada, Antakrddaasah, Anuttaraupapātikadaśāh, Aupapatika, Bhairava ... find all - see agama. It is normal to add s to pluralize many Sanscrit words. Johnbod (talk) 12:22, 29 January 2020 (UTC)
- 26 - wikt:zeitlose - Count of St. Germain, Martin Werhand, Martin Werhand Verlag, St. Germain (Theosophy) ... find all
Most common non-English, missing from English Wiktionary[edit]
These words are commonly found in English Wikipedia, are present in a non-English Wiktionary, but are missing from English Wiktionary. Word counts are from English Wikipedia. This is a special report from the 2019-08-20 dump.
- 106 - wikt:стрелковая - 109th Rifle Division (Soviet Union), 10th Guards Motor Rifle Division, 114th Rifle Division (Soviet Union), 121st Guards Rifle Division, 137th Rifle Division (Soviet Union) ... find all
- 101 - wikt:īn - Ab Barik-e Sofla, Kermanshah, Ab Garmak-e Sofla, Khuzestan, Abbasabad-e Moin, Abd al-Mu'in ibn Musa'id, Alhashem-e Sofla ... find all
- 80 - wikt:ει - Ancient Greek verbs, Attic Greek, Axiotta, Celtic deities, Cernunnos ... find all
- 67 - wikt:pasangan - Ba (Javanese), Ca (Javanese), Da (Javanese), Dha (Javanese), Ga (Javanese) ... find all
- 52 - wikt:književnosti - August Kovačec, Bogdan Popović, Borisav Stanković, Božidar Petranović, Bratoljub Klaić ... find all
- 48 - wikt:jazyka - 1818 in literature, Aleš Klégr, Andrey Korolev, Andrey Zaliznyak, Bohemian ... find all
- 46 - wikt:изд - Albena Stambolova, Boris Koyalovich, Church of St Demetrius, Boboshevo, Demyanka River, Fedor Kapelyush ... find all
- 42 - wikt:гвардейская - 100th Guards Rifle Division, 104th Guards Airborne Division, 10th Guards Motor Rifle Division, 10th Guards Uralsko-Lvovskaya Tank Division, 121st Guards Rifle Division ... find all
- 40 - wikt:eskadrila - 122nd Hydroplane Liaison Squadron, 461st Light Combat Aviation Squadron, 462nd Light Combat Aviation Squadron, 463rd Light Combat Aviation Squadron, 464th Light Combat Aviation Squadron ... find all
- 38 - wikt:tunjos - Aguazuque, Bacatá, Eastern Hills, Bogotá, El Dorado, Epítome de la conquista del Nuevo Reino de Granada ... find all
- 38 - wikt:serambi - Al-Wustho Mangkunegaran Mosque, Grand Mosque of Bandung, Great Mosque of Banten, Great Mosque of Malang, Great Mosque of Surakarta ... find all
- 36 - wikt:ουκ - Codex Vaticanus 2061, Lectionary 12, Lectionary 225, Lectionary 240, Matthew 27:6 ... find all
- 31 - wikt:ombiasy - Andrianampoinimerina, Antambahoaka, Antemoro people, Bara people, Culture of Madagascar ... find all
- 31 - wikt:kozane - Dō (armour), Japanese armour, Kimura Shigenari, Lamellar armour, Scale armour ... find all
Mineral words[edit]
Several pages with lists of minerals are showing up as some of the pages with the most detected typos. Below is a list of words from these pages. I'm pretty sure some of them are misspelled, so they all require verification. I don't see anything in wikt:Wiktionary:CFI that would exclude these names; some but not all of them are IUPAC systematic. We could also add Wikipedia stubs or redirects as needed if Wiktionary doesn't want them. -- Beland (talk) 15:36, 30 May 2019 (UTC)
- @Beland: Wiktionary does want them. We just haven't gotten around to them, as there are tens of thousands of terms along these lines. BD2412 T 01:04, 29 March 2021 (UTC)
- Problem is the extremely rare ones, or terms only ever used on Wikipedia. These are unwanted on Wiktionary. There are an infinite number of possible chemical names, so there are some criteria for inclusion. Graeme Bartlett (talk) 05:18, 9 April 2021 (UTC)
- wikt:aluminodecaoxotrisilicate
- wikt:aluminodecaoxytetrasilicate
- wikt:aluminodisilicate
- wikt:aluminohexaoxodisilicate
- wikt:aluminohexaoxosilicate
- wikt:aluminotetraoxosilicate
- wikt:aluminotrisilicate
- wikt:alumotrisilicate
- wikt:berylloalumotrisilicate
- wikt:chloro-potassichastingsite
- wikt:decaoxodihydroxy
- wikt:decaoxotetrasilicate
- wikt:decaoxotriphosphate
- wikt:decaoxotrisilicate
- wikt:decaoxydihydroxy
- wikt:dialuminiosilicate
- wikt:dialuminoctaoxodisilicate
- wikt:dialuminodecaoxodisilicate
- wikt:dialuminodisilicate
- wikt:dialuminohexasilicate
- wikt:dialuminopentaoxosilicate
- wikt:dialuminotrisilicate
- wikt:dialumodisilicate
- wikt:diboro
- wikt:dihydroxoarsenate
- wikt:dihydroxophosphate
- wikt:dihydroxotellurate
- wikt:dioxoarsenate
- wikt:dioxoborate
- wikt:dioxochloride
- wikt:dioxodiarsenate
- wikt:dioxodichloride
- wikt:dioxodifluorine
- wikt:dioxodiphosphate
- wikt:dioxodiselenite
- wikt:dioxofluorine
- wikt:dioxohydroxy
- wikt:dioxophosphate
- wikt:dioxoselenite
- wikt:dioxosilicate
- wikt:dioxosulfate
- wikt:dioxotetrasulfate
- wikt:dioxotriarsenate
- wikt:dioxotriphosphate
- wikt:dioxydecahydroxy
- wikt:dioxydifluorine
- wikt:dioxydihydroxy
- wikt:dioxydodecahydroxy
- wikt:dioxyhydroxy
- wikt:diREE
- wikt:disulfa
- wikt:disulfarsenide
- wikt:ditetraoxosilicate
- wikt:docosahydroxy
- wikt:docosaoxide
- wikt:docosaoxotetrasilicate
- wikt:dodecahydroxy
- wikt:dodecaoxotetrasilicate
- wikt:dodecaoxotrisilicate
- wikt:dodecaoxychloride
- wikt:dodecaoxytetrasilicate
- wikt:fluoro-potassichastingsite
- wikt:fluoro-potassicrichterite
- wikt:henicosahydrate
- wikt:heptaicosahydrate
- wikt:heptaicosaoxodisilicate
- wikt:heptaoxodivanadate
- wikt:heptaoxopentaborate
- wikt:heptaoxosilicate
- wikt:heptasilicon
- wikt:heptasulfadiarsenide
- wikt:heptawater
- wikt:hexaaluminotetraicosaoxohexasilicate
- wikt:hexacontaoxide
- wikt:hexahydrogen
- wikt:hexahydroxide
- wikt:hexaicosahydroxy
- wikt:hexaoxodiborate
- wikt:hexaoxodisilicate
- wikt:hexaoxopentaborate
- wikt:hexaoxtellurate
- wikt:hexaoxydihydroxy
- wikt:hexasulfa
- wikt:hexatricontahydrate
- wikt:hydrodioxoarsenate
- wikt:hydroheptaoxide
- wikt:hydrohexaoxodisilicate
- wikt:hydrophosphate
- wikt:hydrotrioxosilicate
- wikt:hydroxoarsenate
- wikt:hydroxophosphate
- wikt:hydroxyarsenate
- wikt:hydroxyhexaoxodisilicate
- wikt:hydroxypentaoxide
- wikt:hydroxytriborate
- wikt:hydroxytridecaoxodisilicate
- wikt:hydroxytrioxosilicate
- wikt:icosahydrate
- wikt:icosalead
- wikt:icosaoxide
- wikt:icosaoxo
- wikt:icosaoxooctasilicate
- wikt:icosaoxopentasilicate
- wikt:nonadecaoxoctasilicate
- wikt:nonaoxodiarsenate
- wikt:nonaoxodiborate
- wikt:nonaoxodivanadate
- wikt:nonaoxohexaborate
- wikt:nonaoxopentaborate
- wikt:nonaoxosilicate
- wikt:nonaoxotetravanadate
- wikt:nonaoxotrisilicate
- wikt:nonaoxyhydroxytetrasilicate
- wikt:octadecaoxide
- wikt:octadecaoxoheptasilicate
- wikt:octadecaoxohexasilicate
- wikt:octadecaoxopentasilicate
- wikt:octaoxodiborodisilicate
- wikt:octaoxoicosahydroxy
- wikt:octaoxopentaborate
- wikt:octaoxotetraborate
- wikt:octaoxotetrasilicate
- wikt:octaoxotrisilicate
- wikt:octaoxotritellurate
- wikt:octaoxydihydroxy
- wikt:octasulfa
- wikt:octasulfadiantimonide
- wikt:octatelluride
- wikt:octatriacontaoxide
- wikt:octauranyl
- wikt:oxoarsenate
- wikt:oxocarbonate
- wikt:oxochromate
- wikt:oxodecachloride
- wikt:oxodiarsenate
- wikt:oxodiborate
- wikt:oxodiphosphate
- wikt:oxodisulfate
- wikt:oxodisulfide
- wikt:oxohydrophosphate
- wikt:oxosulfate
- wikt:oxotetraoxosilicate
- wikt:oxotrisulfate
- wikt:oxydihydroxy
- wikt:oxydinitride
- wikt:oxyhydroxy
- wikt:oxyphosphate
- wikt:oxytrivanadate
- wikt:pentadecaoxohexasilicate
- wikt:pentaicosahydro
- wikt:pentaicosamanganese
- wikt:pentaoxodiarsenate
- wikt:pentaoxodiborate
- wikt:pentaoxodisilicate
- wikt:pentaoxotellurate
- wikt:pentaoxotetraborate
- wikt:pentaoxotrivanadate
- wikt:pentaoxoundecaborate
- wikt:pentasulfa
- wikt:pentatetracontaoxooctadecasilicate
- wikt:polytypoids
- wikt:potassic-aluminosadanagaite
- wikt:potassic-aluminotaramite
- wikt:potassicarfvedsonite
- wikt:potassic-chloropargasite
- wikt:potassic-ferrisadanagaite
- wikt:Potassic-jeanlouisite
- wikt:potassium-fluorrichterite
- wikt:protoferro-anthophyllite
- wikt:proto-ferro-suenoite
- wikt:stannotrisilicate
- wikt:stewardite
- wikt:sulfantimonide
- wikt:surkhobite
- wikt:tengerite
- wikt:tetrabismuthide
- wikt:tetradecaborate
- wikt:tetradecalead
- wikt:tetradecaoxopentasilicate
- wikt:tetrahydroxoarsenate
- wikt:tetraicosaoxide
- wikt:tetraicosaoxodecasilicate
- wikt:tetraicosaoxotrisilicate
- wikt:tetraluminotetrasilicate
- wikt:tetraoxoarsenate
- wikt:tetraoxoborate
- wikt:tetraoxodichloride
- wikt:tetraoxodiphosphate
- wikt:tetraoxodisulfate
- wikt:tetraoxogermanate
- wikt:tetraoxomolybdate
- wikt:tetraoxoselenate
- wikt:tetraoxosulfate
- wikt:tetraoxotellurate
- wikt:tetraoxotetraphosphate
- wikt:tetraoxovanadate
- wikt:tetraoxozincate
- wikt:tetraoxy
- wikt:tetraoxysilicate
- wikt:tetraoxytetrabismuth
- wikt:tetraoxytriborate
- wikt:tetraselenite
- wikt:tetrastannide
- wikt:tetrasulfa
- wikt:tetrawater
- wikt:triacontaoxide
- wikt:triacontaoxoctasilicate
- wikt:triacontaoxydodecasilicate
- wikt:trialuminotrisilicate
- wikt:triberylohexasilicate
- wikt:triborododecasilicate
- wikt:tridecaoxoditellurate
- wikt:tridecaoxoheptaborate
- wikt:tridecasulfa
- wikt:trihydronium
- wikt:triicosaoxotetrasilicate
- wikt:trilithiododecasilicate
- wikt:trioxoarsenate
- wikt:trioxoborate
- wikt:trioxosilicate
- wikt:trioxotellurate
- wikt:trioxotriborate
- wikt:triREE
- wikt:trisulfa
- wikt:triwater
- wikt:undecaoxoheptasilicate
- wikt:undecaoxohexahydrohexaborate
- wikt:undecaoxotetrasilicate
- wikt:undecaoxotitanotetrasilicate
- wikt:zircono
Incorrect or rare mineral words[edit]
- wikt:aluminoctaoxotrisilicate try aluminum trisilicate octaoxide
- wikt:docosatantalum → valid but too rare for wiktionary
- wikt:hexaoxy → needs changing
- wikt:heptatelluride → too rare for wiktionary but appears valid
- wikt:undecalead → too rare for Wiktionary
Needs Wikipedia article instead?[edit]
- 2 - Anacron - wikt:cronie, wikt:cronie
- 2 - Carpathian Large Carnivore Project - wikt:cntours, wikt:cntours - redlinked company in Romania needs an article.
- 1 - Club Penguin (franchise) - wikt:puffles: plural of a type of character in an online game.
- 33 hexipentisteriruncicantitruncated - a nest of specialized geometrical form names; what to do? → since this is a part of several compound names, it may need a set index or disambig page. If it has use in books, it could go in Wiktionary, but Wikipedia seems to be the source of these geometric terms.
- the articles it's currently (Jan 2021) used in are: Uniform 9-polytope, A8 polytope, Uniform 8-polytope, B8 polytope, Hexicated 7-cubes, Hexicated 7-simplexes. There's also Hexipentisteriruncicantic 7-cube which redirects to a section in (and is a form of) Hexic 7-cubes which is a "convex uniform 7-polytope". Hexipentisteriruncicantic itself has a few pages that it's in: Uniform 8-polytope which is in the other list, D7 polytope, Uniform 7-polytope. And let me just say, what the hell does any of this mean. So hopefully, any of those help with figuring out what to do. --Xurizuri (talk) 12:07, 21 January 2021 (UTC)
- wikt:boxel and wikt:boxels - used and cursorily defined on 2.5D (visual perception). -- Beland (talk) 00:05, 9 April 2021 (UTC)
Possible typos by length[edit]
Longest or shortest in certain categories are shown, sometimes just for fun and sometimes because they form a useful group. Please use strikethrough (or leave a note) for this section rather than removing lines, to avoid repeating work done while the dumps were being processed. Thanks!
Likely chemistry words[edit]
(updated from 2021-03-20 dump)
These need to be checked by a chemist and marked {{not a typo}}.
- 84 - wikt:trans-2-hydroxyisoxypropyl-3-hydroxy-7-isopentene-2,3-dihydrobenzofuran-5-carboxylic - Cāng zhú
- 79 - wikt:d-1,2,3,9,10,10a-hexahydro-6-methoxy-11-methyl-4h-10,4a-iminoethano-phenanthren - Controlled Drugs and Substances Act
- 73 - wikt:d-1,2,3,9,10,10a-hexahydro-11-methyl-4h-10,4a-iminoethanophenanthren-6-ol - Controlled Drugs and Substances Act
- 72 - wikt:l-11-allyl-1,2,3,9,10,10a-hexahydro-4h-10,4a-iminoethanophenanthren-6-ol - Controlled Drugs and Substances Act
- 70 - wikt:n-2'-hydroxyoctadecanoyl-2-amino-9-methyl-4,8-heptade-cadiene-1,3-diol - Ramaria botrytis
- 69 - wikt:n-methyl-l-alanyl-l-leucyl-n-methyl-trans-dehydrophenyl-alanyl-glycyl - Tetrapeptide
- 66 - wikt:dihydrofuran-2,5-dione,3β-hydroxy-5α,8α-epidioxyergosta-6,22-diene - Russula densifolia
- 65 - wikt:dl-1,2-anhydro-4,5-o-cyclohexylidene-1,2,3/4,5-cyclopentanepentol - 1,2,3,4,5-Cyclopentanepentol
- 63 - wikt:uridine-5'-diphospho-n-acetyl-2-amino-2-deoxy-3-o-lactylglucose - UDP-N-acetylmuramate dehydrogenase
- 59 - wikt:octachloro-3a,4,7,7a-tetrahydro-4,7-methanoindene-1,8-dione - Hexachlorocyclopentadiene
- 59 - wikt:dihydroxy-21-oxa-21-chloromethylpregna-1,4-diene-3,20-dione - List of corticosteroids
- 59 - wikt:cis-5,6-dihydroxy-4-isopropylcyclohexa-1,3-dienecarboxylate - 2,3-dihydroxy-2,3-dihydro-p-cumate dehydrogenase
- 58 - wikt:decahydro-10-methoxy-3,6,9-trimethyl-3,12-epoxy-12h-pyrano - Artemether
- 58 - wikt:anti-7β,8α-dihydroxy-9α,10α-epoxy-7,8,9,10-tetrahydrobenzo - Benzo(j)fluoranthene
- 56 - wikt:hydroxy-17α,21-dimethyl-19-norpregna-4,9-dien-3,20-dione - Trimegestone
- 54 - wikt:s-adenosyl-l-methionine:3-hexaprenyl-4,5-dihydroxylate - Hexaprenyldihydroxybenzoate methyltransferase
- 54 - wikt:cis-11,12-dichloro-9,10-dihydro-9,10-ethano-2-anthroic - Field effect (chemistry)
- 52 - wikt:hexahydro-1,3-dimethyl-4-phenylazepine-4-carboxylate - Controlled Drugs and Substances Act
- 52 - wikt:hexahydro-1,2-dimethyl-4-phenylazepine-4-carboxylate - Controlled Drugs and Substances Act
- 52 - wikt:cis-2-methyl-4-trimethylammoniummethyl-1,3-dioxolane - Dioxolane
Probable DNA sequences[edit]
If you're sure this is a DNA or RNA sequence, tag it {{DNA sequence}}.
(All fixed; waiting for 2021-01-20 dump!)
Chemical formulas[edit]
Chemical formulas should be written with subscripts or {{chem2}}.
Chemical formulas that use Unicode subscripts (which is against MOS:SUBSCRIPT) will be detected automatically by moss_entity_check.py.
Chemical formulas that use <sub>...</sub>
are allowed by MOS:CHEM, but may show up in the main typo listings above. They can be converted to use {{chem2}} to be accepted by the spell checker.
Articles with a large number of chemical formulas triggering the spell checker are listed here (updated from 2020-03-20 dump):
- 851 - Classification of non-silicate minerals - wikt:Nb,Ta, wikt:Fe,Ni, wikt:Fe,Ni, wikt:Ni,Fe, wikt:Fe,Ni, wikt:Ni,Fe, wikt:Au,Ag, wikt:Fe,Ni, wikt:Fe,Ni, wikt:Fe,Ni, wikt:Ir,Os, wikt:Ru,Pt, wikt:Rh,Pt, wikt:Pd,Pt, wikt:Os,Ir, wikt:Ru,Ir, wikt:Ir,Os, wikt:Fe,Os, wikt:Ru,Ir, wikt:Mo,Ru, wikt:Fe,Ir, wikt:Ni,Fe, wikt:Pt,Pd, wikt:Fe,Cu, wikt:Pt,Pd, wikt:Pd,Pt, wikt:Bi,Pb, wikt:Fe,Ni...
- 272 - Classification of silicate minerals - wikt:Al,Fe3, wikt:Al,Fe3, wikt:Na,Ce, wikt:Ca,Ce, wikt:OH,Cl, wikt:OH,H2O, wikt:Fe,Mn, wikt:OH,H2O, wikt:Mn,Sr, wikt:Na,Ca, wikt:Na,H3O, wikt:Ca,Mn...
- Several "List of minerals" articles currently posted at Wikipedia:Typo Team/moss/L
- 90 - Nickel compounds - wikt:hexaaquanickel(II), wikt:Ni(H, wikt:Ni(NH, wikt:Nickel(IV), wikt:Ni(N, wikt:Ni(BF, wikt:Ni(AsF, wikt:Ni(SbF, wikt:Ni(BiF, wikt:Ni(AsF, wikt:Ni(SbF, wikt:Ni(ICl, wikt:Ni(N, wikt:Ni(N, wikt:Ni(NH...
- 80 - Silicate mineral - wikt:Hf,Zr, wikt:Mg,Fe, wikt:Fe,Mg, wikt:Ca,Fe, wikt:Al,Fe, wikt:Ce,La, wikt:FeII,FeIII, wikt:Mg,Fe, wikt:Fe,Mn, wikt:Na,Ca, wikt:Al,Li, wikt:Al,Fe, wikt:Fe,Mg, wikt:Al,Fe, wikt:Si,Al, wikt:Mg,Fe...
- 67 - N-heterocyclic silylene - [[wikt:tBuN]Si]], [[wikt:tBuN]Si]], [[wikt:tBuN]Si]]... wikt:Ru(MeCN), wikt:RuCl(MeCN), wikt:NHSi)OTf...
- 50 - Phosphor - wikt:Ca,Sr, wikt:Cu,Mg, wikt:Cu,Al, wikt:Zn,Cd...
- 33 - Silicon - wikt:Mg,Fe, wikt:Mg,Fe, wikt:SiH(OMe), wikt:Si(OMe)...
- 31 - Pyroxene - wikt:Si,Al, wikt:Mg,Fe, wikt:Mg,Fe, wikt:Mg,Fe...
- 28 - Thorium - wikt:OH,Cl, wikt:Ca,Fe, wikt:ThO(OH, wikt:Cl)H...
I'm refining the below report to be more useful; will probably be updated in a few days. -- Beland (talk) 03:17, 1 April 2021 (UTC)
Chemical formulas that don't use subscripts (which is incorrect notation) are listed below. These are found with:
grep -P ' ((H|He|Li|Be|B|C|N|O|F|Ne|Na|Mg|Al|Si|P|S|Cl|Ar|K|Ca|Sc|Ti|V|Cr|Mn|Fe|Co|Ni|Cu|Zn|Ga|Ge|As|Se|Br|Kr|Rb|Sr|Y|Zr|Nb|Mo|Tc|Ru|Rh|Pd|Ag|Cd|In|Sn|Sb|Te|I|Xe|Cs|Ba|La|Ce|Pr|Nd|Pm|Sm|Eu|Gd|Tb|Dy|Ho|Er|Tm|Yb|Lu|Hf|Ta|W|Re|Os|Ir|Pt|Au|Hg|Tl|Pb|Bi|Po|At|Rn|Fr|Ra|Ac|Th|Pa|U|Np|Pu|Am|Cm|Bk|Cf|Es|Fm|Md|No|Lr|Rf|Db|Sg|Bh|Hs|Mt|Ds|Rg|Cn|Nh|Fl|Mc|Lv|Ts|Og|R)([2-9]|[1-9][0-9]))+$' debug-spellcheck-ignored.txt | head -100
These should be converted to use {{chem2}}. If there is a suitable target, it would also be nice to create a redirect that is tagged with {{R from molecular formula}}. (A redirect is also a good way to clear sequences that aren't actually chemical formulas.)
TODO: Search for incorrect instances of chemical formulas listed in Category:Molecular formulas.
(Updated from 2021-01-01 dump.)
- 78/28 - C6F5
- 76/50 - Nb6 → Chess notation
74/35 - H3K4- 71/44 - Nh5 → Chess notation
- 68/38 - Rf8 → Chess notation
- 52/25 - Nb3 → Chess notation
- 46/29 - Nb5 → Chess notation
- 44/15 - Fe4S4
- 36/20 - Rh8
36/13 - Si8O22- 30/11 - Si2O6
- 27/13 - Y32
- 25/15 - Y30
- 22/12 - Y33
- 21/19 - Si3O9
- 21/17 - Rg5 → chess notation
- 19/10 - Cu6
- 18/7 - Y34
17/7 - H4K16- 17/17 - K52
17/17 - F6F6F6- 17/17 - C5H7O2
- 17/16 - V82
- 16/7 - Si6Al2
- 16/1 - Ti3C2
- 16/16 - No8
- 15/9 - Si6O18
15/7 - H3K79- 15/6 - Y36 → bus routes, page number, database reference, mutation
- 15/15 - Si9O27
- 14/9 - V31
- 13/9 - V51
- 13/8 - Y13
- 13/8 - W61
- 13/6 - Y93
13/4 - Ga2S3- 13/13 - K65
13/12 - C6H1113/11 - H4K20- 12/7 - V69
12/4 - Al2Br612/2 - H3K1412/10 - Al2Cl6- 11/9 - K54
- 11/7 - Mn5
11/5 - B6N2→ Japanese bomber type- 11/1 - V3I2
- 11/10 - Y95
- 10/9 - Si4O11
- 10/7 - Na7
10/7 - H4K12- 10/7 - C8H17
- 10/6 - Th9
- 10/6 - Pr6O11
10/5 - Rb9O210/5 - H4K510/5 - Bi2Se310/4 - In2S3- 10/10 - R88
9/8 - W439/8 - V58- 9/8 - V53
9/8 - O2C5H7- 9/8 - C2R2
- 9/8 - Al2Si2O5
- 9/6 - Mg3Al2
9/6 - C2B10H12- 9/4 - Fe7C3
9/3 - Tb4O7- 9/1 - V2I3 → stands for volume 2 issue 3; this is a pattern found in dois
- 9/1 - Si25O73
- 8/8 - V79
- 8/8 - K49
- 8/8 - Cu5
- 8/7 - V76
- 8/6 - H56
8/6 - Dy2Ti2O7- 8/5 - V67
- 8/3 - Si2H2
- 8/3 - C6R6
- 8/2 - Al2Si4O12
- 8/1 - V3R5
- 7/7 - Si4O10 → only parts of formula -- false positive?
- 7/7 - Pb4
- 7/7 - O35
- 7/6 - Na8
7/6 - K2Mg2- 7/6 - In20
- 7/6 - Bh8
- 7/5 - W93
- 7/5 - R94
7/5 - Cu31S167/4 - Cs11O37/3 - Sn6O47/3 - S50B30→ BMW engine7/3 - Li6→mostly isotope of lithium, but LI6 something to do with Buick engines7/3 - H2W12O42- 7/3 - Dy4 → Dy4 Systems Inc. a Canadian company probably notable based on how many times it is mentioned; DY4 also used for designations of minor planets
7/3 - Cu6Sn5
Repeating patterns[edit]
For rhyme schemes, they probably need to be re-styled to follow Wikipedia:WikiProject Poetry#Style for rhyme schemes. If this ends up making them all-caps, they won't show up here on the next run. For mixed-case rhyme scheme notations, use {{not a typo}} after making sure dashes, commas, and spaces follow the recommended style.
(2021-03-20 dump all fixed; waiting for 2020-04-20 dump.)
For Beland todo[edit]
- Rhyme scheme hunting:
- Sync style for articles in Category:Stanzaic form and Category:Rhyme and add to rhyme scheme list if appropriate.
- Sync annotation style for articles that mark up poems line-by-line (use tables, not column divs or parens)
- Manually search for patterns like:
- a-b-a-b-a-b-c-c
- AB,CD,AB (internal rhyme)
- "aa", "ab", "aaa", "aab", "aba", "abb", "abc", "aaaa", "aaba", "aabb", "aabc", "abaa", "abab", "abba", "abca", "abcb", "abcc", "abcd" - probable rhyme sequences where there's an article present so it's not detected as a misspelling
False positives[edit]
Is there a word that is correctly used in an article, but which shouldn't be added to Wiktionary? List it here, and Beland will fix the problem.
Archived solutions: Wikipedia:Typo Team/moss/Archive
- wikt:singer(s), wikt:composer(s), etc. Found in Kanto (music).
False negatives[edit]
Is there a misspelled word in an article mentioned here that was not reported? Feel free to list it below and Beland will try to improve the code if appropriate.
These are currently over-ignored, but could be used to suggest correct spellings:
- Wikipedia articles with {{R from misspelling}}, {{R from incorrect name}}, {{R from miscapitalisation}}, and redirects to these templates
- Wiktionary entries that are known misspellings (e.g. wikt:anticiliary)
- In cases where there are variant spellings of the same word or phrase, Wikipedia should probably pick one and stick to it except to mention the variants. This happens with:
- Compound words - whether to use a space, dash, or nothing, as in "junebug" vs. "june bug" or "email" vs. "e-mail".
- Words with multiple transliterations from another language (often there are multiple systems, no particular system, or a modern system different from historical systems).
- Redirects with {{R from alternate spelling}} and redirects to that template.
- Article Ana Recio Harvey | detected misspelling: appoinment | additional, undetected misspelling: enterpreneur
- Looks like this was because of redirects with "enterpreneur" in the title. I have tagged them all {{R from misspelling}}, but I'll have to change the code to ignore those, as noted above. Thanks for catching that! -- Beland (talk) 23:52, 18 October 2018 (UTC)
- 1 - Jack Beckitt - wikt:monacled -> also had "whow" in place of who --Xurizuri (talk) 05:10, 5 February 2021 (UTC)
- 1 - Jack Jenkins (rugby player) - wikt:scummage -> "forst" instead of "first" --Xurizuri (talk) 05:43, 5 February 2021 (UTC)
- 1 - Johan Christian Drewsen - wikt:cultication -> "Rogether" instead of "Together", at the start of a sentence. "Copenahgen" instead of "Copenhagen". They obviously didn't get picked up because of capitalisation, but thought I'd list them here anyway just in case it helps. -Xurizuri (talk) 11:09, 13 February 2021 (UTC)
Archived notes[edit]
See Wikipedia:Typo Team/moss/Archive.
Mismatched markup and punctuation[edit]
Errors in punctuation (mostly quotation marks) and wiki markup generally cause confusion for readers, and also prevent the spell checker from running on these articles.
Inches and feet should not use " and ', per Wikipedia:Manual of Style/Dates and numbers#Specific units; use letters instead. (See MOS:UNITS for general guidance.) Where conversions are needed, use {{convert}}, for example: 2 feet 3 inches (69 cm)
WORK IN PROGRESS
- Integrating these with main listings
- Filter only unmatched " for now
- Filter articles with non-ASCII quote marks to a separate list for JWB processing
- Filter \d" and \d' to a separate sublist for inch/feet style conversion
- Explain ✂ or skip snippets showing this
- Bracketbot web UI seems to be down
-- Beland (talk) 19:03, 4 September 2019 (UTC)
Gender-neutral language[edit]
Manned[edit]
The word "manned" and related forms like "unmanned" are used in many articles, but is not gender-neutral as required by MOS:S/HE and the NASA style guide. Gender-neutral alternatives include:
- Crewed, uncrewed
- Staffed, unstaffed
- Human spaceflight
- Defended
Not all instances need to be changed.
- Proper nouns should remain the same, like Manned Orbiting Laboratory
- Titles of sources and quotes should remain unchanged.
- If the term itself is being discussed, for example to say that "manned spaceflight" is another way of saying human spaceflight.
- There seems to be consensus on unmanned aerial vehicle that this and related phrases (like unmanned aerial system) should remain intact, since it is much more frequent than "uncrewed aerial vehicle" at the moment. However, when using Wikipedia's voice it is preferred to describe a UAV as "uncrewed" when not using the whole phrase.
- Non-article pages that are retained for historical interest shouldn't be modified if they won't be visible to readers.
- Redirects with this title should be left alone if they are redirecting readers to a gender-neutral title
If the word is found the names of articles and categories (except those with names directly related to UAVs), those should be renamed, and the links changed. Many articles have already been renamed, and the links just need to be updated. (Remember that to rename a category, all the articles in that category must be edited to change their pointers.)
- Coming soon: moss report on "manned" that ignores references, page titles, proper nouns, and consensus-OK phrases.
- Find all instances of "manned" in articles
- Find all instances of "unmanned" in articles
- Find all instances of "manned" in Wikipedia:, File:, Category:, and Portal: (recommended for advanced editors only)
- Find all instances of "unmanned" in Wikipedia:, File:, Category:, and Portal: (recommended for advanced editors only)
Borderline cases[edit]
These may need to be discussed before being potentially renamed.
These are generic terms, like Human mission to Mars, as opposed to proper names like Manned Orbiting Laboratory. -- Beland (talk) 19:41, 21 May 2019 (UTC)
- Manned Venus flyby - Based on the NASA style guide, NASA probably would now refer to this as "human Venus flyby" but historical sources say "manned Venus flyby" so that's what the majority of editors commenting on the talk page currently favor. There is some question as to whether the scope of the article concerns a specific mission or this type of mission in general, which is related to the proper name exception (but then the title would be "Manned Venus Flyby"). Compare Colonization of Venus and Human mission to Mars. -- Beland (talk) 19:41, 21 May 2019 (UTC)
Objections in specific cases:
Marriage[edit]
Wikipedia:Writing about women § Marriage points out:
- "is the wife of" is less neutral than "is married to" - find all "is the wife of"
- "born to X and his wife Y" is less neutral than "born to X and Y" - approximate search
- "man and wife" is less neutral than "husband and wife", and to be fully neutral the order should be varied - find all "man and wife"
Ladies[edit]
Wikipedia:Writing about women § Girls, ladies prefers "women" to "ladies" except where part of set phrases or traditional titles (like first lady). find all lowercase "ladies"
Instructional and presumptuous language[edit]
MOS:NOTE says to avoid the following phrases when they address the reader directly. Not all instances are problematic, such as those in direct quotations.
- remember that - find all "remember that"
- note that - find all "note that"
- of course - find all "of course"
- naturally - find all "naturally" (the meaning "related to nature" is not problematic)
- obviously - find all "obviously"
- clearly - find all "clearly"
- actually - find all "actually"
- rhetorical questions, especially in headings - find all questions in headings (some cases, like the names of works, are not problematic)
Internationally comprehensible spelling and vocabulary[edit]
MOS:COMMONALITY advises the use of vocabulary and spellings that are shared across national varieties of English, where possible. This section collects instances where an unshared term is being used which could be improved. For proper nouns and direct quotes, a translation or re-spelling into another dialect may be helpful.
- "gaol" should be "jail"
- Disputed, discussion underway at Wikipedia talk:Manual of Style#Gaol vs. jail
Currency style[edit]
Per MOS:CURRENCY:
- For the UK, Irish, Australian, New Zealand, and South African pound, ₤ should be changed to £
- ₤ is OK to use with Italian lira. Changing e.g. ₤100,000 to [[Italian lira|₤]]100,000 will prevent legitimate uses from showing up in automated reports, and also help readers understand that this is not British pounds. (Mentions of Italian lira are increasingly rare because it has been replaced by the Euro.)
Caution: Not all problem pages show up reliably; if you do a search, fix all the pages in the results, and then do another search, you will probably get a fresh batch of problem pages. It may also take a minute or two for fixed pages to disappear from the results, due to lag updating the search index.
Work is in progress on detecting and fixing other MOS-related issues with numbers and currencies.
Small caps[edit]
Per MOS:BCE, smallcaps are not to be used for years like "400 BC". Find all instances of known smallcaps issues...
HTML tags[edit]
Updated from 2021-03-20 dump.
You can do one of two things for these articles:
- Remove, repair, or convert the HTML markup to wiki markup yourself.
- Tag the article {{cleanup HTML}} and it will show up under Category:Articles with HTML markup but not on this list. Use the "tags" parameter to indicate which tags are present on the page; many editors find it hard to locate the offending HTML. For example: {{cleanup HTML|tags=table, cite}}
How to clean up[edit]
See Category:Articles with HTML markup for instructions on how to find the offending tags and what to do about them.
Find all articles by tag[edit]
Can't wait for the next database dump? Want to look for or fix all instances of a specific tag? Use the links below!
- <tt> - find all
- <li>, <ol>, and <ul> - find all
- <table>, <tr>, <td>, <th>, <caption> - find all
- <i> or <em> - find all
- <dd>, <dt>, and <dl> - find all
- <cite> - find all
- <p> - find all
- <strong> and <b> - find all
- <name=> - find all
- </br> - find all
- <hr> and <hr/> - find all
- <font> - find all
- <ins> - find all
- <samp> - find all
- <q> - find all
- <wbr> - find all and find ­
- <ruby>, <rt>, and <rp> - find all
- Elements and attributes obsoleted in HTML 5 have prefab searches linked from Wikipedia:HTML 5
Additional HTML problems are listed at Special:LintErrors.
Sometimes editors use angle brackets (< and >) for other purposes. Though these are not HTML markup, they often need to be fixed.
<<...>> find all can indicate:
- French quotation marks rendered as <<quoted text>>. These should be normalized to "quoted text" or 'quoted text', even in quotations, per MOS:CONFORM.
- A broken citation that should be converted to {{cite web}})
Other weirdness:
- <the> - find all - More French quoting style, bad linking, bad citation style, etc.
- <blockquote> sometimes shows up on the reports if it is capitalized or all-caps on the article page. It should be all lowercase.
Known bad HTML tags (HB)[edit]
These are also included in the main listings.
- 1520 - <tt> - Aufs, Chargaff's rules, Chmod, Cisco IOS, Comparison of file archivers ... find all
- 1508 - <li> - 2007 UST Growling Tigers men's basketball team, 2019 European Parliament election in Romania, 2019 U.S. Open Polo Championship, 2K Sports Classic, Administrative divisions of American Samoa ... find all
- 729 - <td> - 2020–21 Metro Atlantic Athletic Conference men's basketball season, Akiyama Station, Attention (machine learning), Baraki-Nakayama Station, Brenda Lindiwe Mabaso ... find all
- 644 - <i> - Adam Wilson, Alexander Larman, Anne Montgomery (artist), Anocracy, Ascoli Satriano ... find all
- 441 - <em> - 880s BC, Agitu Ideo Gudeta, Alacritty, Albert Sidney Johnston, Algebra of sets ... find all
- 391 - <p> - 2020 NASCAR Cup Series, ASTM A193/A193M, Abradable powder coatings, Alan Walsh (physicist), Ambiguities in Chinese character simplification ... find all
- 307 - <b> - 1982–83 United States network television schedule (daytime), Affinity chromatography, African Romance, Ambedkar Makkal Iyakkam, Anapaest ... find all
- 165 - <tr> - 2020–21 Metro Atlantic Athletic Conference men's basketball season, Akiyama Station, Attention (machine learning), Baraki-Nakayama Station, Brenda Lindiwe Mabaso ... find all
- 157 - <ol> - 2019 European Parliament election in Romania, Adjective, Baire space, Battle of Grand Port, Biarc ... find all
- 135 - <cite> - 5th of December Party, Cape Collinson, Conic Island, David Lewis (philosopher), Evidentiality ... find all
- 103 - <strong> - Altınşehir (Istanbul Metro), Elite Ice Hockey League, Inauguration of José P. Laurel, Join Java, Kings of Israel and Judah ... find all
- 88 - </table> - 2020–21 Metro Atlantic Athletic Conference men's basketball season, Adventure Time (season 7), Adventure Time (season 8), Akiyama Station, Ang Probinsyano (season 1) ... find all
- 63 - <font> - Clockwork (disambiguation), Darwendale, Lefty O'Doul, List of German states by area, Madurai ... find all
- 56 - <hr/> - Al-Asr, Al-Humazah, Al-Ma'un, Al-Masad, Linha do Alentejo ... find all
- 48 - <ins> - Bank Hey, Chameleon, City of Adam, Hattie Jacques, Kaiser Permanente Bernard J. Tyson School of Medicine ... find all
- 36 - <hr> - Lady Molly of Scotland Yard, List of Air Service American Expeditionary Force aerodromes in France, MySQLi, Nonconvex great rhombicosidodecahedron, Octahemioctahedron ... find all
- 15 - <q> - Aberedw railway station, Afonso I of Kongo, Alfred Delvau, Aramaic Uruk incantation, Canonical link element ... find all
- 3 - <wbr/> - Agnosia, List of COVID-19 vaccine authorizations ... find all
- 1 - </br> - Nagarakretagama ... find all
Bad link formatting (HL)[edit]
These are also included in the main listings. Angle brackets are not used for external links (per Wikipedia:Manual of Style/Computing § Exposed URLs); "tags" like <https> and <www> are actually just bad link formatting. See Wikipedia:External links#How to link for external link syntax; use {{cite web}} for footnotes.
- 94 - <https> - 342nd Infantry Division (Wehrmacht), Bentinck family, Cold wave, Cumberland Compact, Diana Gamage ... find all
- 83 - <http> - Bangladesh Agricultural Development Corporation, Charles Arthur Bissonette, Dmitry Gabrilovich, Doris Haddock, Enoch H. Pardee ... find all
- 49 - <http/> - Eakly, Oklahoma, Lachlan Bassett, Lego Mindstorms NXT, List of armories and arsenals in New York City and surrounding counties, List of convention and exhibition centers ... find all
- 37 - <https/> - Alfred Scharf, CHF Entertainment, Carl Niehaus, Hancock County, Georgia, Lachlan Bassett ... find all
- 15 - <www> - List of marinas, Matthew Mbu, Monique Frize, Moorsbus, Nuthampstead ... find all
Unsorted (H)[edit]
Many of these can be replaced by {{var}} (for text to be replaced) or {{angbr}} (e.g. for linguistic notation).
- 38 - <blockquote> - Cyfronydd Hall, Doaksville, Choctaw Nation, Douglas H. Johnston, Edward B. Green (judge), Eula Pearl Carter Scott ... find all
- Replace with {{blockquote}}? moss parsing failures? -- Beland (talk) 01:00, 30 January 2021 (UTC)
- 32 - <ce> - Potassium dichromate, Single displacement reaction ... find all
- 26 - <c> - Bills C-1 and S-1, Cello Concerto No. 1 (Saint-Saëns), Energy development, Fatty alcohol, Lipps–Meyer law ... find all
- 25 - <n> - French phonology, Gaj's Latin alphabet, Global cascades model, Ingrian language, Merionethshire ... find all
- 23 - <e> - Cello Concerto No. 1 (Saint-Saëns), Eawy Forest, Epirote Greek, Grunewald, Heteronym (linguistics) ... find all
- 22 - <k> - African Romance, Distributed lag, Dividend policy, Evolution of a random network, Gompertz function ... find all
- 21 - <t> - Ancient Roman bathing, Anglo-Saxon riddles, Binary heap, Breitenau concentration camp, C++20 ... find all
- 21 - <m> - Certificate (complexity), Godfried-Willem Raes, Itô calculus, Kaingang language, Lee You-cheong ... find all
- 20 - <cr> - Carriage return, HTTP message body, Hayes command set, Hypertext Transfer Protocol, NMEA 0183 ... find all
- 19 - <lf> - HTTP message body, Hayes command set, Hypertext Transfer Protocol, NMEA 0183, On-line Debugging Tool ... find all
- 18 - <d> - Cello Concerto No. 1 (Saint-Saëns), DT-Manie, Dataphor, Division algorithm, Experix ... find all
- 18 - <cabela> - Cabela's Big Game Hunter 2005 Adventures, Cabela's Big Game Hunter 2006 Trophy Season, Cabela's Big Game Hunter: 2004 Season ... find all
- 17 - <l> - Anatolian hieroglyphs, Bello FiGo, Blind deconvolution, Colonia Tovar, Geometric design of roads ... find all
- 17 - <gallery> - August Falise, Cəlilabad, Indian locomotive class WAP-5, Kathua, Keluri ... find all
- 17 - </activision> - Cabela's Big Game Hunter 2005 Adventures, Cabela's Big Game Hunter 2006 Trophy Season, Cabela's Big Game Hunter: 2004 Season ... find all
- 16 - <no> - 430 Space Shuttle, 57th NHK Kōhaku Uta Gassen, Confederate privateer, Cricket statistics, Kinnikuman Muscle Grand Prix ... find all
- 15 - <x> - Davenport chained rotations, Epirote Greek, Ethernet over SDH, Head (Unix), Meroitic language ... find all
- 15 - <number> - Btrieve, Fasti Ostienses, GRAU, Geom raid5, Time control ... find all
- 15 - <encore> - 10th Anniversary Tour Lead Upturn 2012: Now or Never, Lead 15th Anniversary Live Box, Lead Live Tour Upturn 2005, Lead Upturn 2009: Summer Day & Night Fever, Lead Upturn 2010: I'll Be Around ... find all
- 14 - <y> - Andoque language, Elementary function arithmetic, History of Proto-Slavic, Languages of Argentina, Meroitic language ... find all
- 14 - <the> - Ahn Bong-soon, American Dairy Association, Bello FiGo, Castillo del Príncipe (Havana), Fan (surname) ... find all
- 13 - <link> - Ars Magica, GIO General, IELTS Life Skills, Listia, MTELP Series ... find all
- 12 - <string> - Abstract Document Pattern, C++ Standard Library, Control flow, Generic programming, Is-a ... find all
- 12 - <statement> - C syntax, DG/L, Statement (computer science), XML for Analysis, Zahn's construct ... find all
- 12 - <pv> - Bamboo Collage, Love Paradox, Softly (song), Sympathy (Hitomi Takahashi album), Vanilla (Leah Dizon song) ... find all
- 12 - <operate> - Keihin Kyuko Bus ... find all
- 11 - <o> - Alias (TV series), Epirote Greek, Heteronym (linguistics), Immortal Beloved, Ingrian language ... find all
- 11 - <j> - County Kilkenny, Fast wavelet transform, Glenmore, County Kilkenny, John (given name), Mo Bangfu ... find all
- 11 - <ch> - Basel German, Cuban Spanish, New Rumi Spelling, Nivaclé language, Old Saxon Baptismal Vow ... find all
- 9 - <interlude> - Hall Tour 2014: Bon Voyage, Live Tour 2007: Black Cherry, Live Tour 2015: Walk of My Life ... find all
- 9 - <filename> - Cross File Transfer, Data source name, Ddoc, Leet (programming language), PowerHouse (programming language) ... find all
- 8 - <tl> - Belizean Spanish, Costa Rican Spanish, Guatemalan Spanish, Nicaraguan Spanish ... find all
- 8 - <mm> - LRC (file format) ... find all
- 8 - <lol> - Ala Boratyn, Before I'll Die..., Blog 27, LOL (Blog 27 album), Who I Am (Blog 27 song) ... find all
- 8 - <ll> - Languages of Argentina, Literary Welsh morphology, Lj (digraph), Paraguayan Spanish, Spanish verbs ... find all
- 8 - <is> - Gellish, Information model, Modeling language, Semantic data model ... find all
- 8 - <in> - Alexander Bogomazov, Ilocano numbers, Karl Spencer Lashley Award, Kathy Flores, Nikita Lobanov ... find all
- 7 - <z> - Dutch-language literature, General Chinese, Heteronym (linguistics), Leiden Willeram, New Mexican Spanish ... find all
- 7 - <yod> - Modern Standard Tibetan grammar ... find all
- 7 - <year> - AMD Radeon Software, Animecon (Netherlands), Constitutional Court of Korea, Date and time notation in Catalonia, Madras High Court ... find all
- 7 - <us> - HyTelnet, Pinnacle Valley, Yūki Kaneko ... find all
- 7 - <type> - Circuit ID, Common Platform Enumeration, SSHFP record, Toi (programming language) ... find all
- 7 - <red> - Modern Standard Tibetan grammar ... find all
- 7 - <random> - C++ Standard Library, Sality, Swen (computer worm), Voyager (computer worm) ... find all
- 7 - <personal> - Andrei Katkov, Doc Cheatham, George Air Force Base, Hangar Theatre, Jimmy Knepper ... find all
- 7 - <mark> - Dynamic time warping, List of Saturday Live (British TV series) episodes, Tumoral calcinosis ... find all
- 7 - <date> - Battle honour, Carus Publishing Company, Charles E. Fraser, Opera Dragonfly, Radiocarbon dating ... find all
Need debugging[edit]
- 19 - <pre> - Arena (web browser) (ASCII art breaking parsing?), Back-to-back user agent (ASCII art breaking parsing), BagIt, Call graph, Code folding ... find all
- (These look legit, probably a moss bug. Beland note to self: Run these on wikitext_util functions in an interactive window to find parse breakage.)
Notification of new dumps[edit]
"Most likely misspellings by articles" should always have work to do (if not, ping Beland to add more from the current dump). Some of the other sections are occasionally waiting for a new dump to get a useful list, either because they are ranked by frequency or a code change has been made to clean up noise in the next run. New runs are generally posted twice a month. The database snapshot from the first day of the month generally takes about 9-13 days to process, and the snapshot from the twentieth day of the month might take 4-6 days until it can be posted.
All that said, if you want to get a ping when results from a new dump are posted, you can add your name to the list below. If you are only interested in a particular section, include a note to that effect.
- (add your username to this list)
- Jake The Great!📞talk! 01:40, 18 December 2019 (UTC)
- Sun Creator(talk) 22:21, 19 November 2019 (UTC)
- Puddleglum2.0 (talk) 20:31, 13 October 2019 (UTC)
- Schazjmd (talk) 18:25, 21 December 2018 (UTC)
- bradleyagin (talk) 04:08, 12 January 2019 (UTC)
- Darylgolden(talk) Ping when replying 00:50, 11 February 2019 (UTC)
- MarkZusab (talk) 03:52, 15 February 2019 (UTC)
- Amiodarone talk 20:52, 2 April 2019 (UTC)
- Zojomars (talk) 17:48, 31 May 2019 (UTC)
- Anarhistička Maca (talk) 06:25, 30 June 2019 (UTC)
- Clovermoss (talk) 00:46, 27 October 2019 (UTC)
- JaAlDo (talk) 14:18, 11 March 2020 (UTC)
- Creativecreatr Creativecreatr (talk) 09:56, 26 May 2020 (UTC)
- Voidify (talk) 06:12, 9 June 2020 (UTC)
- Doghouse09 (talk) 20:52, 8 September 2020 (UTC)
- -- spazure (contribs) 09:24, 2 December 2020 (UTC)
- Idell (talk) 21:26, 23 October 2020 (UTC)
- -- *Fehufangą ♮ ✉ Talk page ♮ 12:16, 28 December 2020 (UTC)
moss source code[edit]
moss is written in Python, and is available on github at: https://github.com/cdbeland/moss
Data is obtained from XML database backup dumps.