Wikipedia:Typo Team/moss
This page has a backlog that requires the attention of willing editors. Please remove this notice when the backlog is cleared. |
The moss project seeks to find and remove the furry green typos that have been growing on Wikipedia articles. It uses a python script named moss and written by User:Beland to automatically find misspellings, mistakes in English grammar, violations of the Wikipedia:Manual of Style, and confusing or broken wiki markup.
QUICK LINK TO THE BEST PAGE FOR NEW PARTICIPANTS
About misspellings
[edit]How the lists are made
[edit]The moss spell checker is run against a recent set of database dumps, which are generated on the 1st and 20th of every month (but take a few days to process). All the articles in the English Wikipedia are examined. The following are ignored:
- Text inside references, templates, tables, quotation marks, sections like "External links" and "Works", and some other weird places.
- Capitalized words (which are presumed to be correctly-spelled proper nouns)
- Words that appear in titles in the English Wiktionary (which has definitions of all words in all languages, excluding proper nouns and systematic words like chemical names and large numbers)
- Words that appear in titles in the English Wikipedia (which explains some things that don't appear in the dictionary)
- Words that appear in titles in the Wikispecies (which has many technical words that don't appear in the dictionary or encyclopedia)
Many mistakes are not (yet) caught:
- Improper addition of 's (possessives are not added to Wiktionary, so these are excluded systematically)
- Incorrect capitalization
- Incorrect multi-word phrases
- Wrong word used in context
- Non-English language words not tagged with {{lang}} or where an English misspelling happens to be the same as a word in another language. (These are counted as correct spellings if they are in the English Wiktionary, which lists words in all languages – only the definitions are restricted to English.)
- Other situations listed in #False negatives below
2023 statistics
[edit]- See also: Older statistics
Dump (moss version) | Parse failures (articles + articles with MOS:STRAIGHT violations) | TOTAL (instances) | A | BC | BW | C | D | H | HB | HL | L | ME | N | P | T/ | T1 | TE | TF | TS | U | Z |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2023-01-01 (c2370a5) | 161163 + 29891 | 1187870 | 10615 | 83981 | 534264 | 8233 | 0 | 1498 | 4601 | 110 | 1975 | 179206 | 1905 | 5 | 2229 | 41525 | 6115 | 198814 | 97810 | 1428 | 13556 |
2023-01-20 (36ce94e) | 161298 + 29949 | 1182833 | 10598 | 83813 | 534411 | 8235 | 0 | 1525 | 4965 | 116 | 1958 | 178578 | 1889 | 6 | 2196 | 38722 | 6055 | 198441 | 96321 | 1402 | 13602 |
2023-02-01 (90a97fc) | 161048 + 29944 | 1180485 | 10602 | 83842 | 534121 | 8245 | 0 | 1500 | 5011 | 111 | 1936 | 178163 | 1862 | 6 | 2183 | 38247 | 6050 | 197047 | 96542 | 1392 | 13625 |
2023-02-20 (f606b45) | 161111 + 30009 | 1180176 | 10609 | 83664 | 534782 | 8249 | 0 | 1509 | 5224 | 108 | 1930 | 177709 | 1861 | 4 | 2071 | 37810 | 5997 | 196478 | 97105 | 1383 | 13683 |
2023-03-01 (75cbca7) | 161224 + 30095 | 1179378 | 10613 | 83570 | 534792 | 8206 | 0 | 1510 | 5286 | 100 | 1918 | 177568 | 1860 | 5 | 2076 | 37445 | 5970 | 196360 | 97010 | 1382 | 13707 |
2023-03-20 (56a3811) | 161344 + 30169 | 1177045 | 10566 | 83245 | 535523 | 8214 | 0 | 1509 | 5202 | 99 | 1911 | 176955 | 1861 | 5 | 2092 | 36281 | 5811 | 196309 | 96321 | 1361 | 13780 |
2023-04-01 (no run) | |||||||||||||||||||||
2023-04-20 (57a4619) | 161810 + 30162 | 1178156 | 10577 | 83076 | 536215 | 8241 | 0 | 1541 | 5473 | 105 | 1904 | 175853 | 2043 | 5 | 2049 | 36561 | 5740 | 196528 | 96979 | 1370 | 13896 |
2023-05-01 (77de75d) | 162001 + 30150 | 1171871 | 10418 | 82887 | 536140 | 8170 | 0 | 1535 | 4633 | 98 | 1890 | 173066 | 2028 | 5 | 2050 | 36282 | 5781 | 195082 | 96960 | 1361 | 13485 |
2023-05-20 (73bb66d) | 162329 + 30138 | 1171817 | 10379 | 82480 | 536386 | 8161 | 0 | 1470 | 4913 | 88 | 1890 | 171905 | 2037 | 0 | 2064 | 36364 | 5817 | 195132 | 97814 | 1367 | 13550 |
2023-05-20 (d0a8560) | 163084 + 29893 | 1170266 | 10186 | 81955 | 529811 | 8192 | 0 | 1473 | 4902 | 89 | 1879 | 173759 | 2042 | 1 | 2064 | 38044 | 5842 | 194194 | 100920 | 1366 | 13547 |
2023-06-01 (040dd4d) | 163371 + 29818 | 1169150 | 10189 | 81451 | 529652 | 8200 | 0 | 1474 | 5163 | 90 | 1895 | 172815 | 2031 | 1 | 2052 | 37997 | 5827 | 193963 | 101375 | 1365 | 13610 |
2023-06-20 (50a82ce) | 163664 + 29771 | 1169732 | 10189 | 81086 | 529892 | 8232 | 0 | 1519 | 5624 | 86 | 1879 | 171891 | 2050 | 1 | 2059 | 38342 | 5785 | 194184 | 101817 | 1364 | 13732 |
2023-07-01 (8533535) | 163877 + 29747 | 1169420 | 10201 | 80978 | 529664 | 8242 | 0 | 1564 | 5806 | 83 | 1873 | 171484 | 2042 | 3 | 2061 | 38446 | 5814 | 193933 | 102073 | 1373 | 13780 |
2023-07-20 (9812c05) | 164115 + 29742 | 1170482 | 10174 | 80456 | 529875 | 8255 | 0 | 1553 | 5943 | 80 | 1872 | 171720 | 2036 | 3 | 2057 | 38956 | 5806 | 194057 | 102367 | 1361 | 13911 |
2023-08-01 (7468187) | 164308 + 29748 | 1170928 | 10136 | 80230 | 529739 | 8249 | 0 | 1549 | 6036 | 79 | 1873 | 171743 | 2037 | 5 | 2061 | 39182 | 5811 | 194411 | 102497 | 1351 | 13939 |
2023-08-20 (7170d29) | 164473 + 29635 | 1171932 | 10148 | 80137 | 529804 | 8263 | 0 | 1556 | 6132 | 80 | 1874 | 171627 | 2048 | 8 | 2062 | 39280 | 5856 | 194769 | 102930 | 1344 | 14014 |
Dump (moss version) | Parse failures (articles + articles with MOS:STRAIGHT violations) | TOTAL (instances) | A | BC | BW | C | D | H | HB | HL | L | ME | N | P | T+gcld3_broken | T/ | T1 | TS | U | Z |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2023-09-01 (8c03bd1)* | 164600 + 29593 | 1173119 | 10135 | 80154 | 530301 | 8245 | 0 | 1567 | 5692 | 87 | 1875 | 171823 | 2061 | 9 | 200991 | 2057 | 39595 | 103147 | 1337 | 14043 |
2023-09-20 (8c03bd1)* | 164777 + 29611 | 1173098 | 10183 | 80123 | 530578 | 8240 | 0 | 1583 | 4775 | 85 | 1870 | 171711 | 2064 | 8 | 201138 | 2064 | 39874 | 103376 | 1339 | 14087 |
2023-10-01 (d531b95)* | 164779 + 29586 | 1173193 | 10164 | 80017 | 530906 | 8238 | 0 | 1577 | 4719 | 87 | 1860 | 171300 | 2061 | 9 | 201083 | 2047 | 39886 | 103784 | 1328 | 14127 |
2023-10-20 (9c53721)* | 164889 + 29667 | 1173548 | 10178 | 79977 | 531174 | 8243 | 138 | 1584 | 4762 | 87 | 1860 | 171070 | 2048 | 11 | 201277 | 2042 | 39910 | 103702 | 1323 | 14162 |
2023-11-01 (9c53721)* | 165069 + 29668 | 1174710 | 10164 | 79988 | 531412 | 8252 | 138 | 1577 | 4738 | 90 | 1844 | 171440 | 2033 | 11 | 201449 | 2059 | 40250 | 103724 | 1338 | 14203 |
2023-11-20 (1edb851)* | 165362 + 29748 | 1177078 | 10196 | 79995 | 531684 | 8262 | 138 | 1597 | 4859 | 93 | 1856 | 171957 | 2034 | 10 | 202060 | 2054 | 40847 | 103797 | 1323 | 14316 |
2023-12-01 (1edb851)* | 165429 + 29788 | 1179043 | 10208 | 79941 | 531789 | 8294 | 138 | 1610 | 4950 | 93 | 1867 | 172253 | 2028 | 12 | 202513 | 2056 | 41284 | 104336 | 1310 | 14361 |
2023-12-20 (1edb851)* | 165685 + 29862 | 1180181 | 10205 | 79762 | 531632 | 8362 | 138 | 1603 | 4895 | 103 | 1868 | 172415 | 2022 | 12 | 203189 | 2042 | 41499 | 104750 | 1301 | 14383 |
* Due to software issues, language detection wasn't working for this run.
2024 statistics
[edit]Dump (moss version) | Parse failures (articles + articles with MOS:STRAIGHT violations) | TOTAL (instances) | A | BC | BW | C | D | H | HB | HL | L | ME | N | P | T+gcld3_broken | T/ | T1 | TS | U | Z |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2024-01-01 (1edb851)* | 165792 + 29766 | 1180781 | 10226 | 79927 | 531362 | 8352 | 0 | 1628 | 4917 | 100 | 1865 | 172474 | 2027 | 9 | 203478 | 2043 | 41749 | 104903 | 1301 | 14420 |
2024-01-20 (2caa23a)* | 165661 + 29837 | 1180491 | 10237 | 79493 | 531501 | 8345 | 0 | 1624 | 4127 | 103 | 1858 | 172622 | 2019 | 9 | 203838 | 2044 | 41878 | 105071 | 1298 | 14424 |
2024-02-01 (3242653)* | 165836 + 29834 | 1181230 | 10245 | 79246 | 531803 | 8337 | 0 | 1629 | 4120 | 103 | 1858 | 172799 | 2024 | 8 | 204049 | 2043 | 42002 | 105240 | 1287 | 14437 |
2024-02-20 (10d0c37)* | 165885 + 29901 | 1182750 | 10251 | 78915 | 531861 | 8343 | 1 | 1630 | 4043 | 114 | 1849 | 173461 | 2015 | 10 | 204251 | 2045 | 42357 | 105827 | 1286 | 14491 |
2024-03-01 (9ccfa0d)* | 166045 + 29975 | 1182428 | 10255 | 78805 | 531778 | 8362 | 0 | 1638 | 4041 | 112 | 1854 | 173370 | 2030 | 24 | 203994 | 2037 | 42461 | 105848 | 1299 | 14520 |
2024-03-20 (460959f)* | 166141 + 30055 | 1185611 | 10292 | 78621 | 532345 | 8424 | 0 | 1631 | 4237 | 116 | 1858 | 173672 | 2045 | 25 | 204545 | 2049 | 42870 | 106954 | 1278 | 14649 |
2024-04-01 (ce9f129)* | 166181 + 30054 | 1184405 | 10287 | 76464 | 533031 | 8419 | 0 | 1618 | 4309 | 114 | 1849 | 173577 | 2051 | 40 | 204408 | 2031 | 42961 | 107298 | 1258 | 14690 |
2024-04-20 (1ee7a35)* | 166362 + 30118 | 1177599 | 10275 | 67649 | 533534 | 8425 | 0 | 1617 | 4335 | 112 | 1848 | 173787 | 2063 | 40 | 204403 | 2012 | 43481 | 107996 | 1258 | 14764 |
2024-05-01 (6d3c9c7)* | 166292 + 30184 | 1175980 | 10277 | 66114 | 533831 | 8426 | 0 | 1643 | 4495 | 110 | 1845 | 173629 | 2064 | 1 | 204334 | 2020 | 43407 | 107675 | 1248 | 14861 |
2024-05-20 (489f6f1)*† | 144265 + 25968 | 1003795 | 8924 | 53789 | 453466 | 7619 | 0 | 1381 | 3715 | 90 | 1693 | 150497 | 1795 | 1 | 176951 | 1725 | 37151 | 92577 | 1120 | 11301 |
2024-06-01 (07eaceb)* | 166755 + 30248 | 1173354 | 10304 | 60088 | 534568 | 8460 | 0 | 1648 | 4461 | 105 | 2020 | 174740 | 2074 | 2 | 203514 | 1997 | 44495 | 108560 | 1241 | 15077 |
2024-06-20 (b1c7e7b)* | 166980 + 30276 | 1173538 | 10299 | 59845 | 534381 | 8444 | 0 | 1673 | 4501 | 102 | 1922 | 174948 | 2071 | 3 | 204346 | 2000 | 43905 | 108742 | 1227 | 15129 |
2024-07-01 (6787e3e)* | 167034 + 30300 | 1172833 | 10295 | 59766 | 533956 | 8440 | 0 | 1654 | 4345 | 101 | 1924 | 175086 | 2065 | 3 | 204357 | 1992 | 43915 | 108542 | 1227 | 15165 |
* Due to software issues, language detection wasn't working for this run.
† This run seems to have malfunctioned, possibly run on partial dumps.
Dump (moss version) | Parse failures (articles + articles with MOS:STRAIGHT violations) | TOTAL (instances) | A | BC | BW | C | D | H | HB | HL | L | ME | N | P | T/ | T1 | TE | TF | TS | U | Z |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2024-07-20 (9c0d979)* | 167018 + 30354 | 1175268 | 10337 | 59894 | 533911 | 8455 | 0 | 1675 | 4304 | 102 | 1942 | 175528 | 1909 | 2 | 2015 | 44274 | 6018 | 199908 | 108530 | 1219 | 15245 |
2024-08-01 (027458a) | 167192 + 30364 | 1172497 | 10336 | 59874 | 533608 | 8473 | 0 | 1657 | 4315 | 100 | 1917 | 175240 | 1904 | 0 | 2011 | 43272 | 5990 | 199733 | 107535 | 1225 | 15307 |
2024-08-20 (a13c743) | 167561 + 30399 | 1170154 | 10336 | 59930 | 533732 | 8498 | 0 | 1661 | 4324 | 97 | 1911 | 174117 | 1902 | 1 | 2015 | 42363 | 5945 | 199740 | 106986 | 1224 | 15372 |
2024-09-01 (313f784) | 167769 + 30088 | 1169770 | 10346 | 60064 | 533615 | 8504 | 0 | 1652 | 4370 | 94 | 1916 | 173479 | 1894 | 0 | 2014 | 42271 | 5946 | 200037 | 106914 | 1223 | 15431 |
2024-09-20 (61a2a69) | 167769 + 30088 | 1170579 | 10346 | 60064 | 533615 | 8504 | 0 | 1652 | 5640 | 94 | 1915 | 173240 | 1894 | 0 | 2004 | 42244 | 5944 | 199857 | 106912 | 1223 | 15431 |
2024-10-01 (6afa51c) | 168227 + 30163 | 1174679 | 10337 | 60291 | 534111 | 8536 | 0 | 1648 | 8004 | 95 | 1942 | 173723 | 1892 | 1 | 2053 | 42304 | 5936 | 199891 | 107127 | 1235 | 15553 |
Typo classification legend
[edit]Reporting symbol | Explanation |
---|---|
Parse failure | Mismatched punctuation; spell checker is unsure which words to ignore, so the whole page is skipped |
A | mAth |
BC | Bad Characters (not allowed by Manual of Style) |
BW | Bad Words (not allowed by Manual of Style) |
C | Chemistry words |
D | DNA sequence |
H | HTML/XML/SGML tag |
HB | Known bad HTML tag, like <font> |
HL | Bad HTML-like linking, like <http://...> |
L | Probable Romanization (transLiteration) |
ME | Probable coMpound, English (with and without dash) - need to be added to Wiktionary |
N | A-Z plus numbers and hyphens |
P | Patterns (e.g. rhyme schemes - Beland fixes these) |
T/ | Suspected MOS:SLASH violation |
T1 | Edit distance 1 from common English word |
TE | AI thinks it's trying to be English |
TF | AI thinks it's trying to be a non-English language (Foreign to English Wikipedia), sorted by language (e.g. TF+el) |
TS | Missing or extra whitespace or dash (or new compound). Currently included if there is a period (TS+DOT), comma (TS+COMMA), or extra space (TS+EXTRA). Missing bracket (TS+BRACKET) needs code improvements to be reliable, and the remainder of TS need sorting. |
U | URL |
Z | Decimal fraction missing leading Zero |
I | Definitely not English (International) due to accents or mixed with punctuation (other than hyphen) |
MI | Probable coMpound, non-English (International) in English Wiktionary (both A-Z and non-ASCII characters, with and without dash) |
ML | Probable coMpound, transLiteration |
MW | Probable coMpound, found in non-English Wiktionary |
R | Regular word (A-Z only) not near a common English word |
T2 | Edit distance 2 from common English word |
T3 | Edit distance 3 from common English word |
W | Not in English Wiktionary, in non-English Wiktionary |
- red = Probably need to fix
- yellow = Unsorted - need code improvements to sort into likely vs. unlikely typos or subtypes that can be usefully processed.
- blue = Probably OK (but may need to verify)
- bold = actively working on fixing
- grey = no longer used
Instructions for editors
[edit]Just like a regular spell checker, sometimes a word that's highlighted is really a misspelling and should be changed, but sometimes it is a correct spelling that needs to be added to the spell checker's dictionary (which in this case is the English Wiktionary and Wikispecies). For the below lists, here's how you can help:
- For spelling mistakes: Click on the links to the individual Wikipedia articles, and edit them to correct the misspelling. Make sure this is actually a misspelling, and not a technical term that needs to be better explained, or an alternate spelling (possibly from a different regional variety of English).
- For non-English words (including words from Old English and Middle English, since they are pronounced differently): Edit the article and use the {{lang}} or {{transl}} templates to mark all non-English passages. Template contents are ignored, so they will not show up in the next report. If you can define the word, it would still be helpful to add the non-English word to the English Wiktionary or the same-language Wiktionary if you speak that language. As of the March 20, 2019 dump, only words not found in any Wiktionary are reported by moss as misspellings. (The "home" Wiktionary for Old and Middle English words is the modern English one.)
- For Early Modern English spellings, use {{lang|en-emodeng}}.
- For languages that don't have an ISO 639 code (often happens with historical languages), you can use an IETF language tag instead. Failing that, use the miscellaneous code "mis" and add an HTML comment indicating the language. For example: {{lang|mis|sharbe do kin ratz}}<!-- Old Runish -->
- For incorrect spellings in direct quotes:
- These shouldn't be picked up by the spell checker, as text in double quotes ("") is ignored. The article probably has incorrect punctuation.
- Regardless of punctuation problems, you can add {{sic}} around the word or phrase. See Wikipedia:Manual of Style#Quotations for guidance.
- For correct spellings that belong in the dictionary: Click on the word to add it to the English Wiktionary. Remember the word might not be English (though the definition must be) and be sure to check capitalization!
- For correct spellings already in the dictionary: Delete from the list. These have been added in the meantime since the database dump by other editors. They do not automatically turn red as internal Wikipedia links do.
- For correct spellings not appropriate for Wiktionary:
- For complicated chemical names:
- If there is an article about this chemical, it's best to make a redirect. You may want to tag it {{R from systematic name}} or {{R from technical name}} if appropriate.
- If there is no Wikipedia article, you can either {{chem name}}; for example:
This should not be used for chemical formulas such as H2O, for which {{H2O}} or {{chem2}} may be appropriate. For some common compounds there are specific templates available such as Template:CO2.
- {{chem name|poly(1-phenylethene)}}
- For DNA sequences, add {{DNA sequence}} around it.
- For species, add the whole name to Wikispecies:Wikispecies:Requested articles#From_Wikipedia and it will be suppressed from future runs.
- For proper nouns and (including non-English titles) that aren't capitalized, put inside a {{proper name}} tag.
- Use <code></code> or similar tags for computer programs; see Wikipedia:WikiProject_Computer_science/Manual_of_style#Code_samples.
- For terms that are only relevant to one Wikipedia article (and for which the article makes clear the definition) consider creating a redirect to the article. As long as the "typo" word is in the title (as a whole word), it won't show up as a mistake in future spell checks.
- {{IPA}} or {{respell}} can be used for word pronunciations. See Wikipedia:Manual of Style/Pronunciation for details.
- For bird calls: Treat these as foreign-language words or words-as-words and put them in italics, following MOS:ITALICS. Put the call inside {{not a typo}} so it won't show up on moss spell check reports. (It doesn't matter if the double apostrophes that make the italics go inside or outside the template.)
- Anything else, add {{not a typo}} around it (for example, nonsense series of letters used as examples in puzzles).
- For complicated chemical names:
- Correct or incorrect, when finished delete the entry for the word from the lists on this page (or subpages), so work won't be duplicated. (There is no longer any need for strikethru.)
- If an article or section has generally bad grammar, and you don't have time to fix the whole thing, just add {{copyedit}} at the top of the article or {{copyedit|section}} at the top of the affected section. If it's just a sentence or two, {{copy edit inline}} or {{incomprehensible inline}} can go at the end of the problem passage.
- If you see errors being reported from footnotes or bibliographies, check to make sure the section is titled with a standard name following MOS:APPENDIX conventions. Standard end-matter sections like "References" and "Further reading" and "Works" are ignored.
- If it helps to leave a message on the article's talk page asking if the word is correct or incorrect, you can use Template:Typo help like this when editing the bottom of the talk page (leave the section header blank; it will automatically be added):
- {{subst:typo help|PUT WORD HERE}} -- ~~~~
Don't worry if you miss something; it will reappear in a future report if there are still mistakes.
Suggested edit summaries
[edit]If you want to help publicize this project, you can copy-and-paste these into your edit summary, if appropriate.
For Wikipedia edits:
- Fix misspelling found by [[Wikipedia:Typo Team/moss]] – you can help!
- Tag non-English text found by [[Wikipedia:Typo Team/moss]] – you can help!
- Tag correct text as {{not a typo}} for automated spell checkers (including [[Wikipedia:Typo Team/moss]])
- Fix mismatched quote marks found by [[Wikipedia:Typo Team/moss]] – you can help!
For Wiktionary edits:
- Add word identified by [[w:Wikipedia:Typo Team/moss]] – you can help!
Wiktionary cheat sheet
[edit]Need to add a word to Wiktionary? The Wiktionary cheat sheet has copy-and-paste templates that make it easy for the types of words commonly encountered here, even if you've never done it before.
Misspellings - lists of things to fix
[edit]Likely misspellings by article (main listing)
[edit]The most efficient list to work on if all you want to do is fix misspellings. These listings try to list all the typos from a given article, so they can be fixed all at once. It also tries to only show typos that legitimately need fixing. It's not perfect, so a few words found need to be added to Wiktionary or tagged as not English, not a typo, etc. Only a few letters are updated on each run, to avoid stale listings as the whole list takes far longer than two weeks to work through. (This also avoids duplicating recent work when listings are refreshed.)
See subpages due to length:
- Wikipedia:Typo Team/moss/before A - Completed 2022-06-20 dump
- Wikipedia:Typo Team/moss/A - 2022-07-20 dump partly completed, 2020-05-20 dump completed except case notes
- Wikipedia:Typo Team/moss/B - Completed 2022-12-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/C - Typos from 2023-02-01 dump partially fixed
- Wikipedia:Typo Team/moss/D - Typos ready for fixing from 2024-07-01 dump
- Wikipedia:Typo Team/moss/E - Completed 2020-08-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/F - Completed 2020-09-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/G - Completed 2020-09-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/H - Completed 2020-10-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/I - Completed 2020-10-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/J - Completed 2021-01-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/K - Completed 2021-02-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/L - Completed 2021-03-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/M - Completed 2021-04-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/N - Completed 2021-05-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/O - Completed 2021-05-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/P - Completed 2021-05-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/Q - Completed 2021-06-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/R - Completed 2021-06-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/S - Completed 2021-06-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/T - Completed 2021-09-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/U - Completed 2021-10-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/V - Completed 2021-11-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/W - Completed 2021-12-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/X - Completed 2022-03-01, currently empty
- Wikipedia:Typo Team/moss/Y - Completed 2022-05-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/Z - Completed 2022-06-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/after Z - Completed 2022-06-01 dump (except for cases that need tagging or investigation)
Notes:
- For more cases that require investigation, see Category:Articles with unidentified words.
- Due to length and an increased number of false positives, typo reports for dumps 2020-05-20 and later don't include T2+, T3+, and TS+BRACKET+.
Possible typos by length
[edit](Updated from 2022-12-20 dump.)
Longest or shortest in certain categories are shown, sometimes just for fun and sometimes because they form a useful group. Feel free to delete articles that are fixed or tagged.
Likely chemistry words
[edit]These need to be checked by a chemist and marked as {{chem name}}.
- 221 -
wikt:dodecahydro-1h,4h,14h,17h-2,16:3,15-dimethano-5h,6h,7h,8h,9h,10h,11h,12h,13h,18h,19h,20h,21h,22h,23h,24h,25h,26h-2,3,4a,5a,6a,7a,8a,9a,10a,11a,12a,13a,15,16,17a,18a,19a,20a,21a,22a,23a,24a,25a,26a-tetracosaazabispentaleno - Cucurbituril - 79 -
wikt:d-1,2,3,9,10,10a-hexahydro-6-methoxy-11-methyl-4h-10,4a-iminoethano-phenanthren - Controlled Drugs and Substances Act - 73 -
wikt:d-1,2,3,9,10,10a-hexahydro-11-methyl-4h-10,4a-iminoethanophenanthren-6-ol - Controlled Drugs and Substances Act - 72 -
wikt:l-11-allyl-1,2,3,9,10,10a-hexahydro-4h-10,4a-iminoethanophenanthren-6-ol - Controlled Drugs and Substances Act - 65 - wikt:dl-1,2-anhydro-4,5-o-cyclohexylidene-1,2,3/4,5-cyclopentanepentol - 1,2,3,4,5-Cyclopentanepentol
- 63 - wikt:uridine-5'-diphospho-n-acetyl-2-amino-2-deoxy-3-o-lactylglucose - UDP-N-acetylmuramate dehydrogenase
- 59 - wikt:octachloro-3a,4,7,7a-tetrahydro-4,7-methanoindene-1,8-dione - Hexachlorocyclopentadiene
- 59 - wikt:dihydroxy-21-oxa-21-chloromethylpregna-1,4-diene-3,20-dione - List of corticosteroids
- 59 - wikt:cis-5,6-dihydroxy-4-isopropylcyclohexa-1,3-dienecarboxylate - 2,3-dihydroxy-2,3-dihydro-p-cumate dehydrogenase
- 58 - wikt:decahydro-10-methoxy-3,6,9-trimethyl-3,12-epoxy-12h-pyrano - Artemether
- 56 - wikt:d-glucopyranosyl-3,23-dihydroxycucurbita-5,24-dien-19-al - Momordicine
- 54 - wikt:s-adenosyl-l-methionine:3-hexaprenyl-4,5-dihydroxylate - Hexaprenyldihydroxybenzoate methyltransferase
- 54 - wikt:cis-11,12-dichloro-9,10-dihydro-9,10-ethano-2-anthroic - Field effect (chemistry)
- 52 - wikt:dipentalene-1,4,6,8,10,12,14,17,19,21,23,25-dodecone - Cucurbituril
- 51 - wikt:n-cyclobutylmethyl-4,5-epoxy-morphinan-3,6,14-triol - Controlled Drugs and Substances Act
- 51 - wikt:alpha-amino-3-hydroxy-5-methyl-4-isoxazolpropionate - GRIA4
- 49 - wikt:dodecahydro-6a,10b-dimethyl-4,10-dioxo-2h-naphtho - Controlled Drugs and Substances Act
- 48 -
wikt:octahydro-1,3,5,7-tetraza-2,4,6,8-tetraborocines - Iminoborane - 48 -
wikt:octahydro-1,3,5,7-tetranitro-1,3,5,7-tetrazocine - Aluminium(I) oxide, HMX - 48 - wikt:dimethoxy-4-methyl-3-oxo-3,4-dihydroquinoxalinyl - Yessotoxin
Chemical formulas
[edit](Updated from 2023-05-20 dump.)
Chemical formulas should be written with HTML subscripts or {{chem2}}; these listings identify those that incorrectly just use regular numbers.
Chemical formulas that use Unicode subscripts (which is against MOS:SUBSCRIPT) will be detected automatically by moss_entity_check.py.
Chemical formulas that use <sub>...</sub>
are allowed by MOS:CHEM, but may show up in the main typo listings above. They can be converted to use {{chem2}} to be accepted by the spell checker, and {{chem2}} is also the way to fix listings of partial formulas.
Any "possible" listings that aren't chemical formulas can be cleared from this list by adding a redirect to an appropriate target (like Dy4 Systems). Most "known" listings that aren't chemical formulas can be fixed with {{proper name}}.
Redirects added for strings that are chemical formulas should be added to Category:Chemical formulas.
Most chemical articles
[edit]Articles with a large number of chemical formulas triggering the spell checker are listed here (manual check on 2022-06-20 dump; counts include potential typos other than formulas, mostly compound names):
- 87 - Nickel compounds
- 28 - Nickel double salts
- 28 - Cerium nitrates
Possible chemical formulas that don't use subscripts
[edit]Note: These are easier to find by searching with "insource://", for example: insource:/Si6Al2/. -- Beland (talk) 02:32, 27 December 2022 (UTC)
- 11/6 - Ge9
- 10/2 - N62B44
- 7/6 - V2O7
- 7/5 - Ac2S3
- 7/1 - B3R2
- 6/6 - Cu5
- 6/5 - Ti3O5
- 6/5 - S50B32
- 6/5 - Bi2O2
- 6/5 - Al63Cu24Fe13
- 6/3 - Pr2C6H3
- 6/3 - H3R17
- 6/2 - Mn12O12
- 6/2 - Ga2I3
- 6/2 - C6R6
- 5/5 - Si9O27
- 5/5 - Pb9
- 5/5 - No17
- 5/5 - H3K18
- 5/5 -
Fe5Si3 - 5/5 - Fe2O4
- 5/5 - B18B4
- 5/4 - Zr4
- 5/4 - S6K2
- 5/4 - Mo6S8
- 5/4 - Fe4S3
- 5/3 - V3R6 - version 3 release 6?
- 5/3 - Pu2O3
- 5/3 - K3V2
- 5/3 - H3R26
- 5/3 - Cf2O3
- 5/2 - Np2O5
- 5/2 - N62B48
- 5/2 - Mn5Si3
- 5/2 - Lv5
- 5/2 - B12C3
- 5/1 - Si4O13
- 5/1 - Np2S3
- 5/1 - B12Cl11
- 4/4 - Ti22
- 4/4 - Si4O10
- 4/4 - Sb3O6
- 4/4 - No16
- 4/4 - Kr2
- 4/4 - I4O9
- 4/4 - H4R3
- 4/4 - Gd3Ga5O12
- 4/4 - Ga2Cl4
- 4/4 - C6H5O7
- 4/4 - C6H3Cl2
- 4/4 - C2B2
- 4/4 - C16H33
- 4/4 - Bi4Ti3O12
- 4/4 - Au75Si25
- 4/4 - Al2Si2
- 4/3 - W18O49
- 4/3 - Tc3Cl9
- 4/3 - R2B2
- 4/3 - Pb10
- 4/3 - No11
- 4/3 - Ni6
- 4/3 - H3R8
- 4/3 - Ca3Al2
- 4/3 - C5H3
- 4/3 - C2B7H13
- 4/3 -
B6H10 - 4/3 - B18C4
- 4/2 - R2P2
- 4/2 - Ni31Si12
- 4/2 - H4K8
- 4/2 - Cu4O3
- 4/2 - Cr7C3
- 4/2 - B5O6
- 4/1 - Ti4N3
- 4/1 - Ta5N6
- 4/1 - Ta2Cl6
- 4/1 - Sm2Co17
- 4/1 - O2C6Cl4
- 4/1 - Np3S5
- 4/1 - Mg3Si2O5
- 4/1 - Lv8
- 4/1 - Ho5
- 4/1 - H4H2
- 4/1 - Ga2I4
- 4/1 - Cr2Ge2Te6
- 4/1 - C6S4
- 4/1 - C50H10
- 4/1 - C2P2
- 4/1 - Ag6
- 3/3 - V4R4 - version 4, release 4?
- 3/3 - V4R3 - version 4, release 3?
- 3/3 - Th4
Known chemical formulas that don't use subscripts
[edit]H2O
[edit]- 295 - H2O -
2022 F1 Powerboat World Championship, 252P/LINEAR, America's Funniest Home Videos, Ammonium carbonate, Andrew Mwangura, BAF agar, Blues Brothers 2000 (video game), Bruno Crastes, C/1979 Y1 (Bradfield), Calcium caseinate, Calerway, Cambodge Soir, Cape Cod Regional Transit Authority, Carnot battery, Cedar Point, Cedar Point Shores, Centennial School District (Minnesota), Chadwick (surname), Chaparral Boats, Chris Cox (DJ), Clarendon Entertainment, Club Filter Melbourne, Col Pearse, Colby Corino, College of Engineering, Trivandrum, Cricoid pressure, Cutting Class, Cyril Hanouna, DJ Symphony, Dadoo, Dalpalan, Danny Havoc, Darius Rucker, Dean Buntrock, Decora (rapper), Delirious, Dendral, Design U, Dicky Barrett, Disodium hydrogen phosphite,DloHaiti, Don't Forget Your Roots, Downtown: A Street Tale, E Ink, EGG, the Arts Show, Easton Corbin, Edwin Decena, Electricity distribution companies by country, Elephant Walk (Texas A&M), Emma Marrone, Estadio Venustiano Carranza, Evergreen Aviation & Space Museum, Every Witch Way, Expo 86, FEFLOW, FTTW, Fabio Golombek, Felipe Ehrenberg, Fibrecity Holdings, Fishdom, Fishdom H2O: Hidden Odyssey, Fly International Luxurious Art, Formula 1 Powerboat World Championship, Fountain, Colorado, Freedom-class cruise ship, Gary White (engineer), Gay bathhouses in the United Kingdom, Genevieve Chappell, Gisela Pulido, Global H2O, Global Inheritance, Glossary of environmental science, Go, Gregory Iron, Guilty or Innocent of Using the N Word, H2O (1929 film), H2O (American band), H2O (Puerto Rican band), H2O (Scottish band), H2O (disambiguation), H2O (web server), H2O Africa Foundation, H2O Audio, H2O Networks, H2O Purépechas F.C., H2onews, HaMerotz LaMillion 8, Hamilton-Oshawa Port Authority, Hanging Lake, Hangzhou Bay Sunac Tourism City, Hare and Loathing in Las Vegas, Harwich, Massachusetts, Hatebreed, Hazen Street, Heavy Water Board, Hematite, Henri Coquand, High Times Medical Cannabis Cup, History of neuroimaging, Homeboyz RFC, Hubert Bonnet, Hyannis Transportation Center, Hydronephrosis, Hypofrontality, Ieperfest, Ignite Film Festival, Indang, Infinity Falls, Isolation forest, Italian hip hop, Jackson Heywood, James Moll, Joey Yung, John Sinkankas, John Vijay, Johnie All Stars, Jonathan Zittrain, Josh Thompson (singer), Jukki Hanada, K38 Water Safety, KDXX, KNIME, KTBZ-FM, Kaput and Zösky, Karlovačko live 2011., Kasba, Kolkata, Katie Spotz, Kavita Krishnamurti, Kelowna, Kevin Lin, Kill Your Idols, Kim McCarty, Kobo Aura HD, Kobo Inc., Kommando Spezialkräfte Marine, Kris McGaha, LGBT culture in Puerto Vallarta, LaToya London, Laird Hamilton, Lam Sheung Yee, Larry Tanz, Laryngeal tube, Leland Wilkinson, Leptomeningeal cancer, Leslie Law, Liga TDP, List of Brad Paisley concert tours, List of Doctor Who universe creatures and aliens, List of EMI labels, List of Marks & Spencer brands, List of United Arab Emirates–related topics, List of Wu-Tang Clan affiliates, List of aircraft (Ms), List of former Def Jam Recordings artists, List of individual cetaceans,List of inorganic compounds,List of minerals named after people, List of spa towns, List of tourist attractions in Greater Orlando, List of water parks in Europe, List of water parks in the Americas, Livid (festival), Liz Katz, Lost Jewlry, Lovefest, Luciano Pagliarini, Luis Gueilburt, Luis Szarán, Lunar Trailblazer, M. Shadows, MMN medium, MSL Aero H80, Manila Ocean Park, Maradu apartments demolition order, Margaritaville Resort Orlando, Market America, Martha Diaz, Martin Schoell, Martí Guixé, Matt Damon, Matt Tremont, McMullan, Miao Xiaochun, Micellar solution, Michael J. Dennis, Mickie Knuckles, Molecularium Project, Music of Benin, Music of Mozambique, My Curse, My Oh My (Aqua song), NEXEN (platform), NGC 2782, Nano-scaffold, Naren Gupta, National Institute of Statistical Sciences, Nations Cup, Natural kind, Nature, Necessity of identity, Neil Voss, Neodymium acetylacetonate, Netmage, New England Metal and Hardcore Festival, Next-Generation Transit Survey, Nikolai Volkoff, Nina Kovacheva, Noah's Ark Water Park, Norwegian Star, O2, Oasis-class cruise ship, Oberkorn, Odd Rods, Odin-OSIRIS, Okanagan Mission, British Columbia, Olga Liashenko, Only Built 4 Cuban Linx... Pt. II, Ontario Place, Opposite of H2O, Otoha, Outline of agriculture, PLA2G6, PLCE1, PLGA, Paranormal Action Squad, Parrot SA, Paul Florian, Paul Freud, Pechet, Pectinase, Peter Rauhofer, Peugeot, Platfora, Polin Waterparks, Premier Parks, LLC, Rainy Lake, Randers FC, Rapsody, Rapzilla, Rasassination, Rebecca Alban Hoffberger, Recycled Orchestra of Cateura, Reincarnation (band), Richard Farleigh, Rickey Shane Page, Royal Caribbean International, Rubicon Riders, Sabine Pigalle, Safeway, Scaled Composites Proteus, Seasonal flows on warm Martian slopes, Shamako Noble, Shane Dollar, Shane Douglas, Shaolin vs. Wu-Tang, Sharlyn Sarac, Sho Lee, Siempre en Domingo, So Get Up, Solemn on Stage, Something About You (Joey Yung album), Steal This Record, Stephen P. Boyd, Stéphane Courbit, Suniel Shetty, Tarsilinha, Tetrisphere, The Anthology of Swiss Legal Culture, The British Horse Society Equestrian Hall of Fame,The Lex Diamond Story, The New Tetris, The Wild (Raekwon album), Tristar and Red Sector Incorporated, Tulsa, Oklahoma, Universal Orlando, University of California Center for Hydrologic Modeling, Urban Punk, Use Your Voice, Valentin Stefanoff, Van Full of Pakistans, Vanadium(II) iodide, Voyages Indigenous Tourism Australia, Watchung Hills Regional High School, Water Country USA, Water miscible oil paint, Water supply and sanitation in Ghana, WaterPartners, Waterproof audio player, Weka (software), Wet 'n Wild Orlando, Wild League (water polo), Won (As Friends Rust album), X Factor Adria (series 1), Y'all So Stupid, Your Hand In Mine
CO2
[edit]- 189 - CO2 -
2022 in climate change, 8 Bishopsgate, ABO Wind, Aker Solutions, Alfredo Aceto,Algorithmic skeleton,Alprose, Ammonium carbonate,Angela Busheska,Angela K. Wilson, Atmospheric methane, Atomic vapor laser isotope separation, Baruch Sterman, Becker, Minnesota, Beirut River, Berwyn, Illinois, Bhilai Steel Plant, Biochar, Biosensor, Biotin carboxyl carrier protein, Blue Yonder, Blue carbon,Brunner Island Steam Electric Station,C/1979 Y1 (Bradfield), COGIX, Carbamino,Carbon Connect Delta Program, Carbon capture and utilization, Carbon dioxide in Earth's atmosphere, Carbon dioxide removal, Carbon offsets and credits, Carrying capacity, Central heating, Charles Koch, Chicago Convention on International Civil Aviation, China–Pakistan Economic Corridor, Circular economy, Citroën C3 Picasso, City of Unley, Clean Energy Finance Corporation, Climate change denial, Climate change in Botswana, Climate change in Brunei, Climate change in Europe, Climate change in Greece, Climate change in Japan, Climate change in Poland, Climate change in Spain, Climate change in Vietnam, Climate change mitigation, Cold pad batch, Comet Bennett, Complement component 2, Coral reefs of Jamaica, Cupriavidus necator, Cyperus helferi, Cytochrome c oxidase subunit 2, Cytochrome c oxidase subunit I, Cytochrome c oxidase subunit III, Cádiz Bay tram-train, Daniel Mancinelli, Dark October, Deforestation and climate change, Deforestation of the Amazon rainforest, Delage, Digital ecology, Direct TPMS, District of Columbia v. Exxon Mobil Corp, E-ferry Ellen, EOS SAT-1, Economy of China, Embryo culture, Emissions Reduction Currency System, Emissions trading, Energy Recovery, Energy in Croatia, Environment of Indonesia, Environmental degradation, Environmental effects of aviation, Environmental effects of transport, Environmental impact of nuclear power, Environmental justice, Enys Men, ExxonMobil, ExxonMobil climate change denial, Fatemeh Shayan, Fermi 1, Foliar feeding, Food waste in Barcelona, Freediving, French press, Funso Ojo, Gastric bypass surgery, Giusto Manetti Battiloro, Global dimming, Government Engineering College, Ajmer, Grabovica, Gornji Milanovac, Green vehicle, Greenhouse gas emissions by China, Greenhouse gas emissions by the United States, Greenpeace, Grid energy storage, Hagerty (insurance), Heliciculture, Hemiscyllium, Hempcrete, History of Ford Motor Company, Hornwort, Houston, Houston, Do You Read?, Hydrogen production, Hyperthermal event, IPCC Sixth Assessment Report, IPhone 12 Pro, John Clauser, John Workman, Jorge Lankenau, Julian Schroeder, Jumilla, KRAY-FM, Kakono Hydroelectric Power Station, Kava culture, Ken-ichi Ueda, Knutsen NYK Offshore Tankers, Laka Competition, Laser-heated pedestal growth, Leaf, List of aircraft engines, List of female scientists before the 20th century, Lliuya v RWE AG, MUD Jeans, Maebashi Station, Marist College, Athlone, Martin Schoell, Milieudefensie v Royal Dutch Shell, Ministry of Transport (New Zealand), Mitchel P. Goldman, Mitsui Engineering & Shipbuilding, Mutanda mine, Nadine Laporte, NamX HUV, Neubauer v Germany, Newcastle, KwaZulu-Natal, Nickel tetracarbonyl, Nike, Inc., Nuclear microreactor, Oleksiy Ryabchyn, Orbital O2, Organic farming in New Zealand, PKN Orlen, PLGA, Penman–Monteith equation, Personal flotation device, Peugeot, Photoheterotroph, Plastisphere, Pre-Bötzinger complex, Pyrolobus fumarii, Radiation damage, Rapid transit, Renee Salas, Renewable energy in the European Union, Resuscitative endovascular balloon occlusion of the aorta, Robert L. McGinnis, Robot combat, Rosebank oil and gas field, Royal Enfield Hunter 350, Rybnik Power Station, Sclerophyll, Silicone oil, Sinezona singeri, Sinopec, Skeleton Technologies, Sofidel Group, Sulfur metabolism, Tallgrass Energy Partners, Thacker Pass Lithium Mine, Thales Alenia Space, Thomas Siebel, Timeline of transportation technology, Transition metal carbonate and bicarbonate complexes, Vistara, Volkswagen Up, Vulcan Energy Resources, WIMEX Group, Walter Cunningham, Wells Fargo, William Federspiel, World Future Energy Summit, World of Color
CS2
[edit](Mostly not carbon disulfide.)
- 98 - CS2 - 2016–17 Radivoj Korać Cup, 8250 UART, A11 road (England), Adobe Audition, Adobe Bridge, Adobe Creative Suite, Adobe GoLive, Adobe Illustrator, Adobe ImageReady, Adobe InCopy, Adobe InDesign, Adobe PageMaker, Adobe Photoshop, Adobe Premiere Pro, Adobe Streamline, Bank state branch, Baseball scorekeeping, Blocking effect, Bow Interchange, Bow, London, Bowen Basin, Bowen Basin Coalfields, CB military symbol, CS gas, CS2 (disambiguation), Carbir Race Cars, Carnegie stages, Ciphertext stealing, Classical conditioning, Clean Sky, Code page, Counter-Strike: Global Offensive Major Championships, DCF77, David Revoy, Deep Fear, Denys Ovenden, Direct debit, Distortion (optics), Elliot Koffman, Extensible Metadata Platform, Fiat 124 Sport Spider, Financial system in Australia, Foton Motor, Gay bathhouses in the United Kingdom, Honda Accord (North America eighth generation), Honorverse, Institute and Faculty of Actuaries, Jorge Pescara, K. A. Rahman, KOI character encodings, Lee Kee Group, Linux range of use, List of BMP-1 variants, List of George Franklin Barber works, List of Sonic Team games, List of computer technology code names, List of international submarine communications cables, Lois Marshall, Mile End, Mile End Stadium, Multi-exposure HDR capture, Märklin Digital, Nathan Manufacturing, National Cycle Route 1, Perspective cloning, Perspective control, Plaistow, Newham, R.K. Malhotra, RSK Group, Raw image format, Sakura Wars, Sakura Wars (1996 video game), Sakura Wars (2019 video game), Second-order conditioning, Sega, Sega AM1, Sega Sports R&D, Sensory preconditioning, Sexual and reproductive health, Sigma SD10, Slicing (interface design), Sonic Team, South Australian Aviation Museum, Space monkey, Stratford, London, Ted Alspach, Telecommunications in the Solomon Islands, Telemundo, The Meek, Theme (magazine), Transport in Leeds, USS Detroit (AOE-4), Whitechapel, Whitechapel Road, Whitechapel Road market, Wine (software), XTAR, Zone System
C2H2 zinc finger weirdness
[edit]These might be better written as Cys2His2; see Zinc finger#Classes. -- Beland (talk) 01:16, 18 June 2022 (UTC)
- 77 - C2H2 - AN1 zinc finger, ARID2, BCL11A, BCL11B, BEN domain, C1orf94, C2H2, CCDC109B, CCDC180, CGGBP1, EGR2, EGR3, FAM155B, FAM76A, FLYWCH zinc finger, GLI2, GLI3, HIVEP1, HIVEP2, HIVEP3, Holometabolism, KLF1, KLF13, KLF4, KLF9, Kruppel-like factors, Krüppel, Krüppel associated box, Octopus, PLAGL1, PRDM9, Plant–fungus horizontal gene transfer, SCAPER, SNAI2, Sequence motif, Superman (gene), TSHZ1, Vascular endothelial zinc finger 1, WRKY transcription factor, WWC2, ZFP91, ZFX, ZHX1, ZHX2, ZHX3, ZIC1, ZIC2, ZIC3, ZIC5, ZNF10, ZNF16, ZNF184, ZNF238, ZNF274, ZNF300, ZNF337, ZNF384, ZNF423, ZNF43, ZNF548, ZNF821, ZNF837, ZXDC, Zinc Finger Protein 800, Zinc finger, Zinc finger and BTB domain-containing protein 16, Zinc finger protein 107, Zinc finger protein 142, Zinc finger protein 197, Zinc finger protein 2, Zinc finger protein 208, Zinc finger protein 226, Zinc finger protein 334, Zinc finger protein 398, Zinc finger protein 510, Zinc finger protein 516, Zinc finger protein 674
Remainder
[edit]- 14 - CH4 - AMX-40,
Atmospheric methane,CH4 (disambiguation), Cashis,Climate change in Botswana, Climate change in Greece,Complement component 1q, ETP-1,Flame treatment,Immunoglobulin C2-set domain,Martin Schoell,POKEY, Tanox,The Coca-Cola Company - 12 - PH3 - 2009 Queen's Birthday Honours (Australia), APEX system, Gatwick Aviation Museum, Honda B20A engine, Idiopathic pulmonary haemosiderosis, Primary hyperoxaluria, The Legend of Heroes: Trails of Cold Steel III, The Legend of Heroes: Trails of Cold Steel IV, The Legend of Nayuta: Boundless Trails, Warner Music Sweden, Way Out West (festival), Ys IX: Monstrum Nox
- 11 - CF3 - 5-TFM-DMT, Alfa Romeo GTV and Spider, Alfa Romeo Twin Spark engine, Animal echolocation, Aryne, Butyrivibrio, Honda F engine, List of Mullard–Philips vacuum tubes, Memory card, Royal Army Chaplains' Department, Soldier Soldier
- 10 - CN4 - Automobile Dacia, Centenari, Centenari Mac3, Dacia 1310, Dacia 1325, ETD Bridge over Green River, ETR Big Island Bridge, Lucchini Engineering, Saint-Gervais–Vallorcine railway, The Red Turtle
- 10 - K2S - Architecture of Finland, Bizarre Ride II the Pharcyde, Dopravný podnik Bratislava, E. Power Biggs, Fazer, Gajin Fujita, Kamppi Chapel,
Polysulfide–bromide battery,Prime (graffiti artist), Trams in Bratislava - 9 - C3H - Bruce Beutler, C2C12, Cytostasis, Epitalon, Europhenome, MBNL2, MKRN3, Sepsis, Stearman C3
- 5 - CaCl2 -
BAF agar,Calcium chloride transformation,Canavalin, MMN medium, Transformation efficiency - 4 - BH5 - HMS Centaur (R06), Pokesdown, Subaru EJ engine, Tommykaira
- 4 - H2N - Beetin, H2N, List of Ambisonic hardware, Thermoset polymer matrix
- 4 - MF6 - List of Honda engines, List of Mullard–Philips vacuum tubes, MODFLOW, Soyuz programme
- 3 -
CaCO3 - Freshwater acidification, Marble, Snowpiercer - 3 - Co2 - Human leg, The Coca-Cola Company
- 3 -
H2O2 - Beetin, Dakin oxidation, P-Chlorocresol - 3 - NO2 -
Deforestation and climate change,Haplogroup K2a (Y-DNA),Paulínia - 3 -
TiO2 - David McCarthy (academic), Indian Rare Earths, Photocatalysis - 2 -
C10H15N - Isopropylbenzylamine, N,N-Dimethylphenethylamine - 2 -
FeCl3 - BAF agar, MMN medium - 2 -
MgCl2 - Canavalin, Transformation efficiency - 2 -
MgSO4 - BAF agar, MMN medium - 1 -
BaSO4 - Hokutolite - 1 - C26H27FN2O2 - Aticaprant → parse failure, as this is in a template for an inchi string
- 1 - C2H2O2 - Synchrony (The X-Files)
- 1 -
C2H6 - 252P/LINEAR - 1 - C2N2 - Nakajima Ki-6
- 1 -
C3H8 - Flame treatment - 1 -
C3O - CM chondrite - 1 -
C4H10 - Flame treatment - 1 - C540 - Kenworth
- 1 - C58H73N7O17 - Anidulafungin → apparent parse failure, only in template
- 1 -
C6H15NO2 - Diisopropanolamine - 1 -
Ca(OH)2 - Calcium caseinate - 1 -
H2SO4 - Copper(II) sulfate - 1 -
H₂O - Community respiration - 1 -
K2O - Phonolite - 1 -
LiClO4 - History of the lithium-ion battery - 1 -
Na2O - Phonolite - 1 - PBr7 - List of inorganic pigments
- 1 - YH3 -
Dodge Ram van
Problem cases
[edit]Parsing problems (where noted) are probably resulting in words showing up in debug-spellcheck-ignored.txt that shouldn't. -- Beland (talk) 03:09, 27 December 2022 (UTC)
- 12/11 - Al2Si2O5 → Parsing problems? Might be leaking out of Al2Si2O5(OH)4 or
<chem>...</chem>
?; see Kaolinite find all - 8/3 - Fe7C3 → Form of Iron carbide - parsing problems?
- 8/5 - Mg3Al2 → From silicate mineral Mg3Al2(SiO4)3[1] and its compositional variations; see Pyrope - possible parsing problems
- 1 - C21H17F4NO3S2 - GW0742
- Seems to be some sort of wikitext parser failure; this should be hidden inside {{Drugbox}}
- 1 - C19H19N7O6 - Folate
- Seems to be some sort of wikitext parser failure; this should be hidden inside {{Drugbox}} perhaps due to nowiki
- 1 - C58H73N7O17 - Anidulafungin
- Seems to be some sort of wikitext parser failure; this should be hidden inside {{Drugbox}}. -- Beland (talk) 00:57, 5 December 2021 (UTC)
- 1 - C22H27N3O4S - Azeloprazole
- Seems to be some sort of wikitext parser failure; this should be hidden inside {{Drugbox}}
- 15/6 - Si6Al2 → From Ca2[(Mg,Fe)3Al2]Si6Al2O22(OH)2 and its many compositional variations; see Double chain inosilicates
- Some of these seem to be parse failures from tables? -- Beland (talk) 03:17, 26 August 2022 (UTC)
- 11/7 - Si6O18 → Compound of SiO3; see Silicate and Cyclosilicates. Related to Beryl.
- Remainder are probably parse failures. -- Beland (talk) 03:20, 26 August 2022 (UTC)
- 9/8 - Si3O9 → As above. Related to Benitoite.
- More parsing problems. -- Beland (talk) 03:25, 26 August 2022 (UTC)
- 8/7 - Si4O11 → See Inosilicates - parsing problems
- 7/1 - Ga2I62 → Related to Gallium halides; see Intermediate halides - no longer found in source, parsing problems?
Repeating patterns
[edit]For rhyme schemes, they probably need to be re-styled to follow Wikipedia:WikiProject Poetry#Style for rhyme schemes. If this ends up making them all-caps, they won't show up here on the next run. For mixed-case rhyme scheme notations, use {{not a typo}} after making sure dashes, commas, and spaces follow the recommended style.
(All fixed as of 2022-12-20 dump!)
False positives
[edit]Is there a word that is correctly used in an article, but which shouldn't be added to Wiktionary? List it here, and Beland will fix the problem.
Archived solutions: Wikipedia:Typo Team/moss/Archive
- wikt:singer(s), wikt:composer(s), etc. Found in Kanto (music).
False negatives
[edit]Is there a misspelled word in an article mentioned here that was not reported? Feel free to list it below and Beland will try to improve the code if appropriate.
These are currently over-ignored, but could be used to suggest correct spellings:
- Wikipedia articles with {{R from misspelling}}, {{R from incorrect name}}, {{R from miscapitalisation}}, and redirects to these templates
- Wiktionary entries that are known misspellings (e.g. wikt:anticiliary)
- In cases where there are variant spellings of the same word or phrase, Wikipedia should probably pick one and stick to it except to mention the variants. This happens with:
- Compound words - whether to use a space, dash, or nothing, as in "junebug" vs. "june bug" or "email" vs. "e-mail".
- Words with multiple transliterations from another language (often there are multiple systems, no particular system, or a modern system different from historical systems).
- Redirects with {{R from alternate spelling}} and redirects to that template.
- Article Ana Recio Harvey | detected misspelling: appoinment | additional, undetected misspelling: enterpreneur
- Looks like this was because of redirects with "enterpreneur" in the title. I have tagged them all {{R from misspelling}}, but I'll have to change the code to ignore those, as noted above. Thanks for catching that! -- Beland (talk) 23:52, 18 October 2018 (UTC)
- 1 - Jack Beckitt - wikt:monacled -> also had "whow" in place of who --Xurizuri (talk) 05:10, 5 February 2021 (UTC)
- 1 - Jack Jenkins (rugby player) - wikt:scummage -> "forst" instead of "first" --Xurizuri (talk) 05:43, 5 February 2021 (UTC)
- 1 - Johan Christian Drewsen - wikt:cultication -> "Rogether" instead of "Together", at the start of a sentence. "Copenahgen" instead of "Copenhagen". They obviously didn't get picked up because of capitalisation, but thought I'd list them here anyway just in case it helps. -Xurizuri (talk) 11:09, 13 February 2021 (UTC)
Archived notes
[edit]See Wikipedia:Typo Team/moss/Archive.
For Wiktionary
[edit]Spell-checking Wiktionary itself
[edit]A new project has started to do that using moss software, at wikt:Wiktionary:Spell check.
Triaged for Wiktionary
[edit]Dictionary writers needed! And speakers of languages other than English!
Many words (English and otherwise) detected as potential typos have been manually triaged as legitimate words that need to be added to Wiktionary, and are listed at Wikipedia:Typo Team/moss/For Wiktionary. (Moved from this page due to length.) Many of the subpages under the misspelling main listing also have long lists of words to add to Wiktionary, which are sometimes bundled up and moved to the "For Wiktionary" subpage.
Wiktionary aims to have definitions for all words in all languages (with some exceptions), and acts as the primary database for the moss spell-checker.
Highest-frequency words missing from dictionary (a-m)
[edit](updated 2022-12-20) Good candidates for words to add to the English Wiktionary (which provides English definitions for words in all languages, including all compound words), as it seems English Wikipedia readers will frequently encounter them. For each run, only words from half of the alphabet are shown, to avoid duplicate work from when new dumps are being processed.
Most of the words are not from English. To get them off this list, you can either add an entry to the English Wiktionary (which provides English definitions for words in all languages) or tag all instances of the word on the English Wikipedia with {{lang}}. Wiktionary does not accept Romanizations for some languages, so those cases must be tagged as {{transl}} or {{lang}}.
Legitimate misspellings are candidates for Wikipedia:Lists of common misspellings. If there is an obvious correction, adding that to Wikipedia:Lists of common misspellings/For machines will help editors who use automated tools to fix cases faster.
- 106 - wikt:farābād - Bagh-e Jafarabad, Jafarabad, Ahar, Jafarabad, Alborz, Jafarabad, Amol, Jafarabad, Andika ... find all
- 90 - wikt:dehs - Badin, Badin District, Dadu District, Ghotki District, Hyderabad District, Sindh ... find all
- 89 - wikt:eː - Acute accent, African Romance, Alsea language, Atong language (Sino-Tibetan), Awadhi language ... find all
- 77 - wikt:ispánate - Andrew I Hont-Pázmány, Andrew, Bishop of Eger, Arnold II Hahót, Atyusz (genus), Atyusz Hahót ... find all
- 66 - wikt:çokgözlü - List of butterflies of Turkey ... find all
- 63 - wikt:jangha - Akhadachandi Temple, Arjunesvara Siva Temple, Belesvara Siva Temple, Bhimesvara Bisrama ghara, Bhringesvara Siva Temple ... find all
- 58 - wikt:īlābād - Boneh-ye Esmail, Khuzestan, Eshqabad, West Azerbaijan, Esmailabad (north), Dorudzan, Esmailabad (north), Gowhar Kuh, Esmailabad (south), Dorudzan ... find all
- 56 - wikt:საკრებულო - Abasha Municipality, Adigeni Municipality, Akhalkalaki Municipality, Akhaltsikhe Municipality, Akhmeta Municipality ... find all
- 48 - wikt:molodezhnaja - Foster Daddy, Tora!, Hearts and Flowers for Tora-san, Maid-Droid, Marriage Counselor Tora-san, Stage-Struck Tora-san ... find all
- 48 - wikt:medjig - Magic square ... find all
- 44 - wikt:liveshow - Anders Kobro, Australian Idol 3: The Final 13 – Australian Made: The Hits, Bảo Thy, Cẩm Ly, Deutschland sucht den Superstar ... find all
- 44 - wikt:afwc - A S M Ridwanur Rahman, AHM Fazlul Haque, ASM Fakhrul Islam, Ashraful Hoq Chowdhury, Bangladesh Coast Guard ... find all
- 43 - wikt:äkım - 2020 Dungan–Kazakh ethnic clashes, 2021 Kazakh municipal elections, 21st Nur Otan Extraordinary Congress, Ashat Oralov, Nurlan Nogaev ... find all
- 42 - wikt:īdābād - Aqeh Kheyl, Gorgabad, Ardabil, Kalateh-ye Seyyed Ali, South Khorasan, Nematabad-e Ghar, Sadabad, Kerman ... find all
- 41 - wikt:äkıms - 2019 Kazakh presidential election, 2021 Kazakh municipal elections, Bakhytjan Sagintayev, Kazakh democracy movement, List of Äkims of Atyrau Region ... find all
- 41 - wikt:hvbat - Swedish Army ... find all
- 40 - wikt:ɛː - Bahrani Arabic, Bora language, Buryat language, Bärenbach, Bad Kreuznach, Cremunés dialect ... find all
- 40 - wikt:ŭnbyŏng - Goryeo coinage, Korean currency, Korean mun ... find all
- 37 - wikt:øltapper - Admiralgade 23, Badstuestræde 7, Bredgade 24, Ernst Burmeister, Gråbrødretorv 4 ... find all
- 36 - wikt:α-hydroxykaurane - Niebla (lichen), Niebla homalea, Niebla infundibula, Niebla juncosa, Vermilacinia ... find all
- 36 - wikt:ŋg - Chimbu–Wahgi languages, Dagan languages, East Strickland languages, Finisterre–Huon languages, Gogodala–Suki languages ... find all
- 35 - wikt:āgamas - Anekantavada, Antakrddaasah, Anuttaraupapātikadaśāh, Aupapatika, Bodhipakkhiyādhammā ... find all
- 34 - wikt:προσευχη - Codex Augiensis, Codex Claromontanus, Codex Porphyrianus, Minuscule 1739, Minuscule 181 ... find all
- 33 - wikt:guachimonton - Guachimontones, Tequila, Jalisco, Teuchitlán culture ... find all
- 32 - wikt:farābād-e - Jafarabad, Ardestan, Jafarabad, Ferdows, Jafarabad, Golestan, Jafarabad, Ilam, Jafarabad, Nahavand ... find all
- 32 - wikt:dcdn - Content delivery network interconnection ... find all
- 31 - wikt:maçkolik - 1964–65 Mersin İdmanyurdu season, 1965–66 Mersin İdmanyurdu season, 1966–67 Mersin İdmanyurdu season, 1967–68 Mersin İdmanyurdu season, 1969–70 Mersin İdmanyurdu season ... find all
- 31 - wikt:fstnt - TNNI2, TNNT1, TNNT2, TNNT3 ... find all
- 30 - wikt:kammerpige - Cathrine Marie Gielstrup, Johanna Elisabeth Dahlén, Louise Phister, Marie Cathrine Preisler ... find all
- 30 - wikt:coaxium - Han Solo, Lego Star Wars: Summer Vacation, List of Star Wars characters, Qi'ra, Solo: A Star Wars Story ... find all
- 30 - wikt:bajraks - Albanian tribes, Bajrak, Bulgëri, Dukagjin Highlands, Ghegs ... find all
- 29 - wikt:ọgbanje - Odinala, Ogbanje, West African mythology ... find all
- 29 - wikt:arwulf - A Magical Approach, Blue Ballads, Blythe Byte, Collaborations (Marilyn Crispell album), Exhale (Arthur Blythe album) ... find all
- 28 - wikt:mäslihats - 1994 Kazakh local elections, 2021 Kazakh local legislative elections, 2021 Kazakh municipal elections, 2022 Kazakh constitutional referendum, Akmola Regional Mäslihat ... find all
- 28 - wikt:itti-marduk-balāṭu - Asharid-apal-Ekur, Ashur-bel-kala, Bārûtu, Itti-Marduk-balatu, Itti-Marduk-balatu (king) ... find all
- 28 - wikt:gsang - Anuyoga, Guhyagarbha tantra, Guhyasamāja Tantra, Mahayoga, Mandāravā ... find all
- 28 - wikt:dābād - Aliabad-e Jowhari, Asgarabad, Fars, Narmeh, Radabad, Sadabad, Anbarabad ... find all
- 28 - wikt:dharas - Alko Hiti, Dhunge dhara, Drinking fountain, History of water supply and sanitation, Kathmandu ... find all
- 28 - wikt:apkc - Barry James Thompson, Carla V. Rothlin, Cell cortex, Cell polarity, Epithelial polarity ... find all
- 27 - wikt:ādatābād - Dashtabad, Narmashir, Saadatabad, Abadeh, Saadatabad, Arsanjan, Saadatabad, Bardsir, Saadatabad, Darab ... find all
- 27 - wikt:eremophilas - Andrew Phillip Brown, Calamphoreus, Eremophila (plant), Eremophila anomala, Eremophila bowmanii ... find all
- 27 - wikt:dacoz - Epimeria ... find all
- 26 - wikt:fänikor - 1st Life Grenadier Regiment (Sweden), 2nd Life Grenadier Regiment (Sweden), Halland Regiment, Jönköping Regiment, Kalmar Regiment ... find all
- 26 - wikt:funfactor - Battle Arena Toshinden 2, Brain Dead 13, Brandish (video game), Chrono Trigger, College Football USA 97 ... find all
- 25 - wikt:mprs - Allopregnanolone, Membrane progesterone receptor, Membrane steroid receptor, Pharmacodynamics of progesterone, Progesterone ... find all
- 25 - wikt:külliyye - Külliye ... find all
- 25 - wikt:bandform - Analogue filter, Composite image filter, Distributed-element filter, Electronic filter topology, Filter (signal processing) ... find all
- 25 - wikt:aɪ - A Pronouncing Dictionary of American English, Bequia English, British English, Burmese phonology, Cantonese ... find all
- 24 - wikt:ṣomä - Beta Israel, Haymanot ... find all
- 24 - wikt:şehnameci - Seyyid Lokman ... find all
- 24 - wikt:mesoleg - Eristalis brousii, Sphegina, Sphegina achaeta, Sphegina albolobata, Sphegina amplistylus ... find all
- 24 - wikt:lyrium - Characters of Dragon Age II, Characters of Dragon Age: Inquisition, Characters of Dragon Age: Origins, Dragon Age, Dragon Age II ... find all
- 24 - wikt:jātakas - Early Buddhist texts, Jataka tales, Vessantara Jātaka ... find all
- 24 - wikt:istudy - Bored of Studies, System Technology-i Co, Ltd ... find all
- 24 - wikt:hradani - Oath of Swords, The War God's Own, Wind Rider's Oath ... find all
- 24 - wikt:echinulins - Aspergillus aerius, Aspergillus appendiculatus, Aspergillus brunneus, Aspergillus caperatus, Aspergillus cibarius ... find all
- 23 - wikt:νηστεια - Codex Augiensis, Codex Claromontanus, Codex Porphyrianus, Minuscule 1739, Minuscule 181 ... find all
- 23 - wikt:župans - Albanian nobility, Bulgarian–Serbian wars of 917–924, Byzantine–Bulgarian war of 913–927, Constantine Bodin, Croatian nobility ... find all
- 23 -
wikt:metalbending - Avatar: The Last Airbender (comics), Avatar: The Last Airbender – North and South, Avatar: The Last Airbender – The Promise, Avatar: The Last Airbender – The Rift, Bolin (The Legend of Korra) ... find allonly used in one comic, not WT-includable - 23 - wikt:etraby - Ashgabat, Balkan Region, Districts of Turkmenistan, Regions of Turkmenistan, Türkmenbaşy District ... find all
- 23 - wikt:dihydroauroglaucin - Aspergillus aerius, Aspergillus appendiculatus, Aspergillus brunneus, Aspergillus caperatus, Aspergillus cibarius ... find all
- 23 - wikt:cmybp-c - Myosin binding protein C, cardiac ... find all
- 23 - wikt:chronokinetic - Irredeemable, List of Kamen Rider Ex-Aid characters, List of Kamen Rider Saber characters, List of Kamen Rider Zi-O characters, List of returning characters in Kamen Rider Zi-O ... find all
- 23 - wikt:cayolar - Alçay-Alçabéhéty-Sunharette, Aussurucq ... find all
- 23 - wikt:baserris - Baserri, Etxeberria, Gipuzkoa, History of the Basques, Manor house ... find all
- 22 - wikt:metalogs - Metalog distribution, Pearson distribution ... find all
- 22 - wikt:lipfire - Ethan Allen (armsmaker) ... find all
- 22 - wikt:kubbs - Kubb, The Amazing Race 6 ... find all
- 22 - wikt:honā - Continuous and progressive aspects, Future tense, Grammatical mood, Habitual aspect, Imperfective aspect ... find all
- 22 - wikt:haltijas - Finnish paganism, Haltija ... find all
- 22 - wikt:groundplot - Clube Atlético Hermann Aichinger, Douradão, Estádio Anacleto Campanella, Estádio Couto Pereira, Estádio Décio Vitta ... find all
- 22 - wikt:fújì - Debbie Klein, Fuji music ... find all
- 21 - wikt:экз - Ekaterina Georgiewna Czerniakowska, Emmanuel Steinschneider, Gorodetsky Glacier, History of the Jews in Brody, Ilya Shifman ... find all
- 21 - wikt:сост - Dmitry Yakovlevich Popov, Efim Kolbintsev, Gaisa Enikeev, Ibniyamin Akhtyamov, Ivan Aleksandrovich Flerov ... find all
- 21 - wikt:θεου - Codex Basilensis A. N. IV. 4, Codex Glazier, Codex Vaticanus 2061, Lectionary 60, Minuscule 1739 ... find all
- 21 - wikt:ʃa - Campaniacum, Central Atlas Tamazight, Central Atlas Tamazight grammar, Da (Indic), József Eötvös ... find all
- 21 - wikt:ánthos - Anigozanthos, Annona haematantha, Antheraxanthin, Anthodite, Anthology ... find all
- 21 - wikt:maljo - Evil eye ... find all
- 21 - wikt:madōgu - Flame of Recca, List of Flame of Recca characters ... find all
- 21 - wikt:kleismas - Semantic System ... find all
- 21 - wikt:ispánates - Alexander Köcski, Ampud II, Andrew Kőszegi, Denis Tomaj, Denis, son of Ampud ... find all
- 21 - wikt:hoiamide - Hoiamides ... find all
- 21 - wikt:godspoken - Gloriously Bright, List of Ender's Game characters, Xenocide ... find all
- 21 - wikt:geoviewer - Alder Brook (West Branch French Creek tributary), Bailey Brook (West Branch French Creek tributary), Baskin Run, Beaver Run (South Branch French Creek tributary), Beaverdam Creek (Crabtree Creek tributary) ... find all
- 21 - wikt:fmris - Auditory hallucination, Bilingual memory, Brain types, Cross modal plasticity, Dyslexia ... find all
- 21 - wikt:bullettes - Cyrtogomphoceras, Cyrtogomphoceratidae, Discosorus, Hecatoceras, Kiaeroceras ... find all
- 21 - wikt:anuratha - Arjunesvara Siva Temple, Bata Mahadeva, Bhringesvara Siva Temple, Byamokesvara Temple, Devasabha Temple ... find all
- 20 - wikt:ḡalyān - Hookah, Muʽassel ... find all
- 20 - wikt:ārohaṇa-avarohaṇa - Ahiri, Asaveri, Darbari Kanada, Garudadhvani, Gaula (raga) ... find all
- 20 - wikt:myfm - CFMP-FM, CHCD-FM, CHMY-FM, CIMA-FM, CIMY-FM ... find all
- 20 - wikt:masnawī - Mathnawi ... find all
- 20 - wikt:maskalai - Arsha (community development block), Baghmundi (community development block), Balarampur, Purulia (community development block), Barabazar (community development block), Binpur I ... find all
- 20 - wikt:liveshows - Andrea Renzullo, Cẩm Ly, Deutschland sucht den Superstar, Deutschland sucht den Superstar (season 8), Got to Dance – Tylko Taniec ... find all
- 20 - wikt:exrnas - Extracellular RNA, Non-coding RNA, Oncomir ... find all
- 20 - wikt:evtols - Airbus CityAirbus, Avolon, EVTOL, Eve Air Mobility, Personal air vehicle ... find all
- 20 - wikt:commlock - All That Glisters (Space: 1999), Dragon's Domain, Earthbound (Space: 1999), Guardian of Piri, Seed of Destruction (Space: 1999) ... find all
- Only used in the show. Not Wiktionary-worthy P. Sovjunk (talk) 10:26, 1 May 2024 (UTC)
- 20 - wikt:cohopfian - Ax–Grothendieck theorem, Finitely generated module, Hopfian object ... find all
- 20 - wikt:bushdrive - Toyota Land Cruiser, Toyota Land Cruiser (J40) ... find all
Translation and general cleanup
[edit]See Wikipedia:Typo Team/moss/not English.
Mismatched markup and punctuation
[edit]Errors in punctuation (mostly quotation marks) and wiki markup generally cause confusion for readers, and also prevent the spell checker from running on these articles.
Inches and feet should not use " and ', per Wikipedia:Manual of Style/Dates and numbers#Specific units; use letters instead. (See MOS:UNITS for general guidance.) Where conversions are needed, use {{convert}}, for example: 2 feet 3 inches (69 cm)
WORK IN PROGRESS
- Integrating these with main listings
- Filter only unmatched " for now
- Filter articles with non-ASCII quote marks to a separate list for JWB processing
- Filter \d" and \d' to a separate sublist for inch/feet style conversion
- Explain ✂ or skip snippets showing this
- Bracketbot web UI seems to be down
-- Beland (talk) 19:03, 4 September 2019 (UTC)
Gender-neutral language
[edit]Manned
[edit]The word "manned" and related forms like "unmanned" are used in many articles, but is not gender-neutral as required by MOS:S/HE and the NASA style guide. Gender-neutral alternatives include:
- Crewed, uncrewed
- Staffed, unstaffed
- Human spaceflight
- Defended
Not all instances need to be changed.
- Proper nouns should remain the same, like Manned Orbiting Laboratory
- Titles of sources and quotes should remain unchanged.
- If the term itself is being discussed, for example to say that "manned spaceflight" is another way of saying human spaceflight.
- There seems to be consensus on unmanned aerial vehicle that this and related phrases (like unmanned aerial system) should remain intact, since it is much more frequent than "uncrewed aerial vehicle" at the moment. However, when using Wikipedia's voice it is preferred to describe a UAV as "uncrewed" when not using the whole phrase.
- Non-article pages that are retained for historical interest shouldn't be modified if they won't be visible to readers.
- Redirects with this title should be left alone if they are redirecting readers to a gender-neutral title
If the word is found the names of articles and categories (except those with names directly related to UAVs), those should be renamed, and the links changed. Many articles have already been renamed, and the links just need to be updated. (Remember that to rename a category, all the articles in that category must be edited to change their pointers.)
- Coming soon: moss report on "manned" that ignores references, page titles, proper nouns, and consensus-OK phrases.
- Find all instances of "manned" in articles
- Find all instances of "unmanned" in articles
- Find all instances of "manned" in Wikipedia:, File:, Category:, and Portal: (recommended for advanced editors only)
- Find all instances of "unmanned" in Wikipedia:, File:, Category:, and Portal: (recommended for advanced editors only)
Borderline cases
[edit]These may need to be discussed before being changed.
- Manned Venus flyby - Based on the NASA style guide, NASA probably would now refer to this as "human Venus flyby" but historical sources say "manned Venus flyby" so that's what the majority of editors commenting on the talk page currently favor. There is some question as to whether the scope of the article concerns a specific mission or this type of mission in general, which is related to the proper name exception (but then the title would be "Manned Venus Flyby"). Compare Colonization of Venus and Human mission to Mars. -- Beland (talk) 19:41, 21 May 2019 (UTC)
- Discussion in progress on Talk:Manned Venus flyby. -- Beland (talk) 09:37, 5 January 2022 (UTC)
Objections in specific cases:
Marriage
[edit]Wikipedia:Writing about women § Marriage points out:
- "is the wife of" is less neutral than "is married to" - find all "is the wife of"
- "born to X and his wife Y" is less neutral than "born to X and Y" - approximate search
- "man and wife" is less neutral than "husband and wife", and to be fully neutral the order should be varied - find all "man and wife"
Ladies
[edit]Wikipedia:Writing about women § Girls, ladies prefers "women" to "ladies" except where part of set phrases or traditional titles (like first lady). find all lowercase "ladies"
Instructional and presumptuous language
[edit]MOS:NOTE says to avoid the following phrases when they address the reader directly. Not all instances are problematic, such as those in direct quotations.
- remember that - find all "remember that"
- note that - find all "note that"
- of course - find all "of course"
- naturally - find all "naturally" (the meaning "related to nature" is not problematic)
- obviously - find all "obviously"
- clearly - find all "clearly"
- actually - find all "actually"
- rhetorical questions, especially in headings - find all questions in headings (some cases, like the names of works, are not problematic)
Internationally comprehensible spelling and vocabulary
[edit]MOS:COMMONALITY advises the use of vocabulary and spellings that are shared across national varieties of English, where possible. This section collects instances where an unshared term is being used which could be improved. For proper nouns and direct quotes, a translation or re-spelling into another dialect may be helpful.
- "gaol" should be "jail"
- Disputed, discussion underway at Wikipedia talk:Manual of Style#Gaol vs. jail
- looks like its wrapped up, with jail preferred except in proper nouns Xurizuri (talk) 15:36, 21 December 2020 (UTC)
Currency style
[edit]Per MOS:CURRENCY:
- For the UK, Irish, Australian, New Zealand, and South African pound, ₤ should be changed to £
- ₤ is OK to use with Italian lira. Changing e.g. ₤100,000 to [[Italian lira|₤]]100,000 will prevent legitimate uses from showing up in automated reports, and also help readers understand that this is not British pounds. (Mentions of Italian lira are increasingly rare because it has been replaced by the Euro.)
Caution: Not all problem pages show up reliably; if you do a search, fix all the pages in the results, and then do another search, you will probably get a fresh batch of problem pages. It may also take a minute or two for fixed pages to disappear from the results, due to lag updating the search index.
Work is in progress on detecting and fixing other MOS-related issues with numbers and currencies.
- all fixed as of today Graeme Bartlett (talk) 06:26, 5 October 2022 (UTC)
Small caps
[edit]Per MOS:SMALLCAPS, smallcaps are not to be used for years like "400 BC". Find all instances of known smallcaps issues...
HTML tags
[edit]Updated from 2024-04-01 dump.
You can do one of two things for these articles:
- Remove, repair, or convert the HTML markup to wiki markup yourself.
- Tag the article {{cleanup HTML}} and it will show up under Category:Articles with HTML markup but not on this list. Use the "tags" parameter to indicate which tags are present on the page; many editors find it hard to locate the offending HTML. For example: {{cleanup HTML|tags=table, cite}}
How to clean up
[edit]See Category:Articles with HTML markup for instructions on how to find the offending tags and what to do about them.
Find all articles by tag
[edit]Can't wait for the next database dump? Want to look for or fix all instances of a specific tag? Use the links below!
- <tt> - find all
- <li>, <ol>, and <ul> - find all
- <table>, <tr>, <td>, <th>, <caption> - find all
- <i> or <em> - find all
- <dd>, <dt>, and <dl> - find all
- <cite> - find all
- <p> - find all
- <strong> and <b> - find all
- <name=> - find all
- </br> - find all
- <hr> and <hr/> - find all
- <font> - find all
- <ins> - find all
- <samp> - find all
- <q> - find all
- <wbr> - find all and find ­
- <ruby>, <rt>, and <rp> - find all
- Elements and attributes obsoleted in HTML 5 have prefab searches linked from Wikipedia:HTML 5
Additional HTML problems are listed at Special:LintErrors.
Sometimes editors use angle brackets (< and >) for other purposes. Though these are not HTML markup, they often need to be fixed.
<<...>> find all can indicate:
- French quotation marks rendered as <<quoted text>>. These should be normalized to "quoted text" or 'quoted text', even in quotations, per MOS:CONFORM.
- A broken citation that should be converted to {{cite web}})
Other weirdness:
- <the> - find all - More French quoting style, bad linking, bad citation style, etc.
- <blockquote> sometimes shows up on the reports if it is capitalized or all-caps on the article page. It should be all lowercase.
Known bad HTML tags (HB)
[edit]These are also included in the main listings.
- 725 - <em> - 2002–03 Kent Football League, 2003–04 Kent Football League, 2004–05 Kent Football League, 2005–06 Kent Football League ... find all
- 280 - <p> - 'Oh, Whistle, and I'll Come to You, My Lad', 1943: The Battle of Midway, Al Rayyan (city), Andrew and Sharon Turner, Anti Mail-Order Spouse Act ... find all
- 245 - <td> - 2021–22 Denizlispor season, 2022 West Virginia High School Boys' Soccer (AAA), Archery GB, Boxer Protocol, GJ 1214 ... find all
- 201 - <b> - 2023–24 Super League Greece 2, Agartala Sadar I Assembly constituency, Agartala Sadar II Assembly constituency, Agartala Sadar III Assembly constituency, Agartala Town Assembly constituency ... find all
- 189 - <i> - Ada or Ardor: A Family Chronicle, Attorney–client privilege, Audrey Oldfield, Canyon del Oro High School ... find all
- 152 - <hr> - 2015 Handball Super Cup, 2021 SCSA Regional Tournament, 2022 SCSA Regional Tournament, 2023–24 in Ukrainian football, NBA versus EuroLeague games ... find all
- 135 - </table> - 2021–22 Denizlispor season, 2022 West Virginia High School Boys' Soccer (AAA), Ang Probinsyano (season 6), Ang Probinsyano (season 7), Ang Probinsyano (season 8) ... find all
- 133 - <hr/> - 2011 NIRSA National Soccer Championship, 2018 NIRSA National Soccer Championship, 2019 NIRSA National Soccer Championship, 2021 NIRSA National Soccer Championship, 2022 NIRSA National Soccer Championship ... find all
- 123 - <tr> - 2021–22 Denizlispor season, 2022 West Virginia High School Boys' Soccer (AAA), Archery GB, Boxer Protocol, Family of Barack Obama ... find all
- 120 - <ol> - Biarc, Bite It, Bloodstained Oz, Blue Jays (album), Carne De Melocotón ... find all
- 82 - </ins> - Andrea Muzii, Antiphospholipid syndrome, Capitalism, Clara Ponsatí, DE-CIX ... find all
- 80 - <cite> - Antin Vasynchuk, Comparison of relational database management systems, Iris recognition, Jim (Huckleberry Finn), Kaillera ... find all
- 68 - <strong> - Control key, English relative words, Finite verb, Join Java, List of costliest tornadoes in the Americas ... find all
- 58 - <table> - 2021–22 Denizlispor season, 2022 West Virginia High School Boys' Soccer (AAA), Archery GB, Boxer Protocol, Chirped pulse amplification ... find all
- 55 - <q> - Appin (company), Aramaic Uruk incantation, Bach Temperament, Brian Skerry, Craigellachie Bridge ... find all
- 40 - <th> - Archery GB, Etymology of Norway, Geometrical Product Specification and Verification, San Antonio (disambiguation) ... find all
- 20 - </br> - Balkan Mountains, Chamberlain (band), List of Indian National Developmental Inclusive Alliance candidates for the 2024 Indian general election, Piecewise ... find all
- 5 - <dd> - Antigorite, ISO 13567 ... find all
- 3 - <dt> - ISO 13567 ... find all
- 2 - <wbr/> - Disney Renaissance, Race (human categorization) ... find all
- 2 - <dl> - Antigorite, ISO 13567 ... find all
Bad link formatting (HL)
[edit]These are also included in the main listings. Angle brackets are not used for external links (per Wikipedia:Manual of Style/Computing § Exposed URLs); "tags" like <https> and <www> are actually just bad link formatting. See Wikipedia:External links#How to link for external link syntax; use {{cite web}} for footnotes.
- 62 - <https> - 37th Annie Awards, Alhassan Tampuli Sulemana, Aparajito (2022 film), Bala Devi Chandrashekar, Bill Hynes ... find all
- 24 - <https/> - Anarchy (video game), Eduardo Costantini, Garaway Local School District, Global Rights, John Felton (canoeist) ... find all
- 17 - <http> - Atari ST User, Bill Loewen, Diana (mythology), Harvard Summit for Young Leaders in China, IT risk ... find all
- 7 - <http/> - Coober Pedy, Rietvlei Wetland Reserve ... find all
- 4 - <www> - Earlimart (band), List of phishing incidents, Neuroplastic surgery, Paul Juon ... find all
Unsorted (H)
[edit]Many of these can be replaced by {{var}} (for text to be replaced) or {{angbr}} (e.g. for linguistic notation). Enclose in <code>...</code>
for inline software source code.
- 34 - <templatestyles/> - 2018 United States House of Representatives elections in California, 2022 New York State Assembly election, 2023 Rugby World Cup knockout stage, 2024 United States House of Representatives elections in California, Augusto Barrios ... find all
- 15 - <caption> - Dolmen (miniseries), Geometrical Product Specification and Verification, San Antonio (disambiguation) ... find all
- 15 - </caption> - Dolmen (miniseries), Geometrical Product Specification and Verification, San Antonio (disambiguation) ... find all
- 14 - <f> - 21st Street–Queensbridge station, 36th Street station (IND Queens Boulevard Line), 42nd Street–Bryant Park/Fifth Avenue station, 47th–50th Streets–Rockefeller Center station, 57th Street station (IND Sixth Avenue Line) ... find all
- 13 - <the> - Art in Paris, Cho Seung-woo, David Ord, Five Seals, Olympic Hall ... find all
- 11 - <t> - Caipira dialect, Database transaction schedule, Priority queue, Sardica paschal table, Smart Pascal ... find all
- 10 - <edited> - Yaron Avitov ... find all
- 9 - <mark> - Dynamic time warping, Google Analytics, Henry Raikes, Janet Lusk, Juan Correa ... find all
- 9 - <c> - List of Schedule 1 substances (CWC), Middle Mongol, Peetre theorem, Saliba language, Tamil phonology ... find all
- 9 - </mark> - Dynamic time warping, Google Analytics, Henry Raikes, Janet Lusk, Juan Correa ... find all
- 8 - <canadian> - Whiteshell Laboratories ... find all
- 8 - <be> - Chen Hualan, Jo Jung-min, Luigi Giorgi (soldier), TNCO ceilings, Vincenzo Lancia ... find all
- 7 - <x> - Fusion tree, Saliba language, Surface and bulk erosion, Surface second harmonic generation, Template metaprogramming ... find all
- 7 - <o> - Cremunés dialect, Inquiry, Lai Tay script, Language planning, Quebec French phonology ... find all
- 7 - <n> - Attié language, Ban number, Caipira dialect, Ebrié language, Proto-Afroasiatic language ... find all
- 7 - <encore> - Lead 15th Anniversary Live Box, Lead Upturn 2011: Sun x You, Lead Upturn 2013: Leap, Lead Upturn 2016: The Showcase, Lead Upturn 2019: Sync ... find all
- 6 - <sic> - Babson-Alling House, Battle of Fort Bull, Hack Wilson, Keilor, Victoria, Shadow World (role-playing game) ... find all
- 6 - <pig> - Cubanate, En Esch, I Ya Toyah, WWIII Live 2003 ... find all
- 6 - <onlyinclude/> - 2011 ANZ Championship season, 2012 WNBL Finals, 2015 WNBL Finals, Ink Master season 11, List of Los Bastardos episodes ... find all
- 6 - <not> - Delhi–Alwar Regional Rapid Transit System, Fail-safe, Hebron, New York, Stoked for the Holidays ... find all
- 6 - <name/> - Carbon governance in England, Chalena Vásquez, Elvin C. Stakman, Libeaus Desconus, Pioneer Valley ... find all
- 6 - <m> - Jacobi elliptic functions, Kolmogorov complexity, Nasal consonant, Samsung Economic Research Institute, Suyá language ... find all
- 6 - <in> - Robert Ira Lewy, Yaron Avitov ... find all
- 6 - <impact> - Whiteshell Laboratories ... find all
- 6 - <e> - Caipira dialect, Cremunés dialect, Language planning, Projection principle, Traditional Spelling Revised ... find all
- 6 - <d> - Northern Borderlands dialect, Northern, Central and Southern Vietnam, Pakora, Papadam, Southern Borderlands dialect ... find all
- 6 - <ch> - Chan Chan, Gwari language, Romanization, Southern Borderlands dialect, Trellech ... find all
- 5 - <y> - Caipira dialect, Middle Mongol, Ordinal Pareto efficiency, Redundancy principle (biology), Yeísmo ... find all
- 5 - <sh> - Gwari language, Judaeo-Spanish, Middle Mongol, Weltdeutsch, Ṣ ... find all
- 5 - <reference> - Olympus Guardian ... find all
- 5 - <location> - SIGMET ... find all
- 5 - <l> - Caipira dialect, Dia, Mali, Teiresias algorithm ... find all
- 5 - <eom> - Range coding ... find all
- 5 - <annals> - Boyle Abbey ... find all
- 4 - <z> - Dutch-language literature, General Chinese, Leiden Willeram, Weltdeutsch ... find all
- 4 - <who> - Audiometry, Iba N'Diaye, Nicolay family, VOEvent ... find all
- 4 - <what> - Fan rice, J. Lynn Helms, Ractopamine, VOEvent ... find all
- 4 - <usa> - Cape Verdi, Ramruma, Sun Princess (horse) ... find all
- 4 - <target> - BlackNurse, Client-to-client protocol ... find all
- 4 - <step> - Servotron 9000 ... find all
- 4 - <statement> - C syntax, DG/L, XML for Analysis ... find all
- 4 - <sketch> - Fusion tree ... find all
- 4 - <program> - Shift-reduce parser ... find all
- 4 - <liberty> - Ideograph (rhetoric) ... find all
- 4 - <is> - Gellish, Semantic data model ... find all
- 4 - <if> - Executive Council of Hong Kong, Executive Council of Macau ... find all
- 4 - <g> - Crimean Gothic, Intercultural communication principles, Ormulum, Osmanoğlu family ... find all
- 4 - <first> - Luhn mod N algorithm, St Mary's Cathedral, Edinburgh (Episcopal) ... find all
Need debugging
[edit]- 19 - <pre> - Arena (web browser), Back-to-back user agent, BagIt, Call graph, Code folding ... find all
- (These look legit, probably a moss bug. Beland note to self: Run these on wikitext_util functions in an interactive window to find parse breakage.)
- 5 - </gallery> - Broch of Gurness, Danapur, Museum Mayer van den Bergh, Osceola, Iowa, Spitakavor Monastery ... find all
Notification of new dumps
[edit]"Most likely misspellings by articles" should always have work to do (if not, ping Beland to add more from the current dump). Some of the other sections are occasionally waiting for a new dump to get a useful list, either because they are ranked by frequency or a code change has been made to clean up noise in the next run. New runs are generally posted twice a month. The database snapshot from the first day of the month generally takes about 9-13 days to process, and the snapshot from the twentieth day of the month might take 4-6 days until it can be posted.
All that said, if you want to get a ping when results from a new dump are posted, you can add your name to the list below. If you are only interested in a particular section, include a note to that effect.
- (add your username to this list)
- Endersslay (talk) 00:51, 24 December 2023 (UTC)
- snoozebug … zzz 15:24, 28 November 2023 (UTC)
- Jake01756 (talk) (contribs) 21:31, 19 February 2023 (UTC)
- Jake The Great!📞talk! 01:40, 18 December 2019 (UTC)
- Puddleglum2.0 (talk) 20:31, 13 October 2019 (UTC)
- Schazjmd (talk) 18:25, 21 December 2018 (UTC)
- bradleyagin (talk) 04:08, 12 January 2019 (UTC)
- Darylgolden(talk) Ping when replying 00:50, 11 February 2019 (UTC)
- MarkZusab (talk) 03:52, 15 February 2019 (UTC)
- Amiodarone talk 20:52, 2 April 2019 (UTC)
- Zojomars (talk) 17:48, 31 May 2019 (UTC)
- Anarhistička Maca (talk) 06:25, 30 June 2019 (UTC)
- Clovermoss (talk) 00:46, 27 October 2019 (UTC)
- JaAlDo (talk) 14:18, 11 March 2020 (UTC)
- Creativecreatr Creativecreatr (talk) 09:56, 26 May 2020 (UTC)
- Voidify (talk) 06:12, 9 June 2020 (UTC)
- Doghouse09 (talk) 20:52, 8 September 2020 (UTC)
- -- spazure (contribs) 09:24, 2 December 2020 (UTC)
- Idell (talk) 21:26, 23 October 2020 (UTC)
- --
- Fehufangą ♮ ✉ Talk page ♮ 12:16, 28 December 2020 (UTC)
- Triethylborane (talk) 03:23, 19 May 2021 (UTC)
- littleb2009 (talk · contribs)
- Normal Name (talk) 20:28, 29 June 2021 (UTC)
- Amazomagisto (talk) 02:36, 6 July 2021 (UTC)
- TreeReader (talk) 09:17, 1 August 2021 (UTC)
- A live mussel (talk or contribs) 09:05, 14 October 2021 (UTC)
- --lettherebedarklight – 晚安 (おやすみなさい)。 04:07, 4 June 2022 (UTC)
- rbstrachan (talk) 21:11, 4 August 2022 (UTC)
- Tymewalk (talk) 08:38, 4 September 2022 (UTC)
- Max263 (talk • contribs) 12:32, 19 September 2022 (UTC)
- SikiWtideI (Speak to the backwards police) 21:44, 11 November 2022 (UTC)
- KING WIKIPEDIAN DCCLXIV (talk | contribs) 21:12, 16 December 2022 (UTC)
- Blue Edits (talk) 16:55, 30 June 2023 (UTC)
- BD2412 T 20:16, 28 November 2023 (UTC)
- Tommi1986 let's talk! 20:41, 16 March 2024 (UTC)
- Bunnypranav (talk))14:34, 10 July 2024 (UTC)
- A37393 (talk) 20:29, 29 October 2024 (UTC)
moss code and data sources
[edit]moss is written in Python, and is available on github at: https://github.com/cdbeland/moss
Data is obtained from XML database backup dumps.