Wikipedia talk:WikiProject Check Wikipedia

From Wikipedia, the free encyclopedia
Jump to: navigation, search
  Check Wikipedia   WMFLabs   List of Errors   Discussion


Adjacent references ?[edit]

Hi, what do you think of adding a detection for adjacent references, like <ref>...</ref><ref>...</ref><ref>...</ref> ? This error probably won't be of any interest for enwiki because reference numbers are put between square brackets [1][2][3]. But on frwiki reference numbers are displayed without any decoration so adjacent references may look like only one reference 123, so we're generally using a template {{,}} between references. --NicoV (Talk on frwiki) 22:14, 27 May 2014 (UTC)

NicoV, could you get me some articles with the problem as test subjects. <maniacal laugh> Test Subjects </maniacal laugh> I take it I need to look for cases of: </ref><ref> and <ref name=ack /><ref ? I also saw your message above about adding to the done pages. Bgwhite (talk) 05:29, 28 May 2014 (UTC)
Ok, will try to find some... The subject was brought on WPCleaner's talk page for this modification, but the page is fixed now. --NicoV (Talk on frwiki) 07:17, 28 May 2014 (UTC)
Bgwhite, I checked a lot of articles but I haven't found an other example yet... --NicoV (Talk on frwiki) 12:16, 28 May 2014 (UTC)
fr:Utilisateur:Zetud/Pb Ref should have a list. --NicoV
Bgwhite, fr:Leetchi, with at least 2 problems in the introduction. --NicoV (Talk on frwiki) 07:34, 2 July 2014 (UTC)

#14 false positives[edit]

Two false positives at plwiki are reported. To remove such cases, you might check only for "<source ", not "<source", and skip code which is in a <source> by itself. ToSter (talk) 19:20, 21 October 2014 (UTC)

The solution is again "<source[^a-z]". -- Magioladitis (talk) 06:13, 22 October 2014 (UTC)

Magioladitis, Error #14 doesn't use a regex. It uses the same subroutine used for checking imbalanced nowiki, pre, comment, syntaxhighlight, code, math, hiero, and score. The regex also doesn't solve the problem with the articles ToSter mentioned. The problem with the articles... there are valid, unbalanced source tags inside source tags.
Following scenario is in ToSter's articles, where the second source is not an html source tag.
<source> [text] <source> [text] </source>
Problem is... how does one differentiate between ToSter's scenario and a scenario where the first <source> tag is actually missing a closing tag, especially when editors don't always put extra parameters inside source tags. Bgwhite (talk) 07:26, 22 October 2014 (UTC)
We also have false positives on frwiki, which doesn't seem to fall into the above category:
  • fr:Apache Ant: a <sourcePath> tag is detected as being a <source> tag
  • fr:Vidéo HTML5: there are 3 self-closing <source /> tags inside a <syntaxhighlight> tag. The third one is reported.
--NicoV (Talk on frwiki) 13:41, 25 October 2014 (UTC)

Bgwhite is this fixed somehow? I haven't seen any false positives for a long time. -- Magioladitis (talk) 08:33, 24 January 2015 (UTC)

Stripping pre tags[edit]


<pre> tags are stripped only if they have no additional attributes. In pl:dmesg there's a pre block:

<pre style="height:20em; overflow-y:scroll">...</pre>

It's not getting stripped by (get_pre() function) so false positives are reported (like #56 in that case). ToSter (talk) 18:51, 12 November 2014 (UTC)

ToSter, personally, I'd remove the entire pre text. I don't see the benefit of a boot screen from a 6-year old version of Linux. Bgwhite (talk) 23:35, 12 November 2014 (UTC)
Bgwhite, that's right :) but still the problem can occur in another place. ToSter (talk) 06:13, 13 November 2014 (UTC)
ToSter, it can, but it is not. Also, this is what the whitelist is for. Bgwhite (talk) 21:54, 14 November 2014 (UTC)

Add field with user edit[edit]

Hola, disculpas por escribir en español, se podría agregar un campo mas en el cual indique el nombre de usuario o ip que realizó la edición del error detectado. gracias buen trabojo.Sergio Andres Segovia (talk) 16:59, 30 November 2014 (UTC)

"Hi, I apologize for writing in Spanish, you could add an additional field which states the user name or ip who made the edition of the detected error. thanks good work."
Seems possible but would require a lot of processing to find the particular edit. Frietjes (talk) 17:58, 30 November 2014 (UTC)
Es una pena que requiera una gran cantidad de procesamiento, porque si se agregara ese campo iríamos directamente a las contribuciones del usuario o ip, y el que tenga el flag de reversor podría revertir las edición desde allí. En Wikipedia en español intentamos detectarlo con un filtro de ediciones pero arrojó muchos falsos positivos [1], saludos. Sergio Andres Segovia (talk) 19:05, 30 November 2014 (UTC)
"It's a shame that requires a lot of processing, because if that field we would be linked directly to user contributions or ip, and an editor with rollback could reverse the issue from there. At Spanish Wikipedia, we tried to detect issues with an edit filter but it resulted in many false positives[2], greetings."
I agree that it would be useful. You might be able to get a bot to do this for you? for example, I know that some bots like 'BracketBot' will warn you when you have introduced unbalanced brackets. of course, there is a difference between warning a user about 'breaking an article' and warning a user about using deprecated syntax. maybe you can ask the operator of BracketBot (A930913)? Frietjes (talk) 17:19, 1 December 2014 (UTC)
There is also Bracketbot's brother, ReferenceBot. Both are done by A930913. The two main differences between BracketBot and CheckWiki is: 1) Bracketbot checks articles in near real-time 2) Bracketbot informs the editor of the problem they created instead of reporting the error to a master database. In theory, CheckWiki can also be run in near real-time on individual articles. I would need help from A930913. His bot code would run normally except call CheckWiki to test an article instead of using the bot's checks. Bgwhite (talk) 19:53, 1 December 2014 (UTC)
@Bgwhite: Make a (web)script that I can ping with a pageid/title/diffid/oldid/user? ##930913 connect? 930913 {{ping}} 07:33, 2 December 2014 (UTC)

New error : empty titles ?[edit]

Hi, I was thinking about a new error for detecting empty titles, like the ones VE is creating on a regular basis (== <nowiki /> ==). --NicoV (Talk on frwiki) 18:10, 13 August 2014 (UTC)

NicoV, I did a scan for enwiki and came up with 83 articles. The VE edits all appear old. I wonder if they have fixed the problem in new VE builds? Bgwhite (talk) 22:31, 22 August 2014 (UTC)
Bgwhite, apparently it's still not fixed, the last VE edit I found with this problem is from last night. --NicoV (Talk on frwiki) 09:10, 23 August 2014 (UTC)
Thanks for the list Bgwhite, I've added error #522 to detect empty titles and fixed all the occurrences. --NicoV (Talk on frwiki) 12:21, 24 August 2014 (UTC)

And also, of the same kind, a new error for empty internal links, like in this edit ([[Boom Fm|<nowiki/>]] and [[Roger Blackburn|<nowiki/>]]). --NicoV (Talk on frwiki) 10:11, 23 August 2014 (UTC)

This was fixed in Visual Editor. No new cases have been found over the past few months. Bgwhite (talk) 23:51, 28 January 2015 (UTC)
Bgwhite, not at all. A few examples just in the last 24h (nowiki tags):
And maybe another problem with things like that: [[XX|YY ]]<nowiki/>ZZ which could be easily replaced by [[XX|YY]] ZZ
--NicoV (Talk on frwiki) 13:02, 29 January 2015 (UTC)

@NicoV and Magioladitis: According to Tech News: 2015-14, the problem of nowiki in titles has been fixed. Of course, what new untold problems have arisen due to their fix has yet to be seen. Bgwhite (talk) 20:28, 30 March 2015 (UTC)

Well, when you read in the same announcement that "VisualEditor is now the main editing tool on 53 more Wikipedias", you can't take it really seriously as even on wikis where it has been enabled by default for almost 2 years, it's still far away from from being the "main editing tool"... Face-wink.svg --NicoV (Talk on frwiki) 20:56, 30 March 2015 (UTC)
NicoV When I read that sentence for the first time, thoughts of dread and pity for those 53 sites went thru my mind. I also wondered what "phase 5" meant. mw:VisualEditor/Rollouts explains what each phase means. They have enwiki as a phase 0, which is, "... wikipedias that have been closed or deprecated". Ahhh, Visual Editor... always good for a laugh and a cry. Bgwhite (talk) 21:25, 30 March 2015 (UTC)
You didn't know ? When enwiki made its push to make VE opt-in, they closed enwiki Face-wink.svg Currently, we're not editing enwiki, it's a decoy... In the rollouts, I also liked very much the sentence that wikis in phases 1 to 4 "are relatively easy for VE to support"... --NicoV (Talk on frwiki) 21:35, 30 March 2015 (UTC)
VE is supporting phases 1 thru 4 very well. It's rather obvious. From day one, VE has supported goofs, foul-ups, mistakes and barfs. Bgwhite (talk) 21:45, 30 March 2015 (UTC)
I don't know if they deployed it, but empty titles are still created like here (without nowiki tag). --NicoV (Talk on frwiki) 16:48, 1 April 2015 (UTC)
still nowiki in titles..., and frwiki is running 1.25wmf23, the version identified as fixing the problem in all bug reports... --NicoV (Talk on frwiki) 15:54, 2 April 2015 (UTC)

False positive in Error #85[edit]

In ca:Brainfuck, there are <code> tags between <center> tags. However, the tool is flagging it as if it were empty.

--Joutbis (talk) 11:20, 22 February 2015 (UTC)

Same kind of false positive for frwiki: fr:Messiah with <score> tags between <center> tags, and fr:Tiret with <code> tags between <center> tags. --NicoV (Talk on frwiki) 22:27, 22 February 2015 (UTC)
Joutbis NicoV See discussion two above this one... Anything between comment, math, nowiki, code, pre, source, hiero and score tags gets removed before checks take place.
All right, will do.--Joutbis (talk) 19:50, 28 February 2015 (UTC)
In ca:Brainfuck's case, <center> tags are not to be used like that in tables. This is a case of doing center properly. I did edit Brainfuck to do tables properly. fr:Tiret has the same problem, well actually, it is full of fail (scope="col" is redundant, <font> and <tt> are obsolete).
In the case of fr:Messiah, if it was on enwiki, I'd use the {{center}} template. That does the proper thing anyway instead of using the obsolete <center> tag. Bgwhite (talk) 06:38, 23 February 2015 (UTC)

Whitelist ( for error #34[edit]

CanI get instances such as {{#expr:{{Stat/Finland/Kommuner/Befolkning|Föglö}}/{{Stat/Finland/Kommuner/Areal land|Föglö}} round 2}} whitelisted on svwp, since this is used to automatically update population numbers in articles such as sv:Föglö. I don't know how to do it, or what to do... (tJosve05a (c) 18:57, 23 February 2015 (UTC)

Josve05a This can be handled two ways and it depends on how many article you are talking about. If it is not "alot", then add the articles to a whitelist. If there are alot, then I can added it to the code.
If you are using a whitelist, look at Wikipedia:WikiProject Check Wikipedia/Translation and see how it is done for enwiki (search for "whitelist"). #34 on enwiki does have a whitelist. Frwiki also has whitelists set up. Bgwhite (talk) 19:26, 23 February 2015 (UTC)
@Bgwhite: THis should be used a lot, at least for all populated places in Sweden with population numbers at Statistiska centralbyrån, since a bot updates those automaticle. Not all articles are using this system yet, but more and more are. (tJosve05a (c) 19:29, 23 February 2015 (UTC)
I've also seen constructions like that on frwiki, but I don't like having calculations in articles. An other solution would be to use a template to do the computation instead of putting the #expr directly in the article. --NicoV (Talk on frwiki) 19:47, 23 February 2015 (UTC)

False positive on error #37[edit]

de:Bělá (Divoká Orlice) is stated as not bearing a sort key, but in fact this key is given with the parameter SORTNAME in template de:Vorlage:Infobox Fluss. Adding a defaultsort parameter to the article itself results in a warning message that the previous sort key has been overwritten. So the sort key seems to be valid. I don’t know if there are other templates affected which are listed in de:Kategorie:Vorlage:mit Kategorisierung. --Hadibe (talk) 17:29, 28 February 2015 (UTC)

@Magioladitis: Hadibe, grrrr.... sort values shouldn't be in Infoboxes. Magioladitis is the one to ask about this. Bgwhite (talk) 05:35, 3 March 2015 (UTC)

List with empty title[edit]

Hi Bgwhite, at least on frwiki list for error #25, there was only one line, and it contains an empty title and time found 0000-00-00 00:00:00. The "Done" button does nothing. The "Set all articles as done" works, and the empty title appears now in the list of done articles. --NicoV (Talk on frwiki) 19:41, 10 March 2015 (UTC)

Same for error 59, but I left it as it is. --NicoV (Talk on frwiki) 19:43, 10 March 2015 (UTC)
Same for error 85. --NicoV (Talk on frwiki) 20:10, 10 March 2015 (UTC)

False positive for #31[edit]

Hi Bgwhite, on frwiki there are several false positives with things like <trl>, <trois>, <trk>, <transformers, <transmission, <traduction, <track>, ... Would it be possible to limit the detection ? For example, detect only <tr when followed by a space, a "/", a ">", ... but not by a letter ? --NicoV (Talk on frwiki) 09:12, 11 March 2015 (UTC)

NicoV, I'll take a look, but it will be a couple of weeks till I can get to it. Bgwhite (talk) 05:10, 16 March 2015 (UTC)

False positives for error #103[edit]

Hi, I think that the script is not doing what user NicoV requested:

it should not detect articles where {{!}} is used in the displayed text of the link.

There are a few examples in cawiki: many train stations, like ca:Estació de Bogatell or ca:Llista de cançons del DJ Hero 2 (this one took several tries to fix error #32, and now it's back!).

I'm not sure if this would cover all the false positives, but I think that, if a | is already there, it should allow several {{!}}'s.

--Joutbis (talk) 09:35, 13 March 2015 (UTC)

Joutbis Yes, those are false positives. The DJ Hero 2 article contains M|A|R|R|S, which is also in several English articles too. I've seen errors that also had | inside a wikilink. I'd say add it to the whitelist for now and I'll take a look at. I've been gone for the best part of 2 weeks, so I need to catch up on things first before diving into the code. Bgwhite (talk) 05:07, 16 March 2015 (UTC)
Bgwhite I fixed M|A|R|S using {{pipe}}. -- Magioladitis (talk) 20:08, 16 March 2015 (UTC)
Good idea, thanks! --Joutbis (talk) 19:54, 17 March 2015 (UTC)

Joutbis I created the template in Catalan Wikipedia! 10 -- Magioladitis (talk) 20:10, 17 March 2015 (UTC)

Magioladitis, thanks! However, it doesn't work 100% of the time. It's OK for the M|A|R|R|S thing, and for train stations, but not in some (brain-damaged, granted) templates, which wrap square brackets around some of the parameters. See ca:Papa Bonifaci II, at the end.--Joutbis (talk) 16:27, 11 April 2015 (UTC)

Joutbis Hm... I can't fight with that.. -- Magioladitis (talk) 16:30, 1 August 2015 (UTC)

Error n°54 false positive[edit]

Yuri (genre) is a false positive. The break is in a reference. Jerodlycett (talk) 13:36, 30 March 2015 (UTC)

The mistake in de:Hilfsfrist. Is there any way else to avoid collecting these articles in WPSK than to separate the ref group entries? --Hadibe (talk) 10:47, 28 October 2015 (UTC)

False positive for #60 ?[edit]

Hi, I don't understand why fr:Liste des commandes et des livraisons de l'Airbus A380 keeps getting reported again and again. It seems that the following part is reported: {{#tag:ref|Singapore Airlines a commandé l'A380 en trois versions différentes, dont deux sont opérées : * 01 = 471 places{{#tag:ref|{{Lien web|url=|titre= Singapore Airlines Seat Maps (V1)|éditeur=}}|name=SIA_A}}, * 02 = 411 places{{#tag:ref|{{Lien web|url=|titre= Singapore Airlines Seat Maps (V2)|éditeur=}}|name=SIA_B}}, * 03 : ''configuration non encore connue'' |group=Note|name=SIAVersions}}

But in the notice, you have #tag:ref, Singapore Airlines a commandé l'A380 en trois versions différentes, dont deux sont opérées: note the comma after tag:ref instead of the actual pipe.

There is a similar construct before (Emirates) but it's not reported. --NicoV (Talk on frwiki) 11:36, 12 April 2015 (UTC)

A few questions, and silly requests[edit]

Hi from; a few random things:

  1. Request: in the web interface the "more" link should be sortable, displaying how many errors are there ("1 more", "2 more" and so on); or at least don't display "more" if there isn't any other error; I'd love this so much :D
  2. Question: I'm testing two whitelists on; I'm supposed to wait the new dump to see those articles removed from the web interface?
  3. Request: it should be possible to whitelist a single ISBN instead of articles; in we have a parameter |ignoraisbn= inside the citation templates (doc here, it's part of the LUA module); article it:Jordan 195 has a wrong ISBN and it's not on our error lists[3]; but I don't know the details about this: is this "ignore" parameter working on every wiki due to the Lua module? Can we always use this instead of whitelists?
  4. Gadget proposal: when logged in Wikipedia, on Special:Watchlist there should be something like "Show errors in my whatchlisted articles", redirecting to our interface with a list of errors found; it should work like clicking on "more" for every article. I can do it already using the url, one article at a time. Similar gadgets can be done for a category, etc.
  5. Error 39, translation page: " Due to a Wikimedia bug</a> ": is there a missing url?
  6. Minor bug: in the interface, after clicking on "more", the "list" link is broken.
  7. Suggestion: in the translation, it'd be better to use &nbsp; for spaces inside the examples proposed, at least where spaces are the problem.

Sorry if I'm wasting your time, and thanks for maintaining this wonderful project! --Vittorioo (talk) 23:08, 23 April 2015 (UTC)

Vittorioo There are no silly requests, but I may give silly answers. :)
  1. Good question. I'll look into it.
  2. The whitelist is updated at 0z everyday. Unfortunately, itwiki is only updated twice a month. From what you've already done, it looks good. I'll check it (my) tomorrow to see if the whitelists work just fine.
  3. It's not possible. enwiki has a similar parameter to ignoraisbn. Checkwiki is not checking ISBNs inside any cite template. The Lua module already checks for bad ISBNs. On enwiki, the errors are located at Category:Pages with ISBN errors. Checkwiki is only checking ISBNs that are not inside a cite type template.
  4. I haven't a clue when it comes to Gadgets. Gadgets are written in Javascript, a language I've never dealt with.
  5. I've removed the <a> tag. There was another Mediawiki bug that prevented newlines from being used in <blockquote> and several quote templates, thus <p> had to be used there. Those were fixed and the <a> tag was related to that.
  6. Will fix.
  7. Could you give me an example?
You are not wasting my time. Any suggestions or questions are always welcome. Bgwhite (talk) 23:49, 23 April 2015 (UTC)
7) For example on error 22, the [[Category : ABC]] and the like; but it's just me splitting windows; I've put no break spaces everywere :D Thanks again, will report on --Vittorioo (talk) 00:05, 24 April 2015 (UTC)
6) Fixed. -- Magioladitis (talk) 07:35, 8 May 2015 (UTC)
7) I've added a few myself [4]; regarding 1) and 4): I've found a way to use WPCleaner to find articles with multiple errors or to scan my whatchlisted articles, it's quite the same of what I've asked, so don't waste time on them. Even that bug in 6), it's really not essential, just archive all of this. Thanks. --Vittorioo (talk) 12:43, 28 May 2015 (UTC)

Error 82 confusion and new error 104[edit]

Hi from again. I've problems with error 82 "Link to other wikiproject"; it's active in too.

  • A) Please exclude the "Wikipedia:" namespace from being detected by the script when it's checking a
  • B) Redirects to articles written like [[w:en:Article]] or [[:w:Article]] etc.:
    • from point of view they belong to error #68 "Link to other language";
    • from point of view they are internal links badly written, together with [[:en:Article]] and the like: I propose to transfer them to a new error #104;
  • C) There are redirects to articles using Meta or Mediawiki mixed syntax like [[m:en:Article]] or [[:en:mw:w:Article]] or [[meta:w:Article]] etc., and the script is not handling them correctly:
    • from point of view they belong to error #68 "Link to other language";
    • from point of view they belong to the error #104 I've proposed in point B above;
    • they belong to error #82 only when the script is checking Commons or other sister projects.

A more simple fix to the script would be renaming error 82 to something like "Links with mixed MediaWiki syntax" and heavily expand its description. But in this case you have to be sure that all the above cases and variants are checked.
Sorry for the headache and thanks again. --Vittorioo (talk) 20:02, 1 May 2015 (UTC) Edit: added "at least" in point C) + some minor fixes --Vittorioo (talk) 10:07, 2 May 2015 (UTC) PS: I've rewritten and simplified my proposal. --Vittorioo (talk) 20:49, 26 May 2015 (UTC)

Vittorioo Sorry for ignoring you. I've been sick this past week. When I do edit, I'm trying just to keep up with fixing enwiki checkwiki errors. I'll get back to answering you next week. I've got an in-law gathering this weekend... so I'll probably be really nauseated for awhile. Bgwhite (talk) 21:44, 1 May 2015 (UTC)
Magioladitis Last month, I did the 2nd fewest edits in over four years and March was the 3rd fewest. Besides being sick the past two months, I wonder what else happened..... Bgwhite (talk) 00:01, 2 May 2015 (UTC)
Real life first of course! We have hundreds of years ahead to fix wiki. :D Take care. --Vittorioo (talk) 10:07, 2 May 2015 (UTC)
Pull request with a partial fix, basically an update for the list of projects: we are missing "species" because it's written "speciesi" in the script; also missing "voy" and many others. This is the list I've proposed when the script is checking a (that is, not Commons or other projects): b: c: d: n: q: s: species: v: voy: wikt: m: mw: meta: metawiki: metawikipedia: mediawikiwiki: commons: wikibooks: wikidata: wikinews: wikiquote: wikisource: wikispecies: wiktionary: wikivoyage: wikiversity: phabricator: wikitech: toollabs: testwiki: test2wiki: testwikidata: wmf: foundation: wikimedia: wmania: incubator: outreach:. There are more, but those are less used: see Help:Interwikimedia links. I've also proposed to add zh and bn language codes instead of fl (which doesn't exist) and gv (too small wiki). --Vittorioo (talk) 20:49, 26 May 2015 (UTC)

──────────────────────────────────────────────────────────────────────────────────────────────────── Vittorioo I've updated the program with your changes. Bgwhite (talk) 18:01, 27 May 2015 (UTC)

Slightly better. I've read your commit and you are still listing "fl" language code: as I said, it doesn't exist; also still listing "meta-wiki:" and labs:: they don't exist; re-read the list above please for the updated interwikimedia links: still missing c: for Commons and d: for Wikidata, etc. Also, is it really impossible to consider "Wikipedia:" a namespace, removing it from the list of projects? As I said in the pull request, error #82 is not active in our sister projects, so it's quite safe to remove "Wikipedia:" for now. I'd leave "w:" for a future fix, since that one is more complex to handle. I really appreciate your work, keep going. --Vittorioo (talk) 19:21, 27 May 2015 (UTC) Edited for grammar. --Vittorioo (talk) 12:43, 28 May 2015 (UTC)
With the last week commit it looks much better now. Thank you! --Vittorioo (talk) 10:03, 2 July 2015 (UTC)

CHECKWIKI #69 (2015-06-28)[edit]

On sv.wp CHECKWIKI detects the following false positives:

(tJosve05a (c) 13:13, 28 June 2015 (UTC)

Josve05a can't you just use {{Not a typo}}? -- Magioladitis (talk) 13:16, 28 June 2015 (UTC)
Will that help against CHECKWIKI errors? Also this is perhaps a common issue on svwp since we do tend to use dashes to merge words together. (tJosve05a (c) 13:30, 28 June 2015 (UTC)

Josve05a was this problem fixed/handled somehow? -- Magioladitis (talk) 11:38, 10 March 2016 (UTC)

Error 4 false positive[edit]

Error 4 (HTML-tag <a>): matches <a throne dais>; it's wrong. Fix: add href= to regexp (talk) 23:17, 24 July 2015 (UTC)

Invalid color tracking[edit]

might not be feasible, but may be interesting to (1) parse an article, (2) grab any css style statements, (3) parse the background/foreground colors and compute the contrast ratio, (5) flag articles with really bad ratios. for the parsing part of the style statement, we have code in module:color contrast . for related discussion see Template talk:Episode list#Invalid color tracking category. of course, it would be pointless if there is no one interested in fixing them, but an idea for helping those of us with (partial) colour blindness. Frietjes (talk) 16:16, 1 August 2015 (UTC)

Template:Editnotices/Page/Wikipedia:WikiProject Check Wikipedia/Translation[edit]


There is a dead link to the Toolserver ([[tools:~sk/checkwiki/enwiki/enwiki_translation.txt|toolserver]]) in Template:Editnotices/Page/Wikipedia:WikiProject Check Wikipedia/Translation. Please update the link or remove it entirely if it is no longer needed. --Meno25 (talk) 14:24, 2 September 2015 (UTC)

At the same time, it should be updated to take into account the new elements that are managed by CW: whitelispage, ... --NicoV (Talk on frwiki) 14:39, 2 September 2015 (UTC)
Meno25 I know nothing about this. What is this for and how is it used? Bgwhite (talk) 17:32, 3 September 2015 (UTC)
Removed. -- Magioladitis (talk) 17:38, 3 September 2015 (UTC)
Update this too: Template:Editnotices/Page/Wikipedia:WikiProject Check Wikipedia. -- (talk) 18:27, 3 September 2015 (UTC)
I think Template:Editnotices/Page/Wikipedia:WikiProject Check Wikipedia can be deleted as it doesn't seem to match the current situation and is probably useless now. @Bgwhite and Magioladitis: What do you think? --NicoV (Talk on frwiki) 10:53, 25 February 2016 (UTC)

NicoV link was updated. Feel free to perform any further action. If Bgwhite agrees we can delete it. -- Magioladitis (talk) 23:23, 19 September 2016 (UTC)

Magioladitis, I think that:
-- NicoV (Talk on frwiki) 05:48, 20 September 2016 (UTC)

NicoV I deleted the latter. -- Magioladitis (talk) 07:29, 20 September 2016 (UTC)

Proposed error detection[edit]

I noticed some file-delinker bots (or even users) removing an image name leaves an incorrect syntax like [[File:|thumb|]]. Also, in image galleries I noticed that image title was removed, but caption remained (after pipe). It might be useful to detect these errors also. --XXN, 00:34, 20 November 2015 (UTC)

The example you provided looks like Double pipe in a link. Matěj Suchánek (talk) 14:34, 25 November 2015 (UTC)
Only in this particular example. But it can also be like: [[File:|thumb]] or [[File:|caption here]] or [[File:|some_size_px]] etc. --XXN, 13:33, 26 November 2015 (UTC)

Wrong quotes[edit]

See this edit. Don't know how wide the problem is, but maybe it's worth including in Checkwiki? --Edgars2007 (talk/contribs) 09:58, 26 November 2015 (UTC)

For example, such wikisearch insource:/(style|class|colspan|class|rowspan|align)\s?\=\s?[”“]/i gives 200+ results at enwiki. Regex of course could be improved, as there may be cases, when opening brackets are correct, but closing ones are not. --Edgars2007 (talk/contribs) 11:20, 12 December 2015 (UTC)
And if I already started... Articles are using also style="text-align:centre;", which, of course, doesn't work. --Edgars2007 (talk/contribs) 11:36, 12 December 2015 (UTC)

Category namespace[edit]

When the migration started, I asked about including some more namespaces. Detecting stuff like this [5] would be awesome. Matěj Suchánek (talk) 11:02, 12 December 2015 (UTC)

No title displayed[edit]

Today I noticed an error report which has no title displayed (it's in the first row, using search I found it should be Bisabolol in Czech Wikipedia). The timestamp there is also strange. Matěj Suchánek (talk) 12:15, 13 December 2015 (UTC)

Matěj Suchánek, I've seen that on enwiki and fixed one issue that caused most of the problems. It was related to dump files. I haven't been able to find the cause for the remaining problem. Bgwhite (talk) 05:36, 14 December 2015 (UTC)
I'm seeing this from time to time on frwiki, reported a few sections above. If you need, I can report when I see it. --NicoV (Talk on frwiki) 06:18, 14 December 2015 (UTC)
That would be good if you and Matěj could report them. It would help if the article was found via the dump or daily scan. Bgwhite (talk) 06:37, 14 December 2015 (UTC)
Existing ones on frwiki: 67, 91. I don't know for how long they are there, those errors have never been completely cleaned for some time, and it 's only possible to remove the empty title when it's the only one left. --NicoV (Talk on frwiki) 05:25, 15 December 2015 (UTC)
Bgwhite On frwiki, the problem is visible for #105 and it's probably very recent. I haven't done anything to remove it if it can help you understand where the problem comes from. --NicoV (Talk on frwiki) 17:13, 15 December 2015 (UTC)
Also on #60 and #43 but I don't know for how long. --NicoV (Talk on frwiki) 17:16, 15 December 2015 (UTC)
Bgwhite Can I remove the ones that can be removed, and then warn you if they appear again ? --NicoV (Talk on frwiki) 17:01, 18 December 2015 (UTC)
NicoV Yes, you can remove them. They are showing up in via dump processing. What's weird is they don't show in the log file. Bgwhite (talk) 22:17, 21 December 2015 (UTC)
Ok, I've removed the ones I can. I will notify you when I see some more. --NicoV (Talk on frwiki) 07:23, 22 December 2015 (UTC)

Bgwhite, I think there was a full scan yesterday on frwiki, I see empty titles for #26 (same notice as fr:Emphase (typographie)), #38 (same as #26), #45, #51 (similar than #45), #53 (similar than #45), #67 (maybe an old one). --NicoV (Talk on frwiki) 11:45, 31 December 2015 (UTC)

File URLs[edit]

For some reason people cite files from their own hard drive. At least 58 articles when searching for file://c:/Users. Can we get these flagged? — Dispenser 21:42, 13 December 2015 (UTC)

Dispenser I can't remember when, maybe ~10-12 months ago, I did a scan for this problem and |image = http:// in infoboxes. If I remember right, there were different combinations of the problem. I look into it. I need to see what the other language Wikipedia's are like. For example, do they use file or another word. Bgwhite (talk) 05:47, 14 December 2015 (UTC)
Its standardized, see file URI scheme. — Dispenser 13:03, 14 December 2015 (UTC)

ISBN with invalid syntax missed by #69[edit]

Bgwhite Apparently, #69 doesn't catch the invalid syntax in Donald Strachey, like (isbn = 1-55583-387-X). Before fixing them, I tried checkarticle.cgi and nothing was detected. --NicoV (Talk on frwiki) 16:25, 18 December 2015 (UTC)

NicoV Correct, Checkwiki doesn't detect these. The main reason is the use of isbn= inside cite and infobox templates. Bgwhite (talk) 22:15, 21 December 2015 (UTC)
Bgwhite Could it be modified so that they are reported when they are not inside a template ? --NicoV (Talk on frwiki) 07:23, 22 December 2015 (UTC)

False positive for #60 ?[edit]

Hi, fr:Aldébaran keeps being reported for #60 with the notice Palette VizieR, V*. The template VizieR does have a "V*" parameter, but it seems to be detected as an error. Same for fr:Wolf 1061. --NicoV (Talk on frwiki) 17:09, 2 January 2016 (UTC)

NicoV You are correct. Atleast on enwiki, one can't have a parameter with * in its name. Probably true for dewiki as this was originally added by Stefan. Bgwhite (talk) 22:39, 5 January 2016 (UTC)

Error 104 unbalanced quotes with special characters and curly quotes[edit]

In regard of the display problem and the ref names rules, I've created a test page for error 104 (NicoV: WPCleaner wants to put the quote close to the slash in line 11). I've also searched for the opening and closing curly quotes, and some are mixed up with the regular ones. I think that Check Wiki should warn the user to search carefully every occurrence of a ref name found by error 104. If a ref name is "LuisBuñuel-59" the user needs to search at least "LuisBu" in order to find all of them. I hope it's clear enough. --CX42 (talk) 07:34, 10 January 2016 (UTC)

I once looked through Anomie's list of fixes and collected something (that I understand) for Latvian Wikipedia scanning. Yes, some are bracket-unrelated, but most of them are (in section "Kļūdainās atsauces (# Other issues)"). --Edgars2007 (talk/contribs) 08:20, 10 January 2016 (UTC)

False positive for #46[edit]

Hi Bgwhite, someone reported on frwiki that lately there has been false positives for #46 when image legend contains a link. Today examples:

  • fr:AN/APG-76, radar Norden [[AN/APG-76#AN/APQ-148|AN/APQ-148]]: seems fine in the article [[File:AN-APQ-148 Radar, Norden, 1972 - National Electronics Museum - DSC00068.JPG|thumb|280px|Un radar Norden [[AN/APG-76#AN/APQ-148|AN/APQ-148]].]]
  • fr:Bétail, of sheep.jpg|thumb|Troupeau de [[mouton]]: seems fine in the article [[Fichier:Flock of sheep.jpg|thumb|Troupeau de [[mouton]]s]]

--NicoV (Talk on frwiki) 08:56, 9 February 2016 (UTC)

NicoV It's not a false positive, but it is giving the wrong location for the error. Both articles had the broken bracket fixed on the 9th. Bgwhite (talk) 22:58, 11 February 2016 (UTC)
Bgwhite Yes, but sometimes it seems to be the opposite error that should be reported : currently, fr:Gandhara is reported in #46 with the notice Gandhara Guimet 181171.jpg|thumb|[[Bodhisattva]] while the actual problem is a #10 for [[shivaïsme]. That was also the case for fr:AN/APG-76. --NicoV (Talk on frwiki) 07:44, 12 February 2016 (UTC)

False positives for #3[edit]

@Bgwhite: On frwiki, there are some false positives for #3 due to:

  • a whitespace in the <references>...</references> tag, like here or here
  • a carriage return in the template Références like here

Could this be prevented from being detected ? --NicoV (Talk on frwiki) 20:35, 9 March 2016 (UTC)

  • Whitespace: The regex is <references[ ]?\/?>. Change it to <references(\s*\/)?>?
  • Carriage return: I only slap {{ onto the front of the regex. You'll need to add a carriage return to your regex.
Bgwhite (talk) 21:33, 9 March 2016 (UTC)
@Bgwhite: For the whitespace, yes maybe. For the carriage return, I didn't know it was also a regex for #3, I thought it was only for #78: are you sure? --NicoV (Talk on frwiki) 08:40, 10 March 2016 (UTC)
NicoV Never mind. I was thinking 78. It's amazing I think at all. Will look more tomorrow. Bgwhite (talk) 08:45, 10 March 2016 (UTC)

Id 85 bug[edit]

Hello. Id 85 returns false positive on empty tags (as in "<center> </center>") if there is a code inside: "<center> <syntaxhighlight ... </syntaxhighlight> </center>" IKhitron (talk) 12:19, 13 April 2016 (UTC)

IKhitron The first thing CheckWiki does is to remove various tags and their content, ie <syntaxhighlight>, <nowiki>, <pre>... These tags often have bad wikicode or wikicode symbols that aren't wikicode. There's nothing that can be done with the false-positive blank center tags. However, as <center> is obsolete HTML, it's best to replace the tag. Bgwhite (talk) 19:47, 18 April 2016 (UTC)
Thank you, Bgwhite. But this is a special case id, it checks empty text. Can't you replace the tags with something neutral, as "qwerty" string, in place of removing, to work property? IKhitron (talk) 19:57, 18 April 2016 (UTC)
Well, Bgwhite, I rephrased the template, and the new run did not catch it. But I still do not know, what was the problem. IKhitron (talk) 18:40, 26 July 2016 (UTC)
IKhitron Wrong discussion. Do you mean #3 down below? Bgwhite (talk) 23:37, 26 July 2016 (UTC)
Sorry. Bgwhite. It's ##60 possible false positive

Self-closing div and span tags to be deprecated[edit]

The latest Tech News (dated today) has this notice:

Future changes
  • Using self-closing tags like <div/> and <span/> to mean <div></div> and <span></span> will not work in the future. Templates and pages that use these tags should be fixed. When Phabricator ticket T134423 is fixed these tags will parse as <div> and <span> instead. This is normal in HTML5. [6]

Should a check for these tags be added to Checkwiki? – Jonesey95 (talk) 21:13, 16 May 2016 (UTC)

Jonesey95 I've already run a list for them. There's a total of 72 in articles. There are <span /> tags in template space and I left a message on Frietjes' talk page about these. I'd rather not touch templates. I'll be adding this to error #2. Bgwhite (talk) 21:22, 16 May 2016 (UTC)
Thanks. I don't mind editing templates, even if it means the occasional run-in with editors who either can't read or refuse to read and then blame me for their shortcomings. I know that you know what that's like. I'll head over to F's talk page for the list. – Jonesey95 (talk) 21:34, 16 May 2016 (UTC)
Sadly, I've turned off the second error, because there's no consensus with <br clear="all" /> -> template replacement in ruwiki. Error #2 becomes more and more sophisticated, maybe it's time to divide it to the several errors? Or could you, please, disable founding br tags with "clear" attribute in ruwiki? If it's not very difficult. Facenapalm (talk) 07:57, 18 May 2016 (UTC)

It appears that the check for error #2 is not catching some cases of errors that cause pages to be placed in the new Category:Pages using invalid self-closed HTML tags. Examples:

Is error #2 supposed to find these? Can it be modified to do so? – Jonesey95 (talk) 01:45, 17 July 2016 (UTC)

Jonesey95 @NicoV: That's a lot of articles in that category. One of the articles I looked at should be caught, but isn't.
  1. I'm currently only catching cases that don't have other attributes, such as id=.
  2. I'm not looking for any cases of some others, such as <p>.
I'll work on adding them. I'm behind on coding things up due to trying to fix articles on the daily CheckWiki scans. Bgwhite (talk) 05:03, 18 July 2016 (UTC)
The category is new, and it is filling slowly as the job queue runs through the whole population of pages. Some gnomes have been busy cleaning out the category, including fixing templates that have zillions of transclusions, but the category population has stayed relatively constant at a few thousand as new pages are null-edited by the job queue. At this writing, it seems likely that there are 5,000 to 10,000 individual pages left with these errors, not including pages transcluding pages that have errors in them.
In addition to the above, I have seen <small/>, <center/>, <p "with text" />, and maybe one or two others, as well as all of those tags with both leading and closing slashes in the same tag. – Jonesey95 (talk) 05:47, 18 July 2016 (UTC)

@Bgwhite and Jonesey95: I've started updating WPCleaner to handle some of the tags that trigger the categorization. It's not finished, but you can help me by listing cases I'm currently missing (not a lot of free time to analyze what's missing). --NicoV (Talk on frwiki) 17:10, 18 July 2016 (UTC)

Is there a list somewhere? In addition to the above tags, I have seen <big/>, <s/>, <del/>, <tr/>, <td/>. – Jonesey95 (talk) 17:15, 18 July 2016 (UTC)
@Jonesey95: List available in the code. --NicoV (Talk on frwiki) 22:14, 18 July 2016 (UTC)
I see a list of tags, but interpreting the code is beyond me. It looks like del, td, and tr are missing. Will it find tags formatted like </blockquote/>, with a leading and trailing slash? There are a surprising number of those. – Jonesey95 (talk) 22:51, 18 July 2016 (UTC)
The link was just for the list of tags, not to analyze the code ;-) The code will find both regular self-closing tags and also incorrect tags with a leading and trailing slash. I've added del, td and tr. If you see other cases, tell me. --NicoV (Talk on frwiki) 06:14, 19 July 2016 (UTC)
I just found and fixed <code/> on one page. There may be more pages with this tag. – Jonesey95 (talk) 17:13, 19 July 2016 (UTC)

@Bgwhite and Jonesey95: If you're interested, I ran a dump analysis yesterday, the result for #2 is at Wikipedia:CHECKWIKI/WPC 002 dump. --NicoV (Talk on frwiki) 08:29, 21 July 2016 (UTC)

Excellent. It looks like there might be a couple of false positives on that list, but they are not worth worrying about until the hundreds of real errors are fixed. Good work. – Jonesey95 (talk) 12:58, 21 July 2016 (UTC)
This one doesn't look like a tag syntax error to me. As far as I know, any amount of white space is valid within a tag:
Does WP have its own rules about tags like this? – Jonesey95 (talk) 14:33, 21 July 2016 (UTC)
I don't know if I should keep detecting this or not : for the moment, carriage return are considered as invalid characters in a tag in WPC. --NicoV (Talk on frwiki) 16:29, 21 July 2016 (UTC)

Here are a few more tags to add to the check: <sup/>, <em/>, <i/>, <th/>, and <rb/> (typo for "br")Jonesey95 (talk) 21:28, 25 July 2016 (UTC)

@Magioladitis, Jonesey95, and NicoV: In theory, tomorrow CheckWiki will start to catch the br tags in NicoV's report and all the self-closing tags. It's also catching br tags with carriage returns. Bgwhite (talk) 00:32, 28 July 2016 (UTC)

Do I look at Wikipedia:CHECKWIKI/WPC 002 dump or somewhere else for the updated list? I fixed a few hundred errors on that page and am looking forward to a refresh of it. I was unable to persuade my computer to run the Java command at the top of the page, so I was unable to refresh it myself. – Jonesey95 (talk) 03:38, 28 July 2016 (UTC)
Jonesey95 In theory, August's dump will come out in a week or so. Might want to wait till then to see all the new and wonderful errors. I reran Nico's list via Checkwiki. The only errors listed were ones with the <br> tag... assuming I coded it right. Not sure if you or Nico have access to WMFLabs. Java and the dump files are available there. Bgwhite (talk) 04:55, 28 July 2016 (UTC)
Jonesey95 I have been trying to rerun the dump analysis for the last 2 days, but I'm only spending an hour or so home once a day (it failed the first time due to an out of memory error, and I don't know what's the status of the second run...). If you want to try it by yourself, the command on fr:Projet:Correction syntaxique/Analyse 002 is probably more explicit than the one displayed on enwiki... I won't be able to handle the August dump analysis, at least not until the 15th.
Bgwhite I think WMLabs severely limits the amount of memory a process can have, so it's probably a no go for WPC for the dump analysis. --NicoV (Talk on frwiki) 08:47, 28 July 2016 (UTC)
NicoV, No, they don't severely limit the amount of memory. One does have to specify the max amount of memory one needs. The default is 256MB. I've gone upto 3GB. Bgwhite (talk) 20:53, 28 July 2016 (UTC)
Jonesey95 I updated the description of the command line to run the dump analysis for enwiki. --NicoV (Talk on frwiki) 12:30, 28 July 2016 (UTC)


@Bgwhite: Could you help set up CHECKWIKI for Commons, so that it will list errors there, which doesn't seem to be working? (tJosve05a (c) 07:42, 24 May 2016 (UTC)

Error #2 and AfD syntax[edit]

The fix for this error seems to break AfD tag syntax - it moves the {{Article for deletion}} template outside of the comments it comes with when it replaces/alphabetises the tags at the top of the page. This in turn breaks the commonly used AfD closing script User talk:Mr.Z-man/closeAFD.

For an example see: [7].

Pinging Bgwhite and Magioladitis, who run BG19bot and Yobot, which run these fixes. ansh666 05:08, 26 May 2016 (UTC)

Ansh666 This is actually an AWB question. That is the tool causing all the fun. CheckWiki only finds problems, not fixes them. I swear this was brought up recently but I can't find it at any of Magioladitis' talk pages. He'll have an answer. Bgwhite (talk) 05:29, 26 May 2016 (UTC)
Ansh666 Magioladitis is on his phone and not at his laptop. He asked me to type up the issue in phabricator. Bgwhite (talk) 06:43, 26 May 2016 (UTC)

I 'll be back at my laptop in 4 days. Thanks for reporting it to Phabricator. -- Magioladitis (talk) 16:24, 26 May 2016 (UTC)

Alright, thanks guys. ansh666 17:19, 26 May 2016 (UTC)

#60 possible false positive[edit]

Hello, Bgwhite. Could you, please, explain me what is the problem in the first article in [8]? It's there a couple of months. I can't understand the second one two, but it's less important because it's new. Thank you in advance, IKhitron (talk) 18:46, 9 June 2016 (UTC)

IKhitron Most of the time, #60 will give an error if there are unbalanced brackets. It will also give an error if there is a bad parameter in a template. |date*= or |date:= would be errors as * or : can't be part of a parameter's name. I can't tell if that is an issue in the Eviatar Nevo article because of your wacky right-left, funny character language. :) Bgwhite (talk) 22:02, 9 June 2016 (UTC)
OK, Bgwhite, here is a kind of translation.
The text in the article:
The text in Notice column:
And this is the same every month. I have no idea where checkwiki finds this comma, and why does it loose the pipe. Thank you, IKhitron (talk) 22:30, 9 June 2016 (UTC)
IKhitron I only looked at the first article. I'll look at the second one too. Bgwhite (talk) 04:55, 10 June 2016 (UTC)
But this is in the first one, Bgwhite. IKhitron (talk) 08:51, 10 June 2016 (UTC)
@Bgwhite: IKhitron (talk) 12:09, 14 June 2016 (UTC)

"Tags without content" screws up a format hack[edit]

I wanted to link a name that I inserted in brackets because it was simply "she" in the original quote, i.e. [[[Tammy Baldwin]]]. That displays without any formatting, so I put a blank span in the middle. Your bot just took it out. [9]

What worries me is that I've done this a LOT over time - not always for this reason, but it's amazing how often Wiki syntax fouls up some text with a single quote mark or some other feature for which this has been a workaround.

PLEASE stop removing empty tags and review the bot's edits. Wnt (talk) 10:44, 26 June 2016 (UTC)

It's logical to use nowiki instead of span. It's really non-obvious to understand what your span means. I think you should form your code as something like that: <nowiki>[</nowiki>[[Tammy Baldwin]]<nowiki>]</nowiki>. In ruwiki, we usually use self-closing nowiki tags for such purposes as this one, but seems like they're going to be deprecated. :( Facenapalm (talk) 11:30, 26 June 2016 (UTC)
UPD: "but seems like they're going to be deprecated" - hm, seems like not. Then I would write this: [<nowiki />[[Tammy Baldwin]]<nowiki />]. But template is even better, yes. Facenapalm (talk) 11:42, 26 June 2016 (UTC)

Wnt I used Bracket and fixed it for you. -- Magioladitis (talk) 11:37, 26 June 2016 (UTC)

@Facenapalm and Magioladitis: Sometimes I've used nowiki tags, but I didn't care that much one way or the other and I wasn't sure the bot wouldn't come after those. The Bracket template adds &#91; to the text (NOTE: I just tried that with nowiki and it didn't work! It just displays [! And [ html comments also do not work for this sequence!) - I'd actually prefer to do that than to add the confusion of a template which you don't know what it is. I think an HTML comment would work also.
But none of this really matters. My concern isn't trying to write this one sentence - my concern is that the bot is out there churning away, screwing up format kludges (good or bad) that will be very confusing for editors who don't know Wiki/HTML to figure out. It's the changes you don't know about that you need to be concerned about. Some of this stuff could be buried deep in tables and other arcane syntax. If the bot is going to take out empty spans, it should replace them with whatever you would tolerate like nowiki or HTML comments or whatever so that the text displays the same way. Wnt (talk) 11:48, 26 June 2016 (UTC)
IMHO, using empty span is dirty hack to trick the parser. I'm not sure I'll understand what it means even if I'll edit code manually. So it's ok that bot broke this rare case. Usually empty spans are just empty spans, and they shouldn't be replaced by something like <nowiki />. Facenapalm (talk) 11:57, 26 June 2016 (UTC)

This is the reason that the templates were created. It makes wikicode cleaerer and no hacks are needed. -- Magioladitis (talk) 12:14, 26 June 2016 (UTC)

So what is the template for writing &#91; without it coming out as a bracket? How do I look it up? (Or them up ... I have a feeling there are probably dozens, each used by one or two editors and unknown to the rest of us) Wnt (talk) 14:16, 26 June 2016 (UTC)
A few things...
Facenapalm and others... Self-closing HTML tags are being depreciated because they aren't in the HTML5 spec and they are removing them from the Mediawiki parser. <nowiki /> is not HTML, so it is not being depreciated. <br /> is still in HTML5, but is not mentioned in 5.1 that I could find. They are so common, who knows when it will die.
Wnt <span></span> is bad HTML and should never be used, period.
That leaves three options:
  1. <nowiki /> option that Facenapalm mentioned.
  2. {{bracket}}/{{brackets}} templates
  3. &#91; route.
Of the three options, #3 is probably the worst for editors. Not many people know what that means, but it is in common use. Templates are nice because people can look up the doc page for them. I personally use nowiki tags and it is the most common in use. Use whatever option you want. Bgwhite (talk) 05:16, 27 June 2016 (UTC)
The span element is still valid in HTML5.1, but "doesn't mean anything on its own" (cit.), and it's generally "used to color a part of a text" (cit.); so I agree that using the nowiki tag or the brackets templates in the above problem is preferable. Regarding the br element, nothing changed between HTML5 and 5.1, except that "Content model" has been renamed "Nothing" instead of "Empty". The only correct way to write it is <br>. The fact that the old XHTML <br /> is still in use is because Tidy is outdated; fortunately they are working on it (they mention the Sanitizer in the comments). -- (talk) 14:31, 27 June 2016 (UTC) PS: I've run a little test and the W3C Validator doesn't see an empty span as an error; also, it has been used as a hack for some other reason (Fahrner Image Replacement#Implementations); so, it's just ugly, but harmless. -- (talk) 15:43, 27 June 2016 (UTC)

False positive on list number 91 and 90[edit]

At list 91 it shows some articles which doesn't have any Interwiki link written as an external link. in my pinion it should check ifعنوان is inside {{}} so it isn't internal linkYamaha5 (talk) 20:22, 28 June 2016 (UTC)

Also list 90 has the same False positive bugYamaha5 (talk) 20:24, 28 June 2016 (UTC)

Yamaha5 I'm confused. You said "Interwiki link" and #90 and #91 doesn't check for interwiki links. #90 and #91 should be checking for external links and references. I haven't seen a problem with enwiki on the ~10,000 articles I've fixed except for when a reference has a Wikipedia link and the article is about something on Wikipedia. Could you give a better example. Bgwhite (talk) 20:36, 28 June 2016 (UTC)
please check list 91 it's title is Interwiki link written as an external link.
for example: at the first row page has {{یادکرد-ویکی|پیوند =|عنوان = .bm|زبان = انگلیسی|بازیابی =30 آپریل ۲۰12}} which is in english equal to {{citation|link=|title= .bm|lang= en|date =30 April 2012}}.
Now checkwiki assumes|عنوان as Interwiki link written as an external link which is not correct it is inside a template and it doesn't have any [[]]. please check articles that are listed on list 91 and list 90 you see the False positive bugYamaha5 (talk) 00:04, 29 June 2016 (UTC)
Yamaha5 If I'm understanding correctly.... This is still an error. One can't use Wikipedia as a reference. 90/91 checks references that used Wikipedia as a source. The 90/91 definition didn't reflect that and that is my fault. I've updated the English translation file. Bgwhite (talk) 00:30, 29 June 2016 (UTC)
Thanks. would you please update fawiki's data? I cleaned more than 30k pages and I wait for new updateYamaha5 (talk) 00:37, 29 June 2016 (UTC)
Yamaha5 The new dumps start up on the 1st. If all goes well, checkwiki will update fawiki around the 4th. Bgwhite (talk) 18:26, 29 June 2016 (UTC)

feature list request[edit]

On fa.wikipedia we have a page and cleaning bot which lists and do some cleaning task, I will list some of useful Items for your tool:

  1. Category pages which have {{Category redirect}} and interwiki (local or wikidata)
  2. Categories which are like article (huge size) for example page_len>1000. some newbies add article text to category page.
  3. Redirect pages which have interwiki
  4. Pages which have old_interwiki (not wikidata)
  5. Pages which have duplicated coordination
  6. Redirect pages which their talk page is redirected to other page query
  7. Redirect pages which their talk page is not redirect query
  8. Redirect talk pages which the main page is not redirect query
  9. Redirect pages with (disambiguation) and linked to not disambiguation pages query
  10. Similar pages with different hidden characters query
Cleaning content
  1. Pages which have : after == (for example == foo ==\n:the text)
  2. Pages which have more <br/> after each other (for example foo<br/><br/><br/><br/><br/>bar)
  3. Page which have [•●⚫⬤] instead of * (for example • foo \n• bar)
  4. Pages which their lines started with numbers instead of #
  5. Page which have non-standard title for source or external links subsection (for example == our sources == or == the sources == ,...)
  6. Pages linked to (wiki(pedia|media|data|source|news|oyage|quote)|wiktionary)\.org without using their template
  7. Pages/articles which have more ['math', 'code', 'nowiki', 'pre', 'source', 's', 'su[bp]', 'noinclude', 'includeonly', 'big', 'small','gallery'] after each other
  8. Pages which have [\u0085\u00A0\u1680\u180E\u2000-\u200A\u2028\u2029\u202F\u205F\u3000] characters instead of normal space
  9. Pages which have LRM، RLM characters like (\u202A|\u202B|\u202C|\u202D|\u202E|\u200F)
  10. Pages which have ... instead of …
  11. Pages which have ---- for horizental line
  12. Pages which have space between == (for example = =)
  13. Pages which have more than 5 = in their subsection (for example ========= foo ===========)
  14. Pages which have more empty lines in their content (for example \n\n\n\n\n\n\n or \n\n \n\n)
  15. Pages which have tab \t at their first lines (for example \n\t)
Yamaha5 (talk) 01:25, 29 June 2016 (UTC)
I believe that many of these features can be handled by queries or some PetScan lists. IMO CW should be aimed on things which are not accessible from database, such as wikitext or HTML markup errors. Matěj Suchánek (talk) 18:41, 29 June 2016 (UTC)
The Cleaning content part shouldn't be possible by query. the database text's table is closed so it is not possible to get them by queryYamaha5 (talk) 20:25, 29 June 2016 (UTC)
Yamaha5 Egads. I hate to have been your mom. Yamaha, what do you want for dinner. Mom, I'll have chicken, steak, carrots, peas, mashed potatoes, cauliflower, spaghetti ...
  • A quick look... some can't be implemented, for example interwikilinks and ---- are valid.
  • For #8 and #9 on the cleanup list, on enwiki the following are being checked: \x{007F}, \x{200B}, \x{2028}, \x{202A}, \x{202C}, \x{202D}, \x{202E}, \x{00A0}, \x{00AD}, \x{202B}, \x{200F}, \x{2004}, \x{2005}, \x{2006}, \x{2007}, \x{2008}
  • To implement this is easy. Are there any on the enwiki list you don't want? Can you and Magioladitis (he is the expert, not me) look at the rest and see if they are ok to be added. I can't remember exactly but I think \x{202B} and \x{202F} caused problems if they were removed on enwiki.
Bgwhite (talk) 22:18, 29 June 2016 (UTC)

4 will be a disaster ad I good proved why. -- Magioladitis (talk) 22:23, 29 June 2016 (UTC)

User:Bgwhite :))) for characters we can omite LRM، RLM and ZWNJ they uses in foreign languages
User:Magioladitis: 4 you mean #4 ? Yamaha5 (talk) 22:39, 29 June 2016 (UTC)
Yamaha5 yes, I mean #4. -- Magioladitis (talk) 22:42, 29 June 2016 (UTC)
Yamaha5 I've gotten these mixed up in the past. Do you want me to add enwiki's list for fawiki? Bgwhite (talk) 00:20, 30 June 2016 (UTC)
Is it different list for projects? I thought lists for all projects are the same.In fawiki the query part we have active bot for them but the content part which is related to checkwiki we don't have active bot. is it possible to add them to whole project for all languages? if you want I can help you for adding them.
If we have these lists at checkwiki we can clean them regularly. Yamaha5 (talk) 05:44, 30 June 2016 (UTC)
Yamaha5 We are concerned that some of Unicode characters were needed in other languages, especially in right-to-left ones. I'd rather take this one slow and push any new Unicode characters to those projects that what them. For example, two of the LRM، RLM characters on your wanted list does cause problems on enwiki if they were removed. I get confused on what acronyms belong to which Unicode character... I've got dyslexia. I can read ok, it's processing in the head and also writing that causes me problems, LRM and RLM gets jumbled for example. So, what Unicode characters I listed above do you want or not want? These can be easily added for the next run, then we can test the others you mentioned in fawiki and enwiki for August's run. Bgwhite (talk) 05:14, 1 July 2016 (UTC)
Bgwhite what Unicode characters I listed above do you want or not want? if you mean for Now we have cleaning tool which convert #8 to space and #9 to \u200c and do conversion for #10 we tested and It was fine. If you mean which characters may cause problem for other languages like English in my opinion we should get list and check one by one by the local users and they can tell us which one should remove for them. so for fawiki we need #8, #9, #10 as I mentioned above for other languages we can remove as they want.Yamaha5 (talk) 07:48, 1 July 2016 (UTC)
#8:I removed the duplicated characters in mine and your list so there is characters should add to the checkwiki for all languages.  : U+0020, U+2000, U+2001, U+2002, U+2003, U+2009, U+200A, U+007F, U+200B, U+2028, U+202A, U+202C, U+202D, U+202E, U+00A0, U+00AD, U+202B, U+200F, --convert to--> space
#9: for fa.wikipedia we need to list all mentioned in #9 for other languages I don't know.
#10:for we need it.
At end please take a look on this. we can add them to checkwiki (new request :) ).Yamaha5 (talk) 08:34, 1 July 2016 (UTC)
Yamaha5 I've added fawiki to the same ones enwiki currently find. AWB can convert or remove these via the find and replace. For example, add "\u200E|\u200F|\uFEFF|\u200B|\u2028|\u202A|\u202C|\u202D|\u202E|\u00AD" to the find column and a space in the replace column. Bgwhite (talk) 01:04, 3 July 2016 (UTC)

#90 and #91 for[edit]

Is it possible to deactivate #90 and #91 for (the part which shows error for using other wiki as reference) because of lack of reliable online farsi sources At fa.wikipedia we have a consensus to use en.wikipedia and other big wikis as source for minor articles so most of #90 and #91's reprort for shouldn't solve.Yamaha5 (talk) 19:45, 3 July 2016 (UTC)

Yamaha5 Unfortunately, no. Keeping #90 on should be fine, but you will have to turn #91 off. Bgwhite (talk) 06:45, 4 July 2016 (UTC)
How can i turn of #91. can we control the lists? or you mean we should solve the articles on[ [User:Yamaha5|Yamaha5]] (talk) 08:02, 4 July 2016 (UTC)
Yamaha5 You can turn off #91. You've edited the list before. I generally leave the lists to be maintained by whoever wants to. You know Farsi, I don't, so edit it to your heart's content.— Preceding unsigned comment added by Bgwhite (talkcontribs)
I believe the stat page dosen't use that page becuase as you see we translated many of the labels but at the here we can't see them. for example top_priority_script was translated at fa:ویکی‌پدیا:ویکی‌پروژه_تصحیح_ویکی‌پدیا/ترجمه but still the fawiki_checkwiki page shows high priority also how can I disable #91? show me on english page (the line which should i remove)(I found it) Yamaha5 (talk) 08:43, 4 July 2016 (UTC)
Yamaha5 I think no _script variables are taken into account, you should use _fawiki variables. --NicoV (Talk on frwiki) 16:59, 8 July 2016 (UTC)
NicoV thanks.Yamaha5 (talk) 19:34, 8 July 2016 (UTC)
I added two patchs here please merge them to use fawiki's translation and have better supportYamaha5 (talk) 12:43, 4 July 2016 (UTC)


Would you please add {{Break}} and these redirects to list #54?Yamaha5 (talk) 08:48, 7 July 2016 (UTC)

False positives for #105[edit]

Hi Bgwhite, CW reports 2 false positives for #105 on frwiki, fr:Tournoi des candidats de Zurich 1953 and fr:Championnat du monde d'échecs 1963, both for the same reason, a table cell filled with several equal signs. Could you ignore those cases as I did with WPC : if the line starts with a pipe, then do not report it as an error as it is most probably a table cell. --NicoV (Talk on frwiki) 15:31, 7 July 2016 (UTC)


Bgwhite, maybe you would consider changing regex a little bit for #3? At lvwiki, equivalent for {{reflist}} is {{atsauces}}, for {{unreferenced}} - {{atsauces+}}. So if article has maintaince tag {{atsauces+}} and <ref>s, the article will be skipped in Checkwiki (as script checks only for {{atsauces). --Edgars2007 (talk/contribs) 15:29, 8 July 2016 (UTC)

Edgars2007 What's happening is in the translation file, under #03, you have atsauces listed. In the code, I take that and look for cases of "{{atsauces". There's nothing at the end of {{atsauces because there maybe parameters in the template. The code looking for "{{atsauces" is a regex. So, in theory, adding atsauces[^\+] to the template file should solve the problem. Could you add it to the template file and give me some articles/sandbox to check for a valid #3 error and some with atsauces+. I'll test and make sure it works. Bgwhite (talk) 21:51, 22 July 2016 (UTC)
I assume, they aren't case sensitive, right? Do you want me to give examples, which fits to your requirements now or 2016-07-01 (dump day)? --Edgars2007 (talk/contribs) 21:59, 22 July 2016 (UTC)
Edgars2007 Yes, they are not case sensitive. If you have any examples it would be good. I'd like to make sure everything is ok before the dump. I looked at today's (25th) results. How in the world did lv:Count Your Blessings get tagged as a #3? Ran the code manually and it didn't see anything wrong. Bgwhite (talk) 22:20, 26 July 2016 (UTC)
Bgwhite for example lv:Dalībnieks:Edgars2007/Reflist/1 should get tagged as #3, and this shouldn't. If you need some other examples to work with, you can simply create another sandbox page at lvwiki at my userspace. 2016-07-20 version of Count Your Blessings didn't have references template. --Edgars2007 (talk/contribs) 06:09, 27 July 2016 (UTC)
Edgars2007 Things look good on the test articles. Should be good to go. Bgwhite (talk) 19:48, 27 July 2016 (UTC)

New false positives for #22[edit]

Hi Bgwhite, new false positives are appearing on frwiki when the category name itself contains a colon with whitespace characters around it, like [[Catégorie:Acteur de Lost : Les Disparus]] in fr:Terry O'Quinn. --NicoV (Talk on frwiki) 19:21, 28 July 2016 (UTC)

NicoV Should be fixed for the run that starts in an hour. enwiki doesn't have two colons in a cat. No good #*$(@ nothing &(*! French. Problem was caused by the update that catches the #22s WPC found. Bgwhite (talk) 23:11, 28 July 2016 (UTC)
Bgwhite Most of them are fixed, except fr:Lost : Les Disparus where [[Catégorie:Lost : Les Disparus|Lost : Les Disparus]] is still detected by CW. --NicoV (Talk on frwiki) 19:57, 17 August 2016 (UTC)

#6 and #37 mostly obsolete.[edit]

@NicoV, Magioladitis, Yamaha5, Josve05a, Edgars2007, and Facenapalm: MediaWiki is moving to a new collation scheme called Unicode collation algorithm (UCA). Letters with diacritics will be sorted the same as with the non-diacritic version. I still don't know the timetable, but I did find the phab ticket (T136150) on moving enwiki to UCA. They have already moved several other wikis to UCA, including Russian, French, Latvian, Farsi and Swedish wikis. The listing of wikis can be found here; I'm thinking, #6 and #37 will only check for punctuation at some point for all wikis. I'll work on getting the wikis already on UCA to only check punctuation. Bgwhite (talk) 02:14, 29 July 2016 (UTC)

@Bgwhite: keep in your mined we have T139110 bug. is it makes problom for #6 and #37? Yamaha5 (talk) 03:49, 29 July 2016 (UTC)
lvwiki has disabled those ones, so I'm fine. --Edgars2007 (talk/contribs) 06:44, 29 July 2016 (UTC)
Same on ruwiki. In ruwiki, the only allowed letter with diacritic in titles is ё, but it's sorted correctly. Facenapalm (talk) 10:29, 29 July 2016 (UTC)

Reference localization[edit]

Hello. Is there a possibility to recognize a template as footnote? Thank you. IKhitron (talk) 15:35, 29 July 2016 (UTC)

  • You're talking about this?
   Список примечаний
   Reflist+ END
# ...
   (Примечания|Список примечаний|Reflist\+?)(?![^}]*group) END
Facenapalm (talk) 16:16, 29 July 2016 (UTC)
Not at all, Facenapalm, thank you, I'm talking about a footnote (ref), bot references. IKhitron (talk) 16:28, 29 July 2016 (UTC)
Facenapalm I'm also unclear what you are asking. Remember, I'm slow. Could you put what your asking in different words?
Is there any possibility that you wanted to ask me this question, Bgwhite? IKhitron (talk) 23:59, 29 July 2016 (UTC)
IKhitron Yes. Like I said, I'm slow. Bgwhite (talk) 00:39, 30 July 2016 (UTC)
Well, Bgwhite, when you want to add a footnote you use <ref name=somename...>some text</ref>. I can't do this in rtl, so I use {{reftemplate|name=somename|...|some text}}, which is transcluded to the previous form. I asked if there is a possibility to add local name of footnote template, that will be recognized as ref tag. IKhitron (talk) 00:47, 30 July 2016 (UTC)
@Bgwhite: IKhitron (talk) 10:33, 9 August 2016 (UTC)
@Bgwhite:? IKhitron (talk) 20:34, 30 August 2016 (UTC)
IKhitron Ok, I've got some time this week. I'm still not understanding. Which error is this for? What would be an error case? Bgwhite (talk) 07:19, 31 August 2016 (UTC)
Thank you, Bgwhite. There are some, especially 78 and 81, but also 61 and 67. For 78 if the article doesn't have no ref and no references, but have ref template, it's not recognized. For 81 it will be splendid if the case when ref text and template text are the same, for example, will be recognizable. IKhitron (talk) 11:34, 31 August 2016 (UTC)
IKhitron Ok, I'm understanding. For #61 and #78, you can add reftemplate to your translation file. For #61, add at the end of its config:
  reftemplate END
Then do the same for 78. For #67... either #61 is on, or #67 is on, but not both. #81 is a different story and its a bugger. Not sure on how to do that one. Do you have some examples so I can do some testing? Bgwhite (talk) 18:22, 31 August 2016 (UTC)
Thank you very much, Bgwhite. It's already a lot for me. About example: You have he:Template:הערה and all transcluded pages. The base is: template named "הערה", which has some parameters, when the reference text is the first unnamed parameter. As in {{הערה|שם=refname1|reftext|קבוצה=refgroup5}}. If you'll decide it's possible I'll thank you even more. IKhitron (talk) 19:36, 31 August 2016 (UTC)
By the way, Bgwhite, is there a possibility to do this for #3? I mean more ref templates, as in #61, not nore references template as in #78? Thank you, IKhitron (talk) 15:35, 4 September 2016 (UTC)
and one more btw, #78 references templates does not recognize different groups. There is some parameter for this? Thank you. IKhitron (talk) 15:48, 4 September 2016 (UTC)

More errors / more bots[edit]

If we manage to have more bots running daily we can reduce the time required to fix errors drastically. This means we have more free time to detect more errors and and add to our list. What could these errors be? In an ideal world, we could check all of WP:GENFIXES and see what is worth to be done even as a sole task. -- Magioladitis (talk) 09:25, 30 July 2016 (UTC)

Help with translation page[edit]

Hello. I hope somebody who read this can find 5 minutes to help me. I'll be very glad if it's possible, so if I know it's not your "duty". I made a lot of changes in our translation page, because most of it was there from the time when checkwiki was a beta on dewiki. But it doesn't work any more! I tryed to find some variable without END or some another syntax error, but could not. What could be the problem? Thank you very very much in advance, IKhitron (talk) 11:55, 31 July 2016 (UTC)

Isn't "description_text_hewiki" the one, that screws up everything? --Edgars2007 (talk/contribs) 13:54, 31 July 2016 (UTC)
Everything is possible. Why do you think it's there, there is some problem in the description? Thank you very much, IKhitron (talk) 15:09, 31 July 2016 (UTC)
As I don't know, how those translation files are getting parsed to Checkwiki system, I'm just guessing. </syntaxhighlight> looked suspicous (and other non-HTML stuff), but I may be wrong. --Edgars2007 (talk/contribs) 16:05, 31 July 2016 (UTC)
I see. I created this part as in frwiki, and it works there. IKhitron (talk) 21:04, 31 July 2016 (UTC)

Article that doesn't exist appears in the database and in maintenance categories[edit]

The page USA:S inrikessäkerhetsdepartement has appeared on sv.wp's list of #2-errors for ~1 year now (or longer), at least when processing with WPCleaner. That page does not exist (the page USA:s inrikessäkerhetsdepartement however does exists). Yet this page appears on the CHECKWIKI list, and in the automated maintenece category Pages using invalid self-closed HTML tags on sv.wp. Why is this? (tJosve05a (c) 10:02, 1 August 2016 (UTC)

It looks like parsers think USA is a namespace and automaticaly uppercase the first letter of the rest. IKhitron (talk) 15:07, 1 August 2016 (UTC)

Another unexpected run[edit]

Hi. It ran today, a long time before the new dumps were completed. It's a bug? IKhitron (talk) 00:29, 6 August 2016 (UTC)

IKhitron Before, WMFLabs wouldn't transfer all the dump files over until all the files were completed. Now, when a file is completed, it is transferred over. The file CheckWiki uses is towards the beginning of the dump cycle. Here's the latest dump progress of hewiki. The file CheckWiki uses is "pages-articles". Bgwhite (talk) 07:40, 6 August 2016 (UTC)
I see: you have protection.:-) Thank you, IKhitron (talk) 14:39, 6 August 2016 (UTC)

#88 has false positive[edit]

At here most of the reported items are false positive. the {{DEFAULTSORT:}} on fa.wikipedia is {{ترتیب‌پیش‌فرض:}}. checkwiki shows any texts which is started with ترتیب: it doesn't care that it should have {{ at the first. for example fa:آرایه‌های ادبی doesn't have blank at first position.Yamaha5 (talk) 11:48, 9 August 2016 (UTC)

In other word: the report should only check cases which have {{ with the first word of mediawiki magice word () for example for english if we have this text it will report it incorrectly
* some text DEFAULTSORT: foo some text...

for Persian

* some text ترتیب: foo some text...

it is wrong and it should check if DEFAULTSORT: had {{ in advance then report it! like text in below

* some text {{DEFAULTSORT: foo some text...

for Persian

* persian text {{ترتیب: foo some text...

Yamaha5 (talk) 07:44, 12 August 2016 (UTC)

at #88 the code should be like below
                my $sortkey = $test_text;
                $sortkey =~ s/^([ ]+)?$current_magicword//;
                $sortkey =~ s/^([ ]+)?://;


                my $sortkey = $test_text;
                $sortkey =~ s/^{{([ ]+)?$current_magicword//;
                $sortkey =~ s/^{{([ ]+)?://;

Yamaha5 (talk) 07:48, 12 August 2016 (UTC)

Request: Report for wrong dictation[edit]

There are some pages on wikipedia's like below which shows common wrong dictation. please add this to the reports to show which pages have these words.

The first word before || is the wrong oneYamaha5 (talk) 09:30, 11 August 2016 (UTC)
Yamaha5 On the enwiki side, you are talking about Wikipedia:Lists of common misspellings and Wikipedia:Lists of common_misspellings/For machines? If so, then this would be outside of CheckWiki's scope. In theory, CheckWiki find syntax errors and other errors in the source code. Spelling and other kinds of word errors wouldn't be in CheckWiki's scope. One can do a Google or a Wikipedia search to find these. Bgwhite (talk) 22:14, 11 August 2016 (UTC)
Bgwhite I know we can search at google. I wanted monthly lists which can be solved by bots or AWB by users Yamaha5 (talk) 07:53, 12 August 2016 (UTC)

AWB provides Typo fixing but this is the outside the scope of this project. - Magioladitis (talk) 07:56, 12 August 2016 (UTC)

Yamaha5, you can generate such lists using WPC, with error #501 (spelling) and the dump analysis feature, but it may require a few modification of configuration and tweaks. --NicoV (Talk on frwiki) 07:46, 25 September 2016 (UTC)

#28 possible false positives[edit]

Hi. I started to fix #28, and found he:(Miss)understood and he:Anastacia at start of the list. It doesn't look like there are problems there. Maybe there are some more, didn't check yet. Thank you, IKhitron (talk) 18:06, 11 August 2016 (UTC)

IKhitron It was fixed a few days ago. The problem happens when a table is the very last thing in an article... no categories, defaultsort or other templates. I made a change to catch more cases of #28. It was thinking |}} was a table ending when it's most likely a template ending. As a result of the change, #28 will pick up cases of {{|, such as {{|url=http... , where "cite web" is missing. This is an error, but not related to tables. Bgwhite (talk) 21:47, 11 August 2016 (UTC)
Thank you, Bgwhite. It means, these articles will not be in the list in the next run? IKhitron (talk) 22:10, 11 August 2016 (UTC)
IKhitron Correct. These should not be in next month's run. Bgwhite (talk) 22:16, 11 August 2016 (UTC)
Thank you very much for your help. IKhitron (talk) 22:58, 11 August 2016 (UTC)

Suggestion for the future[edit]

Hi. There is something that can take you a couple of days to implement, but it will be very useful for many people and wikis. Could you please:

  1. Create a new function, nulledit.
  2. It will run on the list, open each page and force saving without changing anything.
  3. It will not wait for each edit, as it was a bot run.
  4. It will work for any registered user, not just for those with AWB rights.
  5. It will skip automatically pages that this user has no rights to edit.

Please take a year, if you need, to think about this in positive direction. Thank you very much in advance. IKhitron (talk) 14:21, 22 August 2016 (UTC)

It shouldn't be included into Check Wikipedia project: Check Wikipedia finds the errors, that's all. If you want to do nulledits, you can download python, install pywikibot framework and write something like that:
import pywikibot

def main():
    site = pywikibot.Site()
    for line in open("pages.txt", encoding="utf-8").readlines():
        page = pywikibot.Page(site, line)"Null edit.")

if __name__ == "__main__":
This code will make nulledits in every page from pages.txt file. Facenapalm (talk) 14:59, 22 August 2016 (UTC)
There is page.touch(), btw. And shouldn't open("pages.txt", encoding="utf-8").readlines() be outside of the loop? --Edgars2007 (talk/contribs) 15:19, 22 August 2016 (UTC)
Never heard about touch() before, thanks. No sense to open file outside the loop, it will be processed only once, I've checked that. Anyway, it's just an example (despite the fact that I used this code before). It's good idea to catch exceptions too, for example. Facenapalm (talk) 15:29, 22 August 2016 (UTC)
I have this bot, of course. I ask for AWB to use it's make list mechanism. And you can't say it does not fix errors, it fixes a lot of errors during this run, especially of "forgotten to insert to category". IKhitron (talk) 15:33, 22 August 2016 (UTC)
It's a good idea to at least once go trough file and maybe others, too (generators and :) --Edgars2007 (talk/contribs) 18:58, 22 August 2016 (UTC)

New id suggestion[edit]

Hi. What do you think about such an id:

  1. Read the article.
  2. Find all strings [^']''[^'] and count them as I.
  3. Find all strings [^']'''[^'] and count them as B.
  4. Find all strings [^']'''''[^'] and count them as IB.
  5. At the end, mark the article as new id if I count or B count (or both) are odd.

Thank you. IKhitron (talk) 14:26, 23 August 2016 (UTC)

There will be false positives from constructions such as "Billboard's", which usually renders correctly wherever I have seen it. I suppose someone might think it valuable to replace that construction with Billboard's, but it's not really an error, since it renders correctly. – Jonesey95 (talk) 14:34, 23 August 2016 (UTC)
Yes, you are right. But it's better from ignoring this problem. One can whitelist this article. IKhitron (talk) 14:37, 23 August 2016 (UTC)
I find 1,213 articles with this search for ]]'''s. It might be a fun little AWB project for someone to clean them all. – Jonesey95 (talk) 15:36, 23 August 2016 (UTC)
I afraid this whitelist will be really big. The other problem is that the article can be wrong even if counts are correct, for example, here: a<ref>'''b</ref> '''c d. Facenapalm (talk) 15:39, 23 August 2016 (UTC)

A nice template is {{'}}. -- Magioladitis (talk) 15:44, 23 August 2016 (UTC)

Yes, 1,213 is a lot indeed. It's in enwiki (and other en* wikis) only, but you can't write one id for enwiki and other for rest wikis. So, what about the smaller project - mark if I+B is odd? IKhitron (talk) 16:29, 23 August 2016 (UTC)
Well, Bgwhite, what's the decision? IKhitron (talk) 20:35, 30 August 2016 (UTC)
IKhitron I think there are too many false positives. Looks like more false positives than actual errors. So, I don't think it would be a good idea. Bgwhite (talk) 21:16, 30 August 2016 (UTC)
Thank you. IKhitron (talk) 21:19, 30 August 2016 (UTC)

Update September 2016[edit]

After the update from 2016-09-01 there are 0 errors counted on alswiki. The update before had found hundreds of errors. Does anybody know why many projects are listed with zero errors now? --Holder (talk) 19:12, 7 September 2016 (UTC)

Holder Some of the dump files were messed up. Another dump run should be available around the 16th. Bgwhite (talk) 05:13, 8 September 2016 (UTC)
Thanks! --Holder (talk) 06:04, 8 September 2016 (UTC)

Last ruwiki scan has failed[edit]

The problem is in dump or in our translation file? I decided to write the vast majority of magic words in error_034_templates_ruwiki and then disable some of them step-by-step if some of them will lead to false positives - I hope that's not the reason of failed scan. Can I be sure that the next dump will be scanned successfully? Facenapalm (talk) 12:52, 16 September 2016 (UTC)

Did you see the #Update September 2016? It can be the same problem. IKhitron (talk) 14:53, 16 September 2016 (UTC)
Yes I did. I just want to be sure that the problem doesn't depend on me. If so, I can wait, that's not a problem. Facenapalm (talk) 15:09, 16 September 2016 (UTC)
Facenapalm Looking at the dumps, it appears they are scheduled to start up again on the 20th. In theory, messing up the translation file for one error won't stop the program from running. At worst, it should only cause false positives or less errors for that particular error #.
You do have #34 wrong for the translation file. These are already in the program and no need to list again:
Then add the rest one item per line. Program already adds | between each item. Bgwhite (talk) 23:35, 16 September 2016 (UTC)
I do not understand how exactly the subst can be a part of wiki code. It should be, you know, subst, shouldn't it? IKhitron (talk) 00:01, 17 September 2016 (UTC)
Like phab:T4700? Matěj Suchánek (talk) 08:47, 17 September 2016 (UTC)
First of all, subst: doesn't work with unexisting templates. See: {{subst:verylongtemplatenamethereisnosuchtemplate}}. Once we decided to delete template called "ab", so we should replace "{{ab|text}}" by "text". Botmaster X created template "User:X/0" with such code: {{{1|}}} and just replaced "{{ab" by "{{subst:User:X/0". He doesn't noticed that he broke template "{{abbr" as well, and we got a lot of "{{subst:User:X/0br" in articles. Checkwiki helps me to detect it. The other example is MediaWiki bugs: "subst:" not always works at least in refs. Facenapalm (talk) 12:07, 17 September 2016 (UTC)
I see. thank you. IKhitron (talk) 12:51, 17 September 2016 (UTC)
Understood, thanks. Well, one item "a|b" should works just as two items "a" and "b", so it's not an error, is it? Anyway, list of items are easier to understand. Facenapalm (talk) 12:10, 17 September 2016 (UTC)

#24 False positive[edit]

There is a false positve on Telephone numbers in Indonesia due to their use of <prefix>. Not saying it's a good thing, but it's not the pre tag. Jerod Lycett (talk) 22:01, 18 September 2016 (UTC)

Jerod Lycett It's because CheckWiki only scans for <pre. There can be parameters in <pre> and people do forget the space. Two options:
  1. Fix it somehow... remove the < > or some other way.
  2. Add it to the whitelist. The list for #24 is located here.
Bgwhite (talk) 04:30, 19 September 2016 (UTC)

False positive for #94[edit]

Hi, fr:Nicotinamide adénine dinucléotide is detected as having an isolated ref tag, with the notice </ref>| cl50 = | logp = | dja = | od but I don't understand what's wrong because the reported closing ref tag has an opening tag <ref name="ChemIDplus">{{ChemID|53-84-9|Nadide}}, consulté le 16 août 2009</ref> | CL50 = | LogP = | DJA = . --NicoV (Talk on frwiki) 22:49, 18 September 2016 (UTC)

NicoV It's not giving me an error. I haven't changed that part of the code this month. Article hasn't been changed this month. I don't know. Bgwhite (talk) 04:57, 19 September 2016 (UTC)
Bgwhite checkarticle.cgi gives the following answer:
  • - 94 3695 </ref>| cl50 = | logp = | dja = | od
so it's still reported as an error on wmflabs... --NicoV (Talk on frwiki) 05:21, 19 September 2016 (UTC)

False Positive #15[edit]

Consistent Overhead Byte Stuffing has a false positive for #15. Jerod Lycett (talk) 01:29, 19 September 2016 (UTC)

Jerodlycett I messed up the ping from your other message, so also look above.
This is a no win situation. In Checkwiki's code, I have it looking for <code> and then looking for <source>. Program runs, finds a broken code tag, then goes into the <source> section and blanks everything between <source>...</source>. If I had it the other way around, program wouldn't find a borked <code> because it was deleted.
Somethings have to run in a certain order... blanking what's between comment tags goes first. Blanking what goes between <nowiki> is second. I'll put the code section after source and see what happens... No more false positives, keep it where it is at. If more false positives show up, move the tag section back and white list Overhead Byte Stuffing. Bgwhite (talk) 04:48, 19 September 2016 (UTC)
But @Bgwhite:, you promissed to consider to replace such tags with some neutral string as "qwerty" instead of blanking. IKhitron (talk) 06:51, 19 September 2016 (UTC)
IKhitron You are correct. I have horrible memory, so always feel free to remind me about something. I added "CheckWiki" between the tags to make it obvious that CheckWiki added it. I've also added <pre> and <code> tags to #85. If all goes well will add more to #85. Bgwhite (talk) 22:48, 19 September 2016 (UTC)
Thanks! IKhitron (talk) 23:06, 19 September 2016 (UTC)
IKhitron I backed out adding "CheckWiki" between comment tags. It was causing false positives elsewhere. All others look good. Bgwhite (talk) 04:28, 20 September 2016 (UTC)

Invalid link to article[edit]

In dewiki the link to 1% of one shows only 400 Bad Request. The problem is the missing URL-encoding for the "%".

This is the correct link with encoding:

This encoding should be done automatically.--GünniX (talk) 03:39, 25 September 2016 (UTC)

Hi. We have the same problem. The name shown as "Ss", and I still do not know what is the right one. IKhitron (talk) 09:13, 25 September 2016 (UTC)