Wikipedia:Edit filter/Requested
Requested edit filters |
---|
This page can be used to request edit filters, or changes to existing filters. Edit filters are primarily used to address common patterns of harmful editing. Private filters should not be discussed in detail. If you wish to discuss creating an LTA filter, or changing an existing one, please instead email details to wikipedia-en-editfilterslists.wikimedia.org. Otherwise, please add a new section at the bottom using the following format: == Brief description of filter == *'''Task''': What is the filter supposed to do? To what pages and editors does it apply? *'''Reason''': Why is the filter needed? *'''Diffs''': Diffs of sample edits/cases. If the diffs are revdelled, consider emailing their contents to the mailing list. ~~~~ Please note the following:
|
Index |
This page has archives. Sections older than 30 days may be automatically archived by ClueBot III when more than 4 sections are present. |
Racist labeling of political leaders and historical figures
Here are the two most recent examples:
This has been happening for several months, perhaps back to last year. I've seen various combinations of the wording "white supremacist" and "racist " edited into political articles, both currently serving individuals and historical figures. These would be edits made after the article was already created. Not limited by geographical area, time period, living or deceased office holders. Can we create a bot that blocks these? And once created, can we update it if a new similar term begins happening? — Maile (talk) 22:40, 30 September 2021 (UTC)
- I just blocked Special:Contributions/2600:1700:12E1:A090:0:0:0:0/64 for a year as it appears every edit from there has been junk for a long time. That covers several examples of what you describe although I don't know if there are more from other IPs. Johnuniq (talk) 23:21, 30 September 2021 (UTC)
- Well, that's helpful. Thanks. I don't know if it's been this one IP or not. But I can date this phenomenon to beginning after the BLM events of the last year or two. For whatever reason, one or more editors have been motivated to label BLP and deceased individuals, or geographical areas as, racist, by one term or another. — Maile (talk) 23:59, 30 September 2021 (UTC)
- @Maile66: Started testing at 1014 (hist · log). Just checking for "racist" or "supremacist" for now. I'll add a check for biographies later. Any other words? FYI, I doubt this could ever be refined to the point where it's possible to disallow. Yes, all filters have false positives, but I'm worried about what message we'll be perceived as sending if we stop "X was the target of racist taunts on the field", etc. Suffusion of Yellow (talk) 22:38, 11 October 2021 (UTC)
- @Suffusion of Yellow: no other words come to mind. I keep hoping this type of editing will fade on its own, but I doubt so in my lifetime, because a lot of it is fed by national-international media reports. Not necessarily limited to the United States, or any other country. — Maile (talk) 22:44, 11 October 2021 (UTC)
- @Maile66: I added both words to 189 (hist · log) (tag-only). This is actually really common; see the log of 1014 (hist · log). I still might try to work out a disallowing filter if possible. I just don't feel comfortable with stopping
Senator McSenatorface resigned after admitting to sending hundreds of racist texts...<ref><ref><ref>
. It looks like we're whitewashing. So maybe I'll just disallow very small edits without refs, e.g. Special:Diff/1053952115 and leave 189 to tag the rest. Suffusion of Yellow (talk) 23:43, 7 November 2021 (UTC)- @Suffusion of Yellow: Understood. My request here is more about the concern the past year or two of a pattern of adding blatant labeling to existing articles, usually in the opening sentences of a lead, and begin more or less, " ...(name) is a racist and white supremacist ... " without any sourcing indicating it as factual. I've never seen, "" ...(name) is a racist and black supremacist ... " Or pick any color inbetween. — Maile (talk) 23:55, 7 November 2021 (UTC)
- Well, here's one. But of course it's not as common. Suffusion of Yellow (talk) 18:55, 11 November 2021 (UTC)
- @Suffusion of Yellow: Understood. My request here is more about the concern the past year or two of a pattern of adding blatant labeling to existing articles, usually in the opening sentences of a lead, and begin more or less, " ...(name) is a racist and white supremacist ... " without any sourcing indicating it as factual. I've never seen, "" ...(name) is a racist and black supremacist ... " Or pick any color inbetween. — Maile (talk) 23:55, 7 November 2021 (UTC)
- @Maile66: I added both words to 189 (hist · log) (tag-only). This is actually really common; see the log of 1014 (hist · log). I still might try to work out a disallowing filter if possible. I just don't feel comfortable with stopping
- @Suffusion of Yellow: no other words come to mind. I keep hoping this type of editing will fade on its own, but I doubt so in my lifetime, because a lot of it is fed by national-international media reports. Not necessarily limited to the United States, or any other country. — Maile (talk) 22:44, 11 October 2021 (UTC)
Should this section still be pinned? It has been months since the last comment here67.21.154.193 (talk) 12:13, 14 June 2022 (UTC)
- Unpinned. Sorry, Maile66, I could never work out a filter targeted enough to set to disallow. I moved the contents of my test filter 1208 (hist · log), at least. Suffusion of Yellow (talk) 23:14, 14 June 2022 (UTC)
pronoun changes
- Task: changes or pronouns in articles "he" to "she", "they" to "he", etc
- Reason: often a MoS violation when transgender BLP subjects are involved. Saw a request in the archives of the pages but with no response
- Diffs: one example
67.21.154.193 (talk) 15:29, 20 April 2022 (UTC)
- Previous comment: Wikipedia:Edit_filter/Requested/Archive_18#Changes_of_pronoun started by User:Valereee. No one responded 67.21.154.193 (talk) 13:52, 27 April 2022 (UTC).
- Probably something with similar logic to Special:AbuseFilter/1154 would work. Will work on this. Galobtter (pingó mió) 20:32, 2 May 2022 (UTC)
- Tamzin and Firefly already started on this, see private filter 1200 (hist · log). Suffusion of Yellow (talk) 20:35, 2 May 2022 (UTC)
- @Suffusion of Yellow Thanks for letting me know. Galobtter (pingó mió) 21:40, 2 May 2022 (UTC)
- seems to be public now and getting a good amount of hits. thx! 67.21.154.193 (talk) 13:44, 30 May 2022 (UTC)
- @Tamzin: I think there are edits in India Willoughby that have not been hit by the filter but weren't. Maybe someone should fix that? 67.21.154.193 (talk) 13:27, 6 June 2022 (UTC)
- seems to be public now and getting a good amount of hits. thx! 67.21.154.193 (talk) 13:44, 30 May 2022 (UTC)
- @Suffusion of Yellow Thanks for letting me know. Galobtter (pingó mió) 21:40, 2 May 2022 (UTC)
- Tamzin and Firefly already started on this, see private filter 1200 (hist · log). Suffusion of Yellow (talk) 20:35, 2 May 2022 (UTC)
Claiming the death of an article subject
- Task: section titled "death". claims in general that the subject is dead
- Reason: This filter is needed for the same reason Special:AbuseFilter/712 in needed
67.21.154.193 (talk) 15:40, 20 April 2022 (UTC)
- Might I add that it should trip to tag if there is a ref provided, but trips to warn or something if no reference is provided. Generally this would be a filter, active on BLP articles, that trips at the addition of "foo died" or "foo [wildcard, to allow for adjectives] passed away" or similar strings. I've seen a bit of this on RC patrol. Mako001 (C) (T) 🇺🇦 14:34, 26 April 2022 (UTC)
- The main reason I want this is to help prevent death hoaxes from appearing. If it gets overlooked early, it could end up being a while before someone notices. 67.21.154.193 (talk) 13:49, 27 April 2022 (UTC)
- so.... is there gonna be any discussion? any action? 67.21.154.193 (talk) 15:22, 2 May 2022 (UTC)
- Should I ping someone to this discussion? 67.21.154.193 (talk) 12:06, 31 May 2022 (UTC)
- I decided that I am going to ping @Ohnoitsjamie: to this page because he's quite active, and numerous sections have not had any response from EF mamagers. 67.21.154.193 (talk) 14:37, 1 June 2022 (UTC)
- I don't have time at the moment to work on that; I have written that kind of filter before; I'd want to do a lot of testing on it first as it's more complex than average. OhNoitsJamie Talk 13:57, 2 June 2022 (UTC)
- I decided that I am going to ping @Ohnoitsjamie: to this page because he's quite active, and numerous sections have not had any response from EF mamagers. 67.21.154.193 (talk) 14:37, 1 June 2022 (UTC)
- Should I ping someone to this discussion? 67.21.154.193 (talk) 12:06, 31 May 2022 (UTC)
- so.... is there gonna be any discussion? any action? 67.21.154.193 (talk) 15:22, 2 May 2022 (UTC)
- The main reason I want this is to help prevent death hoaxes from appearing. If it gets overlooked early, it could end up being a while before someone notices. 67.21.154.193 (talk) 13:49, 27 April 2022 (UTC)
Came across deleted filter 40, but it seems like it would require significant rework if we were to revive it. 67.21.154.193 (talk) 12:44, 8 June 2022 (UTC)
should we revive Special:AbuseFilter/402 with a warning message similar to MediaWiki:abusefilter-warning-AfC-unsourced-submissions?
The filter here (for unreferenced articles) was deleted back in 2013 [1] because it apparently had no purpose. Now, a warn+tag filter exists are submitting completely unsourced afc submissions see here. I think reintrodicing filter 402 (with warn) would help against non-notable or spam creations, as well as make new users add more reliable sources.
Also, are the above "pronoun change" and "adding death" filters going to be implemented?
67.21.154.193 (talk) 15:34, 2 May 2022 (UTC)
- Perhaps #964 should be extended to mainspace? casualdejekyll 01:11, 6 May 2022 (UTC)
Pinging @Tamzin: since she's an edit filter manager, and no EF manager has responded to any section below this one yet. 67.21.154.193 (talk) 13:41, 30 May 2022 (UTC)
Expand the "poop" filter to include "poo poo"
- Task: Stop edits adding this string, by adding it to a "disallow" filter (such as the existing "poop" filter)
- Reason: Because the poop vandalism filter doesn't catch it, and this is quite common.
- Diffs: [2]
Can this string ("poo poo") be added to the ones prevented by the "poop" filter? I see virtually no legitimate use for this string in mainspace. 💩 Mako001 (C) (T) 🇺🇦 12:20, 12 May 2022 (UTC)
should characters in the 33xx range and circled letters and numbers be included in the filter?
67.21.154.193 (talk) 15:25, 24 May 2022 (UTC)
- see Enclosed Alphanumerics, Enclosed Alphanumeric Supplement and Enclosed CJK Letters and Months unicode blocks. Edit: also CJK Compatibility 67.21.154.193 (talk) 17:01, 24 May 2022 (UTC)
- Also some at Dingbats. 67.21.154.193 (talk) 13:18, 2 June 2022 (UTC)
Vaguely related: are "funny" Greek letters being normalised to, er, normal Greek letters? This looks like an FP: Special:AbuseLog/32463333 (06:00, 27 April 2022: Ωχγ triggered filter 1,168). Certes (talk) 16:03, 24 May 2022 (UTC)
- It seems like the ohm symbol is now removed bc of this. This is the same filter 67.21.154.193 (talk) 17:00, 24 May 2022 (UTC)
- Yes. The Angstrom symbol, Kelvin symbol and ohm symbol all normalised to regular characters, so I had to remove them to avoid false positives. — The Anome (talk) 15:15, 8 June 2022 (UTC)
Also:
Change:ℬ and ℭ should have a pipe |
between them. This is on the line that starts with “(accountname rlike "℀|℁|ℂ|℃|℄|℅|℆|ℇ|℈|℉…”
Add (test for false positives by unicode normalization first):
ªº
(ordinal indicators),
ₐₑₒₓₔₕₖₗₘₙ₊₋₌₍₎ₚₛₜⱼ
(subscript),
ꬲꬽꬾ
(blackletter),
ⁱⁿ⁺⁻⁼⁽⁾
(superscript),
ⱻꜰɢʛʜɪʟɴɶʀꝶꜱʏꭥꞮꟸᶦᶧᶫᶰʶᶸ
(small caps/modifier)
ᶛᶜᶝᶞᶟᶠᶡᶢᶣᶤᶥᶨˡᶩᶪᵚᶬᶭᶮᶯᶱᶲᶳᶴᶵᶶᶷᶹᶺᶻᶼᶽᶾᶿꟹꭟʰʱʲʳʴʵʷˣʸꭜꭝꭞ
(modifiers)
ⅠⅡⅢⅣⅤⅥⅦⅧⅨⅩⅪⅫⅬⅭⅮⅯⅰⅱⅲⅳⅴⅵⅶⅷⅸⅹⅺⅻⅼⅽⅾⅿↀↁↂↃↅↆↇↈ
(roman numerals)
(more small caps and modifiers --->)
ꟲ (unicode A7F2)
ꟳ (unicode A7F3)
ꟴ (unicode A7F4)
𝼂 (unicode 1DF02)
𝼄 (unicode 1DF04)
𝼐 (unicode 1DF10)
Latin Extended-F unicode block (10780-107BF; bunch of modifier letters) 67.21.154.193 (talk) 12:14, 2 June 2022 (UTC)
- Thank you. These strings are a nightmare to edit, because they break text renderering in the online editor. I'll take a look through your list and see what I can incorporate. I suspect some of these might already be caught by higher-level filters at the Mediawiki or global config level, see MediaWiki:Titleblacklist and [3], which are useful, but not comprehensive enough, as hits on Filter 1168 keeps on demonstrating. — The Anome (talk) 15:19, 8 June 2022 (UTC)
- Yeah, I think this would be better in the global blacklist, but the problem is if there are false positives, by unicode normalization, how would we know. It would also be more difficult to pinpoint which charactor exactly is causing the false positives. BTW, there are a bunch of likely problematic characters in the 2000-2bff and 1f000-1ffff range. 67.21.154.193 (talk) 14:20, 9 June 2022 (UTC)
- Also, this would probably need a custom message if it were to be added to the title blacklist against usernames. 67.21.154.193 (talk) 15:08, 9 June 2022 (UTC)
- @The Anome: Haven't looked into the changes suggested here, but I swapped out those literal characters with \x{...} escapes, which should make the filter easier to edit. Suffusion of Yellow (talk) 21:12, 13 June 2022 (UTC)
- @Suffusion of Yellow: I'm never quite sure what regex format anything supports, so I'm glad to hear that \x{...} escapes work. Regarding normalization false positives, I can easily check for that with a bit of Python code: the ohm, Angstrom and Kelvin signs came as a bit of a surprise to me. Ultimately, it would be great if we could get these characters pushed into the top level Mediawiki filter, so we don't need to have this filter at all. UAX #31 might be our friend here. But we are already doing pretty will by just blocking the mathematical and IPA characters, as they are so popular with text obfuscators/prettifiers. — The Anome (talk) 22:00, 13 June 2022 (UTC)
- Yeah, I think this would be better in the global blacklist, but the problem is if there are false positives, by unicode normalization, how would we know. It would also be more difficult to pinpoint which charactor exactly is causing the false positives. BTW, there are a bunch of likely problematic characters in the 2000-2bff and 1f000-1ffff range. 67.21.154.193 (talk) 14:20, 9 June 2022 (UTC)
- Should we include ligatures (except W) in the set of unwanted characters? Certes (talk) 18:26, 14 June 2022 (UTC)
- Maybe, but that would need testing first. Otherwise, we might risk blocking valid usernames by unicode normalization. Also, is it time to get the characters in filter 1168(as well as maybe circled letters) in the global blacklist (with a custom message)? 67.21.154.193 (talk) 13:22, 16 June 2022 (UTC)
False GA/FA tags
- Task: Warn when articles are created with Good Article or Featured Article tags already attached.
- Reason: To prevent editors who don't understand GA/FA procedure from misusing the tags, and to stop malicious use of the same.
- Diffs: Most recent one I could find (I removed the tag later): Special:Diff/1090058919
Sumanuil. 05:48, 27 May 2022 (UTC)
- Is this a common issue? I can't imagine it happens particularly often and these kinds of non-urgent problems are likely to be picked up as part of new page patrol. Sam Walton (talk) 21:25, 30 May 2022 (UTC)
- Not sure how common, but it shouldn't be happening at all. Sumanuil. 03:10, 31 May 2022 (UTC)
- There is Special:AbuseFilter/716, but it only catches non-autoconfirmed accounts. 67.21.154.193 (talk) 12:16, 2 June 2022 (UTC)
Helping newer users deal with dead links appropriately
- Task: Use a custom warning message to notify a newer user or IP when they attempt to remove a url or ref with an edit summary including "dead url/ref/link" "404 error/message" "doesn't work" or any other string which would indicate that they are removing it because it is a wp:dead link. The message would be more friendly in tone, and would include links to guidelines about dealing with dead links, and a brief summary of those guidelines, something along the lines of "Instead of removing dead links, tag them with
{{dead link}}
..., try to find an archive yourself at one of these sites (add links to useful archive sites here) or, if you have an account, try using (IABot console link here)." - Reason: Many inexperienced users will (in good faith) remove broken links, thinking that they are of no use anymore, and not actually realise that it is a problem. Whilst "references removed" is helpful, this would be a narrower filter than that one, and is designed to offer advice and guidance to users who may not realise that their edits are possibly problematic. If alerted to the appropriate way of dealing with such links, I believe that most of these users would do so, as the responses I have got to uw-dead1 warnings whilst on RC patrol seem to be quite positive along the lines of "oh, thanks, I didn't realise I could fix that, I'll do that now".
Mako001 (C) (T) 🇺🇦 05:50, 28 May 2022 (UTC)
- @Mako001: That could also catch a certain spammer who replaces dead links by links to irrelevant content on a website they promote. Certes (talk) 10:55, 28 May 2022 (UTC)
- I think I've seen that one before too, though it would probably need to be more complex than the idea that I had of just "lines removed contains <ref> or </ref>" and "edit summary contains dead link/ref/page/site or 404 or link broken/doesn't work/dead". Mako001 (C) (T) 🇺🇦 11:05, 28 May 2022 (UTC)
- @Mako001: No, the spammer uses an edit summary along the lines of "dead link". They believe (or want us to believe) that a link to their website is an improvement on a dead link. Certes (talk) 11:28, 28 May 2022 (UTC)
- @Certes: Do they ever remove the ref tags? If not then it would probably need to be a more complex filter, but the issue you are referring to would be better handled by the spam blacklist anyway (as I understand). Did you want to propose an addition there? Mako001 (C) (T) 🇺🇦 11:51, 28 May 2022 (UTC)
- @Mako001: No, they leave everything unchanged except the URL. It's low volume and we have checks for the text they add, but if it grows then they can go on the blacklist. Certes (talk) 11:53, 28 May 2022 (UTC)
- Hmm, if that was a filter, it would have to be a separate filter then, and an LTA one too, so it'd be best to not discuss it here. My idea is to just check if the lines removed contains ref tags or http(s):\\ and has an edit summary suggesting a dead link was removed, so it wouldn't catch their sort of edits. Mako001 (C) (T) 🇺🇦 12:03, 28 May 2022 (UTC)
- @Mako001: No, they leave everything unchanged except the URL. It's low volume and we have checks for the text they add, but if it grows then they can go on the blacklist. Certes (talk) 11:53, 28 May 2022 (UTC)
- @Certes: Do they ever remove the ref tags? If not then it would probably need to be a more complex filter, but the issue you are referring to would be better handled by the spam blacklist anyway (as I understand). Did you want to propose an addition there? Mako001 (C) (T) 🇺🇦 11:51, 28 May 2022 (UTC)
- @Mako001: No, the spammer uses an edit summary along the lines of "dead link". They believe (or want us to believe) that a link to their website is an improvement on a dead link. Certes (talk) 11:28, 28 May 2022 (UTC)
- I think I've seen that one before too, though it would probably need to be more complex than the idea that I had of just "lines removed contains <ref> or </ref>" and "edit summary contains dead link/ref/page/site or 404 or link broken/doesn't work/dead". Mako001 (C) (T) 🇺🇦 11:05, 28 May 2022 (UTC)
Turkey / Türkiye
- Task: Log only: when a user changes "Turkey" to "T[u|ü]rkiye". Article namespace only.
- Reason: Turkey has (officially) changed its name to Türkiye, however per Talk:Turkey#Requested_move_3_June_2022, the overwhelming consensus is that COMMONNAME should apply and our article remain at Turkey. There have already been a number of attempts to change the name of the country (and even to move a page) based on the new name, so it would be useful to log such changes when the RFC closes. Log only, as a minority may be valid (i.e. when the actual name of an organization includes the Turkish name). Black Kite (talk) 11:46, 4 June 2022 (UTC)
- @Black Kite: Simple initial attempt at this logging at Special:AbuseFilter/1207. Sam Walton (talk) 08:52, 6 June 2022 (UTC)
Fix <source>
tag detection in Filter 432
Currently, filter 432 does a check to ensure that <source lang=
is not in the new wikitext. However, this has 2 failures to it that make the detection practically useless.
First of all, <source>
is deprecated, and has been superceded by <syntaxhighlight>
, which is used instead, so the detection should be at least swapped from <source lang=
to <syntaxhighlight lang=
(Or both, though judging from the changes in the deprecation tracking category, I don't see source getting used ever).
Second of all, the filter immediately follows the check with a look for lang=
, which disregards the possibility of the inline
attribute which could come before it (E.g. <syntaxhighlight inline lang=text>
).
Side note: I have no idea if this is the correct place to suggest an edit to a filter rather than a new filter, but I don't see any pages anywhere for filter requests other than this, so I'm putting it here. Aidan9382 (talk) 08:16, 16 June 2022 (UTC)