Jump to content

Wikipedia:Edit filter/Requested

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by 67.21.154.193 (talk) at 12:13, 14 June 2022 (→‎Racist labeling of political leaders and historical figures). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

    Requested edit filters

    This page can be used to request edit filters, or changes to existing filters. Edit filters are primarily used to address common patterns of harmful editing.

    Private filters should not be discussed in detail. If you wish to discuss creating an LTA filter, or changing an existing one, please instead email details to wikipedia-en-editfilters@lists.wikimedia.org.

    Otherwise, please add a new section at the bottom using the following format:

    == Brief description of filter ==
    *'''Task''': What is the filter supposed to do? To what pages and editors does it apply?
    *'''Reason''': Why is the filter needed?
    *'''Diffs''': Diffs of sample edits/cases. If the diffs are revdelled, consider emailing their contents to the mailing list.
    ~~~~
    

    Please note the following:

    • Edit filters are used primarily to prevent abuse. Contributors are not expected to have read all 200+ policies, guidelines and style pages before editing. Trivial formatting mistakes and edits that at first glance look fine but go against some obscure style guideline or arbitration ruling are not suitable candidates for an edit filter.
    • Filters are applied to all edits. Problematic changes that apply to a single page are likely not suitable for an edit filter. Page protection may be more appropriate in such cases.
    • Non-essential tasks or those that require access to complex criteria, especially information that the filter does not have access to, may be more appropriate for a bot task or external software.
    • To prevent the creation of pages with certain names, the title blacklist is usually a better way to handle the problem - see MediaWiki talk:Titleblacklist for details.
    • To prevent the addition of problematic external links, please make your request at the spam blacklist.
    • To prevent the registration of accounts with certain names, please make your request at the global title blacklist.
    • To prevent the registration of accounts with certain email addresses, please make your request at the email blacklist.



    Racist labeling of political leaders and historical figures

    Here are the two most recent examples:

    This has been happening for several months, perhaps back to last year. I've seen various combinations of the wording "white supremacist" and "racist " edited into political articles, both currently serving individuals and historical figures. These would be edits made after the article was already created. Not limited by geographical area, time period, living or deceased office holders. Can we create a bot that blocks these? And once created, can we update it if a new similar term begins happening? — Maile (talk) 22:40, 30 September 2021 (UTC)[reply]

    I just blocked Special:Contributions/2600:1700:12E1:A090:0:0:0:0/64 for a year as it appears every edit from there has been junk for a long time. That covers several examples of what you describe although I don't know if there are more from other IPs. Johnuniq (talk) 23:21, 30 September 2021 (UTC)[reply]
    Well, that's helpful. Thanks. I don't know if it's been this one IP or not. But I can date this phenomenon to beginning after the BLM events of the last year or two. For whatever reason, one or more editors have been motivated to label BLP and deceased individuals, or geographical areas as, racist, by one term or another. — Maile (talk) 23:59, 30 September 2021 (UTC)[reply]
    @Maile66: Started testing at 1014 (hist · log). Just checking for "racist" or "supremacist" for now. I'll add a check for biographies later. Any other words?
    FYI, I doubt this could ever be refined to the point where it's possible to disallow. Yes, all filters have false positives, but I'm worried about what message we'll be perceived as sending if we stop "X was the target of racist taunts on the field", etc. Suffusion of Yellow (talk) 22:38, 11 October 2021 (UTC)[reply]
    @Suffusion of Yellow: no other words come to mind. I keep hoping this type of editing will fade on its own, but I doubt so in my lifetime, because a lot of it is fed by national-international media reports. Not necessarily limited to the United States, or any other country. — Maile (talk) 22:44, 11 October 2021 (UTC)[reply]
    @Maile66: I added both words to 189 (hist · log) (tag-only). This is actually really common; see the log of 1014 (hist · log). I still might try to work out a disallowing filter if possible. I just don't feel comfortable with stopping Senator McSenatorface resigned after admitting to sending hundreds of racist texts...<ref><ref><ref>. It looks like we're whitewashing. So maybe I'll just disallow very small edits without refs, e.g. Special:Diff/1053952115 and leave 189 to tag the rest. Suffusion of Yellow (talk) 23:43, 7 November 2021 (UTC)[reply]
    @Suffusion of Yellow: Understood. My request here is more about the concern the past year or two of a pattern of adding blatant labeling to existing articles, usually in the opening sentences of a lead, and begin more or less, " ...(name) is a racist and white supremacist ... " without any sourcing indicating it as factual. I've never seen, "" ...(name) is a racist and black supremacist ... " Or pick any color inbetween. — Maile (talk) 23:55, 7 November 2021 (UTC)[reply]
    Well, here's one. But of course it's not as common. Suffusion of Yellow (talk) 18:55, 11 November 2021 (UTC)[reply]

    Should this section still be pinned? It has been months since the last comment here67.21.154.193 (talk) 12:13, 14 June 2022 (UTC)[reply]

    pronoun changes

    • Task: changes or pronouns in articles "he" to "she", "they" to "he", etc
    • Reason: often a MoS violation when transgender BLP subjects are involved. Saw a request in the archives of the pages but with no response
    • Diffs: one example

    67.21.154.193 (talk) 15:29, 20 April 2022 (UTC)[reply]

    Previous comment: Wikipedia:Edit_filter/Requested/Archive_18#Changes_of_pronoun started by User:Valereee. No one responded 67.21.154.193 (talk) 13:52, 27 April 2022 (UTC).[reply]
    Probably something with similar logic to Special:AbuseFilter/1154 would work. Will work on this. Galobtter (pingó mió) 20:32, 2 May 2022 (UTC)[reply]
    Tamzin and Firefly already started on this, see private filter 1200 (hist · log). Suffusion of Yellow (talk) 20:35, 2 May 2022 (UTC)[reply]
    @Suffusion of Yellow Thanks for letting me know. Galobtter (pingó mió) 21:40, 2 May 2022 (UTC)[reply]
    seems to be public now and getting a good amount of hits. thx! 67.21.154.193 (talk) 13:44, 30 May 2022 (UTC)[reply]
    @Tamzin: I think there are edits in India Willoughby that have not been hit by the filter but weren't. Maybe someone should fix that? 67.21.154.193 (talk) 13:27, 6 June 2022 (UTC)[reply]

    Claiming the death of an article subject

    • Task: section titled "death". claims in general that the subject is dead
    • Reason: This filter is needed for the same reason Special:AbuseFilter/712 in needed

    67.21.154.193 (talk) 15:40, 20 April 2022 (UTC)[reply]

    Might I add that it should trip to tag if there is a ref provided, but trips to warn or something if no reference is provided. Generally this would be a filter, active on BLP articles, that trips at the addition of "foo died" or "foo [wildcard, to allow for adjectives] passed away" or similar strings. I've seen a bit of this on RC patrol. Mako001 (C)  (T)  🇺🇦 14:34, 26 April 2022 (UTC)[reply]
    The main reason I want this is to help prevent death hoaxes from appearing. If it gets overlooked early, it could end up being a while before someone notices. 67.21.154.193 (talk) 13:49, 27 April 2022 (UTC)[reply]
    so.... is there gonna be any discussion? any action? 67.21.154.193 (talk) 15:22, 2 May 2022 (UTC)[reply]
    Should I ping someone to this discussion? 67.21.154.193 (talk) 12:06, 31 May 2022 (UTC)[reply]
    I decided that I am going to ping @Ohnoitsjamie: to this page because he's quite active, and numerous sections have not had any response from EF mamagers. 67.21.154.193 (talk) 14:37, 1 June 2022 (UTC)[reply]
    I don't have time at the moment to work on that; I have written that kind of filter before; I'd want to do a lot of testing on it first as it's more complex than average. OhNoitsJamie Talk 13:57, 2 June 2022 (UTC)[reply]

    Came across deleted filter 40, but it seems like it would require significant rework if we were to revive it. 67.21.154.193 (talk) 12:44, 8 June 2022 (UTC)[reply]

    How'd the repeat filter not catch this?

    this and this seem rather obvious instances... RandomCanadian (talk / contribs) 22:17, 26 April 2022 (UTC)[reply]

    Does the filter you had in mind (1163?) only check the article namespace? Certes (talk) 22:36, 26 April 2022 (UTC)[reply]
    @Certes: Looks like it does. Is there any reason not to extend it to talk pages, beyond the obvious "well, they get less traffic". That kind of edit is still the kind of stuff that's so universally useless that there's no point to allow it or pollute even talk page histories with it. RandomCanadian (talk / contribs) 02:33, 27 April 2022 (UTC)[reply]
    I'd say extend to everything but sandboxes, also see this for one too. If active in talk pages (at least) it would stop a good deal of nonsense. Mako001 (C)  (T)  🇺🇦 03:48, 27 April 2022 (UTC)[reply]
    I set 1163 to mainspace only because the filter catches a lot of hits, and because I haven't managed to narrow down FPs to make disallow or DatBot appropriate, it needs to be manually checked to be useful. If it gets tons of hits (due to other namespaces) I figure it will just turn into those log-only filters that never get checked. ProcrastinatingReader (talk) 11:37, 30 April 2022 (UTC)[reply]
    @ProcrastinatingReader: This probably has a really obvious answer, but would it reduce the FPs enough to disallow and/or Auto-report if another filter existed that only triggered when it repeated more than, say, six times? Mako001 (C)  (T)  🇺🇦 14:12, 14 May 2022 (UTC)[reply]

    Repeated emojis

    I'm surprised ClueBot didn't catch this edit, but seems it didn't. Could we make an edit filter for additions of repeated strings like it? {{u|Sdkb}}talk 02:51, 28 April 2022 (UTC)[reply]

    Ha, related to my report above... Are emojis also not covered by Special:AbuseFilter/1163? Two improvements for the price of one, I say... RandomCanadian (talk / contribs) 03:24, 28 April 2022 (UTC)[reply]
    It does, but that's template namespace. It would've caught that edit in mainspace. ProcrastinatingReader (talk) 11:38, 30 April 2022 (UTC)[reply]
    If it's not too expensive, most mainspace-only filters might benefit from covering templates too. Apart from oddities like Template/Did you know nominations/..., they can do more damage. Certes (talk) 12:01, 30 April 2022 (UTC)[reply]

    should we revive Special:AbuseFilter/402 with a warning message similar to MediaWiki:abusefilter-warning-AfC-unsourced-submissions?

    The filter here (for unreferenced articles) was deleted back in 2013 [1] because it apparently had no purpose. Now, a warn+tag filter exists are submitting completely unsourced afc submissions see here. I think reintrodicing filter 402 (with warn) would help against non-notable or spam creations, as well as make new users add more reliable sources.

    Also, are the above "pronoun change" and "adding death" filters going to be implemented?

    67.21.154.193 (talk) 15:34, 2 May 2022 (UTC)[reply]

    Perhaps #964 should be extended to mainspace? casualdejekyll 01:11, 6 May 2022 (UTC)[reply]

    Pinging @Tamzin: since she's an edit filter manager, and no EF manager has responded to any section below this one yet. 67.21.154.193 (talk) 13:41, 30 May 2022 (UTC)[reply]

    Expand the "poop" filter to include "poo poo"

    • Task: Stop edits adding this string, by adding it to a "disallow" filter (such as the existing "poop" filter)
    • Reason: Because the poop vandalism filter doesn't catch it, and this is quite common.
    • Diffs: [2]

    Can this string ("poo poo") be added to the ones prevented by the "poop" filter? I see virtually no legitimate use for this string in mainspace. 💩 Mako001 (C)  (T)  🇺🇦 12:20, 12 May 2022 (UTC)[reply]

    just another example. Mako001 (C)  (T)  🇺🇦 10:44, 18 May 2022 (UTC)[reply]

    should characters in the 33xx range and circled letters and numbers be included in the filter?

    67.21.154.193 (talk) 15:25, 24 May 2022 (UTC)[reply]

    see Enclosed Alphanumerics, Enclosed Alphanumeric Supplement and Enclosed CJK Letters and Months unicode blocks. Edit: also CJK Compatibility 67.21.154.193 (talk) 17:01, 24 May 2022 (UTC)[reply]
    Also some at Dingbats. 67.21.154.193 (talk) 13:18, 2 June 2022 (UTC)[reply]


    Vaguely related: are "funny" Greek letters being normalised to, er, normal Greek letters? This looks like an FP: Special:AbuseLog/32463333 (06:00, 27 April 2022: Ωχγ triggered filter 1,168). Certes (talk) 16:03, 24 May 2022 (UTC)[reply]

    It seems like the ohm symbol is now removed bc of this. This is the same filter 67.21.154.193 (talk) 17:00, 24 May 2022 (UTC)[reply]
    Yes. The Angstrom symbol, Kelvin symbol and ohm symbol all normalised to regular characters, so I had to remove them to avoid false positives. — The Anome (talk) 15:15, 8 June 2022 (UTC)[reply]

    Also:
    Change:ℬ and ℭ should have a pipe | between them. This is on the line that starts with “(accountname rlike "℀|℁|ℂ|℃|℄|℅|℆|ℇ|℈|℉…”
    Add (test for false positives by unicode normalization first): ªº (ordinal indicators), ₐₑₒₓₔₕₖₗₘₙ₊₋₌₍₎ₚₛₜⱼ (subscript), ꬲꬽꬾ (blackletter), ⁱⁿ⁺⁻⁼⁽⁾ (superscript), ⱻꜰɢʛʜɪʟɴɶʀꝶꜱʏꭥꞮꟸᶦᶧᶫᶰʶᶸ(small caps/modifier) ᶛᶜᶝᶞᶟᶠᶡᶢᶣᶤᶥᶨˡᶩᶪᵚᶬᶭᶮᶯᶱᶲᶳᶴᶵᶶᶷᶹᶺᶻᶼᶽᶾᶿꟹꭟʰʱʲʳʴʵʷˣʸꭜꭝꭞ (modifiers) ⅠⅡⅢⅣⅤⅥⅦⅧⅨⅩⅪⅫⅬⅭⅮⅯⅰⅱⅲⅳⅴⅵⅶⅷⅸⅹⅺⅻⅼⅽⅾⅿↀↁↂↃↅↆↇↈ (roman numerals)
    (more small caps and modifiers --->) ꟲ (unicode A7F2) ꟳ (unicode A7F3) ꟴ (unicode A7F4) 𝼂 (unicode 1DF02) 𝼄 (unicode 1DF04) 𝼐 (unicode 1DF10)
    Latin Extended-F unicode block (10780-107BF; bunch of modifier letters) 67.21.154.193 (talk) 12:14, 2 June 2022 (UTC)[reply]

    Thank you. These strings are a nightmare to edit, because they break text renderering in the online editor. I'll take a look through your list and see what I can incorporate. I suspect some of these might already be caught by higher-level filters at the Mediawiki or global config level, see MediaWiki:Titleblacklist and [3], which are useful, but not comprehensive enough, as hits on Filter 1168 keeps on demonstrating. — The Anome (talk) 15:19, 8 June 2022 (UTC)[reply]
    Yeah, I think this would be better in the global blacklist, but the problem is if there are false positives, by unicode normalization, how would we know. It would also be more difficult to pinpoint which charactor exactly is causing the false positives. BTW, there are a bunch of likely problematic characters in the 2000-2bff and 1f000-1ffff range. 67.21.154.193 (talk) 14:20, 9 June 2022 (UTC)[reply]
    Also, this would probably need a custom message if it were to be added to the title blacklist against usernames. 67.21.154.193 (talk) 15:08, 9 June 2022 (UTC)[reply]
    @The Anome: Haven't looked into the changes suggested here, but I swapped out those literal characters with \x{...} escapes, which should make the filter easier to edit. Suffusion of Yellow (talk) 21:12, 13 June 2022 (UTC)[reply]
    @Suffusion of Yellow: I'm never quite sure what regex format anything supports, so I'm glad to hear that \x{...} escapes work. Regarding normalization false positives, I can easily check for that with a bit of Python code: the ohm, Angstrom and Kelvin signs came as a bit of a surprise to me. Ultimately, it would be great if we could get these characters pushed into the top level Mediawiki filter, so we don't need to have this filter at all. UAX #31 might be our friend here. But we are already doing pretty will by just blocking the mathematical and IPA characters, as they are so popular with text obfuscators/prettifiers. — The Anome (talk) 22:00, 13 June 2022 (UTC)[reply]

    False GA/FA tags

    • Task: Warn when articles are created with Good Article or Featured Article tags already attached.
    • Reason: To prevent editors who don't understand GA/FA procedure from misusing the tags, and to stop malicious use of the same.
    • Diffs: Most recent one I could find (I removed the tag later): Special:Diff/1090058919

    Sumanuil. 05:48, 27 May 2022 (UTC)[reply]

    Is this a common issue? I can't imagine it happens particularly often and these kinds of non-urgent problems are likely to be picked up as part of new page patrol. Sam Walton (talk) 21:25, 30 May 2022 (UTC)[reply]
    Not sure how common, but it shouldn't be happening at all. Sumanuil. 03:10, 31 May 2022 (UTC)[reply]
    There is Special:AbuseFilter/716, but it only catches non-autoconfirmed accounts. 67.21.154.193 (talk) 12:16, 2 June 2022 (UTC)[reply]

    Helping newer users deal with dead links appropriately

    • Task: Use a custom warning message to notify a newer user or IP when they attempt to remove a url or ref with an edit summary including "dead url/ref/link" "404 error/message" "doesn't work" or any other string which would indicate that they are removing it because it is a wp:dead link. The message would be more friendly in tone, and would include links to guidelines about dealing with dead links, and a brief summary of those guidelines, something along the lines of "Instead of removing dead links, tag them with {{dead link}}..., try to find an archive yourself at one of these sites (add links to useful archive sites here) or, if you have an account, try using (IABot console link here)."
    • Reason: Many inexperienced users will (in good faith) remove broken links, thinking that they are of no use anymore, and not actually realise that it is a problem. Whilst "references removed" is helpful, this would be a narrower filter than that one, and is designed to offer advice and guidance to users who may not realise that their edits are possibly problematic. If alerted to the appropriate way of dealing with such links, I believe that most of these users would do so, as the responses I have got to uw-dead1 warnings whilst on RC patrol seem to be quite positive along the lines of "oh, thanks, I didn't realise I could fix that, I'll do that now".

    Mako001 (C)  (T)  🇺🇦 05:50, 28 May 2022 (UTC)[reply]

    @Mako001: That could also catch a certain spammer who replaces dead links by links to irrelevant content on a website they promote. Certes (talk) 10:55, 28 May 2022 (UTC)[reply]
    I think I've seen that one before too, though it would probably need to be more complex than the idea that I had of just "lines removed contains <ref> or </ref>" and "edit summary contains dead link/ref/page/site or 404 or link broken/doesn't work/dead". Mako001 (C)  (T)  🇺🇦 11:05, 28 May 2022 (UTC)[reply]
    @Mako001: No, the spammer uses an edit summary along the lines of "dead link". They believe (or want us to believe) that a link to their website is an improvement on a dead link. Certes (talk) 11:28, 28 May 2022 (UTC)[reply]
    @Certes: Do they ever remove the ref tags? If not then it would probably need to be a more complex filter, but the issue you are referring to would be better handled by the spam blacklist anyway (as I understand). Did you want to propose an addition there? Mako001 (C)  (T)  🇺🇦 11:51, 28 May 2022 (UTC)[reply]
    @Mako001: No, they leave everything unchanged except the URL. It's low volume and we have checks for the text they add, but if it grows then they can go on the blacklist. Certes (talk) 11:53, 28 May 2022 (UTC)[reply]
    Hmm, if that was a filter, it would have to be a separate filter then, and an LTA one too, so it'd be best to not discuss it here. My idea is to just check if the lines removed contains ref tags or http(s):\\ and has an edit summary suggesting a dead link was removed, so it wouldn't catch their sort of edits. Mako001 (C)  (T)  🇺🇦 12:03, 28 May 2022 (UTC)[reply]

    Turkey / Türkiye

    • Task: Log only: when a user changes "Turkey" to "T[u|ü]rkiye". Article namespace only.
    • Reason: Turkey has (officially) changed its name to Türkiye, however per Talk:Turkey#Requested_move_3_June_2022, the overwhelming consensus is that COMMONNAME should apply and our article remain at Turkey. There have already been a number of attempts to change the name of the country (and even to move a page) based on the new name, so it would be useful to log such changes when the RFC closes. Log only, as a minority may be valid (i.e. when the actual name of an organization includes the Turkish name). Black Kite (talk) 11:46, 4 June 2022 (UTC)[reply]
    @Black Kite: Simple initial attempt at this logging at Special:AbuseFilter/1207. Sam Walton (talk) 08:52, 6 June 2022 (UTC)[reply]
    Thanks! Black Kite (talk) 13:12, 6 June 2022 (UTC)[reply]