Wikipedia:Edit filter/Requested
This page can be used to request edit filters, or changes to existing filters. Edit filters are primarily used to address common patterns of harmful editing.
Private filters should not be discussed in detail. If you wish to discuss creating an LTA filter, or changing an existing one, please instead email details to wikipedia-en-editfilterslists.wikimedia.org.
Otherwise, please add a new section at the bottom using the following format:
== Brief description of filter == *'''Task''': What is the filter supposed to do? To what pages and editors does it apply? *'''Reason''': Why is the filter needed? *'''Diffs''': Diffs of sample edits/cases. If the diffs are revdelled, consider emailing their contents to the mailing list. ~~~~
Please note the following:
- Edit filters are used primarily to prevent abuse. Contributors are not expected to have read all 200+ policies, guidelines and style pages before editing. Trivial formatting mistakes and edits that at first glance look fine but go against some obscure style guideline or arbitration ruling are not suitable candidates for an edit filter.
- Filters are applied to all edits. Problematic changes that apply to a single page are likely not suitable for an edit filter. Page protection may be more appropriate in such cases.
- Non-essential tasks or those that require access to complex criteria, especially information that the filter does not have access to, may be more appropriate for a bot task or external software.
- To prevent the creation of pages with certain names, the title blacklist is usually a better way to handle the problem - see MediaWiki talk:Titleblacklist for details.
- To prevent the addition of problematic external links, please make your request at the spam blacklist.
- To prevent the registration of accounts with certain names, please make your request at the global title blacklist.
- To prevent the registration of accounts with certain email addresses, please make your request at the email blacklist.
This page has a backlog that requires the attention of willing editors. Please remove this notice when the backlog is cleared. |
Index |
This page has archives. Sections older than 30 days may be automatically archived by ClueBot III when more than 1 section is present. |
Brainrot account creation
[edit]I've seen a lot of accounts like this one that use brainrot terms and usually are bad faith accounts that just vandalize wikipedia. As a result, I think we should create a filter similar to 54 (hist · log) with the regex of 614 (hist · log). It should look something like this:
action contains "createaccount" & !contains_any(user_rights, "override-antispoof", "tboverride", "tboverride-account") & ( abuseStr := "f\s*r\s*e\s*e\s*d\s*i\s*d\s*d\s*y|y\s*o\s*[lo\s]+s\s*w\s*[4ae]+\s*g+ // etc, the rest of the 614 regex; (accountname irlike abuseStr) )
– PharyngealImplosive7 (talk) 17:14, 14 December 2024 (UTC)
- If this request is implemented, it should also exclude users with
tboveride
andtboverride-account
, as this is essentially equivalent to an addition to the title blacklist. JJPMaster (she/they) 03:43, 15 December 2024 (UTC)- Added your suggestion to the proposed code. – PharyngealImplosive7 (talk) 21:55, 15 December 2024 (UTC)
- Sorry, I missed an "r" in
tboverride
, so could you add that? JJPMaster (she/they) 22:03, 15 December 2024 (UTC)- PharyngealImplosive7,
ccnorm(accountname) rlike abuseStr
will not work for this lowercased regex, so useaccountname irlike abuseStr
instead if we plan to implement this new filter. But for now, I'm not seeing that many vandalism-only accounts with brainrot usernames on the recent changes list. Codename Noreste 🤔 Talk 03:34, 16 December 2024 (UTC)- I see them all the time. Not sure there's much point, though, because people can just choose a different username. It won't actually prevent any vandalism. If anything, usernames like this make it very easy to spot vandalism-only accounts. C F A 05:17, 25 December 2024 (UTC)
- I mean I would intend this filter to be log-only like filter 54, so it's an easy way to see these accounts and block them quickly, not a disallow filter. – PharyngealImplosive7 (talk) 23:55, 25 December 2024 (UTC)
- I don't see a problem with that. C F A 00:38, 26 December 2024 (UTC)
- Alternatively, there's User:AmandaNP/UAA/Blacklist in which someone can place a request to add some brainrot regex to that blacklist (usernames like these will eventually be reported to UAA). Codename Noreste (talk) 03:13, 10 January 2025 (UTC)
- I don't see a problem with that. C F A 00:38, 26 December 2024 (UTC)
- I mean I would intend this filter to be log-only like filter 54, so it's an easy way to see these accounts and block them quickly, not a disallow filter. – PharyngealImplosive7 (talk) 23:55, 25 December 2024 (UTC)
- I see them all the time. Not sure there's much point, though, because people can just choose a different username. It won't actually prevent any vandalism. If anything, usernames like this make it very easy to spot vandalism-only accounts. C F A 05:17, 25 December 2024 (UTC)
- PharyngealImplosive7,
- Sorry, I missed an "r" in
- Added your suggestion to the proposed code. – PharyngealImplosive7 (talk) 21:55, 15 December 2024 (UTC)
- I have two issues here. The first is, is an edit filter the right path for implementation here, or would the title blacklist be more appropriate? The second is, if implemented through an edit filter, I would almost certainly only exclude override-antispoof, keeping with what was used for 54 (hist · log). This is given that
tboverride
is a far wider amount of people than would generally be creating accounts with unusual patterns. Unusual and otherwise generally disruptive username patterns are generally held for those with the account creator flag, which are those identified to the Foundation and working with account creation requests, as well as administrators. I'm not sure it's the best idea to toss in every page mover and template editor, given there would be a near-zero chance of them actually tripping this at all (not all PMR/TPE are account-creation savvy, either, such as a current TPE who isn't even extended confirmed...). EggRoll97 (talk) 02:46, 28 December 2024 (UTC)
New user possibly adding Copyright violation or unreliable source
[edit]- Task: Highlighting edits by new users that add urls to wikis, that aren't licensed with a compatible license.
- Reason: Those edits are likely either a copyright violation or an use of a self-published source. This filter would partly be an extension of filter 894 (hist · log) (Self-Published Sources).
- Diffs: I've seen this a few times over at CopyPatrol, those diffs were all revdelled as RD1.
Nobody (talk) 12:47, 16 December 2024 (UTC)
- What are the urls of these incompatible wikis? – 2804:F1...69:1A4C (::/32) (talk) 15:09, 16 December 2024 (UTC)
- Mirrors and forks lists some of them, I don't think its even possible to make a complete list. There's also Fandom, which has both, compatible and non-compatible licenses for their wikis.[1] Nobody (talk) 15:36, 16 December 2024 (UTC)
Here's the basic code for it. (With a few example urls of mirrors that aren't compatible.)
Code
|
---|
equals_to_any(page_namespace, 0, 2, 118) &
!contains_any(user_groups, "extendedconfirmed", "sysop", "bot") &
(
url := "\d{5}\.us|99colors\.net|alchetron\.com|celebsagewiki\.com|en-us\.nina\.az|knowpia\.com|profilpelajar\.com|wikizero\.org";
added_lines irlike url &
!(removed_lines irlike url) &
!(summary irlike "^(?:revert|rv|undid)")
)
|
Nobody (talk) 17:44, 16 December 2024 (UTC)
- 1AmNobody24, I've modified the code to also exclude removed_lines. Without it, the user would get flagged regardless if they edit a part of a section containing the website or not. Codename Noreste 🤔 Talk 23:17, 16 December 2024 (UTC)
Filter for drive-by, unconstructive talk page junk related to student assignments
[edit]- Task: This is related to the persistent issue with talk page junk, some of which is addressed by Special:AbuseFilter/1245. I am proposing a filter to catch a further subset of them, most likely generated by students, that follow a specific but extremely common pattern:
- The page is not a user talk page, a sandbox page, or any subpage of Wikipedia:Reference desk
- The editor is an IP
- The subject line should be a school subject from a predetermined list. Some subjects that are common here: "English", "Math", "Mathematics", "Maths", "Geography", "History", "Social studies", "Chemistry", "Civics", "Physics", "Biology", "Life science", "Earth science".
- One or more of the following should apply to the comment body:
- Comment filter 1: Edits that are really short (fewer than 5 words or thereabouts)
- Comment filter 2: Edits that start with certain phrases: "Definition of", "Write", "Information about", etc.
- Comment filter 3: Edits that start with the phrases "what is" or "what are" (possibly others) and are somewhat short (fewer than 10-20 words? idk)
- Reason: This is a very common pattern of the talk page junk that has ratcheted up since 2021. See this village pump entry and this requested edit filter discussion for past discussions on the topic.
- This specific subset is clearly related to student assignments -- WikiEd doesn't think it's related to their assignments specifically -- there is a correlation but it's probably just school, in general. For instance this diff seems to be associated with this assignment or a very similar one.
- I suspect some of these are produced by LLMs, text-to-speech, search integrations, or other automated tools because of the time frame (the date they really started pouring in lines up almost exactly with the date GPT-3, ChatGPT, etc. came out); because of the formulaic predictability of the pattern; and because of certain tells in some of these suggesting they're overheard conversations, ChatGPT prompts, etc. (Here is a smoking gun for this.) These edits have almost no utility and usually go unanswered; if they are answered, it's usually to scold the user, who almost never responds.
- There are literally thousands of these, cleaning them up is a huge task, and that task also has a deadline. If nobody cleans them up before the page is archived (which is likely to happen because school-curriculum talk pages are often long, and because archiving is often done by bots who don't check what they're doing) then they will be stuck there forever. (I cannot emphasize enough how arbitrary and asinine that is, but whatever.). While I'm willing to clean up as much of the existing stuff as I catch in time, it would be nice to stop the floods.
- I'm happy to add to or refine this filter to reduce false positives and catch more false negatives, this is off the top of my head. The real solution is to either find a technological or UI-design cause, but this subset of edits is just so predictable that a filter might make sense.
- Diffs: 1094685874 (comment filter 1), 1183615020 (comment filter 1), 1085568369 (comment filter 2), 1108078327 (comment filter 2), 1064959579 (comment filter 2), 1185080593 (comment filter 3), 1110355731 (comment filter 3). Again there are thousands more examples, these are the ones I happen to have convenient.
- If you want to find more -- or to help clean them up -- the relevant search pattern is insource:"UTC [subject]". A search pattern more prone to false positives is insource:"[subject or common one-word edit] Special".
Gnomingstuff (talk) 19:04, 19 December 2024 (UTC)
- Bumping this. I can provide more acronyms that are even less likely to be false positives. Gnomingstuff (talk) 20:25, 5 January 2025 (UTC)
- Here is some regex I've made quickly so it might not be accurate completely:
!("confirmed" in user_groups) & !( (page_namespace == 3) || (page_namespace == 4 & contains_any(page_title, "sandbox", "reference desk")) ) & ( junkStr := "={1,6}\s*(?:(?:math(?:ematics)?)|(?:english))\s*={1,6}"; /* add other subjects */ added_lines irlike junkStr & !(removed_lines irlike junkStr) ) & (edit_delta < 50 || added_lines irlike "(?:(?:definition\s*of) || (?:write) || (?:info(?:rmation)?\s*about)) || (?:what\s*(?:(?:is)) )
- This is fairly rudimentary and probably has a few errors but I hope it helps in creating a sketch of what the filter could look like. Thanks, – PharyngealImplosive7 (talk) 14:48, 6 January 2025 (UTC).
- PharyngealImplosive7, your suggestion unfortunately does not work because the regex did not match some edits from those diffs, and because of the regex in the last line which was broken. Gnomingstuff, are there recent cases of these specific talk page junk edits? These diffs that you have provided are from 2022 and 2023, and because of that I believe that it's not worth creating a new filter just to check for these edits. Codename Noreste (talk) 03:10, 10 January 2025 (UTC)
- Yeah the edit filter is not going to catch anywhere near 100% of this -- I'm mostly hoping to hit the major categories while avoiding false positives as much as possible.
- That said, this is absolutely still ongoing. The list of diffs I linked is heavily skewed toward 2022/2023 because it only includes talk pages that have been archived. If the talk page wasn't archived, then I just reverted the edit and it isn't on that list.
- I can put together a list of December 2024/January 2025 diffs but it'll take a while. Gnomingstuff (talk) 04:27, 10 January 2025 (UTC)
- PharyngealImplosive7, your suggestion unfortunately does not work because the regex did not match some edits from those diffs, and because of the regex in the last line which was broken. Gnomingstuff, are there recent cases of these specific talk page junk edits? These diffs that you have provided are from 2022 and 2023, and because of that I believe that it's not worth creating a new filter just to check for these edits. Codename Noreste (talk) 03:10, 10 January 2025 (UTC)
- This is fairly rudimentary and probably has a few errors but I hope it helps in creating a sketch of what the filter could look like. Thanks, – PharyngealImplosive7 (talk) 14:48, 6 January 2025 (UTC).
@Codename Noreste OK, timeboxed to about ~1 hour or so of searching, here are some edits from the past 30 days that fall into this category. This is not a complete list -- a lot of what was out there has been reverted/caught, which is why the list is skewed toward the past few days -- nor a full list of subjects, nor representative of how much each subject gets relative to the others. It's just what I found in an hour.
Sample drive-by edits, 9 December 2024 - 10 January 2025
|
---|
|
Let me know if you have any other questions. Gnomingstuff (talk) 05:49, 10 January 2025 (UTC)
Some more recent edits, this time with some of the more common abbreviations:
Sample drive-by edits, 11 December 2024 - 14 January 2025
|
---|
|
I think this should demonstrate how ongoing an issue this is. Gnomingstuff (talk) 12:23, 14 January 2025 (UTC)
Given that the average report has a edit size between 600 and 1100, I think that edits by non-confirmed users that have an edit delta much bigger than that (2500 or 5000?) be disallowed, since they're likely non-constructive edits. Nobody (talk) 15:07, 13 January 2025 (UTC)
- Do you have any diffs related to this? I can modify the old and new size OR the edit delta conditions together, perhaps. Codename Noreste (talk) 03:00, 14 January 2025 (UTC)
- Diff 1 (+14,121), Diff 2 (+2,981), Diff 3 (+402,411), Diff 4 (+19,845), all in the last two weeks. Edit: Another one (17,279) Nobody (talk) 06:08, 14 January 2025 (UTC)
- See Special:Diff/1269760874 (2,433 bytes added) in which an anonymous user added an article lead of a district in Thailand (probably disruptive), so I think we can lower the edit_delta limit to more than 1800 bytes. Codename Noreste (talk) 08:20, 16 January 2025 (UTC)
- The biggest constructive edits I've seen this year by new or unregistered users were between +2,000 and +2,100 bytes, so I wouldn't lower it below that. Nobody (talk) 08:25, 16 January 2025 (UTC)
- Never mind. Codename Noreste (talk) 08:35, 16 January 2025 (UTC)
- The biggest constructive edits I've seen this year by new or unregistered users were between +2,000 and +2,100 bytes, so I wouldn't lower it below that. Nobody (talk) 08:25, 16 January 2025 (UTC)
- See Special:Diff/1269760874 (2,433 bytes added) in which an anonymous user added an article lead of a district in Thailand (probably disruptive), so I think we can lower the edit_delta limit to more than 1800 bytes. Codename Noreste (talk) 08:20, 16 January 2025 (UTC)
- Diff 1 (+14,121), Diff 2 (+2,981), Diff 3 (+402,411), Diff 4 (+19,845), all in the last two weeks. Edit: Another one (17,279) Nobody (talk) 06:08, 14 January 2025 (UTC)
I believe my suggested change to filter 707 below could work, along with your suggestion about blocking reports with more than 2500 bytes. Note that I removed the old and new_size condition logic because only the edit_delta works for some reason.
page_id == 26204397 & /* False positives reports page */
!("confirmed" in user_groups) &
(
(
/* Removal or modification of the page's headers */
contains_any(
removed_lines,
"__NONEWSECTIONLINK__",
"__NOINDEX__",
"<noinclude>",
"{{Wikipedia:Edit filter/False positives/Header}}",
"{{shortcut|WP:EF/FP/R|WP:EFFPR}}",
"</noinclude>"
)
) | (
/* New or anonymous users blanking or modifying reports */
edit_delta <= -250 |
/* False positive reports containing more than 2500 bytes */
edit_delta >= 2500
)
)
Codename Noreste (talk) 15:06, 16 January 2025 (UTC)
- Surely this will also disallow good faith edits where the person pasted their edit into the description? – 2804:F1...70:9D36 (::/32) (talk) 15:46, 16 January 2025 (UTC)
- I probably assume that you meant when people remove duplicate reports, or withdraw their own reports? On the other hand, see here. Codename Noreste (talk) 15:48, 16 January 2025 (UTC)
- No, I mean the thing the page notice warns users against doing but people do anyways:
Please also note that there is no reason for you to paste the content of your edit here. The edit you tried to do is visible to others, and sometimes the same filter which stopped you earlier may stop you again. This is especially the case when including external links.
- If this filter will also disallow that, as those cases are usually large edits, and there's consensus to do so, a different disallow message than the current MediaWiki:Abusefilter-disallowed-EFFPR might be more appropriate. – 2804:F1...70:9D36 (::/32) (talk) 15:53, 16 January 2025 (UTC)
- All users (and those who can see private filter log entries) can see the attempted edit, so I'm not sure about the case regarding on disallowing good edits to EFFPR. I haven't seen any recent good edits to that page that have more than 2500 bytes, yet. Codename Noreste (talk) 16:02, 16 January 2025 (UTC)
- and also, it seems that some people are removing or disrupting headers from edit filter related pages (not just EFFPR), see Special:Diff/1270484744 and Special:Diff/1270358969 (both EFN), and Special:Diff/1270356788 (EFR). For these three, I believe we can create a new filter by sending a request to the mailing list, or somewhere?Cc to 1AmNobody24 who started the thread. Codename Noreste (talk) 12:58, 20 January 2025 (UTC)
- That gets picked up by filter 1151 sometimes, but it could be added to 707. Though I wonder if filter 809 (private) isn't the better place for it. Nobody (talk) 13:32, 20 January 2025 (UTC)
- I'm not sure of adding to 809 without commenting on other specifics, but perhaps a new filter could do because 707 uses a custom message, while for the new filter, we could use the default disallow message. Codename Noreste (talk) 13:59, 20 January 2025 (UTC)
- Honestly, @Codename Noreste: I'm not sure why any filter preventing the disruption of edit filter-related page headings would need to be private (707 after all is public) since this is just regular vandalism. So I think a request to EFR or EFN would be fine. – PharyngealImplosive7 (talk) 20:47, 20 January 2025 (UTC)
- I'm not sure of adding to 809 without commenting on other specifics, but perhaps a new filter could do because 707 uses a custom message, while for the new filter, we could use the default disallow message. Codename Noreste (talk) 13:59, 20 January 2025 (UTC)
- This and this are also examples of edits that are clearly not constructive and should be stopped. Nobody (talk) 10:49, 24 January 2025 (UTC)
- That gets picked up by filter 1151 sometimes, but it could be added to 707. Though I wonder if filter 809 (private) isn't the better place for it. Nobody (talk) 13:32, 20 January 2025 (UTC)
- and also, it seems that some people are removing or disrupting headers from edit filter related pages (not just EFFPR), see Special:Diff/1270484744 and Special:Diff/1270358969 (both EFN), and Special:Diff/1270356788 (EFR). For these three, I believe we can create a new filter by sending a request to the mailing list, or somewhere?Cc to 1AmNobody24 who started the thread. Codename Noreste (talk) 12:58, 20 January 2025 (UTC)
- All users (and those who can see private filter log entries) can see the attempted edit, so I'm not sure about the case regarding on disallowing good edits to EFFPR. I haven't seen any recent good edits to that page that have more than 2500 bytes, yet. Codename Noreste (talk) 16:02, 16 January 2025 (UTC)
- I probably assume that you meant when people remove duplicate reports, or withdraw their own reports? On the other hand, see here. Codename Noreste (talk) 15:48, 16 January 2025 (UTC)
Keyboard mashing filter?
[edit]- Task: What is the filter supposed to do? To what pages and editors does it apply?
The filter is intended to catch "keyboard spam" edits (things along the line of "ajksljhgfhlasjaewzxcvo"). The way I believe this could be implemented is with a filter that catches strings of length 5 that contain only lowercase consonants (y is a vowel in this case). For example, in the example given above, the substring "jklsj" would be caught and flagged. Should only apply for main space edits and only for IPs to avoid usernames triggering the filter. Exception needed for links. I don't know what regex has in its capabilities so I don't know if this is possible. I'm worried about edits on other language scripts messing it up.
- Reason: Why is the filter needed?
This is a relatively common pattern of vandalism; the diffs below were collected over a span of a single, non cherry-picked hour.
- Diffs: Diffs of sample edits/cases. If the diffs are revdelled, consider emailing their contents to the mailing list
Wildfireupdateman :) (talk) 17:50, 13 January 2025 (UTC)
- If this is done, I would suggest a longer string length than 5. For example, place names in Wales where "w" is effectively a vowel, such as Cwmbran, Amlwch or Pwllheli, may regularly have five consonants in a row. Not to mention occasional normal English plurals such as "strengths". Black Kite (talk) 08:49, 14 January 2025 (UTC)
- Have you given some thought to compounds such as Knightsbridge and Catchphrase, names like Goldschmidt and Norbert Pfretzschner, technical articles like HTML color names (white is #FFFFFF; see also hex for color names Blanched almond, Gainsboro, Lemon chiffon, Navajo white, Pale turquoise, and Snow); the parenthetical phrase in the first line of The Adventures of Mr. Nicholas Wisdom, and non-English content (notably German compounds) such as Handschriftencensus (6), Selbstschutz (7), and Rechtschreibreform (7). But I believe these examples are rare, and that there are no 8-letter examples, so you can probably whitelist all of these. There might be a portion of an article that covers keyboard spam with examples, and you might have to whitelist that, too. Mathglot (talk) 10:31, 14 January 2025 (UTC)
- I didn't think of those. It appears that in addition to the filter below, there are way too many exceptions to work properly. I'm going to retract this request but I don't know how; can someone help out? Wildfireupdateman :) (talk) 20:16, 14 January 2025 (UTC)
- There IS a filter for this:
- It works almost exactly as suggested as well, even the exception for links, with the difference being it looks for 9 characters, not 5.
- At any rate, perhaps the filter could be improved - for example, it didn't catch the second example because the edit edited a line starting with a pipe (
|
), why do we exclude edits that do that? - That change was done here in 2012, which changed it from excluding edits that left a line like
|-
or|.
in the article to ones that edit any line starting with a pipe or an exclamation mark. - The filter did not catch examples 1 and 3 because of the aforementioned vowels before it reached 9 'repeating' characters. – 2804:F1...87:8192 (::/32) (talk) 15:32, 14 January 2025 (UTC)
possibly misleading newcomer task
[edit]- Task: Tag possibly misleading newcomer tasks with "possibly misleading newcomer task".
- Reason: Most of the newcomer task edits I have saw are straight up vandalism.
- Diffs: Special:Diff/1265081093
M.C. (talk) 08:41, 21 January 2025 (UTC)
- You can already filter for edits tagged with newcomer task at RecentChanges, I don't see how we could improve on that with an edit filter. Nobody (talk) 08:56, 21 January 2025 (UTC)
- I agree, it's very hard to make a filter for these edits. Codename Noreste (talk) 11:00, 21 January 2025 (UTC)
Implementing the balanced editing restriction
[edit]WP:ARBPIA5 has been closed, and one of the successful remedies was to authorize the imposition of the "balanced editing restriction" on an editor if it is found that it would be a net positive for the project were the user to lower their activity in the topic area, particularly where an editor has repeatedly engaged in conflict but is not being intentionally or egregiously disruptive.
Doing so would require an edit filter that would ensure that [i]n a given 30-day period, a user under this restriction is limited to making no more than one-third of their edits in the Article, Talk, Draft, and Draft talk namespaces to pages that are subject to the extended-confirmed restriction under Arab–Israeli conflict contentious topic procedures.
See also: Wikipedia:Arbitration/Requests/Case/Palestine-Israel articles 5 § Balanced editing restriction. JJPMaster (she/they) 00:11, 24 January 2025 (UTC)
- It seem 1339 has already been written by SilverLocust. It looks about right to me, at least as a starting point. I can only see this as a log that interested editors are going to have to go through, rather than a filter which actually limit edits through the filter. -- zzuuzz (talk) 00:20, 24 January 2025 (UTC)
- Later today I'm expecting to edit the filter a bit. SilverLocust 💬 00:23, 24 January 2025 (UTC)
- I have edited it to exclude bots, non-extended-confirmed users, and namespaces other than Article, Talk, Draft, and Draft talk. I'd welcome suggestions for improving it, preferably at the arbitration clerks' noticeboard. SilverLocust 💬 05:07, 24 January 2025 (UTC)
- Later today I'm expecting to edit the filter a bit. SilverLocust 💬 00:23, 24 January 2025 (UTC)