Jump to content

Wikipedia:Bot requests

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by 93.173.38.154 (talk) at 11:28, 5 February 2024 (→‎Bot to automatically revert date change vandalism: new section). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

This is a page for requesting tasks to be done by bots per the bot policy. This is an appropriate place to put ideas for uncontroversial bot tasks, to get early feedback on ideas for bot tasks (controversial or not), and to seek bot operators for bot tasks. Consensus-building discussions requiring large community input (such as request for comments) should normally be held at WP:VPPROP or other relevant pages (such as a WikiProject's talk page).

You can check the "Commonly Requested Bots" box above to see if a suitable bot already exists for the task you have in mind. If you have a question about a particular bot, contact the bot operator directly via their talk page or the bot's talk page. If a bot is acting improperly, follow the guidance outlined in WP:BOTISSUE. For broader issues and general discussion about bots, see the bot noticeboard.

Before making a request, please see the list of frequently denied bots, either because they are too complicated to program, or do not have consensus from the Wikipedia community. If you are requesting that a template (such as a WikiProject banner) is added to all pages in a particular category, please be careful to check the category tree for any unwanted subcategories. It is best to give a complete list of categories that should be worked through individually, rather than one category to be analyzed recursively (see example difference).

Alternatives to bot requests

Note to bot operators: The {{BOTREQ}} template can be used to give common responses, and make it easier to keep track of the task's current status. If you complete a request, note that you did with {{BOTREQ|done}}, and archive the request after a few days (WP:1CA is useful here).


Please add your bot requests to the bottom of this page.
Make a new request
# Bot request Status 💬 👥 🙋 Last editor 🕒 (UTC) 🤖 Last botop editor 🕒 (UTC)
1 Automatic NOGALLERY keyword for categories containing non-free files (again) 25 10 Thryduulf 2024-08-04 01:54 Legoktm 2024-06-24 01:34
2 Can we have an AIV feed a bot posts on IRC? 8 3 Legoktm 2024-06-21 18:24 Legoktm 2024-06-21 18:24
3 Bot to update match reports to cite template BRFA filed 14 5 Yoblyblob 2024-06-20 21:21 Mdann52 2024-06-20 21:11
4 Bot to mass tag California State University sports seasons Doing... 5 4 Frostly 2024-06-10 17:05 Headbomb 2024-06-09 17:28
5 Clear Category:Unlinked Wikidata redirects 9 6 Wikiwerner 2024-07-13 14:04 DreamRimmer 2024-04-21 03:28
6 Fixing stub tag placement on new articles Declined Not a good task for a bot. 5 4 Tom.Reding 2024-07-16 08:10 Tom.Reding 2024-07-16 08:10
7 Bot to change citations to list defined references Declined Not a good task for a bot. 3 2 Apoptheosis 2024-06-09 17:44 Headbomb 2024-06-09 16:56
8 Adding Facility IDs to AM/FM/LPFM station data Y Done 13 3 HouseBlaster 2024-07-25 12:42 Mdann52 2024-07-25 05:23
9 Tagging women's basketball article talk pages with project tags BRFA filed 15 4 Hmlarson 2024-07-18 17:13 Usernamekiran 2024-07-18 17:10
10 Adding links to previous TFDs 7 4 Qwerfjkl 2024-06-20 18:02 Qwerfjkl 2024-06-20 18:02
11 Bot that condenses identical references Coding... 12 6 ActivelyDisinterested 2024-08-03 20:48 Headbomb 2024-06-18 00:34
12 Convert external links within {{Music ratings}} to refs 2 2 Mdann52 2024-06-23 10:11 Mdann52 2024-06-23 10:11
13 Stat.kg ---> Stat.gov.kg 2 2 DreamRimmer 2024-06-23 09:21 DreamRimmer 2024-06-23 09:21
14 Add constituency numbers to Indian assembly constituency boxes 3 2 C1MM 2024-06-25 03:59 Primefac 2024-06-25 00:27
15 Bot to remove template from articles it doesn't belong on? 3 3 Thryduulf 2024-08-03 10:22 Primefac 2024-07-24 20:15
16 One-off: Adding all module doc pages to Category:Module documentation pages 6 2 Nickps 2024-07-25 16:02 Primefac 2024-07-25 12:22
17 Draft Categories 7 4 DannyS712 2024-07-27 07:30 DannyS712 2024-07-27 07:30
18 Remove new article comments 3 2 142.113.140.146 2024-07-28 22:33 Usernamekiran 2024-07-27 07:50
19 Removing Template:midsize from infobox parameters (violation of MOS:SMALLFONT)
Resolved
14 2 Qwerfjkl 2024-07-29 08:15 Qwerfjkl 2024-07-29 08:15
20 Change stadium to somerhing else in the template:Infobox Olympic games Needs wider discussion. 8 5 Jonesey95 2024-07-29 14:57 Primefac 2024-07-29 13:48
21 Change hyphens to en-dashes 16 7 1ctinus 2024-08-03 15:05 Qwerfjkl 2024-07-31 09:09
22 Consensus: Aldo, Giovanni e Giacomo 15 4 Bsoyka 2024-08-02 20:48 Qwerfjkl 2024-08-02 20:23
23 Cyclones 1 1 OhHaiMark 2024-08-04 01:47
Legend
  • In the last hour
  • In the last day
  • In the last week
  • In the last month
  • More than one month
Manual settings
When exceptions occur,
please check the setting first.



SVG

Good day, can someone make a bot to run through this and append {{SVG-logo}} below the Non-free xxx template and add ==Summary== above the FUR template to files that don't have it? --Minorax«¦talk¦» 11:22, 7 November 2023 (UTC)[reply]

Something like Special:Diff/1183936918 --Minorax«¦talk¦» 11:23, 7 November 2023 (UTC)[reply]
Hey, @Minorax, sure I'll give it a shot. I'll let you know if I'm able to get something working and sent off to BRFA. Dr vulpes (💬📝) 06:19, 20 December 2023 (UTC)[reply]

Disabling categories on drafts

Ever since the idea of immediately moving inadequate articles to draftspace emerged as a common alternative to deletion, the amount of time that has had to be invested in cleaning up polluted categories that have draftspace pages in them has gone way up, because the people who do the sandboxing frequently forget to remove or disable the categories in the process — so I wanted to ask if there's any way that a bot can be made to clean up any overlooked stuff.

Since there's already a bot, JJMC89bot, that detects main-to-draft page moves and tags them as {{Drafts moved from mainspace}}, the easiest thing would probably be to just have that bot automatically disable any categories on the page at the same time as it's tagging it — but when I directly approached that bot's maintainer earlier this year to ask if this could be implemented, they declined on the basis that the bot hadn't already been approved to perform that task, while failing to give me any explanation of why taking the steps necessary to get the bot approved to perform that task was somehow not an option. As an alternative, I then approached the maintainer of DannyS712bot, which catches and disables categories on drafts that are in the active AFC submission queue (which newly sandboxed former articles generally aren't, and thus don't get caught by it), but was basically told to buzz off and talk to JJMC89bot.

So, since I've already been rebuffed by the maintainers of both of the obvious candidate bots, I wanted to ask if there's any other way to either get one of those two bots on the task or make a new bot to go through Category:All content moved from mainspace to draftspace disabling any active categories, so that editors can cut down on the amount of time we have to spend on DRAFTNOCAT cleanup. If possible, such a bot would ideally also do an ifexist check, and outright remove any redlinked categories that don't even exist at all, though just disabling redlinks too would still be preferable to editors having to manually clean up hundreds of categorized drafts at a time — it's just that merely disabling the redlinks creates another load of cleanup work later on when the draft gets approved or moved by its own creator without AFC review or whatever, so killing redlinks right away is preferable to simply deferring them for a second round of future cleanup. Bearcat (talk) 16:18, 8 November 2023 (UTC)[reply]

Technically, this is doable — without interfering DannyS712bot's task. But I also would like to know why this was rejected by these two bot operators, and at the BRfA as well. —usernamekiran (talk) 18:22, 8 November 2023 (UTC)[reply]
This seems like something we can also tackle at the source. Hey MPGuy2824. Does the WP:MOVETODRAFT script disable categories when draftifying? If not we should consider adding this feature. If so we may need to look at diffs to see where these undisabled categories are coming from (manual moves? old script?) –Novem Linguae (talk) 19:02, 8 November 2023 (UTC)[reply]
The MTD script wraps categories within {{Draft categories}} which disables them. (e.g. [1]) The older script disables categories by adding a ":" before the word "Category:". That leaves two possible culprits. 1. manual moves and 2. the regex in my script isn't catching all categories. Let me see if I can narrow it down by running a quarry or two. -MPGuy2824 (talk) 04:07, 9 November 2023 (UTC)[reply]
Maybe Bearcat can provide some diffs for us to examine. Would be interesting to see if any of these are being created by MoveToDraft, or something else. –Novem Linguae (talk) 05:24, 9 November 2023 (UTC)[reply]
I've only ever seen manual moves; in fact, I wasn't aware that MTD even existed. Bearcat (talk) 12:41, 9 November 2023 (UTC)[reply]
@Bearcat: I already have a bot task approved for adding {{Draft categories}}. The challenge for me is identifying which drafts have article categories. There's Wikipedia:Database reports/Polluted categories to find user pages with article categories, but I'm not aware of anything comparable for drafts. Going through Category:Pending AfC submissions looked like an easy start, especially since it's so small at the moment, but I didn't find any drafts with article categories. I'm open to further discussion. GoingBatty (talk) 21:36, 8 November 2023 (UTC)[reply]
Wikipedia:Database reports/Polluted categories (2) catches categories with drafts in them; the minor flaw is that it's currently not correctly recognizing the {{Polluted category}} template that used to flag maintenance categories as "don't bother with this because we don't care about it", so it's picking up things it doesn't need to pick up like Category:Miscellany to be merged and Category:Wikipedia Student Program. But even if that report is failing to react to that template properly, a bot could potentially be programmed to react to that template around whatever's stopping that report from reacting to it.
In the case of the particular issue I was asking about, just working directly with Category:All content moved from mainspace to draftspace itself is also an option: have a bot go through that, and disable categories that are on the pages in that category. That won't catch all categorized drafts by itself, but it will certainly catch the ones that are categorized because they're former articles that got moved into draftspace without the mover disabling the categories in the process — and at this point, that accounts for the majority of categorized drafts, so it would become easier for human editors to catch whatever's still left if we only have to deal with 25 or 30 per cent as many pages as we do now.
I genuinely doubt that there's any way to make a bot perfect at catching all improperly categorized drafts without ever missing any — but if we can get bots to deal with as many as possible, that still reduces the amount of time that human editors have to invest in worrying about it. So I don't think we need to shoot for "the magic bullet that will make a bot infallible at instantly catching every draft that ever gets categorized at all" — let's just aim for "where can a bot make as many dents in the problem as feasibly possible by working on defined targets". Bearcat (talk) 22:48, 8 November 2023 (UTC)[reply]
@Bearcat: Aha - thanks for telling me about that report! I ran the bot over the report, and manually cleaned up some drafts that had incorrect categories. I've added it to my favorites, so I can run the bot when it gets republished. Thanks! GoingBatty (talk) 04:16, 9 November 2023 (UTC)[reply]
Ah, ok. I think the reason you didn't know about it might be that the report you already knew about used to catch both user-polluted and draft-polluted categories in the same place — but then they were split up into two separate reports later on for whatever reason, so you might simply never have found out about the newer draft report. Bearcat (talk) 12:41, 9 November 2023 (UTC)[reply]
If you don't want to have your bot churn through 8000 pages most of which don't need action, then Category:Content moved from mainspace to draftspace from November 2023 (using the current month instead) would also work fairly well. * Pppery * it has begun... 23:32, 8 November 2023 (UTC)[reply]
Actually, yeah, that's a good alternative too. In reality, categorized drafts will virtually always be new pages that became categorized drafts within the past couple of days (and humans can catch the less common exceptions where a much older draft gets recategorized more than a month later), so the dated categories are likely more manageable chunks for a bot to grind through. Bearcat (talk) 23:48, 8 November 2023 (UTC)[reply]
Wouldn't it be possible to change the Draft namespace properties so that it's contents is never shown on category pages? Wikiwerner (talk) 15:32, 17 January 2024 (UTC)[reply]
No. — Qwerfjkltalk 15:33, 17 January 2024 (UTC)[reply]

Fixing duplicate banners

I asked at the help desk and I was told to ask here. The Organized crime task force and the Serial killer task force banners recently got added to the banner of Template:WikiProject Crime and Criminal Biography using parameters. The previous banners (as wrappers of the new one with the parameters) were mass substituted. This has left ~6700 (see Category:Unknown-importance Crime-related articles, not counting ones that didn't have an initial basic crime importance) duplicates, that have both the original crime importance and task force importance but split between two duplicate banners.

Is there any bot that can merge the importance values on the pages that have both templates so there aren't so many duplicates (for example if there's two duplicate banners, one of which has the importance for wp crime and one which has the task force importance, add them together)? Of course the ones that were not initially tagged with the original crime ones will have to be manually tagged as they don't have the basic importance parameter, but that's less than 500 which isn't as bad (compared to 6700 that already HAVE all the required importance parameters) PARAKANYAA (talk) 17:56, 11 November 2023 (UTC)[reply]

@PARAKANYAA: I just ran my bot against Category:Pages using WikiProject banner shell with duplicate banner templates, which edited about 550 talk pages (many WP:CRIME related) to remove a WikiProject template only if every parameter is included in a duplicate WikiProject template on the same page. You'll need someone else to clean up those where the two templates have different parameters that need merging. GoingBatty (talk) 06:16, 12 November 2023 (UTC)[reply]
@GoingBatty Someone else meaning it must be done manually or someone else's bot? PARAKANYAA (talk) 04:03, 16 November 2023 (UTC)[reply]
@PARAKANYAA: Other bot operators may be able to fulfill your requests. Doing things manually is also an option. GoingBatty (talk) 04:17, 16 November 2023 (UTC)[reply]
Well, I sure hope that someone's bot can... that would take a while. PARAKANYAA (talk) 04:21, 16 November 2023 (UTC)[reply]
to elaborate further on the problem and what I imagine the solution to be
a hypothetical bot should merge
{{WikiProject Crime and Criminal Biography|serialkiller=yes|serialkiller-imp=low|organizedcrime=yes|organizedcrime-imp=low}} (or having 1 or the other task forc parameters, just showing both for sake of example)
on the same page as either
{{WikiProject Crime|importance=low}} OR {{WikiProject Criminal Biography|importance=low}} (also called WikiProject Criminal which iirc has quite a few transclusions)
would combine to be
{{WikiProject Crime and Criminal Biography|importance=low|serialkiller=yes|serialkiller-imp=/nowiki>low|organizedcrime=yes|organizedcrime-imp=low}}
95% of the articles I've seen that have a crime duplicate have either the basic crimebio banner or normal crime (which doesn't matter, Crime/Crimebio/Crime and Crimebio are all the same now) with a crime importance parameter, and a separate banner with either the serial killer importance or the organized crime importance that lacks the crime project importance, so wiki categorizes it as unknown importance (though strangely WP 1.0's bot does not) PARAKANYAA (talk) 04:43, 21 November 2023 (UTC)[reply]
Y Done. Primefac (talk) 12:58, 24 December 2023 (UTC)[reply]

Date ranges for noticeboard archives

This seems uncontroversial.

It would be pretty cool if some kind bot could go through the pre-current (should never change) archives of the boards listed in the dramaboard archivebox series, extract the earliest and latest timestamps, truncate them to dates, and use those dates to annotate the links somehow. Inactive archives at time of writing are:

User story: I was recently trying to find an archived conversation from a few months ago, and the best tools I had available were a scattershot "tap an archive number, wait for the entire page to load, check top and bottom timestamps" and "search archives for exact string matched date". Improved navigability gained from annotating the archive links with date ranges should save people time.

Implementation ideas: The quickest implementation would just be a plaintext date range edited onto the archive list pages linked above. A further step could be to add a |date-span= (or similar) to {{Administrators' noticeboard navbox all}} which, if present, would display the date range of comments posted at the top of the page itself, so the information is available both on the archive page and the index of archives. The most elegant, stupid, and expensive implementation would be to add {{shortdesc}} to all the archives, set the |1= to the date range, and convert the indices to use {{annotated link}}.

Anyway though: Anyway though the first step is getting the date ranges. Maybe this is already in a report somewhere? Folly Mox (talk) 18:30, 11 November 2023 (UTC)[reply]

@Folly Mox, I suspect you'd need consensus to go through with this. Perhaps trying asking at those noticeboards first? — Qwerfjkltalk 19:06, 11 November 2023 (UTC)[reply]
Oh right I tacked on all those expanded scope ideas in the process of making the edit. Folly Mox (talk) 20:22, 11 November 2023 (UTC)[reply]
Just noting, an Index of some variety would probably be easier than going through the thousands of archives and amending them. Primefac (talk) 08:10, 12 November 2023 (UTC)[reply]
Yep. A user script could be useful for displaying date ranges without the need for edits. — Frostly (talk) 10:04, 7 January 2024 (UTC)[reply]

Would this need a bot?

Does the task at Wikipedia:Village pump (technical)#Implementation of Template:Refideas editnotice require a bot, or is there another way to accomplish that? You can respond there if you like. BOZ (talk) 05:42, 14 November 2023 (UTC)[reply]

N Not done. A different solution was figured out. QueenofHearts 01:45, 8 January 2024 (UTC)[reply]

Automatic NOGALLERY keyword for categories containing non-free files

I have noticed that many categories, especially content categories, include non-free files without the __NOGALLERY__ magic word, which is against WP:NFCC#9. I'd suggest using a bot to auto-tag such categories, skipping a whitelist for those categories covered by WP:NFEXMP (generally those categories concerning reviews of questionable files, such as CAT:FFD, and some maintenance categories that should contain no non-free files). –LaundryPizza03 (d) 12:01, 14 November 2023 (UTC)[reply]

I'm pretty sure that most categories on en.wikipedia should have __NOGALLERY__ and it would be actually smarter and less work to disable image showing on all categories by default (without requiring any code per page or bot work) and have a __YESGALLERY__ magic word for the much less instances of categories that actually could show images. Gonnym (talk) 12:38, 14 November 2023 (UTC)[reply]
I doubt that would be possible, since as a MediaWiki tweak that would have to apply to all Wikimedia wikis, many of which don't allow non-free files. –LaundryPizza03 (d) 02:48, 15 November 2023 (UTC)[reply]
From a coding perspective it could just be a setting that can be activated per wiki. I'd be opposed to any bot before other solutions are researched. --Gonnym (talk) 06:23, 15 November 2023 (UTC)[reply]
@Gonnym: Might want to report this request at https://phabricator.wikimedia.org/? I'd still like to hear from other people on whether a bot would be a good solution in the short term. –LaundryPizza03 (d) 01:22, 15 December 2023 (UTC)[reply]
I think galleries are useful to visually navigate the category, and that they should be kept enabled for free files. — Frostly (talk) 10:02, 7 January 2024 (UTC)[reply]

We have a lot of citations that could be improved using |author-link= eg. Special:Diff/1186331321/1186342802. The problem is it's difficult to match the correct author, it requires a human. Thus wondering if/how this might be automated in certain cases. It doesn't require every case only those it can match with greater certainty. For example we know, per the above diff, there is only one Steven Poole there is no dab page. And we know Steven Poole writes for a publication called Quercus. Thus any other cites that match those criteria, is a good bet that is the same person, and where an |author-link= could be added.

Is this method 100% foolproof? Probably not, but is it at least 99% accurate in matching names? Probably. I think a test run would show how reliable it is. I don't have the time right now but wanted to mention in case anyone wants to run an experiment. Or had other ideas. A dump of CS1|2 citations on enwiki - not including cite web - can be found here. I currently have updates disabled, but can restart if anyone wants. -- GreenC 20:27, 22 November 2023 (UTC)[reply]

One could involve Wikidata: if the corresponding item is a writer, then the author link can be added. Wikiwerner (talk) 20:58, 22 November 2023 (UTC)[reply]
I don't think that will help that much. Many authors cited on Wikipedia aren't "writers": probably as many or even more are scientists of some vein.
I'm not sure I have a better idea how to automate this though. Apparently Growth Team have a pretty good algorithm for suggesting appropriate wikilinks given topic and text, and I doubt their application will interface with template parameters, but solving the "what should link here" problem – of which this is a subtype – is not easy. Folly Mox (talk) 14:57, 1 December 2023 (UTC)[reply]
For a first pass, you might limit the suggestions to publications mentioned in the author's article. Perhaps exclude citations, so we don't link to a footballer of the same name whose transfer was reported in another edition of the same newspaper. Certes (talk) 15:21, 1 December 2023 (UTC)[reply]
I think look at existing uses of |author-link= and build a 2-column database: "Steven Pool = Quercus". Then find all other citations that cite Steven Pool and Quercus, and add the |author-link= if missing. It works backwards from what is know to be true. -- GreenC 19:42, 5 December 2023 (UTC)[reply]
Good plan. For extra safety, any names with two author-links that don't redirect to the same article can be filtered out into a log file without attempting to fix them. There probably won't be many. Certes (talk) 23:00, 5 December 2023 (UTC)[reply]
I really like this idea. Folly Mox (talk) 14:29, 6 December 2023 (UTC)[reply]
Incidentally, I did ask whether the future Suggested Links task will interface with citation template parameters, and although I got a response I'm still not sure. The conversation is at :mw:Talk:Growth/Feature summary § Suggested Links and interaction with non-prose text, in case anyone is interested. Folly Mox (talk) 13:20, 14 December 2023 (UTC)[reply]

Early idea

"The three most dangerous things in the world are a programmer with a soldering iron, a hardware type with a program patch, and a user with an idea."

— Rick Cook, The Wizardry Consulted

So I have an idea, and...

Is it possible for a bot to find articles that:

  • contain a number of words of readable prose (e.g., as calculated by Wikipedia:Prosesize) between x and y (e.g., 150 and 2,000) and also
  • contain fewer internal links per word than some simple mathematical formula (e.g., "less than three total" or "less than one link per 100 words")?

If a bot could automatically detect such articles, then I'd like to have it add the {{underlinked}} template, on a schedule of perhaps a few articles being tagged per hour, to feed the seemingly popular Category:Underlinked articles for the Wikipedia:Growth Team features, without giving a large number of articles to the first editor and then leaving none for anyone else.

I realize that this would require a demonstration of consensus, but I don't want to make the suggestion, get people's hopes up, and then find out that bots can't count the number of words or links in an article. WhatamIdoing (talk) 22:21, 30 November 2023 (UTC)[reply]

The tagging part seems like a CONTEXTBOT problem, but I can imagine a bot-generated report that listed the 1,000 articles with the fewest links per 100 prose words. Humans could then look through the report and refine the bot's criteria. If somehow the bot can be made to correctly identify underlinked articles without false positives, tagging might be possible. – Jonesey95 (talk) 01:22, 1 December 2023 (UTC)[reply]
Related question, for WhatamIdoing: What can we do to provide feedback about this newcomer linking activity? I see that at least some edits are adding undesirable disambiguation links. Is the tool suggesting these links? – Jonesey95 (talk) 01:25, 1 December 2023 (UTC)[reply]
The tool doesn't suggest any specific link. It pops up a box that says things like "All you need to do is add one or two links to make a difference." They're using the visual editor, so its link tool will de-prioritize (in the search results) and label dab links (so you can see that it's not the kind of page you were expecting). However, it doesn't tell you what a redirect points to, so if you have a redirect to a dab page, then you'll see 'redirect' and not know that it's a redirect to a dab page. WhatamIdoing (talk) 02:25, 1 December 2023 (UTC)[reply]
It might be worth mentioning that the Suggested Links task, scheduled to be enabled on en.wp next year, does not use maintenance templates as an inclusion criterion. So if timetables are not further extended, any success in this effort will apply only to a few months of newcomer activity. Folly Mox (talk) 15:01, 1 December 2023 (UTC)[reply]
Is word-counting and link-counting a realistic task, then? WhatamIdoing (talk) 02:28, 1 December 2023 (UTC)[reply]
WhatamIdoing, I don't know of any way to do it using queries, but running a bot on a database dump probably wouldn't be that hard. — Qwerfjkltalk 14:23, 1 December 2023 (UTC)[reply]
From a technical perspective, word counting and link counting are pretty straightforward to do. I explained how to implement prosesize word counts on my blog a while back, and that technique is used to power, among other things, Wikipedia:Database reports/Featured articles by size. Link counts are a simple database query or extraction from page HTML/wikitext. Unfortunately much of this work is blocked on the fact that the HTML dumps are currently created using proprietary source code. Legoktm (talk) 07:20, 5 December 2023 (UTC)[reply]
Thanks for that, @Legoktm. It sounds like word-counting could be done "today" (i.e., by adapting existing code). I'm not sure how to summarize what you said about link-counting. On the one hand, you say it's a "simple" query, but on the other hand, that it's blocked.
Is the database report for FAs the size at time of promotion, or the size today? Tpbradbury had been looking into that recently. (He's been hoping to find out whether there was a trend in FA size over time.) WhatamIdoing (talk) 17:58, 5 December 2023 (UTC)[reply]
The report I threw together shows the size at time of promotion, as requested. There may be other reports based on current size. Certes (talk) 18:56, 5 December 2023 (UTC)[reply]
Sorry, I wasn't very clear @WhatamIdoing. Individually, getting an article's prose size and link count is simple. Finding articles out of the entire wiki that meet those criteria isn't really feasible right now because of the lack of HTML database dumps. So if there's some other way to limit the number of articles to check, e.g. just looking at a few categories, that's probably doable. Legoktm (talk) 07:43, 7 December 2023 (UTC)[reply]
If I wanted to get this for, say, all the articles in Category:WikiProject Medicine articles, excluding articles in Category:Society and medicine task force articles, then it sounds like we (i.e., you/someone/not me) could make a one-time report that lists each article and the number of words and links in it, but an ongoing "monitoring" process would be less feasible. Am I closer to understanding this now? WhatamIdoing (talk) 18:54, 8 December 2023 (UTC)[reply]
@WhatamIdoing: I just tried writing some code and seeing how it went and here you go: User:WhatamIdoing/Possibly underlinked medicine articles (feel free to move elsewhere, etc.). It took about 15 minutes to generate the listing; definitely surprised me how fast it was. So I think it's fine to run as a regular thing on categories of roughly that magnitude, probably weekly?
I used the criteria you suggested at the beginning (between 150-2000 words and less than 3 links total), but we can change that based on what you find useful without really affecting the runtime.
So let me know if that list of articles is useful and if/how you'd like to move forward :) Legoktm (talk) 04:47, 16 December 2023 (UTC)[reply]
@Legoktm, thank you for this. Could you take a look at Motoric Cognitive Risk and Igor Smirnov (engineer), which are reported as having two links, but appear to have none? WhatamIdoing (talk) 02:56, 17 December 2023 (UTC)[reply]
I tagged all nine of the identified articles as being underlinked. Less than 48 hours later, all of them were improved, and all of the tags had been removed by the (apparently many) watchers of the category. WhatamIdoing (talk) 22:12, 18 December 2023 (UTC)[reply]
Thanks for pointing out those two articles - I found a bug in my code, so I re-ran the report and surfaced ~280 more articles that meet the criterion and have updated the listing. Should I turn this into a regular database report that updates every week? Legoktm (talk) 06:18, 29 December 2023 (UTC)[reply]
Probably not yet. Why is Academy of Medical Royal Colleges and Faculties in Scotland in the list? WhatamIdoing (talk) 06:35, 29 December 2023 (UTC)[reply]
List items don't count for prose size (see this), so those links aren't being counted. Legoktm (talk) 06:54, 29 December 2023 (UTC)[reply]
I'm not sure that using Prosesize's limitations makes sense for counting links. It's a bit tricky, because you don't want a "List of 100 most popular songs", with links to 100 articles about songs, in this list (and Prosesize will do that), but you also don't want a medium-sized ==See also== section to exclude an article with completely unlinked text (and counting all links in list-formatted text will do that).
Unless you have ideas about how to get around this, it's possible that this is not a suitable task for full bot automation. WhatamIdoing (talk) 18:03, 29 December 2023 (UTC)[reply]
Right, I think there are pros and cons both ways. The initial list missed a number of articles that were underlinked because the link was in a reference or something.
I'm also not interested in pursuing full automation, I think, if you find the list useful (but not necessarily perfect), to just regularly generate the list and let humans review it and decide whether to tag it or not. Legoktm (talk) 23:53, 29 December 2023 (UTC)[reply]

{od}

An update and some observations, in case anyone else is interested:

  • Status: My burn rate is about 50 articles per week. Most of these (shorter) articles get thoroughly linked within about 36 hours. I expect therefore to reach the end of this initial list about a week from now.
    • Links might be added faster for a subject area of broader interest (e.g., food or sports).
  • Edits: Most new contributors follow the suggestion/instruction to only add a couple of links. Consequently, an article might have ten or more editors make one, two, or three links before it gets removed from the list.
    • A significant fraction of the new editors who are editing these articles because they are listed in Special:Homepage for needing links have surprised me by trying to improve the article through other means, e.g., copyediting, adding citations, or (unfortunately/incorrectly) adding external links to other websites (like Red Cross). (Pinging Trizek (WMF))
      • I wonder whether this desirable "side effect" will evaporate under the new link-suggesting system.
  • Primary pain point: Un-tagging articles requires manual effort. I remove the tags myself sometimes, usually thanking anyone who has made a plausible contribution. Most days, other experienced editors get to it before I do.
    • Trying not to overwhelm the cat's dedicated patrollers is kind of tedious. It would be easy to review hundreds of articles in one go, and Special:Homepage would be happy to have a mass-dumping of all of them at once in Category:Articles with too few wikilinks, but it would make manually reviewing the articles to see whether it's time to remove the tag very difficult.
    • It might be nice to have a bot auto-remove the tag after edits from 10 new editors.
  • Future possibilities: After I've run through the previously unidentified 'backlog', I expect future runs to be much shorter, unless we change the criteria to be a little more expansive. It might be better to pick a different topic (maybe even deliberately aiming at a theme, like Wikipedia:WikiProject Africa during Black History Month in February and Wikipedia:WikiProject Women during Women's History Month in March).

WhatamIdoing (talk) 04:32, 26 January 2024 (UTC)[reply]

Thank you for the ping @WhatamIdoing!
@Folly Mox detailed a possibility to solve your problem: the Suggested Links task. It detects links and provide them to newcomers so that they can add the missing links. We will deploy it in the coming weeks, but the activation will be your community's responsibility.
It would be better to let newcomers adding these links. Easy tasks are easily solvable with a bot, but they are the best way to let newcomers discover that they can edit.
The number of links added through Suggested Links bu one use is default at 3. But this number can be changed by your community using special:EditGrowthConfig. These links are suggested on all articles where too few links are present, no matter if they are tagged or not. There is no banner to remove when enough links have been added, except on the already tagged articles.
The side effect you observed will be partially lost, as users won't be able to edit the entire article. But they have the possibility to continue editing the article in the standard way when they have published their added links. It also decreases the feeling some patrollers had at other wikis, that not all links are relevant: the tool won't suggest links into infoboxes or citations, and it won't suggest disambiguation pages.
As I write this message the Growth and Campaigns teams are working together on suggested tasks related to events. We can imagine creating focuses on topics, where newcomers would be invited to themes like you suggest.
Trizek_(WMF) (talk) 09:53, 26 January 2024 (UTC)[reply]

Protection fallback adminbot

Simple idea: monitor the protection log, and any time the protection level is increased, but the expiration time is decreased, wait until a few minutes before the expiration, and restore the status quo. If it really is the intention of the protecting admin to leave the page unprotected at expiry, they can leave a keyword like NOFALLBACK or something in the protection summary. An obvious complication would arise if the bot is lagging, and some edits slip in before protection can be restored, but that's a minor detail. Yes, I know about the PC trick, but people sometimes forget, and sometimes PC is isn't enough. Suffusion of Yellow (talk) 03:58, 2 December 2023 (UTC)[reply]

From WP:ADMINBOT this needs a wider discussion on WP:AN or WP:VP, though I think this is a good idea. 0xDeadbeef→∞ (talk to me) 06:08, 2 December 2023 (UTC)[reply]
Wouldn't it be best to find someone willing to operate the bot before proposing it to the wider community? Suffusion of Yellow (talk) 19:39, 2 December 2023 (UTC)[reply]
It's a chicken-and-egg problem, right? Operators don't want to invest time until there's consensus, and it's a waste of time to determine consensus if there's no operator...
It would be nice if we had a collective group of admin bot operators, so we don't need to rely on one single person volunteering, i.e. I'd be happy to contribute and work on such a protection bot but it would not be a good idea for me to do it alone.
P.S. Might be be easier to find an operator if this was a blue link. Legoktm (talk) 05:26, 16 December 2023 (UTC)[reply]
+1— Qwerfjkltalk 22:00, 16 December 2023 (UTC)[reply]
Subtle hint there, Legoktm. I like it. @Suffusion of Yellow: feel free to take them up on that advice . More admin bot ops couldn't hurt. As for this idea, that does definitely seem like a doable one (technically speaking), though I am not entirely sure at just past 1am local time (my timezone) how I would implement that...but TheSandBot is an adminbot and I would be happy to entertain the idea.
I have played around with monitoring event logs on Commons, so I am sure I could find the log. The main question would honestly be keeping track of the changes and how to jump/kick-start the system. Though, as I think this through writing this, I guess it wouldn't necessarily need to have data pre-populated as it could just create entries etc (i.e. in a database) based on protection level changes that come in. Before this went to a community discussion, I would definitely like to hammer out some of the details/concepts before attaching my name to it as a bot-op. If you have any thoughts on either implementation or conceptually how this would work further, I am definitely all ears (seriously/no sarcasm). TheSandDoctor Talk 09:11, 24 December 2023 (UTC)[reply]
Oh and, just re-reading your message @Legoktm:, I would be happy to tag-team something as well, potentially. I am thinking that the bot itself might not actually have to be hyper complicated. Could potentially just, at least for part of it, watch/listen for events, shove them in a database table with some sort of action date field, and another component makes some sort of a change at that date/time. Hmm... TheSandDoctor Talk 09:15, 24 December 2023 (UTC)[reply]
Yeah, that's roughly what I was thinking as well. One suggestion I'd make is to store the "database" on a wiki page. This would provide 2 features: 1) it keeps all information on-wiki, which makes transferring the bot much easier since there's no separate database and 2) it provides an easily understandable opt-out feature for admins, since they could just remove it from the wiki page. So my idea of the workflow is:
  • Bot watches event stream (or polls the protection log), to identify instances of protection level being increased and expiration time being decreased.
    • If found, the bot will add an entry to a fully protected user page (aka the "database page"), and ping the protecting admin in the edit summary
    • Admins can undo the bot right away if they don't want it to apply. And if they don't want notifications, they can mute the bot.
  • The second part of the bot just watches the database page, identifies when the next protection change is needed, and sleeps until then.
    • When it's time, it restores the original protection level+expiry, and then removes it from the database page. And then sleeps until the next instance.
Seem reasonable and hopefully not too complicated? Legoktm (talk) 06:27, 29 December 2023 (UTC)[reply]
Thanks; that's much better than anything I could have come up with! And the first part (maintaining the page) can be done without needing community approval. Suffusion of Yellow (talk) 22:00, 31 December 2023 (UTC)[reply]

Automatic tagging of some easily-identified non-English language text

This probably should not be implemented trivially for languages written in the Latin script, but with a few caveats, it seems pretty doable to write a bot that scours articles, and while staying out of appropriate templates, tags text using existing templates like {{lang}} as either being in a specific language, or at least being in some language written in a particular script, e.g. und-Hani or und-Cyrl as per the obligatory HTML |lang= parameter and ISO 639. If there is und text already tagged, it makes it much easier to see whether 漢字 is lang=ja-Hani or lang=zh-Hant, and also to quickly retag everything en masse.
If we are getting dangerous, I can think of multiple ways to further discriminate between, say, Japanese and Chinese-language text beyond simple checking for strings of CJK ideographs. Remsense 21:16, 6 December 2023 (UTC)[reply]

I'm wondering if this may fall afoul of WP:COSMETICBOT, since it doesn't alter the rendering of the page, but I feel like COSMETBOT exceptions have been carved out before for changes that alter the presentation of a page via speech synthesis, which this would do (and, AFAIA is the primary reason we tag non-English terms like this). As a minor note, I don't think it's necessary to specify in these templates the distinction between zh-hant and zh-hans or whatever they're called in the appropriate standards. Folly Mox (talk) 13:29, 7 December 2023 (UTC)[reply]
Oh no, I think a cosmetic argument is a non-starter, since it's a fundamentally semantic change core to the HTML standard itself. Not to be overly dramatic, but every HTML page that doesn't tag foreign language content is meaningfully running afoul of the standard, because it has likely explicitly declared at the top that the whole thing is in English. And no—I don't think Hans versus Hant is useful for most end users, but it's a further specification one could make that I decided to spell out for some reason. Remsense 14:24, 7 December 2023 (UTC)[reply]

template change

Hi, I would like to know whether a bot would be able to do this particular task or not. The task is to replace the existing format with the template like I did here on my sandbox to explain it better: [2]

The following articles: 2004 Andhra Pradesh Legislative Assembly election and 2009 Andhra Pradesh Legislative Assembly election require these template changes. Since I am finding this monotonous task quite difficult to do it myself, I am looking for help probably a bot might help I believe? Any info or help is appreciated. Thank you 456legend (talk) 05:52, 7 December 2023 (UTC)[reply]

Depending how many pages need editing, this could be a good job for AWB. Certes (talk) 09:39, 7 December 2023 (UTC)[reply]
@Certes All the following articles will need this template changes:
1. 2004 Andhra Pradesh Legislative Assembly election
2. 2009 Andhra Pradesh Legislative Assembly election
3. 2014 Andhra Pradesh Legislative Assembly election
4. 2004 Karnataka Legislative Assembly election
5. 2008 Karnataka Legislative Assembly election
6. 2013 Karnataka Legislative Assembly election
7. 2021 Tamil Nadu Legislative Assembly election
8. 2016 Tamil Nadu Legislative Assembly election
And there are few more in addition to these articles.. 456legend (talk) 13:41, 7 December 2023 (UTC)[reply]
Thanks. With a list that size, using AWB will be much quicker than writing a bot. Once you have a final list of articles, WP:AWB/Tasks should be able to help. Alternatively, a good programmers' text editor with regexp features should be able to do the job. Certes (talk) 13:57, 7 December 2023 (UTC)[reply]
Declined Not a good task for a bot. too small in scale/scope, as discussed above. Feel free to bring back here if scale/scope (in terms of number of pages impacted) massively increases. --TheSandDoctor Talk 09:03, 24 December 2023 (UTC)[reply]

A bot to clear Template Sandboxes X21 to X71

Hello, it has come to my attention that template sandboxes X21 to X71 are not automatically cleared by Cyberbot I, which clears template sandboxes X1 to X20, and the main template sandbox. So I think there should be a bot that clears the rest of the template sandboxes. This bot would be called "SandBot", and it would clear the template sandboxes at 00:00 UTC and 12:00 UTC every day. It would do additional help for Cyberbot I for clearing template sandboxes X21 to X71. This is only a proposed bot I had the idea to create. RandomWikiPerson_277talk page or something 19:58, 12 December 2023 (UTC)[reply]

RandomWikiPerson277, have you asked the operator of Cyberbot if they'd be willing to do this? — Qwerfjkltalk 20:40, 12 December 2023 (UTC)[reply]
Well, I did contact the operator of Cyberbot, I haven't got a message back yet/ RandomWikiPerson_277talk page or something 21:22, 12 December 2023 (UTC)[reply]
Just delete them. There's absolutely no good reason to have so many sandboxes, and in particular your creation of 53-71 seems to serve no purpose other than wasting others' time. * Pppery * it has begun... 21:24, 12 December 2023 (UTC)[reply]
I did not mean to waste others time. I just decided to add some more sandboxes for some reason. RandomWikiPerson_277talk page or something 15:01, 13 December 2023 (UTC)[reply]
Mass WP:TFD maybe? –Novem Linguae (talk) 20:35, 13 December 2023 (UTC)[reply]
21-52 survived Wikipedia:Miscellany for deletion/Template sandboxes X21-X52 in the past, although I think the outcome of that discussion would have been different if it had been at TfD since MfD is more inclusionist. * Pppery * it has begun... 20:47, 13 December 2023 (UTC)[reply]
Wikipedia:Templates for discussion/Log/2023 December 14#Template:X21Novem Linguae (talk) 09:14, 14 December 2023 (UTC)[reply]

Question

Can a bot make this kind of changes to multiple pages?

  • Before: {{abcd|ᚠ|ᚡ|ᚢ |ᚣ|ᚤ|ᚥ|ᚦ| ᚧ|ᚨ|ᚩ}}, {{abcd|Ꭰ|Ꭱ|Ꭲ|Ꭳ|Ꭴ}}
  • After: ᚠᚡᚢ ᚣᚤᚥᚦ ᚧᚨᚩ, {{abcd|Ꭰ|Ꭱ|Ꭲ|Ꭳ|Ꭴ}}

That is,

  1. Check if Template:abcd only contains [ ]?[ᚠ-ᛸ][ ]? in each parameter.
  2. If so, remove Template:abcd and |, but retain the text entered as parameters (including spaces).
  3. If not (= if Template:abcd contains characters other than [ ]?[ᚠ-ᛸ][ ]?), leave it as-is.

172.58.208.108 (talk) 19:19, 16 December 2023 (UTC)[reply]

Of course an unsupervised script can make a regex substitution across multiple pages, but the problem statement has been generalised beyond the point where a meaningful discussion can take place. (There is no Template:Abcd, and so it's also unclear whether you're actually talking about Futhark or using it as a placeholder.)
Can you give us an actual example of an article you'd like to make this sort of mass edit at, and describe the effect you'd like to achieve by doing this? Folly Mox (talk) 20:26, 16 December 2023 (UTC)[reply]
The template name "abcd" and the character range [ᚠ-ᛸ] are just placeholders. The actual request will use a different template name and a different character range. Anyway, I would like to know for sure if it is possible for a bot to make changes like what I wrote above. (The discussion has not begun, so I cannot give you more details at this moment.) 172.58.208.125 (talk) 23:02, 16 December 2023 (UTC)[reply]
Yes, there's no technical barrier to that sort of text processing. Folly Mox (talk) 23:21, 16 December 2023 (UTC)[reply]

My website runeberg.org just recently moved from http: to https: so it would be nice if someone could update the 11,000 links accordingly. This is not urgent, as everything works fine with automatic redirects, but it would be nice. Thank you. -- LA2 (talk) 22:33, 17 December 2023 (UTC)[reply]

@LA2: Is this a job for WP:URLREQ? Certes (talk) 22:53, 17 December 2023 (UTC)[reply]
@Certes: Indeed, it might well be. I'll post it there. --LA2 (talk) 22:55, 17 December 2023 (UTC)[reply]
Deferred. QueenofHearts 01:40, 8 January 2024 (UTC)[reply]

 Done -- GreenC 16:33, 9 January 2024 (UTC)[reply]

Small Question

This is my first time posting here, so no idea if this should be done by a bot. So, the "IPAlink" template has another variation "IPA link" (notice the space). The official representation is "IPA link" but I find the "IPAlink" variation also is quite predominant. This isn't urgent ("IPAlink" redirects to "IPA link"), but would a bot fix this sort of thing? PharyngealImplosive7 (talk) 18:08, 21 December 2023 (UTC)[reply]

The links are not broken, and changing them would be purely cosmetic. The small improvement might not justify the human or machine effort and the resulting pollution to the page histories. Certes (talk) 18:27, 21 December 2023 (UTC)[reply]
Yeah, that's what I thought. If my AWB request at PERM gets approved, I might do it myself, but it's fine. PharyngealImplosive7 (talk) 21:44, 21 December 2023 (UTC)[reply]
@PharyngealImplosive7: This is Declined Not a good task for a bot. for the reason above, and doing so with AWB manually would go against the WP:AWBRULES because it would have the same pollution to the page histories. You could update Wikipedia:AutoWikiBrowser/Template redirects to add a rule to change {{IPAlink}} to {{IPA link}}, and then AWB editors and bots will make the update when doing something else that improves the rendered page. GoingBatty (talk) 22:21, 21 December 2023 (UTC)[reply]
Ok. I see, and I won't update it manually. I'll just add it to the Template redirect AWB links you mentioned before. PharyngealImplosive7 (talk) 22:32, 21 December 2023 (UTC)[reply]

There are quite a few external links templates created in recent years (See Category:Social media external link templates) and when used they offer a consistent style and allow for error tracking among other things. However there are still quite a lot of external links that don't use these. Sometimes they are bare links, while others have some kind of text with them. Would it possible for a bot to convert external links in the external links section (links in the body should be ignored as I'm not sure if these templates work in the body correctly or not) to use one of the listed templates at the bottom? Here is an example of an edit with IMDb title.

Templates:

If this is controversial and needs discussion, please point me to where it should be held. Gonnym (talk) 15:45, 28 December 2023 (UTC)[reply]

@Gonnym: Template:IMDb name and Template:Official website are two I manually add frequently. Template:Rotten Tomatoes and Template:Metacritic are two more to consider. There are almost 600 entries in Category:External link templates. GoingBatty (talk) 18:58, 28 December 2023 (UTC)[reply]
@Gonnym ...plus more in its subcategories. GoingBatty (talk) 19:08, 28 December 2023 (UTC)[reply]
Maybe pick one template and start with that, to decrease the massiveness of this task. Get a consensus on a talk page somewhere, then someone can start working on a (now much smaller) bot task, then WP:BRFA it. Template:Google Scholar ID and some of the IMDB templates are the ones I find myself converting the most. –Novem Linguae (talk) 22:21, 28 December 2023 (UTC)[reply]
While the task is big I think having a bot do only one at a time runs the risk of it becomming a spam bot (and having editors complaining) as a lot of times there are more than one of these on a page (Coco Lee as an example of one). Gonnym (talk) 22:41, 28 December 2023 (UTC)[reply]
My thinking is to do two two BRFAs: one that just does one template and gets the process started, then a big one for everything else later. This avoids WP:TRAINWRECK issues with deciding which templates to cleanup, minimizes the amount of bugs that are likely to crop up during the first bot run, etc. Up to y'all though. –Novem Linguae (talk) 23:05, 28 December 2023 (UTC)[reply]
Two cents, take it or leave it: In my workplace for something like this we'd start with a basic test of one, then do a broader test of, say, 3-5, then engage in more widescale implementation. DonIago (talk) 14:56, 29 December 2023 (UTC)[reply]
Sure, sounds like a good plan. Gonnym (talk) 12:17, 9 January 2024 (UTC)[reply]
These templates have a serious drawback: standardized maintenance tools don't support them. IABot, Citation bot, WaybackMedic, reFill, etc.. pass them by like they don't exist. As a result there is a lot of link rot contained within these templates. IMO we are better off not having them at all, there are thousands, they don't reduce complexity and error, they add to it. They are islands of code no one is supporting due to the sheer number and variety. -- GreenC 04:41, 4 January 2024 (UTC)[reply]
Perhaps the template parameters can be somewhat standardized, so that bots can be modified to recognize templates in the category. — Frostly (talk) 05:43, 9 January 2024 (UTC)[reply]
@GreenC I'm sure you have much more experience than I do with link rot, but I don't see how this isn't an issue that can't be fixed. If something needs to be changed in the web address, this would ideally require only one edit instead of tens of thousands of edits. How is that not a better method to handle these simple links? Also, these aren't citation templates and should never be used as such (regarding the list of bots you mentioned). Gonnym (talk) 12:20, 9 January 2024 (UTC)[reply]
I think GreenC's concern is more what happens if the URL changes in a fashion that isn't easily dealt with. I can't think of any specific examples offhand (though I do know they exist) but to arbitrarily make up an example, if the URL changes from https://example.com/person/<number assigned to person> to https://example.com/profile/<persons_full_name>, the template cannot be updated in a fashion that will result in a meaningful change, since all we have on the template calls is {{example|<number>}}. On the other hand, I think at least one of the URL bots has the ability to match old to new, so if it sees https://example.com/person/<number> in the text directly it can update to the new code. Either way a bot will need to update everything, but with the latter case (again, assuming it's possible) there is already a bot that can do that functionality.
In other words, an elink bot will notice a change in URL if the URL is in the article, but a user has to notice a dead link if it's in a template. Primefac (talk) 12:27, 9 January 2024 (UTC)[reply]
Ah ok. So worst case, both template and non-template links need to be all edited. In best case, a single edit to the template and all non-template links need to be edited. Hopefully a bot can be coded to handle the templated data and then there isn't even that problem. Gonnym (talk) 12:31, 9 January 2024 (UTC)[reply]
"Hopefully a bot can be coded" .. That's the problem. There are thousands of these external link templates. We have limited numbers of programmers interested and able to do this kind of specialized work. Incorporating these templates into complex bots is difficult and time consuming, and for any one template limited impact, so we triage it and ignore them. A couple large templates like {{official}} might get supported. Don't forget, these templates change so if a bot supports the template, and someone changes it, the bot has to be updated to avoid making errors. And this is just Enwiki there are over 300 Wikipedias, plus hundreds more in other projects. And the underlying code of saving dead links is quite complex to do correctly, only a few programmers have this down, these are large complex tools that have taken many years of development. A BOTREQ to make a dead link fixer, for one template, doesn't make sense. At best, a bot that converts templates to CS1|2 or square-link, then run the archive bots. -- GreenC 16:30, 9 January 2024 (UTC)[reply]
GreenC, I wonder how hard it would be to parse the templates. They seem fairly formulaic. — Qwerfjkltalk 16:59, 9 January 2024 (UTC)[reply]
Whatever the case, before a bot can do anything, the very first thing is all these templates need parameters for inputting an archive URL plus rendering output when the link is dead. I suppose that is easy. Try 50 template at random and see how long it takes, don't forget to update the docs. If the templates are formulaic it shouldn't be hard. -- GreenC 22:09, 9 January 2024 (UTC)[reply]
@GreenC what are the parameters you want added? |url-status=, |archive-url= and |archive-date=? Gonnym (talk) 12:50, 10 January 2024 (UTC)[reply]
This is what most bots and tools understand by default. It's the right choice even if it gives the false impression of being a CS1|2 template which can cause some editors to reflexively add unsupported arguments like access-date. Bots and tools need some way to know if a template supports the parameters, they can't assume support exists. There are a couple ways to do this: In the template documentation include a TemplateData section eg. Template:Webarchive#TemplateData where the parameters are listed. There is also Category:External link templates with archive parameter which is probably easiest/best method. Or both because TemplateData still is useful for other things. Update: I just noticed in Category:External link templates with archive parameter the templates use |archive= like {{2006 Commonwealth Games profile}} - this looks like an alternative method in use, most of those templates are sports related so it was probably conceived by a few editors at some time.
According to Category:External link templates there are only about 700 in this category and sub-category excluding Wikidata. There is also Category:Citation templates and sub-cats which is well over 1,000. -- GreenC 16:47, 10 January 2024 (UTC)[reply]
Another idea is citation templates use the |archive-url= trio, and external link template use simply |archive= which if it exists the template renders this URL as a replacement for what it would have rendered. It's going to be template-specific how to best approach this. Anyway, if it's true Category:External link templates with archive parameter is only for templates that use |archive=, it will be important to have a new category for Category:External link templates with archive-url parameter, so bots and tools can differentiate which parameters to use. -- GreenC 16:58, 10 January 2024 (UTC)[reply]
┌───────────────────────────┘
Maybe some sort of meta tempate would be helpful here, to standardise the templates. — Qwerfjkltalk 17:14, 10 January 2024 (UTC)[reply]

Hey all! For some background here, for TWL users to access Newspapers.com, the library sends them through a proxied domain at https://www-newspapers-com.wikipedialibrary.idm.oclc.org/. This often results in this domain name making its way into the mainspace, which is problematic because it can only be accessed by those with access to TWL.

JPxG has set up a way to replace these links with the unproxied domain using JWB (see more info and an example edit), but I feel like this is an area where a bot could step in.

Citation bot is able to clean these links up automatically (see an example edit), but it has to be triggered manually. These proxy URLs are not automatically placed in a category, which means a human editor would need to assemble a list of pages to be fixed for Citation bot to even look at them. Citation bot also wouldn't deal with these links outside of citations, such as with external links.

It's worth noting that I've previously filed a tangentially similar BRFA, which was denied as Citation bot would be easier to use and give better results. With these links, however, I don't think that's the case, mainly because Citation bot is tedious to trigger on these pages, but also because Citation bot doesn't even touch other proxied URLs, only Newspapers.com.

I'd love to make this happen using Pywikibot, but based on my previous BRFA I wanted to see some thoughts on this being fully automated. This task is already being done semi-automatically way through JWB, so I think it might as well be fully automated, potentially expanding additionally to other TWL-proxied sites. (Citation bot doesn't even touch other proxied URLs, only Newspapers.com.)

(CCing Headbomb for your thorough comments on the previous BRFA—would love to hear your opinion especially.) Bsoyka (talk) 18:32, 28 December 2023 (UTC)[reply]

Yeah, there should be a bot for this. The JWB regex is dead-on-the-marmot simple, and never pops false positives (there's simply never any legitimate reason to link to www-newspapers-com.wikipedialibrary.idm.oclc.org in mainspace so you can just replace it in ns0 indiscrim[...]ly). I run through every couple weeks and people always give me like 14 thankses for it so it seems like a particularly loved task. jp×g🗯️ 18:36, 28 December 2023 (UTC)[reply]
I support this bot and would further support a task like Qwerfjkl bot Task 17, to post on the usertalk of editors who leave links like this and tell them to stop doing it. Folly Mox (talk) 18:54, 28 December 2023 (UTC)[reply]
Folly Mox, my bot only tracks categories. Is there an error category for this? — Qwerfjkltalk 19:04, 28 December 2023 (UTC)[reply]
@Qwerfjkl: Not as far as I know. The replacement process itself would be built on a simple regex find/replace. Bsoyka (talk) 19:05, 28 December 2023 (UTC)[reply]
Bsoyka, I know, I'm talking about notifying users. But thinking about it now, there's not much point if they can just be fixed. It would be trivial to setup a daily run on this with a bot. — Qwerfjkltalk 19:29, 28 December 2023 (UTC)[reply]
hrrmery mayhapsicles it could mention https://github.com/jp-x-g/PressPass which autoformats newspapers.com citations on firefox and chrome and autofixes this exact issue B^) jp×g🗯️ 19:33, 28 December 2023 (UTC)[reply]
JPxG, didn't recognise you for a moment there! I'm used to a green username, not blue. Congratulations on your adminship! — Qwerfjkltalk 19:33, 28 December 2023 (UTC)[reply]
Seems like this has some general support to become fully automated. I'll give this a bit more time to sit for anyone else who wants to comment then work up some quick code and get a BRFA going. (Just trying to avoid what happened with my last one—thanks for all the feedback so far, everyone!) Bsoyka (talk) 19:33, 28 December 2023 (UTC)[reply]
As far as categories go, tracking categories could be built-in CS1/CS2 templates like Category:CS1 maint: unflagged free DOI and then Citation bot can run over them. The place to raise that is at Help talk:CS1. This is my preferred option, personally, as far as my own individual opinion is concerned.
As for bots doing this, two options. You build a list of articles and feed it to Citation bot (either via a page of links, or separated by pipes). Or you have a dedicated bot fixing that stuff.
Or you can do both, but the tracking category is what needs the least coding. Headbomb {t · c · p · b} 22:28, 28 December 2023 (UTC)[reply]
@Headbomb: While I like the idea of a CS1 maintenance category paired with Citation bot to fix these, that wouldn't work for any TWL links other than Newspapers.com as far as I know.
Take, for example, this version of an article and its references 11 and 14. They link to doi.org through the TWL proxy, but Citation bot doesn't even touch them and they have to be replaced manually (or semi-automatically similar to JPxG's JWB settings mentioned above). (Edit: Probably not the best example since the DOIs are invalid to begin with, but I think the idea is still there.)
This is why I think a separate bot task would be useful—Citation bot only deals with proxied Newspapers.com links, but there are tens of other sources going through TWL proxies that it won't handle. Bsoyka (tcg) 22:45, 28 December 2023 (UTC)[reply]
It deals with many TWL proxied links, not just newspaper.com ones. It might not deal with all of them, which is a great argument to improve Citation bot by providing it with the full list of proxies used by TWL.
But again, a separate bot specifically on this is also not a bad idea. We'd just lose many of CB's other fixes, but proxied links are bad enough to be fixed on their own. Headbomb {t · c · p · b} 22:48, 28 December 2023 (UTC)[reply]
Oh, it does? Apologies, I had tested it a few times with other proxied sites and didn't get any more hits.
Appreciate the feedback though—I'll get started on a script and BRFA for now and we'll see where it goes. Bsoyka (tcg) 22:53, 28 December 2023 (UTC)[reply]
BRFA filed: Thanks for the feedback everyone! I'll point you in the direction of this new BRFA for further discussion. Bsoyka (tcg) 01:32, 29 December 2023 (UTC)[reply]
I will go vouch for it if necessary. It's always fun to push the big button on a JWB run but it's imperative that this problem have a solution beyond "JPxG makes an embarrassing mistake every couple weeks and wants to hide it behind a couple dozen JWB edits" jp×g🗯️ 10:55, 29 December 2023 (UTC)[reply]
It appears quite a lot in other namespaces (see Special:LinkSearch/www-newspapers-com.wikipedialibrary.idm.oclc.org), I assume it should also be fixed there? — Qwerfjkltalk 11:22, 29 December 2023 (UTC)[reply]
No, TWL links in other namespaces are likely intentional and not bad practice. I'll often drop a TWL link on a talk page if I'm discussing a source, with the knowledge that the intended audience (other Wikipedia editors) has proxy access. Folly Mox (talk) 11:32, 29 December 2023 (UTC)[reply]
Just going to update this with a quick Y Done tag! Bsoyka (tcg) 03:08, 8 January 2024 (UTC)[reply]
@Bsoyka Thanks for working on this! Samwalton9 (WMF) (talk) 12:24, 8 January 2024 (UTC)[reply]
Tangentially, @Trizek (WMF), could you talk to the Editing team about whether the citoid service could automagically not add these URLs in the first place? WhatamIdoing (talk) 18:48, 28 January 2024 (UTC)[reply]
T356056 documents the need. Trizek_(WMF) (talk) 14:38, 29 January 2024 (UTC)[reply]
@Bsoyka:, going by a search with insource:wikipedialibrary.idm.oclc.org which throws about 643 hits in mainspace, I wonder if expanding the scope to other wikipedialibrary domains would be warranted. It seems like there are a lot of links to that proxy. Jo-Jo Eumerus (talk) 09:55, 29 January 2024 (UTC)[reply]
@Jo-Jo Eumerus: Thanks for the note! This is definitely on my radar and something I plan to gradually implement. I ensured my BRFA was phrased to allow expanding the scope like this, and I'm tracking progress on GitHub. Bsoyka (tcg) 22:07, 29 January 2024 (UTC)[reply]

Is there a bot

Category:Pages using WikiProject Film with unknown parameters, a maintenance category which exists to flag problems where a use of {{WikiProject Film}} on a talk page is calling parameters that don't exist to be called, currently has 4,808 articles in it — and after looking at it and cleaning up the tiny single-digits handful of exceptions that existed anywhere after the letter B, I was able to determine that the remaining contents all relate entirely to an old, long-deprecated practice whereby B-Class articles in that queue were each also tagged as b1=[y/n], b2=[y/n], b3=[y/n], b4=[y/n] and b5=[y/n] for their individual success or failure in meeting each of the five B-Class criteria listed at Wikipedia:WikiProject Film/Assessment. That's long since been deprecated and isn't done anymore, which is why those are landing as unknown parameters now — but with 4,808 articles to deal with, actually cleaning them up is more work than any human editor would ever actually be inclined to undertake.

Accordingly, I wanted to ask if there's any bot that can be set loose on the task of stripping b#= parameters from the contents of that category. Bearcat (talk) 17:20, 1 January 2024 (UTC)[reply]

Yes. I'll try to get to this in the next few days. Primefac (talk) 17:42, 1 January 2024 (UTC)[reply]
@Bearcat: BattyBot 79 is going through all the subcategories of Category:WikiProject templates with unknown parameters, and will get to this if Primefac doesn't get to it first. GoingBatty (talk) 17:51, 1 January 2024 (UTC)[reply]
Okay, no worries. It isn't urgent or anything, I was just wondering if it was possible — so if it is, it's all good. Bearcat (talk) 17:53, 1 January 2024 (UTC)[reply]
Heh, thanks for the reminder GoingBatty, somehow managed to forget I approved that task... yesterday... Primefac (talk) 17:54, 1 January 2024 (UTC)[reply]
@Primefac: No worries - it's hard to remember what we did last year.  ;-) GoingBatty (talk) 00:15, 2 January 2024 (UTC)[reply]
@Bearcat & @Primefac: Doing... it will take about 13 hours, plus any manual work afterwards to manually clean up edge cases. GoingBatty (talk) 04:28, 4 January 2024 (UTC)[reply]
@Bearcat & @Primefac: Y Done! GoingBatty (talk) 02:37, 5 January 2024 (UTC)[reply]
Awesome! Much thanks! Bearcat (talk) 04:32, 5 January 2024 (UTC)[reply]

Add parameter for WikiProject Africa/The 10,000 Challenge

For the 5832 articles listed at Wikipedia:WikiProject Africa/The 10,000 Challenge please add |AFR10k=yes to the project banner {{WikiProject Africa}} on the talk page. This adds a note to the banner and also populates Category:Articles created or improved during the WikiProject Africa 10,000 Challenge. Thanks — Martin (MSGJ · talk) 19:19, 3 January 2024 (UTC)[reply]

BRFA filed; seems simple enough. Primefac (talk) 20:35, 3 January 2024 (UTC)[reply]
Many thanks! — Martin (MSGJ · talk) 20:43, 3 January 2024 (UTC)[reply]
BRFA filed by @Primefac. GoingBatty (talk) 16:03, 5 January 2024 (UTC)[reply]

GNU/Linux or GNU-Linux to replace Linux on Wikipedia

Hello,

GNU is the Operating System and Linux is one of its Kernels. Linux is not an Operating System. Hence, why I believe a bot should locate and correct these errors. Where Linux is mentioned, it should be changed to GNU/Linux or GNU-Linux. This request is being made for Richard Stallman, who has cancer. Twillisjr (talk) 16:07, 6 January 2024 (UTC)[reply]

This is a naming controversy well-known enough to have its own article, and I don't think the Wikipedia community would have consensus for having a bot mass-change every occurence of the word across over 18,000 pages. (Not to mention, what if the word is being used to refer to Linux as a kernel, not as an operating system?) Bsoyka (tcg) 16:19, 6 January 2024 (UTC)[reply]
Indeed, and even if this was uncontroversial the WP:CONTEXTBOT problem makes this Declined Not a good task for a bot. Taavi (talk!) 16:47, 6 January 2024 (UTC)[reply]
I'd like to interject for a moment, what you are referring to as Linux, is in fact GNU/Linux, or as I've recently taken to calling it, GNU+Linux. Linux is not an operating system unto itself, ..
Memes aside, I don't think this is a good task for a bot per above, even if Linux as an OS should always be referred to as GNU/Linux, this would have context bot issues (e.g. musl and Android (Operating System), and also Linux Foundation) 0xDeadbeef→∞ (talk to me) 17:07, 6 January 2024 (UTC)[reply]

Fix redirect link: Saint Francis University

I just moved Saint Francis University to Saint Francis University (Pennsylvania), because there will be a university also named Saint Francis University in Hong Kong (Caritas Institute of Higher Education acquires university title, Government of Hong Kong Press Release). The page "Saint Francis University" will be a redirect to University of Saint Francis. Before doing so I need to fix all pages with link to [[Saint Francis University]] and replace it with [[Saint Francis University (Pennsylvania)|Saint Francis University]] (or, if the link is [[Saint Francis University|something else]], just replace the link itself, not description), which I found hundreds. Is there a bot that can do this task for me? --Leeyc0 (Talk) 12:04, 9 January 2024 (UTC)[reply]

With only 345 links to the old target, I think WP:AWB/TASKS would be the better ask here. Primefac (talk) 12:15, 9 January 2024 (UTC)[reply]
Thanks for point out this too. I will have a look. --Leeyc0 (Talk) 12:22, 9 January 2024 (UTC)[reply]
Also, the WP:CONTEXTBOT problem makes this Declined Not a good task for a bot. GoingBatty (talk) 17:36, 9 January 2024 (UTC)[reply]

Bad citation fix for several hundred articles

Hi, I asked the following at the Help Desk, and they suggested asking here:

I noticed that there are a ton of pages tagged for needing verification from August 2022. All of the location ones really just need the first of the two notes citations (the one just going to census.gov) removed. Is there a way for someone to mass-fix this?

The note, as it is, is always in the Demographics section as:

"Note: the US Census treats Hispanic/Latino as an ethnic category. This table excludes Latinos from the racial categories and assigns them to a separate category. Hispanics/Latinos can be of any race.<ref>http://www.census.gov {{nonspecific|date=August 2022}}</ref><ref>{{cite web |title=About the Hispanic Population and its Origin |url=https://www.census.gov/topics/population/hispanic-origin/about.html |website=www.census.gov |access-date=18 May 2022}}</ref>"

It is the first of the two that needs to go, because the second has it covered.

To add: on all the pages I have fixed thus far with this error (see: recent Texas edits), it is the only note on the page, and always attached to a table with racial demographic data.

Thanks in advance! Edenaviv5 (talk) 16:18, 11 January 2024 (UTC)[reply]

@Edenaviv5: Doing... This is only 278 articles, so I'll do this manually via AWB (e.g. Special:Diff/1195891794). Future small requests can be made at WP:AWBREQ. GoingBatty (talk) 19:12, 15 January 2024 (UTC)[reply]
@Edenaviv5: Y Done! GoingBatty (talk) 19:36, 15 January 2024 (UTC)[reply]
Thank you, @GoingBatty! Edenaviv5 (talk) 22:01, 15 January 2024 (UTC)[reply]

DRV template bot

See this discussion: is there a bot that can assist us with the deletion review process? Jarble (talk) 19:46, 12 January 2024 (UTC)[reply]

Declined Not a good task for a bot. per the discssion there. — Qwerfjkltalk 18:13, 15 January 2024 (UTC)[reply]

Fixing uninvoked refrences

Deleting or otherwise removing errors from uncalled references. Geardona (talk to me?) 20:24, 17 January 2024 (UTC)[reply]

@Geardona: Could you please give some examples of these errors? GoingBatty (talk) 03:04, 18 January 2024 (UTC)[reply]
Yep, I will get them in a bit. Geardona (talk to me?) 13:31, 18 January 2024 (UTC)[reply]
Here is one (https://en.wikipedia.org/w/index.php?title=Malaysia_Airlines_Flight_370&diff=prev&oldid=1196258156) The refs were in the list but never appeared in the text, generating a cite error.(Cite error: A list-defined reference named "AutoVQ-40" is not used in the content.) Geardona (talk to me?) 14:08, 18 January 2024 (UTC)[reply]
I feel like there is a context issue here. Why are the refs there? Where they removed in the text by someone and they didn't realise the refs were below? Is there a typo? Is it a duplicate ref? Was it an accidental removal? Only some of these questions would have answers where "let's remove the reference entirely" would be the correct solution. Primefac (talk) 14:28, 18 January 2024 (UTC)[reply]
@Geardona: Agree with Primefac that this seems to require human review for context and therefore is Declined Not a good task for a bot. GoingBatty (talk) 16:12, 18 January 2024 (UTC)[reply]
Alright,  Thanks Geardona (talk to me?) 17:08, 18 January 2024 (UTC)[reply]
Honestly, I wish the software wouldn't error over unused list-defined references. It's a pain if you're just adding them in advance (which is necessary before using them), and if someone has bothered to add an LDR in the first place it should probably at least display under "Further reading" even if not cited. Also, I've never known another software that yells at me for defining a variable without using it anywhere. I'm really not sure what the point of the error message is, especially so big and shouty. Folly Mox (talk) 18:14, 18 January 2024 (UTC)[reply]
Not to re-open the bot discussion, but could there at least be a user-script that finds them easily, it is really annoying to have to use CMD+F to find the uncalled refs and then delete them. Geardona (talk to me?) 18:16, 18 January 2024 (UTC)[reply]
Geardona, if you're going to be working on this task, I might suggest moving the uninvoked list defined references into "Further reading" unless the source is unreliable, rather than outright deletion. I haven't looked into this sort of thing in depth so I have no idea what the distribution of cases is like, but citing the source in the article might also be an option. Folly Mox (talk) 18:23, 18 January 2024 (UTC)[reply]
Its mostly a visual editor issue (I think) where the user generates a second citation instead of re-using, leading to it just being there (speculation). Geardona (talk to me?) 18:32, 18 January 2024 (UTC)[reply]

List of Sejm members (2023–2027)

Hi, I would like to ask if it would be possible to align text to the right in the # of votes and % of votes columns in the table listing over 460 MPs located in the List of Sejm members (2023–2027)#List of members section. The use of {{Table alignment}} is imposible due to merged cells which help with wisual representation. There fore befoure every cell in mentioned columns which all contain numerical data, "align-text: right|" sholud be placed. Chears! — Antoni12345 (talk) 23:49, 20 January 2024 (UTC)[reply]

Antoni12345, you should ask this at WP:VPT. — Qwerfjkltalk 23:56, 20 January 2024 (UTC)[reply]
Oh now I see i wasn’t clear. My question was if the task could be done by a bot so I wouldn’t have to manually place “text-align=right|” before 920 cells. — Antoni12345 (talk) 06:20, 21 January 2024 (UTC)[reply]
Instead you may use Template:0. If you would like to apply this just at the page mentioned, then one could apply a one-time regex find-and-replace, for aligning right as well as using the template. Wikiwerner (talk) 10:40, 21 January 2024 (UTC)[reply]
If it's a single page, then copy the text to a word processor of some variety and use a find/replace to do it all at once. Primefac (talk) 14:41, 21 January 2024 (UTC)[reply]
@Primefac: if it would be that siple then I would do it. Unfortunately there is no unique frase befoure mentioned cells to use the find/replace tool. That's why i'm trying to request a edit by a bot. — Antoni12345 (talk) 19:26, 21 January 2024 (UTC)[reply]
You will not get a bot approved for a single page. Someone at WP:AWB/TASKS might be able to help.
Declined Not a good task for a bot. Primefac (talk) 07:15, 22 January 2024 (UTC)[reply]
@Antoni12345:  Done manually in this edit. I first played around trying to fix the column header to align the whole column right, but that didn't work, so I figured out how to do it for each row per Help:Table. As Primefac suggested, I copied the section code and pasted it into Notepad. I then used find/replace 10 times:
  • Find: || 1 Replace: || style="text-align: right;" | 1
  • Find: || 2 Replace: || style="text-align: right;" | 2
...
  • Find: || 0 Replace: || style="text-align: right;" | 0
I then copied the new code from Notepad back into WP, previewed the change, manually fixed less than 10 rows with different formatting, and saved. Happy editing! GoingBatty (talk) 15:10, 22 January 2024 (UTC)[reply]
@GoingBatty: Thank you so much! I am aware of the using find/replace tool ability in text editors, but i was stuck and couldn't think of the frase to find/replace. It would be even easier if I would think of aligning text on the stage of editing in excel but I haven't. And after you excel2wiki there's no coming back :--P Thanks again! — Antoni12345 (talk) 17:30, 22 January 2024 (UTC)[reply]
@Antoni12345: I also tried the VisualEditor. While it allows you to manipulate tables (e.g. adding/merging/removing rows/columns) but doesn't allow you to apply a format such as aligning right. :-( GoingBatty (talk) 17:34, 22 January 2024 (UTC)[reply]

Categorizing ACM Fellows by Year

Right now, there is a category "Category:Fellows of the Association for Computing Machinery" - which is added to all ACM Fellows. I created a bunch of categories "Category:202x Fellows of the Association for Computing Machinery" as subcategories of the main category to organize these by year. The recepients are already organised by year in this page.

  1. Go through each section of the page (which corresponds to a certain year).
  2. Foreach blue linked article in that section add the correct year-specific category "Category:202x Fellows of the Association for Computing Machinery" to the article.
  3. After all sections are done, remove the generic "Category:Fellows of the Association for Computing Machinery" from all the linked articles since they are now living under a subcategory.

KNivedat (talk) 18:37, 26 January 2024 (UTC)[reply]

KNivedat, Coding...— Frostly (talk) 18:37, 27 January 2024 (UTC)[reply]
@KNivedat, BRFA filed— Frostly (talk) 03:45, 4 February 2024 (UTC)[reply]

Add articles under the the following categories and subcategories to Wikiproject

I want to add the articles under the following categories+subcategories to wikiproject: Indian caste system. Is this a good request for a bot?

Category:Dalit (42)
Category:Dalit literature (19)
Category:Dalit politics (61)
Category:Navayana Buddhists (3)
Category:Ambedkarite political parties (22)
Category:Satnami (5)
Category:Paraiyar leaders (3)
Category:Balmiki (4)
Category:Adivasi (40)
Category:Caste system in India (74)
Category:Anti-caste movements (43)
Category:Scheduled Tribes of India (181)

Miximon (talk) 19:48, 30 January 2024 (UTC)[reply]

Probably not; humans will need to add the WikiProject templates based on context. I picked a couple of articles at random from the categories, and Giraudpuri and Guru Balakdas, in Category:Satnami, do not appear to have anything to do with that WikiProject. (edited to add: I believe that the OP is referring to {{WikiProject Indian caste system}} and the associated WikiProject.) – Jonesey95 (talk) 20:10, 30 January 2024 (UTC)[reply]
I would also make the argument that other than the Scheduled Tribes cat, none of these are heavily populated - a quick AWB run would probably be both faster and more accurate (since there would be human oversight). Primefac (talk) 20:15, 30 January 2024 (UTC)[reply]
Even the big category will need human oversight; I don't see a reference to caste in Tani people, which is in that scheduled tribes category. – Jonesey95 (talk) 20:21, 30 January 2024 (UTC)[reply]
Just adding context to clarify - Scheduled castes and tribes are Govt of India's list of castes and tribes that are recognized for affirmative action because of historic discrimination - Caste system in India#Recognition Miximon (talk) 20:29, 30 January 2024 (UTC)[reply]
@Miximon: If you want to post a list of pages (not categories) on the WikiProject's talk page, and there's consensus to tag each of them, I will be happy to have my bot tag them. GoingBatty (talk) 21:06, 30 January 2024 (UTC)[reply]

Backlog drive leaderboard updates

Hi, is there a bot that can automatically update the leaderboard for the unreferenced articles backlog drive? Ideally, it would count the number of edit summaries made by each participant with "feb24" (not case-sensistive) to unique articles. Then, it would update the "Points from references" column in the leaderboard with that number. The rest of the leaderboard doesn't need to be updated by the bot, as the points from reviews is simpler to update, and the total points is automatically provided by a template. There's a bit of prior discussion at the talk page. Thanks! ARandomName123 (talk)Ping me! 23:14, 1 February 2024 (UTC)[reply]

I created a simple scraper which I plan to use daily to update the leaderboard. I don't think it's a very good solution, so if bot experts want to help, it would still be greatly appreciated ;) Broc (talk) 21:32, 2 February 2024 (UTC)[reply]
Y Done. Bot created based on script above. See User:BaranBOT/FEB24DriveLeaderboard. ARandomName123 (talk)Ping me! 18:12, 3 February 2024 (UTC)[reply]

Bot request for Korean hangul text

Per the discussion at Wikipedia talk:Manual of Style/Korea-related articles#About adding a link to each hangul syllable using Template:Linktext, I am submitting this bot request. Please perform the following.

  1. Check if Template:Linktext only contains [ ]?[0-9가-힣][ ]? in each parameter.
  2. If so, remove Template:Linktext and |, but retain the text entered as parameters (including space characters before and/or after [0-9가-힣]).
  3. If not (that is, if Template:Linktext contains (1) any character other than [0-9가-힣], or (2) two or more adjacent [0-9가-힣] in at least one parameter), leave it as-is.

The following examples would help you understand this request.

  • Cases that should be changed
    • {{linktext|국|립|중|앙|도|서|관}}국립중앙도서관 (currently found in National Library of Korea)
    • {{Linktext|수|도|권|제|1|순|환|고|속|도|로}}수도권제1순환고속도로 (currently found in Capital Region First Ring Expressway)
    • {{linktext|새|터|데|이| 나|이|트| 라|이|브| 코|리|아|}}새터데이 나이트 라이브 코리아 (space characters have to be retained; currently found in Saturday Night Live Korea)
    • {{linktext|구|름|은}} {{linktext|흘|러|가|도}}구름은 흘러가도 ((added this example just in case) a space character between two instances of Template:Linktext has to be retained; currently found in Even the Clouds Are Drifting)
  • Cases that should NOT be changed
    • {{linktext|中|文|維|基|百|科}} (contains any character other than [0-9가-힣]; currently found in Chinese Wikipedia)
    • {{linktext|새|마을|호}} (contains two or more adjacent [0-9가-힣] in at least one parameter; currently found in Saemaeul-ho)

172.56.232.167 (talk) 00:24, 2 February 2024 (UTC)[reply]

The regex will need to be a bit more complex than what is described above, but this should be doable. Let me do some small-scale testing and get back to you. Primefac (talk) 12:30, 2 February 2024 (UTC)[reply]
BRFA filed. Primefac (talk) 21:34, 2 February 2024 (UTC)[reply]
Thank you for running your bot. I greatly appreciate it.
But can you please also remove them in the Draft namespace? There are currently 86 pages containing them (see this). They can be moved to the main (article) namespace at any time. 172.56.232.239 (talk) 05:46, 5 February 2024 (UTC)[reply]

Bot to automatically revert date change vandalism

Hi, I want to create a bot that automatically reverts obvious date change vandalism. For example: "James H. (born 26 December 2002)" And then a vandal comes and changes it to: "James H. (born 29 December 2002)" And the goal of the bot is to revert these changes as accurately as possible. And here's how it's gonna work: A bot sees that someone changed the birth date. The bot looks up the name of the person on wikidata. If the person appears on wikidata, The bot searches for his birth date on his wikidata page. And if the birth date written on wikidata is different than the date the vandal changed it to, the bot automatically revert these changes. I hope this bot can be coded for me. It seems like a great idea for a bot. 93.173.38.154 (talk) 11:28, 5 February 2024 (UTC)[reply]