Jump to content

Wikipedia:Bot requests: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Undid revision 830141025 by CrowzRSA (talk) after researching more I think there's a bot for that .. !!!
Line 312: Line 312:


Can I request a bot to tag the articles that are in [[:Category:Reality television series]] with "|reality-tv=yes|reality-tv-importance="(importance can be assessed manually afterwards). I have created a list of each individual category [[Wikipedia:WikiProject Television/Reality television task force/Categories|here]] after removing certain subcategories, mostly participant and container categories. Please let me know if you think that needs more refining. A lot of the articles already have WPTV but are just missing the Reality TV task force label, and some older articles don't have the project at all. So would need the bot to add the full WPTV+realitytv tag to any that are missing the project, and only add the task force parameter to those that are already under WPTV. [[User:WikiVirusC|<b style="color:#000080; font-family:Tahoma">WikiVirus</b>]]'''[[Special:contributions/WikiVirusC|<u style="font-family: Tahoma">C</u>]]'''[[User talk:WikiVirusC|<b style="color:#008000">''<sup>(talk)</sup>''</b>]] 16:30, 9 March 2018 (UTC)
Can I request a bot to tag the articles that are in [[:Category:Reality television series]] with "|reality-tv=yes|reality-tv-importance="(importance can be assessed manually afterwards). I have created a list of each individual category [[Wikipedia:WikiProject Television/Reality television task force/Categories|here]] after removing certain subcategories, mostly participant and container categories. Please let me know if you think that needs more refining. A lot of the articles already have WPTV but are just missing the Reality TV task force label, and some older articles don't have the project at all. So would need the bot to add the full WPTV+realitytv tag to any that are missing the project, and only add the task force parameter to those that are already under WPTV. [[User:WikiVirusC|<b style="color:#000080; font-family:Tahoma">WikiVirus</b>]]'''[[Special:contributions/WikiVirusC|<u style="font-family: Tahoma">C</u>]]'''[[User talk:WikiVirusC|<b style="color:#008000">''<sup>(talk)</sup>''</b>]] 16:30, 9 March 2018 (UTC)

== Archive all of an article's online citations and update text to include archive URL/date ==

As the title states, I am proposing that a bot be made which can go through an article with references to online sources and archive the linked web-pages. In addition, if the bot could update the page's references with the archive URL and date, assuming a template has been used in the citation, that would be quite helpful. I'm not sure how easily this can be done since I have no bot-making experience, but I believe this could be extremely helpful in ultimately eliminating deadlinks on most Wikipedia articles. It would make archiving to avoid dead links a whole lot easier! I hope it would be acceptable to use an outside site for the archival process, perhaps [[WebCite]] would be good. Sorry if this gets asked frequently!

Revision as of 00:26, 13 March 2018

This is a page for requesting tasks to be done by bots per the bot policy. This is an appropriate place to put ideas for uncontroversial bot tasks, to get early feedback on ideas for bot tasks (controversial or not), and to seek bot operators for bot tasks. Consensus-building discussions requiring large community input (such as request for comments) should normally be held at WP:VPPROP or other relevant pages (such as a WikiProject's talk page).

You can check the "Commonly Requested Bots" box above to see if a suitable bot already exists for the task you have in mind. If you have a question about a particular bot, contact the bot operator directly via their talk page or the bot's talk page. If a bot is acting improperly, follow the guidance outlined in WP:BOTISSUE. For broader issues and general discussion about bots, see the bot noticeboard.

Before making a request, please see the list of frequently denied bots, either because they are too complicated to program, or do not have consensus from the Wikipedia community. If you are requesting that a template (such as a WikiProject banner) is added to all pages in a particular category, please be careful to check the category tree for any unwanted subcategories. It is best to give a complete list of categories that should be worked through individually, rather than one category to be analyzed recursively (see example difference).

Alternatives to bot requests

Note to bot operators: The {{BOTREQ}} template can be used to give common responses, and make it easier to keep track of the task's current status. If you complete a request, note that you did with {{BOTREQ|done}}, and archive the request after a few days (WP:1CA is useful here).


Please add your bot requests to the bottom of this page.
Make a new request
# Bot request Status 💬 👥 🙋 Last editor 🕒 (UTC) 🤖 Last botop editor 🕒 (UTC)
1 Implementing the outcome of Wikipedia talk:Naming conventions (television)#Follow-up RfC on TV season article titles BRFA filed 18 9 Wikiwerner 2024-04-20 17:49 Primefac 2024-03-27 12:55
2 Auto-WP:NAVNOREDIRECT Declined Not a good task for a bot. 10 5 Wikiwerner 2024-04-28 12:22 Primefac 2024-03-13 18:37
3 Automatic NOGALLERY keyword for categories containing non-free files (again) 9 5 Wikiwerner 2024-05-15 18:43 Usernamekiran 2024-04-13 02:17
4 Can we have an AIV feed a bot posts on IRC? 7 2 Lofty abyss 2024-06-15 18:17 Usernamekiran 2024-04-15 11:27
5 Bot to update match reports to cite template BRFA filed 7 2 Yoblyblob 2024-06-10 18:47 Mdann52 2024-06-10 18:06
6 Bot to mass tag California State University sports seasons Doing... 5 4 Frostly 2024-06-10 17:05 Headbomb 2024-06-09 17:28
7 Football league infoboxes 7 4 Bagumba 2024-04-25 13:43 Primefac 2024-04-25 12:01
8 Clear Category:Unlinked Wikidata redirects 6 3 A smart kitten 2024-04-23 10:56 DreamRimmer 2024-04-21 03:28
9 Find linkrot with a specific pattern 7 3 GreenC 2024-05-01 16:20
10 Fixing stub tag placement on new articles Declined Not a good task for a bot. 4 3 Headbomb 2024-05-19 20:17 Headbomb 2024-05-19 20:17
11 User:RetractionBot, v2 Y Done 8 5 Mdann52 2024-05-25 16:06 Mdann52 2024-05-25 16:06
12 Bot to change citations to list defined references Declined Not a good task for a bot. 3 2 Apoptheosis 2024-06-09 17:44 Headbomb 2024-06-09 16:56
13 Adding Facility IDs to AM/FM/LPFM station data BRFA filed 10 3 Qwerfjkl 2024-06-12 14:29 Qwerfjkl 2024-06-12 14:29
14 Tagging women's basketball article talk pages with project tags 7 2 Hmlarson 2024-06-10 19:17 Mdann52 2024-06-10 19:15
15 Friendly support for Draft categories – feedback request 1 1 Mathglot 2024-06-10 19:40
16 'Literature of Kashmir' Declined Not a good task for a bot. 2 2 Usernamekiran 2024-06-11 07:37 Usernamekiran 2024-06-11 07:37
17 Adding links to previous TFDs 5 3 Rusty4321 2024-06-18 20:59 Mdann52 2024-06-16 18:57
18 Bot that condenses identical references Coding... 8 4 GreenC 2024-06-18 15:21 Headbomb 2024-06-18 00:34
Legend
  • In the last hour
  • In the last day
  • In the last week
  • In the last month
  • More than one month
Manual settings
When exceptions occur,
please check the setting first.


Convert protocol relative URLs to http/https

All protocol relative links on Wikipedia should be converted to either http or https. As of June 2015, Wikipedia is 100% HTTPS only and because protocol relative links are relative to where they are hosted it will always render as HTTPS. This means any underlying website that doesn't support HTTPS will break. For example:

[1] (//americanbilliardclub.com/about/history/)

..the http version of this link works. The article American rotation shows it in action, the first three footnotes are broken because they use a protocol relative link to a HTTP only website. But Wikipedia is rendering the link as HTTPS.

More info at WP:PRURL and Wikipedia:Village_pump_(technical)#Protocol_relative_URLs. It's probably 10s of thousands of links broken. -- GreenC 21:06, 8 June 2017 (UTC)[reply]

This should only be done if the existing link is proven to be broken, and where forcing it to http: conclusively fixes it. Otherwise, if the link is not dead under either protocol, it is WP:COSMETICBOT. --Redrose64 🌹 (talk) 21:45, 8 June 2017 (UTC)[reply]
Well let's ask, what happens if you keep them? It creates a point of failure. If the remote site stops supporting HTTPS then the link immediately breaks. There is no guarantee a bot will return years later and recheck. WP:COSMETICBOT is fine but it shouldn't prevent from removing a protocol that causes indefinite maintenance problems and MediaWiki no longer really supports. By removing it also discourages editors from further usage, which is good. -- GreenC 22:07, 8 June 2017 (UTC)[reply]
That reasoning makes no sense. If a bot converts the link to https and the remote site stops supporting HTTPS, then the link immediately breaks then too. Anomie 00:22, 9 June 2017 (UTC)[reply]
Different reasoning. IABot forces HTTPS on all PR URLs since Wikipedia does too, when it analyzes the URL. It's erroneously seeing some URLs as dead as a consequence since they don't support SSL. The proposal is to convert all non-functioning PR URLs to HTTP when HTTPS doesn't work.—CYBERPOWER (Message) 02:22, 9 June 2017 (UTC)[reply]
@Cyberpower678: The proposal, as specified above by Green Cardamom (talk · contribs) is not to convert all non-functioning PR URLs to HTTP when HTTPS doesn't work, but to convert all PR URLs to either http or https. No exceptions were given, not even those that are presently functioning. This seems to be on the grounds that some are broken. --Redrose64 🌹 (talk) 09:06, 9 June 2017 (UTC)[reply]
Do I want to get rid of PR URLs? I personally think we should because they confuse editors, confuse other bots, ugly and non-standard etc they're an unnecessary complication. If we don't want to get rid of them (all), we still need to the fix broken HTTP links either way. -- GreenC 14:35, 9 June 2017 (UTC)[reply]
  • As someone who's been strongly involved with URL maintenance over the last 2 years, I think this bot should be run on Wikipedia, and should enforce protocols. It's pushing WP:COSMETICBOT but if the link ends up being broken because only HTTP works, then that will create other issues. The task can be restricted to only converting those not functional with HTTPS, but my first choice is to convert all. — Preceding unsigned comment added by Cyberpower678 (talkcontribs) 01:38, 13 June 2017 (UTC)[reply]
Opining as a bot op: I personally don't think this can be read as having community consensus because it's going to create a lot of revisions for which there is no appreciable difference. Yes it would be nice if wikipedia was smart enough to figure out if the relative URL is accessable only via HTTP or can be accessed via https, but the link is clicked in the user's browser and therefore the user doesn't know that the content may be accessable via HTTPS or HTTP. Ideally, users entering relative URLS could be reminded via a bot that it's better to be explicit with what protocol needs to be used to get to the content. The counter is we could set a bot to hunt down all the relative URLS and put a maintanance tag/category in the reference block so that a human set of eyes can evaluate if the content is exclusively available via one route or if the content is the same on both paths.

TLDR: This request explicitly bumps against COSMETICBOT, needs further consensus, and there might be a way to have "maintenance" resolve the issue. Hasteur (talk) 12:38, 13 June 2017 (UTC)[reply]

Those are all good ideas but too much for me to take on right now. Agree there is no community consensus about changing relative HTTPS links; However existing relative HTTP cases broken in June 2015 should be fixed asap. A bot should be able to do it as any broken-link job without specific community consensus (beyond a BRFA). Broken links should be fixed. That's something I can probably do, unless someone else wants to (I have lots of other work..). Note this fix would not interfere with any larger plans to deal with relative links. -- GreenC 15:26, 13 June 2017 (UTC)[reply]
Bump. -- GreenC 17:13, 9 August 2017 (UTC)[reply]
  • Strong support: This is definitely not COSMETICBOT; these URL errors directly interfere with exercise of WP:Verifiability. They also cause editwarring and article damage; various times I've had to revert people – including some long-experienced editors – removing "dead links" and inserting {{citation needed}} tags, when all that was required was adding the characters "http:".  — SMcCandlish ¢ >ʌⱷ҅ʌ<  21:02, 3 October 2017 (UTC)[reply]
  • Bump thread expire -- GreenC 04:16, 22 October 2017 (UTC)[reply]
  • Support per GreenC and SMcCandlish Jon Kolbert (talk) 23:59, 28 October 2017 (UTC)[reply]
  • Needs wider discussion. I happen to agree with this strongly, but this would need a very wide discussion at a village pump before being considered. This is far too many edits to do without very clear consensus, some of which fail WP:COSMETICBOT if no consensus is obtained to override it. ~ Rob13Talk 14:50, 8 December 2017 (UTC)[reply]
  • Support per GreenC. Also this is seriously needed and would benefit the project.BabbaQ (talk) 13:24, 30 December 2017 (UTC)[reply]
  • I'll chime in as a WP:BAG member here, that the scope of the task means we need wider discussion, if only to identify possible pitfalls and cornercases. WP:VPT/WP:VPR would be the natural places to hold it. I personally support the task FWIW. Headbomb {t · c · p · b} 02:44, 26 January 2018 (UTC)[reply]

Special character de-corrupter

Very often, because of encoding issues, you have situations like é → é.

This is often due to copy-pasting, or bot insertions. It would be nice if a bot could find all corrupted equivalent of all special Latin characters (possibly others too), and then do a de-corruption pass e.g. [2]/[3].

This might be best as a manual AWB run though.. Headbomb {t · c · p · b} 13:41, 1 December 2017 (UTC)[reply]

Related problems with file names on Commons: Rename files with wonky Unicode encoding. — Dispenser 06:16, 2 December 2017 (UTC)[reply]
@Dispenser: could you adapt your script for enwiki? Headbomb {t · c · p · b} 02:46, 26 January 2018 (UTC)[reply]
@Headbomb: Well I had to write a dump parser. Wasted a few hours in writing a word frequency collector. Ultimately regex on the dump was the fastest (4 hour runtime). It only does UTF-8 → mojibake and we need to figure out which of the 4,018 matches across 2,166 articles actually need to be fixed. I've done some already: [4] [5]Dispenser 03:21, 29 January 2018 (UTC)[reply]
Wikiget can return a regex dump search in about 30 seconds. Only limit it maxes out at 10,000 hits (limited by Elasticsearch).
./wikiget -a "insource:/<regexcommand>/"-- GreenC 05:21, 29 January 2018 (UTC)[reply]
Supposedly our Elasticsearch times out easily such that [0-9] needs to be split to properly work: [0-4], [5-9]. — Dispenser 11:45, 29 January 2018 (UTC)[reply]
See T106685, which has been marked as "Resolved", to the dismay of those of us who want to search using regexes. – Jonesey95 (talk) 14:10, 29 January 2018 (UTC)[reply]
Someone should setup a dedicated instance just for searching with no limitations. Cirrus dumps + setup info. -- GreenC 16:10, 29 January 2018 (UTC)[reply]
I'm tempted to create a web version of AWB's Database Scanner since I find it a pain to download a new dump, find a way to update AWB, take 15 minutes to decompress the dump, then try and fail to get my regexp working. Is there interest in building something better? — Dispenser 01:13, 31 January 2018 (UTC)[reply]
There would be interest but the disk space.. I wrote a fast and simple program for regex'ing XML dumps. -- GreenC 02:36, 31 January 2018 (UTC)[reply]
I have 1.2TB of compressed monthly dumps for the top ten wikis going back to September 2015. For enwiki, I have early 2008, 2010, 2012, and 2014 dumps. I also have a spare 120 GB end-of-write-life SSD which could be useful in a high throughput read only mode. But this would be running on my home server/work machine, so I'd be worried about CPU usage and would have to figure out a way of limiting abuse. — Dispenser 04:52, 31 January 2018 (UTC)[reply]

Newspaper

I would like to request for a bot that could fill in the Publisher after use of the Refill tool such as |publisher=Aftonbladet. As many Swedish subject articles uses one or two of the few main newspaper sources that are available in Sweden I would like for the bot to fill in for the sources aftonbladet.se as Aftonbladet, expressen.se as Expressen, svd.se as Svenska Dagbladet, kvp.se as Kvällsposten and dn.se As Dagens Nyheter. If those could be filled in at Publisher it would help seversl thousands of articles.BabbaQ (talk) 13:32, 30 December 2017 (UTC)[reply]

Anything that is known to be a newspaper should use |newspaper= and definitely not |publisher=, which should instead be removed. --NSH001 (talk) 16:53, 21 February 2018 (UTC)[reply]
@BabbaQ: Idea is not well explained. How would these pages be found/determined? --TheSandDoctor (talk) 22:30, 11 March 2018 (UTC)[reply]

Replace "IMDb award" by "IMDb event"?

There is a template which takes an IMDb page, I think, and an event name as parameters - e.g. {{IMDb award|Venice_Film_Festival|Venice Film Festival}} - but it creates broken links. Maybe it relied on some redirect on IMDb's side and they changed their format, I do not know. There is another template which uses a IMDb event code instead of a page name - e.g. {{IMDb event|0000681|Venice Film Festival}}, which creates a correct link. See both at work:

Is there any chance a bot could fix those? I guess it would need to search the IMDb to get the event codes, which I do not know if it is allowed... (both by us and them). Thanks. - Nabla (talk) 17:23, 30 December 2017 (UTC)[reply]

I've done a couple of hundred mannually. There's only about 50 left now. -- WOSlinker (talk) 01:03, 31 December 2017 (UTC)[reply]
And I have done the remaining ten or so. Thank you. - Nabla (talk) 23:08, 24 February 2018 (UTC)[reply]
In that case, N Not done as was done manually. ---TheSandDoctor (talk) 21:11, 11 March 2018 (UTC)[reply]

Tag talk pages of articles about English with Template:WikiProject English language

WP:Article alerts recommends having a bot tag the talk pages of articles with relevant topical wikiproject banners so that the AA bot produces more meaningful results. This would also be useful for getting this barely active project rolling better; I'd been looking into manually going article to article doing this, but it looked to be a rather daunting task even with AWB, and I'm on a Mac, so I'd have to run AWB in a VM or something anyway.

Would start with Category:English languages and its subcats.

Various subcats of Category:Words are going to qualify but will probably have to be done manually (e.g. about 99% of the content of Category:Neologisms, Category:Slang, etc., are English, but a handful of articles in such categories are not and so should not be tagged as within the scope of this project. Similarly, the majority of articles under Category:Punctuation have a section on English and would get tagged, but in a few cases the English coverage has been split out into separate spinoff articles like Quotation marks in English which should get tagged while the main article on the mark would not. We'll probably want to exclude most literature-related categories, but would include Shakespeare (for having had a profound effect on English, in contributing more stock phrases than any other body of work besides the King James Bible). Category:Lexicographers and other such bios will also need manual tagging.  — SMcCandlish ¢ >ʌⱷ҅ʌ<  19:09, 17 January 2018 (UTC)[reply]

Who Was Who link formatting error

We seem to have a large (three-figures, at least) number of external links to Who Was Who, formatted with an extraneous comma at the end of the URL, like the one I fixed in this edit. Can someone fix them all, please?

Better still would be to apply the {{Who's Who}} template, like this, but I appreciate that may not be so straightforward. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 16:52, 21 January 2018 (UTC)[reply]

@Pigsonthewing: This search drags up about 1500. --Izno (talk) 18:15, 23 February 2018 (UTC)[reply]

Bot to search and calculate coordinates

Please look at this table: Lands_administrative_divisions_of_New_South_Wales#Table_of_counties

My goal is to add a column to this table that shows the approximate geographical coordinates of each county. Those county coordinates can be derived form the parish coordinates that are found in each county article, by taking the middle of each northernmost and southernmost / easternmost and westernmost parish coordinates. Is it possible to write a script or a bot to achieve this? --Ratzer (talk) 21:27, 25 January 2018 (UTC)[reply]

For illustration, I did the work for the first county in the list, Argyle County, manually. The table of parishes in this article shows that they range from 34°27'54" and 35°10'54" latitude south and 149°25'04" and 150°03'04" longitude east. The respective middle is 34°49'24" and 149°44'04", which I put in the first table entry of Lands administrative divisions of New South Wales and the info-box of Argyle County.--Ratzer (talk) 07:39, 26 January 2018 (UTC)[reply]

Bot to convert New York Times abstract URLs to archive PDF URLs

There are a lot of New York Times URLs that begin with http[s]://query.nytimes.com/gst/abstract.html?res= or http[s]://select.nytimes.com/gst/abstract.html?res=. However, all this does is take people to the abstract page. If these Wikipedia readers aren't NYT members, they encounter a paywall, and if they are members, they are allowed to select a PDF/TimesMachine version to continue reading the article. Either way, they have to click at least one more time once they reach the abstract page.

Would it be practical to convert these to http://query.nytimes.com/mem/archive/pdf?res= URLs? These PDF versions can be seen by everyone, even non-members, and is much easier to verify. The hexadecimal string after the equals sign will remain the same before and after, but it does have to be an HTTP URL for these NY Times PDF links to work. epicgenius (talk) 22:01, 4 February 2018 (UTC)[reply]

So, just to double-check this, you're doing a find/replace of gst/abstract and converting to mem/archive, as well as changing any select. into query.
And this will work for all the articles? Primefac (talk) 12:50, 5 February 2018 (UTC)[reply]
Actually, it's gst/abstract.html to mem/archive/pdf, select. into query., and all HTTPS to HTTP. Yes, this will work for all articles. However, KolbertBot is converting http://nytimes.com URLs to https://nytimes.com, so I will ping Jon Kolbert for feedback.
I am requesting that HTTPS be converted specifically to HTTP, because http://query.nytimes.com/mem/archive/pdf?res=9801E7DF1330E333A25755C0A96E9C94669ED7CF&legacy=true (for instance) will work, but not https://query.nytimes.com/mem/archive/pdf?res=9801E7DF1330E333A25755C0A96E9C94669ED7CF&legacy=true, which displays an empty frame. epicgenius (talk) 18:42, 5 February 2018 (UTC)[reply]
@Epicgenius: A few weeks ago there were reported issues with query.nytimes.com links, I had fixed issues with links to query.nytimes.com/mem/archive/pdf?res= in response. KolbertBot doesn't act on select.nytimes.com links. Is the desired outcome to have select.nytimes.com/gst/abstract.html?res= and query.nytimes.com/gst/abstract.html?res= changed to query.nytimes.com/mem/archive/pdf?res=? That shouldn't be too hard to do with KolbertBot, I can create a new bot task to do this job. Jon Kolbert (talk) 00:46, 6 February 2018 (UTC)[reply]
@Jon Kolbert: Yes, that is what I am trying to do. epicgenius (talk) 00:48, 6 February 2018 (UTC)[reply]
This strikes me as a "needs consensus" task given it goes from https -> http. Additionally, this strikes me as something which may be a temporary workaround. Whether TNYT allows this deliberately or through some failure of configuration is obviously unknown--but I would guess that if they notice persons jumping straight to their PDF versions from external to their website, they'll cut off the access (which leaves us in a definitely worse spot than current). --Izno (talk) 18:22, 23 February 2018 (UTC)[reply]

Take over GAN functions from Legobot

Legobot is an enormously useful bot that performs some critical functions for GAN (among other things). Legoktm, the operator, is no longer able to respond to feature requests, and is not very active; they've asked in the past if someone would be willing to take over the code. I gather from that link that the code is PHP; see here. There would be a lot of grateful people at GAN if we could start addressing a couple of the feature requests, and if we had an operator who was able to spend more time on the bot. This is not to criticize Legoktm at all -- without their work, GAN could not function; Legobot is a core part of GAN functionality.

I left a note on Legotkm's talk page asking if they would mind a request here for a new operator, and Redrose64 responded there with a link to the note I posted above, so I think it's clear they'd be glad for someone else to pick this up. Any takers? Mike Christie (talk - contribs - library) 23:10, 6 February 2018 (UTC)[reply]

I've heard from Legoktm and they would indeed be glad to have someone else take this over. If you're capable in PHP, this is your chance to operate a bot that's critical to a very active community. Mike Christie (talk - contribs - library) 00:21, 8 February 2018 (UTC)[reply]
I would like to comment that it would be good to expand the functionalities of the bot for increased automation, like automatically adding to the GA lists. Perhaps it would be better to rewrite the bot in a different language? I think Legoktm has tried to get people to take over the php for awhile with no success. Kees08 (Talk) 04:44, 8 February 2018 (UTC)[reply]
The problem with adding to the GA lists is knowing which one. There is no indication on the GAN as to where. All we have is the topic. Even the humans have trouble with this. Hawkeye7 (discuss) 20:20, 16 February 2018 (UTC)[reply]
To correct for the past, we could add a parameter to the GA template for the 'subtopic' or whatever we want to call that grouping. A bot could go through the current listing and then add that parameter to the GA template. Then, when nominating, that could be in the template, and the bot could carry that through all the way to automatically adding it to the GA page at the end. Kees08 (Talk) 20:23, 16 February 2018 (UTC)[reply]
Nominators would need to know those tiny divisions within the subtopics; as it's not something we have on the WT:GAN page, I doubt most are even aware of the sub-subtopics. Even regular subtopics are sometimes too much for nominators, who end up leaving that field blank when creating their nominations. BlueMoonset (talk) 22:15, 26 February 2018 (UTC)[reply]

@Hawkeye7: For what it is worth, due to your bot's interactions with FAC, I think it would be best if you took over the GA bot as well, for what it is worth. I think at this point it is better to just write a new bot than salvage the old bot; no one seems to want to work on salvaging. Kees08 (Talk) 21:59, 26 February 2018 (UTC)[reply]

We'd need to come up with a full list of functionality for whoever takes this on, not only what we have now but what we're looking for and where the border conditions are. BlueMoonset (talk) 22:15, 26 February 2018 (UTC)[reply]

I might interested in lending a hand. A features list and functionality details (as mentioned by BlueMoonset) would be nice to affirm that decision though. I shall actively watch this thread. --TheSandDoctor (talk) 21:30, 11 March 2018 (UTC)[reply]

Okay, I will attempt to list the features, please modify as needed:

  • Place notifications on nominators talk page when their nomination is onreview diff, onhold diff, passed diff, failed
  • Update GAN page when status of a review changes (new, on hold, on review, passed, failed, also number of reviews editors have performed) diff
  • Update the stats page (related to the last bullet point, this is where the stats are stored) diff
  • Transcludes GA review on article talk page diff
  • Adds GA icon to articles that pass diff
  • Adds the oldid parameter to the GA tempate diff

@BlueMoonset: Are you aware of other functions? Looking through the bots edit history and going off of what I know of the bot, this is what I came up with. Kees08 (Talk) 22:10, 11 March 2018 (UTC)[reply]

Thanks Kees08. Does anyone know if it would be possible to take a look at the database structure? --TheSandDoctor (talk) 22:28, 11 March 2018 (UTC)[reply]
@Legoktm: Are you able to answer their question? Thanks! Kees08 (Talk) 23:43, 11 March 2018 (UTC)[reply]
TheSandDoctor, it's great that you're interested in this. Kees08, the second item (about updating the GAN page) is much broader. I don't know whether the bot simply updates the GAN page or generates/recreates the contents of all sections on it. It's basically dealing with all the GA nominee templates out there—which indicate what is currently a nominated article that belongs on the GAN page, and the changes to that page. If an entry wasn't on the page last time but is this time, then it's considered new; if it was on last time but a review page has appeared for it, then it's considered under review and the review page is parsed for reviewer information (but if the GA nominee template instead says "status=onhold", then it's noted as being on hold)... there's a lot involved, including cross-checking, and any field in the GA nominee template, including page number, subtopic, status, and note, can change at any time. If the GA nominee template has disappeared and a GA template is there for that same "page" number, then it has passed; if a FailedGA is there for that same "page" number, then it has failed (but the current bot's code doesn't check this properly, so any FailedGA template on the talk page results in the "failed" message being sent to the nominator even if the nomination was just passed with a new GA template showing). Sometimes review pages disappear when they were created illegally or by mistake and are speedy deleted, and the bot realizes their absence and updates the GAN page accordingly, so it's a comprehensive check each time the bot runs (currently every 20 minutes). If the bot doesn't know how to characterize the change it has found, it appears under an edit summary of "Maintenance": status changes to 2ndopinion go here, as do passes and failures where there was something wrong with the article talk page according to its lights. For example, it views with suspicion any talk page of a nomination under review that doesn't have a transcluded review on it, so it doesn't send out pass or fail messages for them (and maybe not even hold messages; I've never checked that).
There's a difference here between features and functionality. I think the features (with the exception of the 2ndopinion status and the display of anything in the "notes" field of GA nominee) have been listed here. The actual functions—how it needs to work and what it needs to check—are harder to break down. One thing that was mentioned above is the use subtopics: we have been unable to add new subtopics for several years now, so new subtopics on the GA page are not yet available on the GAN page. I'm not sure how the bot gets its list of subtopics—I've found more than one possible page where they could be read from, but there may be a database for subtopics and the topics they come under that actually controls them, with the pages I've found being a place for some templates, like GA, FailedGA, and Article history, to figure out what subtopics translate to which topics, and which subtopics are legitimate. GA nominee templates that have invalid subtopics or missing status or note fields (or other glitches) can cause the bot to try every 20 minutes to enter or update a nomination/review and fail to do so; there are times when a transaction is listed dozens of times, one bot run after another, as the GAN edit summary because it needs to happen, but it ultimately doesn't (until someone sees the problem and fixes the problematic GA nominee template or GA review page). I'm hoping any new bot will be smarter about how to handle these (and many other) situations, and maybe there will be an accessible error log to aid us in determining what's wrong. BlueMoonset (talk) 00:55, 12 March 2018 (UTC)[reply]
Yeah there is a lot in the second bullet point I did not include diffs for, on account of me being lazy. I will try to do that tonight maybe. I tried to limit what I said to the current functionality of the bot and not include my wishlist of new things, including revamping how subtopics are done. There was an error log at some point in time (located here), not sure when we stopped using that, and if it was on purpose or not. Kees08 (Talk) 01:18, 12 March 2018 (UTC)[reply]

Fixing sort keys for biographies of Thai people

According to WP:NAMESORT (and expanded upon at WP:MOSTHAI), biographical articles about Thai people should be sorted like this:

{{DEFAULTSORT:Surname, Forename}}
[[Category:International people]]
[[Category:Thai people|Forename Surname]]

However, this has very inconsistently been adhered to, with some articles specifying the Thai order in the DEFAULTSORT and some not following the Thai order at all.

Would it be plausible for a bot to help fix this? A possible process I have in mind is something along the lines of:

  1. Manually compile a list of all Thai people categories.
  2. Manually compile a list of all biographies of Thai people.
  3. Manually list preferred DEFAULTSORT and Thai sort keys for all of them.
  4. Have a bot go through all the articles, adding and/or replacing the DEFAULTSORT and sort keys according to the listed values.

And, for the long term:

  1. The bot, during the aforementioend run, also adds a {{Thai name sort}} template, which does nothing but notes the correct Thai sort key for future reference.
  2. During periodical runs, a bot looks up the sort key in the {{Thai name sort}} template and adds it to any Thai people categories (from the previous list, which would have to be manually updated) which have been later added and are missing the sort key.

I realise this is pretty labour-intensive, but a more automated process would likely not be able to identify names which don't follow the Forename Surname format. I'd like to know that a bot was available for the task before attempting to review all the names. --Paul_012 (talk) 10:09, 9 February 2018 (UTC)[reply]

Paul 012, Category:Thai people states Note on sorting: Thailand people are usually called by the first name, even telephone books are sorted by the first name. This of course also applies to the subcategories.. Could you point to the relevant passages in the guides you mentioned?   ~ Tom.Reding (talkdgaf)  17:44, 11 February 2018 (UTC)[reply]
Tom.Reding, sorry but I'm not sure you're reading the request correctly. It's asking to add sort keys so that Thai people categories will be sorted by first name. Anyway, the quotes are:

Thai names have only contained a family name since 1915 and the name follows the western pattern of "Forename Surname". However, people in Thailand are known and addressed by their forename. In categories mostly containing articles about Thai people, Thai names should be sorted as they are written with the forename first. Thaksin Shinawatra is sorted [[Category:Thai people|Thaksin Shinawatra]].

and

When categorizing biography articles, do not specify sort keys to sort by surname in Thai people categories. However, sorting by surname is still desirable for non-country-specific people categories, and this is done with the DEFAULTSORT magic word. A biography article for Given-name Surname should therefore be categorized like this:

{{DEFAULTSORT:Surname, Given-name}}
[[Category:International people]]
[[Category:Thai people|Given-name Surname]]

--Paul_012 (talk) 19:21, 11 February 2018 (UTC)[reply]
Paul 012, this should be doable, as long as all of the special given-name-first-sortkey cats are identified and appropriately not affected by {{DEFAULTSORT}}. I'm not available to do this, unfortunately, but no reason someone else can't pick it up. In the meantime, you could compile the list of all such special cats, to do some of the legwork and to entice a potential bot op.   ~ Tom.Reding (talkdgaf)  20:49, 16 February 2018 (UTC)[reply]
Hi there Paul_012, do you have an idea of what {{Thai name sort}} would include/what it would look like? Would it be substituted? Would it have parameters? (How would it note the preferred format?)
As for compiling a list of all Thai people categories and biographies, I see two viable approaches to that part of the problem:
  1. A bot runs through Category:Thai people and just works off of that category
  2. A bot runs through Category:Thai people and compiles a list (easily writeable to a local text file; one article per line) and uses that to work off of, updating periodically (in this case, that part wouldn't even have to be part of a bot's regular function, I could theoretically make generating said list its own program and run periodically for simplicity's sake)
Once I have some more details (above), I would be happy to consider working on this program and already have a rough idea of how I would do it (shouldn't take that long once things are clarified). --All the best, TheSandDoctor (talk) 16:40, 9 March 2018 (UTC)[reply]
I just saw the discussion here, I should clarify that I am happy to considering moving ahead with this one clarifications above are made and consensus is reached. I would consider this a fun little project, but will not move unless adequate consensus and discussion has taken place. --TheSandDoctor (talk) 16:42, 9 March 2018 (UTC)[reply]
Thanks a lot for offering to help, TheSandDoctor. If a bot becomes available then there should be no need to modify the guideline; I hope it can be settled soon.
I'm imagining the template as taking only one parameter, which is the desired sort key, with no visible output. (In most cases it would be identical to the article title, but there may be some exceptions.) So for the Abhisit Vejjajiva article, the desired outcome would be:
{{DEFAULTSORT:Vejjajiva, Abhisit}}
{{Thai name sort|Abhisit Vejjajiva}}
[[Category:Prime Ministers of Thailand|Abhisit Vejjajiva]]
etc.
[[Category:People educated at Eton College]]
etc.
The list of articles would be only needed for the initial run. I've already begun compiling it at Special:Permalink/829616472—It's still a work in progress, and will need to be double-checked. I'm expecting that subsequent periodical runs will identify the articles by looking up inclusions of {{Thai name sort}}. This way, new articles can easily be picked up.
I was thinking that the list of categories would also need to be manually compiled in order to avoid false positives such as expatriates, whose names wouldn't be relevant to this task. But then again, expatriates are already excluded from the article list, so it shouldn't make any difference. Having the bot periodically go through Category:Thai people would require less maintenance, and would be preferable. --Paul_012 (talk) 19:44, 9 March 2018 (UTC)[reply]
Oh, wait. Automatically going down the category would include categories like Category:American people of Thai descent, so this approach wouldn't work. I'll see if I can make a list of the specific categories that should be browsed instead. --Paul_012 (talk) 20:04, 9 March 2018 (UTC)[reply]
@Paul 012: I went and created a template in my userspace to have it ready to go (feel free to edit it, just please leave moving to me or others with the page mover user right as I don't want a redirect there).
Do you want {{DEFAULTSORT}} modified (on pages) to also match "Given Surname"? Adding of the Thai name sort template should be easy as the bot could (theoretically) just take the page name and "plop" it in as the parameter. It won't always work right (ie House of Abhaiwongse), but should (most likely will) work the majority of the time and might see about creating a "blacklist" of titles, where if the page title is equal to X, then it will skip it). I assume that the sub-categories of categories within the list your sandbox are also meant to be included (recursive)? --TheSandDoctor (talk) 00:39, 10 March 2018 (UTC)[reply]

Okay, TheSandDoctor, here's a newer summary of the task:

  1. A one-time task performed on the articles listed at Special:Permalink/829616472* Special:Permalink/829756891 which, entails, for each article:
    1. Modify {{DEFAULTSORT}} to match that given in the table.
    2. Add {{Thai name sort}}, with the desired sort key** as the parameter.
    3. For each [[Category:...]] in the article, see if it is included in the list***. If it is, add the same sort key to the category, but don't replace existing values.
  2. A recurring task performed on articles which are tagged with {{Thai name sort}}:****
    1. For each [[Category:...]] in the article, see if it is included in the list***. If it is, copy the sort key listed in {{Thai name sort}}, and add it to the category, but don't replace existing values.
  • *As previously mentioned, please wait for a finalised version. House of Abhaiwongse and similar non-person-name articles won't be affected as they're already excluded from the table.
  • **I'll add it to the table—there may be some exceptions that don't exactly follow the article title.
  • ***This list should be automatically compiled by scanning through Category:Thai people, with the exception of Category:Thai diaspora‎ and its subcategories, plus Category:Orders, decorations, and medals of Thailand. Please disregard the list currently in my sandbox.
  • ****I not sure how often this should be run, but once a month would probably be plenty often enough.

--Paul_012 (talk) 10:44, 10 March 2018 (UTC)[reply]

I just realised that there should also be a fallback in case {{Thai name sort}} is called with missing/empty parameters. In such cases, the article title should be used as the sort key. --Paul_012 (talk) 11:29, 10 March 2018 (UTC)[reply]
@Paul 012: Alright, let me know when you're ready. One last clarification: "with the exception of Category:Thai diaspora‎ and its subcategories, plus Category:Orders, decorations, and medals of Thailand" means to exclude Category:Orders, decorations, and medals of Thailand as well, correct? --TheSandDoctor (talk) 15:40, 10 March 2018 (UTC)[reply]
Oops, sorry. What I meant was to (1) Include Category:Thai people. (2) For all subcategories of Category:Thai people except Category:Thai diaspora, also include them and all their subcategories (recursive). (3) Include Category:Orders, decorations, and medals of Thailand and all its subcategories (recursive). --Paul_012 (talk) 16:12, 10 March 2018 (UTC)[reply]
@Paul 012: Okay, thanks for the clarification. Sorry to be asking so many questions and to be somewhat anal retentive about this, just trying to make sure we are on the same page and that I know what you want etc (need detailed details to be able to make bot and to ensure it does what you want).
Let me know when you are ready. I have to head out for a bit, but when I get back I will see about continuing to work on the bot (& then file a BFRA if things are looking good). --TheSandDoctor (talk) 16:30, 10 March 2018 (UTC)[reply]
Final list is at Special:Permalink/829756891. Still pending further discussion to address Francis Schonken's concerns below. --Paul_012 (talk) 18:44, 10 March 2018 (UTC)[reply]

TheSandDoctor, Francis Schonken has requested that manual placement of the template be manually trialled on article pages first. Could you go ahead and move your sandbox version into the template space? Thanks. --Paul_012 (talk) 09:07, 11 March 2018 (UTC)[reply]

Well, err, no, that's not what I suggested (and I certainly didn't "request" anything). In the approach I suggested {{Thai name sort}} (or a template with a different name) would be applied to *category* pages (i.e. Categories of Thai people where the collation should be according to actual article titles), not a template that would be inserted in mainspace. Anyhow, such templates—whether according to the original idea or according to my suggested scheme—should be experimented with, would have needed to have found consensus, and would have needed to be explained in the WP:SUR guidance (etc) before any sort of bot operation. This is not a page where to request manual operations, nor a page to find consensus about things that go beyond the mandate of current guidance and particular consensuses. --Francis Schonken (talk) 10:43, 11 March 2018 (UTC)[reply]
Sorry, I misread. Thanks for the clarification. The point about moving the sandbox template into template space is still valid anyway. I'll continue at Wikipedia talk:Categorization of people‎. --Paul_012 (talk) 10:57, 11 March 2018 (UTC)[reply]
@Paul 012: Template moved to Template:Thai name sort.

Please.     — The Transhumanist    10:20, 5 March 2018 (UTC)[reply]

Further anatomy infobox series help

I have another request for a bot to help tighten up our {{Infobox anatomy}} series. Ping to Nihlus who helped out last time.

Request is to:

  1. In all articles that use {{Infobox anatomy}} and all subtemplates* remove the empty |MapCaption=, and |ImageMap= (which have been integrated into the "image" parameter)
  2. In all articles that use {{Infobox muscle}} remove deprecated parameters |NerveRoot=, |PhysicalExam=
  3. In all articles that use {{Infobox anatomy}}, {{Infobox brain}}, {{Infobox neuron}} and {{Infobox nerve}} remove the parameters: |BrainInfoType=, |BrainInfoNumber=, |NeuroLexID=, |NeuroLex= (now moved to Wikidata)
  4. In all articles that use {{Infobox anatomy}} and all subtemplates* remove from pages the field |Code= which I have gone through and checked, and duplicated other fields.

I would be very grateful for this, it will significantly help tidy up both our articles and the infoboxes.--Tom (LT) (talk) 00:32, 3 March 2018 (UTC)[reply]

@Tom (LT): I'll look into this. When it comes to point #1, do you just want |MapCaption= and |ImageMap= removed or their values integrated elsewhere? If so, where? (Would MapCaption just have its value put in |Caption=?) --TheSandDoctor (talk) 17:07, 9 March 2018 (UTC)[reply]
This is an extension of something my bot already did. I just need to alter the settings and run it for this. I can run it some time this weekend. Nihlus 21:11, 9 March 2018 (UTC)[reply]
@TheSandDoctor I have already replaced ImageMap with Image in all articles that used the parameter. Now there's just stacks of empty parameters sitting around (which will display an error message when I finally remove it from the infobox in totalis). --Tom (LT) (talk) 21:57, 9 March 2018 (UTC)[reply]
Thanks very much Nihlus. To let you know, I notice I have made a spelling mistake above and have now corrected it (|NerveRoom= -> |NerveRoot=)). Like last time, once the bot runs I'll be able to remove the parameters, then I will manually go through all articles that have parameter problems and fix them. --Tom (LT) (talk) 21:57, 9 March 2018 (UTC)[reply]
@Tom (LT): @Nihlus: Roger. Had started writing it, but hadn't finished & was good practice anyways . Surprised that this page was not on my watchlist, have solved that problem now. --All the best, TheSandDoctor (talk) 00:11, 10 March 2018 (UTC)[reply]
@Tom (LT): Can you clarify point 4? Should |Code= be removed from all of those templates or do you have a separate list of affected pages somewhere else? Nihlus 11:25, 11 March 2018 (UTC)[reply]
@Nihlus the bot will need to run through all pages that use those templates and remove the blank parameters - See eg [6] - I removed the "ImageMap" and "MapCaption" parameters which are blank. Point 4 is that the bot will also need to run through and remove any blank "Code" parameters, too (eg as I have done here [7]). Once that's done I'll remove it from the templates --Tom (LT) (talk) 19:17, 11 March 2018 (UTC)[reply]

Adding category to articles

Pleas add this category to the articles related to the Children's literature portal, because I need it in arabic wikipedia. Thank you. أبو هشام السوعان (talk) 12:10, 4 March 2018 (UTC)[reply]

@أبو هشام السوعان: Why do you need it in the Arabic Wikipedia? If you need it there, why ask on the English Wikipedia? Also, doesn't Category:Children's literature already contain the related articles? If I am misunderstanding, I apologize (also why I am asking for clarification) --TheSandDoctor (talk) 16:21, 9 March 2018 (UTC)[reply]
The problem solved, thanks. أبو هشام السوعان (talk) 00:27, 10 March 2018 (UTC)[reply]

See Wikipedia talk:List of Wikipedians by article count#Updating?.     — The Transhumanist    10:18, 5 March 2018 (UTC)[reply]

Olympic competitors: Project tagging

Can a bot be created to add the {{WikiProject Olympics}} to the talkpage of all the articles in the sub-cats of Category:Olympic competitors by country that don't already have their TP tagged? If the tag already exists, ignore it, and if it's not there already add it with stub class and low importance, unless the article is already tagged at a higher class than stub by another project. Now the 2018 Winter Olympics are over, it would be good to catch all those athletes who are missing the tag, along with countless others that have been created/updated too. Many thanks. Lugnuts Fire Walk with Me 13:02, 8 March 2018 (UTC)[reply]

For interest, Petscan results show 104k pages in this cat and its subpages. It's likely the majority are already tagged, but that's still a hell of a lot of pages to parse. Primefac (talk) 13:18, 8 March 2018 (UTC)[reply]
Excluding those already tagged gives 13,872 results. Are Ancient Olympians within the scope of the project though? They also fall within the category. --Paul_012 (talk) 14:53, 8 March 2018 (UTC)[reply]
Duh, should have thought of that. And I suppose Ancient Olympians would be in the scope of WikiProject Olympics. Primefac (talk) 15:04, 8 March 2018 (UTC)[reply]
When requests like this are made, we normally ask for an explicit list of categories and not a blanket "plus all subcategories" approach - that way lies mistagging. --Redrose64 🌹 (talk) 16:12, 8 March 2018 (UTC)[reply]
Eyeballing the Petscan for all subcats looks okay, though I didn't check the entire list. --Izno (talk) 16:23, 8 March 2018 (UTC)[reply]
Going deeper:
  1. Checking for sports inclusion: with 1 cat/row, removing matches to the regex [\r\n][^\r\n]*?\b(archers|artists|athletes|(bi|tri|pent)athletes|bobsledders|boxers|canoeists|competitors|cricketers|curlers|cyclists|divers|equestrians|fencers|footballers|golfers|gymnasts|jumpers|lugers|managers|medall?ist stubs|medall?ists|Members of|Olympians|pilots|players|practitioners|racers|rowers|sailors|shooters|skaters|skiers|snowboarders|swimmers|weightlifters|wrestlers)\b[^\r\n]* leaves only 173 Category:Olympic judoka of Japan-type cats, Category:Olympic pelotaris by country, Category:Olympic pelotaris of France, Category:Olympic pelotaris of Spain, and Category:1980 US Olympic ice hockey team. Since, as I just found out, Judoka is one who practices Judo, and pelotaris refers to players of various court-sports (the pelotaris cats only contain people too), everything looks legit here.
  2. Checking for Olympics inclusion: with 1 cat/row, removing matches to the regex [\r\n][^\r\n]*?\b(olympics?|olympians)\b[^\r\n]* leaves only Category:Canoeists of the Republic of Macedonia, which only contains Olympic athletes.
All 5048 cats look good to me. The last canoeists cat deserves a name change though.   ~ Tom.Reding (talkdgaf)  19:45, 8 March 2018 (UTC)[reply]
I've done project tagging before. Looks like it'd be best to take class/importance from {{Wikiproject Biography}}.   ~ Tom.Reding (talkdgaf)  15:34, 8 March 2018 (UTC)[reply]
There are two problems with taking importance from {{WikiProject Biography}}: one is that the importance rating varies between WikiProjects - a topic that is high-importance to one might be low importance to another; the other is that {{WikiProject Biography}} doesn't have importance ratings. Taking the class rating from {{WikiProject Biography}} should be fine though. --Redrose64 🌹 (talk) 16:32, 8 March 2018 (UTC)[reply]
Good point. By 'best' I only meant that it seems to be the most prevalent WP banner in the lot.   ~ Tom.Reding (talkdgaf)  16:39, 8 March 2018 (UTC)[reply]
I should qualify that. {{WikiProject Biography}} doesn't have general importance ratings, but it does have workgroup-specific importance ratings, and these are described as priority ratings. For example, when |sports-work-group=yes is set, then |sports-priority= is recognised; but somebody who is |sports-work-group=yes|sports-priority=low for {{WikiProject Biography}} might rate |importance=mid for {{WikiProject Olympics}}, see for example Talk:Christopher Dean (don't forget to [show] the "WikiProject Biography / Sports and Games" row). So I still think that the importance shouldn't be copied. --Redrose64 🌹 (talk) 10:39, 9 March 2018 (UTC)[reply]
Thanks for everyone's input - is this likely to happen? Thanks again. Lugnuts Fire Walk with Me 09:21, 9 March 2018 (UTC)[reply]
BRFA submitted.   ~ Tom.Reding (talkdgaf)  15:45, 9 March 2018 (UTC)[reply]
Thanks Tom! Lugnuts Fire Walk with Me 14:22, 11 March 2018 (UTC)[reply]

rsssf

hi rsssf.com have many articles about soccer please make bot to added articles from rsssf.com — Preceding unsigned comment added by Amirh123 (talkcontribs) 15:48, 8 March 2018 (UTC)[reply]

Hi there Amirh123, I think that WP:MASSCREATION might apply in this case. Also, due to copyright restrictions, Wikipedia could not just take content from other sites. If you have any questions, please feel free to let me know. --TheSandDoctor (talk) 16:24, 9 March 2018 (UTC)[reply]
hi rsssf.com is free content please make bot to adding articles for this site thanks — Preceding unsigned comment added by Amirh123 (talkcontribs) 08:19, 11 March 2018 (UTC)[reply]
@Amirh123:, please define "free" in this context ("free" to view, or open license/public domain?). Please also see Wikipedia:Copying text from other sources. It is seldom ever appropriate to directly copy content from sources as doing so (in most cases) would be a copyright violation and would be speedily deleted. --TheSandDoctor (talk) 13:29, 11 March 2018 (UTC)[reply]

Mass editing {{DEFAULTSORT}} values

Related to but more generic than #Fixing sort keys for biographies of Thai people above, I'm looking for a bot to make mass edits to {{DEFAULTSORT}} keys (or add them if they don't exist) for a pre-defined list of articles, i.e. Special:PermaLink/829542720. These are articles that may have previously been tagged with incorrect defaultsort keys. Optimally, the bot should also skip the edits if changes are made only in capitalisation. Edits which result in no changes would of course be skipped. --Paul_012 (talk) 08:23, 9 March 2018 (UTC)[reply]

@Paul 012: Is it just for those articles in that version of the sandbox? Is it just for Thai people? How would these be found exactly? --TheSandDoctor (talk) 00:46, 10 March 2018 (UTC)[reply]
@Paul 012: I have a functional proof of concept now (for changing the defaultsort value), just need the confirmation on details above & will file BFRA. --TheSandDoctor (talk) 03:50, 10 March 2018 (UTC)[reply]
This would be a one-time edit for just the 215 listed articles (which are not part of the Thai name sort task above.) --Paul_012 (talk) 09:59, 10 March 2018 (UTC)[reply]

As said, now explicitly, at Wikipedia talk:Categorization of people#Thai names, I'm opposing this bot operation. Since only two people commented there thus far (the OP and me), with a 50%/50% division of opinions, this needs more time for discussion, with let's hope a bit more input from other editors, before firing up a bot. The same goes for the #Fixing sort keys for biographies of Thai people BotReq proposal above, although that one might be more in line with current guidance (can't really get my head around it yet). --Francis Schonken (talk) 17:00, 10 March 2018 (UTC)[reply]

Still something else, bot-assisted insertion of a {{DEFAULTSORT}} value that is exactly equal to the article title of the page where it is inserted would be a WP:COSMETICBOT infringement, as far as I understand the applicable policy. --Francis Schonken (talk) 17:13, 10 March 2018 (UTC)[reply]

Thanks for the comments, Francis Schonken. This request (for the 215 articles) is in accordance with the current guidelines. All the listed articles here are multi-word names which do not contain surnames, which is why comma-separated sort keys would be incorrect. This bot task is to rectify those that have been mistakenly added. Regarding your concerns of the difference between Luang Pu Sodh Candasaro and Luang Pu Thuat, this is because all the other Luang Pus are titles preceding the person's name, but Luang Pu Thuat is a specific name in and of itself—the subject's name wasn't Thuat. (I think this is quite like how Lady Gaga isn't sorted Gaga, Lady because she isn't a lady named Gaga.) --Paul_012 (talk) 17:29, 10 March 2018 (UTC)[reply]
Sorry, no, the request goes beyond what is mandated by the applicable guidance afaics, so would need to find consensus elsewhere first. --Francis Schonken (talk) 17:33, 10 March 2018 (UTC)[reply]
Specifically, the guidance only mentions per-category sort keys for Thai categories (clearly assuming that the {{DEFAULTSORT}} is defined as "surname, given name(s)"), and does not mandate to set the {{DEFAULTSORT}} to "given name(s) surname", which should not normally be done for any article with an actual title in that format, and is thus not mandated by any policy, and is an infringement on WP:COSMETICBOT if done by bot. --Francis Schonken (talk) 17:40, 10 March 2018 (UTC)[reply]
You might want to re-read my above comment. None of the articles titles here contain a surname. Most of them are royalty and nobility, and are covered by WP:PEERS. As for the WP:COSMETICBOT concerns, one could also argue that manually inputting any DEFAULTSORT would be unnecessary, as it results in no changes in the sorting. But the point here is to prevent unknowing editors from inserting incorrect values. I don't think this violates the spirit of WP:COSMETICBOT. --Paul_012 (talk) 17:45, 10 March 2018 (UTC)[reply]

I originally thought manually going through all those articles would be an unnecessary waste of time and effort. Seeing the difficulty I'm having in explaining the task, however, it has become clear that further discussions would actually waste more time and effort on everybody's part than just manually performing the edits. I have gone ahead and done so. Thanks to TheSandDoctor for the assistance, but this is now moot. Marking as N Not done. --Paul_012 (talk) 10:26, 11 March 2018 (UTC)[reply]

@Paul 012: So is it N Not done for both this and the above? Should I move the template back to my userspace & tag U1 then? (Sorry for the delay in my response, this was sent at around 3:30am & the previous one at around 1:10am.) --TheSandDoctor (talk) 13:22, 11 March 2018 (UTC)[reply]
@TheSandDoctor Just this one. The above is still pending further discussion, so please keep the template for now. --Paul_012 (talk) 13:26, 11 March 2018 (UTC)[reply]

Wikiproject task force tagging (Reality TV)

Can I request a bot to tag the articles that are in Category:Reality television series with "|reality-tv=yes|reality-tv-importance="(importance can be assessed manually afterwards). I have created a list of each individual category here after removing certain subcategories, mostly participant and container categories. Please let me know if you think that needs more refining. A lot of the articles already have WPTV but are just missing the Reality TV task force label, and some older articles don't have the project at all. So would need the bot to add the full WPTV+realitytv tag to any that are missing the project, and only add the task force parameter to those that are already under WPTV. WikiVirusC(talk) 16:30, 9 March 2018 (UTC)[reply]