Jump to content

Wikipedia:Bot requests: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Line 255: Line 255:
::Thank you. Will you let me know here, at my talk or at the project page? If there's a choice, the project page would be preferable. Thanks again, <small>--<span style="font-family: Trebuchet MS, sans-serif;border:2px solid #A9A9A9;padding:1px;">[[User:Jza84|<b>Jza84</b>]] | [[User_talk: Jza84|<font style="color:#000000;background:#D3D3D3;">&nbsp;Talk&nbsp;</font>]] </span></small> 12:23, 12 September 2008 (UTC)
::Thank you. Will you let me know here, at my talk or at the project page? If there's a choice, the project page would be preferable. Thanks again, <small>--<span style="font-family: Trebuchet MS, sans-serif;border:2px solid #A9A9A9;padding:1px;">[[User:Jza84|<b>Jza84</b>]] | [[User_talk: Jza84|<font style="color:#000000;background:#D3D3D3;">&nbsp;Talk&nbsp;</font>]] </span></small> 12:23, 12 September 2008 (UTC)
::: Apologies for the delay in the response. I was away over the weekend. btw you can see the entire subcats under [[:Category:Merseyside]] at [[Wikipedia:WikiProject Merseyside/Cats]]. Please double check each category and remove all the possibly wrong categories from the page. I will also leave a note on the project page. Once it is throughly checked and all the false positive categories are hopefully removed, kindly let us know. -- [[User:Tinucherian|'''<em style="font-family:Kristen ITC;color:#ff0000"> Tinu</em>''']] [[User talk:Tinucherian|'''<em style="font-family:Kristen ITC;color:#ff0000">Cherian </em>''']] - 05:47, 15 September 2008 (UTC)
::: Apologies for the delay in the response. I was away over the weekend. btw you can see the entire subcats under [[:Category:Merseyside]] at [[Wikipedia:WikiProject Merseyside/Cats]]. Please double check each category and remove all the possibly wrong categories from the page. I will also leave a note on the project page. Once it is throughly checked and all the false positive categories are hopefully removed, kindly let us know. -- [[User:Tinucherian|'''<em style="font-family:Kristen ITC;color:#ff0000"> Tinu</em>''']] [[User talk:Tinucherian|'''<em style="font-family:Kristen ITC;color:#ff0000">Cherian </em>''']] - 05:47, 15 September 2008 (UTC)

::::In case you didn't see [http://en.wikipedia.org/w/index.php?title=Wikipedia_talk%3AWikiProject_Merseyside&diff=238565214&oldid=238524208 Jza84's reply] on the project talk page, we think we're ready to go. [[User:Nev1|Nev1]] ([[User talk:Nev1|talk]]) 21:26, 15 September 2008 (UTC)
::::In case you didn't see [http://en.wikipedia.org/w/index.php?title=Wikipedia_talk%3AWikiProject_Merseyside&diff=238565214&oldid=238524208 Jza84's reply] on the project talk page, we think we're ready to go. [[User:Nev1|Nev1]] ([[User talk:Nev1|talk]]) 21:26, 15 September 2008 (UTC)
::::: {{BOTREQ|doing}} : Bot working on the tagging. Thanks -- [[User:Tinucherian|'''<em style="font-family:Kristen ITC;color:#ff0000"> Tinu</em>''']] [[User talk:Tinucherian|'''<em style="font-family:Kristen ITC;color:#ff0000">Cherian </em>''']] - 05:52, 16 September 2008 (UTC)


== Inactive users ==
== Inactive users ==

Revision as of 05:52, 16 September 2008

This is a page for requesting tasks to be done by bots per the bot policy. This is an appropriate place to put ideas for uncontroversial bot tasks, to get early feedback on ideas for bot tasks (controversial or not), and to seek bot operators for bot tasks. Consensus-building discussions requiring large community input (such as request for comments) should normally be held at WP:VPPROP or other relevant pages (such as a WikiProject's talk page).

You can check the "Commonly Requested Bots" box above to see if a suitable bot already exists for the task you have in mind. If you have a question about a particular bot, contact the bot operator directly via their talk page or the bot's talk page. If a bot is acting improperly, follow the guidance outlined in WP:BOTISSUE. For broader issues and general discussion about bots, see the bot noticeboard.

Before making a request, please see the list of frequently denied bots, either because they are too complicated to program, or do not have consensus from the Wikipedia community. If you are requesting that a template (such as a WikiProject banner) is added to all pages in a particular category, please be careful to check the category tree for any unwanted subcategories. It is best to give a complete list of categories that should be worked through individually, rather than one category to be analyzed recursively (see example difference).

Alternatives to bot requests

Note to bot operators: The {{BOTREQ}} template can be used to give common responses, and make it easier to keep track of the task's current status. If you complete a request, note that you did with {{BOTREQ|done}}, and archive the request after a few days (WP:1CA is useful here).


Please add your bot requests to the bottom of this page.
Make a new request
# Bot request Status 💬 👥 🙋 Last editor 🕒 (UTC) 🤖 Last botop editor 🕒 (UTC)
1 Automatic NOGALLERY keyword for categories containing non-free files (again) 16 8 WOSlinker 2024-06-25 12:44 Legoktm 2024-06-24 01:34
2 Can we have an AIV feed a bot posts on IRC? 8 3 Legoktm 2024-06-21 18:24 Legoktm 2024-06-21 18:24
3 Bot to update match reports to cite template BRFA filed 14 5 Yoblyblob 2024-06-20 21:21 Mdann52 2024-06-20 21:11
4 Bot to mass tag California State University sports seasons Doing... 5 4 Frostly 2024-06-10 17:05 Headbomb 2024-06-09 17:28
5 Clear Category:Unlinked Wikidata redirects 8 5 Vandervalp 2024-07-01 11:16 DreamRimmer 2024-04-21 03:28
6 Fixing stub tag placement on new articles Declined Not a good task for a bot. 4 3 Headbomb 2024-05-19 20:17 Headbomb 2024-05-19 20:17
7 User:RetractionBot, v2 Y Done 8 5 Mdann52 2024-05-25 16:06 Mdann52 2024-05-25 16:06
8 Bot to change citations to list defined references Declined Not a good task for a bot. 3 2 Apoptheosis 2024-06-09 17:44 Headbomb 2024-06-09 16:56
9 Adding Facility IDs to AM/FM/LPFM station data BRFA filed 11 3 Mdann52 2024-07-06 12:36 Mdann52 2024-07-06 12:36
10 Tagging women's basketball article talk pages with project tags 9 4 Qwerfjkl 2024-06-29 21:03 Qwerfjkl 2024-06-29 21:03
11 Friendly support for Draft categories – feedback request 1 1 Mathglot 2024-06-10 19:40
12 'Literature of Kashmir' Declined Not a good task for a bot. 2 2 Usernamekiran 2024-06-11 07:37 Usernamekiran 2024-06-11 07:37
13 Adding links to previous TFDs 7 4 Qwerfjkl 2024-06-20 18:02 Qwerfjkl 2024-06-20 18:02
14 Bot that condenses identical references Coding... 9 4 Acebulf 2024-06-19 03:07 Headbomb 2024-06-18 00:34
15 Convert external links within {{Music ratings}} to refs 2 2 Mdann52 2024-06-23 10:11 Mdann52 2024-06-23 10:11
16 Stat.kg ---> Stat.gov.kg 2 2 DreamRimmer 2024-06-23 09:21 DreamRimmer 2024-06-23 09:21
17 Add constituency numbers to Indian assembly constituency boxes 3 2 C1MM 2024-06-25 03:59 Primefac 2024-06-25 00:27
18 Replace banners of merged history WikiProjects
Resolved
8 2 Primefac 2024-07-05 17:51 Primefac 2024-07-05 17:51
Legend
  • In the last hour
  • In the last day
  • In the last week
  • In the last month
  • More than one month
Manual settings
When exceptions occur,
please check the setting first.



Old Protection Notification

Betacommand said he would try to work on this, but the community seems to prefer he not get within 100 feet of any not fully manual option (this is why I avoid AN/I), so I'm going to re-request this:

There are many, many old semiprotected pages that have simply been forgotten about and have no expiry date (there are a few that were before expiries were implemented in April 2007 too). We've been discussing ideas about what to do, and I came up with this idea.

My idea is to have a bot notify the protecting admins that it's been over (say) 2 months since a page was protected without an expiry date. I manually tested the idea out yesterday, notifying about 30 admins about old protections (see an example here) with pretty good responses. In some cases, the admin did unprotect the page, some leaving a comment just saying they simply forgot about the page (which is understandable). I'm thinking someone should be able to get a bot to do this. Just start here and work forward (or a database dump would probably work too since many protections date back over a year), find the last admin to protect the page, and notify them (I used a boilerplate here).

So, is this doable? -Royalguard11(T) 03:40, 23 August 2008 (UTC)[reply]

Or is this too complex to do or something? -Royalguard11(T) 16:21, 28 August 2008 (UTC)[reply]
The easiest way to do this from my POV would be to analyse the stub-history DB dump. Therefore it would want to be run just after a DB dump. It's interesting, just after protection was introduced we had the feeling that perhaps 5 or 6 pages would be protected. Now we have hundreds of semi-protected and probably of protected, especially in template space. Rich Farmbrough, 12:00 2 September 2008 (GMT).
We'd be more interested in article protection. Admins have a tendency to protect and forget, leaving semiprotection on some page for over a year. I'm not an expert on DB dumps, but I'm sure even a month old dump would work since we're more concerned about the really old protections, maybe not 2 months but definitely anything over 6. -Royalguard11(T) 22:49, 5 September 2008 (UTC)[reply]

So I tried building an SQL query to do this and got a query to get pages which can tell what type of restriction a page currently has, but it does not provide any of the logging information which is a separate table. Some notes of curiosity: protected redirects and protected pages with only one editor should be examed more carefully.

SELECT page.page_namespace, page.page_title, page.page_is_redirect, page_restrictions.*
FROM page INNER JOIN page_restrictions ON page.page_id = page_restrictions.pr_page
WHERE page_restrictions.pr_type="edit" AND page_restrictions.pr_expiry="infinity"
LIMIT 50;

It should be possible to the newest entry from the logging table to find the oldest still protected pages.

SELECT *
FROM logging
WHERE log_type= "protect"
LIMIT 50;

You might want to open a ticket with tswiki:Query service and ask them to do this for you. — Dispenser 17:29, 9 September 2008 (UTC)[reply]

You could use tools:~erwin85/protectedpages.php which is a beta version of a new tool. It uses a separate table with logging and protection information. As there are quite a lot of protected pages on this Wikipedia you can't run more complex queries yourself. Leave a message on my talk page and I'll run them. --Erwin(85) 19:33, 9 September 2008 (UTC)[reply]
I got the first 1000 pages at tools:~legoktm/protect.html LegoKontribsTalkM 23:44, 15 September 2008 (UTC)[reply]

RescueBot?

Hi, I would like some help at Wikipedia:Article Rescue Squadron (ARS). I'm new to the bot world so if this is the wrong place please feel free to direct me to the correct place. When any user adds the {{rescue}} tag it adds the article to Category:Articles that have been proposed for deletion but that may concern encyclopedic topics. Someone, usually myself, then lists the article at Wikipedia:Article Rescue Squadron/Current articles as follows:

Article X

(we add a signature to give us a rough date of entry)

Ideally a bot would do this for us several times a day, the volume has been low so far 10-20 articles at any one time. If an article is listed already then no need. If it gets listed twice we can cope. The ARS deals almost exclusively with articles at AfD so when an article has the rescue template added we have anywhere from hours to less than 5 days to do any article rescue work. It would also be nice if the bot could add a note to the article's section when an AfD closes.

So that's my basic request for bot help - any ideas? Banjeboi 09:01, 3 September 2008 (UTC)[reply]

I can't do it until this afternoon, at least, but this is similar to other bots I have written, so it should be easily adaptable. --uǝʌǝsʎʇɹoɟʇs(st47) 10:39, 3 September 2008 (UTC)[reply]
Any update? Banjeboi 23:29, 7 September 2008 (UTC)[reply]
I've got the bot finished, but it relies on a resolution to bugzilla:15420 - it can't edit until that bug is resolved. It's been fixed, and now I'm just waiting for one of the sysadmins on Wikimedia to update Wikipedia so I can do some testing. --uǝʌǝsʎʇɹoɟʇs(st47) 10:39, 8 September 2008 (UTC)[reply]
Wikipedia:Bots/Requests for approval/STBot 15. --uǝʌǝsʎʇɹoɟʇs(st47) 19:43, 8 September 2008 (UTC)[reply]
Excellent! I'll await further developments. -- Banjeboi 21:30, 8 September 2008 (UTC)[reply]

Change card suite character template invocations

Please replace

I would like to repurpose {{Hs}} to relieve {{H}} of its double duty. --Yecril (talk) 08:19, 4 September 2008 (UTC)[reply]

Some invocation of {{cards}} would be better. (also)Happymelon 08:31, 4 September 2008 (UTC)[reply]

I have fixed {{cards}} to support stand-alone suit symbols. {{Hearts}} can me modified to invoke {{cards|hearts}} and so on and both can be supported. I think the political question whether to use {{Hearts}} or {{cards|h}} does not need solving in order for the requested bot to do the right thing as originally specified. --Yecril (talk) 09:49, 4 September 2008 (UTC)[reply]

I can do this. However do we want {{Ss}} to go to {{Spades}} or {{cards|s}}? Both are equally as easy for my bot to do. --T-rex 13:24, 4 September 2008 (UTC)[reply]

I never use these templates anyway so it is hard to tell. I think the bot should edit in the same way an editor would. I think {{Spades}} would be more intuitive because the editor already knows she is writing about cards; she could be annoyed about having to this information. {{cards}} are more friendly to the reader in a mixed environment but the application of these templates outside card games context is minimal. I would go with the original request but I am not authoritative here. User:Happy-melon, anyone? --Yecril (talk) 15:20, 4 September 2008 (UTC)[reply]

The principle is, IMO, that we should be condoning the "editor-friendly" approach and encouraging the "meta-friendly" method. By that I mean, using {{cards}} is more technically elegant, reduces the number of templates that need to be maintained, and is generally cleaner. We shouldn't be preventing people from 'taking the easy option' by using {{hearts}} instead of {{cards|h}}, but we should be using the more 'high-tech' implementation. Really the {{hearts}}, {{clubs}} etc, templates should be soft redirects to {{cards}}. Oh, and someone really needs to convert {{cards}} to use a switch statement: the current subpage implementation is horrible!! (also)Happymelon 21:13, 4 September 2008 (UTC)[reply]
Mission impossible because {{cards}} is expected to process running text converting characters to symbols on the fly. A special parser function would be necessary. My failed attempt is at {{pcards}} for now (p for parse).
And the number of subtemplates required by {{cards}} is much bigger that 4. --Yecril (talk) 14:13, 5 September 2008 (UTC)[reply]
Given the lack of any clear preference for one template over the other, I have decided to go with the {{Hearts}} template. Mainly because the use for this is slightly different then that for {{cards}} and that it is more or less agreed that the cards implementation is poor. If it ever becomes a good idea to switch everything over to the cards template completely, my bot is capable of doing that as well. --T-rex 13:40, 8 September 2008 (UTC)[reply]
These templates should now be completely disused. --T-rex 18:32, 13 September 2008 (UTC)[reply]

A bot for finding images that aren't at Commons

I'm trying to go through certain articles, currently those on gulls, that have images on them that aren't at Commons. To do this manually can be tiresome, especially when you have to look through a lot of images to find one to transfer. Is it possible for a bot script to do this? You could give it a article, a list of articles, or maybe even a category (it would search all the pages within, perhaps also including all pages in subcategories).

Others that move images to the Commons might also find such a script useful. Richard001 (talk) 05:13, 8 September 2008 (UTC)[reply]

It would be possibly better if someone with toolserver access did this, but if noone else offers, I'll set up a wiki-based interface to do this. --uǝʌǝsʎʇɹoɟʇs(st47) 19:32, 8 September 2008 (UTC)[reply]
Wikipedia:Bots/Requests for approval/STBot 14. --uǝʌǝsʎʇɹoɟʇs(st47) 19:43, 8 September 2008 (UTC)[reply]
I might try having a go at as part of an image checker/copyright tool. But don't hold your breath as I'm still new to SQL/DB thing. — Dispenser 03:43, 12 September 2008 (UTC)[reply]

Incompatible image tags

It would be very helpful if a bot could remove the PD tags from images that have incompatible tags, for example Image:'Wheel', Indian red granite sculpture by --Satoru Abe--, 1991, --Hawaii State Art Museum--.JPG. An image of a 3D piece of copyrighted art cannot also be PD. I've seen a few of these get moved to Commons, and I think its confusing people. These obviously non-free images are clogging up free images categories, so it takes more time to sort/move images. If any of these images are actually free (which I don't think would be very many), they can be re-tagged with a single free tag by an editor (non-free images get more attention than free ones). ~ JohnnyMrNinja 07:39, 8 September 2008 (UTC)[reply]

Just to clarify: you want a bot to remove the Public Domain notice from images that also contain a non-free copyright notice? If this is the case, do you have any idea how many images are like this? ~ AmeIiorate U T C @ 08:00, 8 September 2008 (UTC)[reply]
I disagree, images with problems should be hand inspected, rather than be "fixed" by a bot. If there is one issue there is very likely more. BJTalk 08:07, 8 September 2008 (UTC)[reply]
I have no idea how many images there are, though the top of Category:User-created public domain images is filled with them. I cannot see many problems with this, but would it be more agreeable to have the tag commented out? That way it would be clear to any future editors what happened. There is no way that a copyright tag and a PD tag can ever be compatible, and it's best to err on the side of non-free, as I find that 99.9% of the time that a PD tag is used incorrectly it is because the image is not PD. ~ JohnnyMrNinja 08:38, 8 September 2008 (UTC)[reply]
When I get back I'll run a query and get a list and see how many we are talking about. BJTalk 19:34, 8 September 2008 (UTC)[reply]
Actually, I asked a question about this at Media_copyright_questions and the reply I got was:

Pretty much, there are two levels of copyright involved in a picture of a 3-dimensional work of art. The original copyright of the work of art, and the copyright of photograph itself, which though a derivative work, gets its own copyright because of the importance of light/shadow/framing involved with photographing 3D objects. To use a photo of 3D art, you need both to be okay. The photograph itself is being released to the public domain, that's what that tag means. The art works themselves (here by George Rickey and Isami Noguchi), are not in the public domain, but because the images significantly add to reader understanding of the articles and meet the rest of NFCC, we're using that copyright under fair use.

--balloonguy (talk) 19:41, 11 September 2008 (UTC)[reply]

Well I guess that clears that up. Thanks! ~ JohnnyMrNinja 06:19, 15 September 2008 (UTC)[reply]

article list -> catagory

I have a (10) longish lists of of articles that need to be put in ten related catagories.

6th century Christian saints
7th century Christian saints
etc.

Is there a Bot that can do this please? --Carlaude (talk) 19:16, 8 September 2008 (UTC)[reply]

Bot approval request will be filed momentarily. --uǝʌǝsʎʇɹoɟʇs(st47) 19:29, 8 September 2008 (UTC)[reply]
Wikipedia:Bots/Requests for approval/STBot 13. --uǝʌǝsʎʇɹoɟʇs(st47) 19:43, 8 September 2008 (UTC)[reply]
Thank you.--Carlaude (talk) 00:59, 13 September 2008 (UTC)[reply]

Coordinates for Communes of the Nord department

Can a bot populate the coordinates column in Communes of the Nord department, A-K & Communes of the Nord department, L-Z, using {{coord}}, by fetching the coordinates from each of the individual articles listed on those pages? If so, I'll add a similar column to the articles listing communes in other departments, also. then {{kml}} can be added to the pages, so that all the places can be plotted on Google Maps, downloaded to GPS devices, etc. Thank you. Andy Mabbett (User:Pigsonthewing); Andy's talk; Andy's edits 10:06, 9 September 2008 (UTC)[reply]

Replacing a film image icon

Consensus from WikiProject Films after discussion was to replace instances of Image:Film reel.svg on Wikipedia with Image:Video-x-generic.svg. Both images are svg and both are free-use from Wikimedia Commons. It could be done using AWB but 688 pages link to the former image. Thank you, Cirt (talk) 10:34, 9 September 2008 (UTC)[reply]

I can do this (with TINA). I'll set it going in a bit. ~ AmeIiorate U T C @ 11:01, 9 September 2008 (UTC)[reply]
Thank you! Cirt (talk) 11:06, 9 September 2008 (UTC)[reply]
Doing... sorry, took a bit longer than expected to set up. ~ AmeIiorate U T C @ 12:10, 9 September 2008 (UTC)[reply]
Done, sort of. There are no mainspace pages with the image, but there are still a few talk pages which include it through {{WPCHINA}} and {{WP Australia}} (you have to view the source of the templates to see that they do use the image if the right parameters are set). I think I got the bulk of the rest of them, but you might want to have a look. Cheers, ~ Ameliorate! U T C @ 14:24, 9 September 2008 (UTC)[reply]
Thank you very very much for your work on this! I will get to checking out {{WPCHINA}} and {{WP Australia}} within the next few days. Cirt (talk) 04:37, 10 September 2008 (UTC)[reply]

Deprecated coordinates templates

Please can we have a bot to change the nine coor * coordinates templates, which are deprecated per discussion at WP:GEO, to {{coord}}?

The changes are in three stages, acting on three templates each:

  1. Change all instances of {{coor d}}, {{coor dm}} and {{coor dms}} by simply changing the letters in the first part (coor, coor dm or coor dms), to coord.
  2. Change all instances of {{coor title d}}, {{coor title dm}} and {{coor title dms}} by changing the letters in the first part, as above, and appending a |display=title parameter.
  3. Change all instances of {{coor at d}}, {{coor at dm}} and {{coor at dms}} by changing the letters in the first part, as above, and appending a |display=inline,title parameter.

It may be possible to redirect the first three set of templates, to {{coord}}, in the interim. If so. I'll let you know.

Thank you.Andy Mabbett (User:Pigsonthewing); Andy's talk; Andy's edits 19:52, 9 September 2008 (UTC)[reply]

My bot is capable of replacing the templates in group 1, but I would suspect that you would actually be better of just redirecting them. --T-rex 22:32, 10 September 2008 (UTC)[reply]
Thanks, I've requested that. The other six will still need to be replaced by a bot. Andy Mabbett (User:Pigsonthewing); Andy's talk; Andy's edits 22:40, 10 September 2008 (UTC)[reply]
The request to redirect the three coor d/dm/dms templates was declined, so we still need a bot for all nine. Andy Mabbett (User:Pigsonthewing); Andy's talk; Andy's edits 22:39, 15 September 2008 (UTC)[reply]

Wiktionary-logo-en.png to Wiktionary-logo-en.svg

Can you change all Portal:* pages that use Wiktionary-logo-en.png to Wiktionary-logo-en.svg? Also Wikipedia:*/right panel if you could.

Most of the other pages that use it should use the wiktionary template anyway, so no reason to change them. Ariel. (talk) 13:02, 10 September 2008 (UTC)[reply]

Its like 100 pages, so I'll tab through it in AWB. MBisanz talk 13:04, 10 September 2008 (UTC)[reply]
Thanks! If you could check pages that start with Wikipedia:* too that would be great - but not all of them should change, some are archives or proposals. Ariel. (talk) 13:09, 10 September 2008 (UTC)[reply]

dump anaylsis request

I'm not sure whether this is the gith place for this request, as I want an analysis of the dump not a bot activity. If someone can redirect me to a more appropriate page, I'd appreciate it.

Could someone please list in a file every ns:0 article Foo that satisfies all the following criteria?

  • English Wiktionary (en.wiktionary) does not have the page en:wikt:foo.
  • If Foo is a redirect to Bar, then the page Bar contains the word foo in it, with first letter lowercase.
  • If Foo is not a redirect, then the page Foo contains the word foo in it, with first letter lowercase.

If possible, the following criterion should also be met:

  • The title Foo has a space (U+0020) in it.

Thanks much.—msh210 21:52, 10 September 2008 (UTC)[reply]

This is certainly doable, and it is at least a quasi-bot activity, since your first criterion requires accessing Wiktionary (or perhaps could be done from a Wiktionary dump?). Bear in mind that the most recent available Wikipedia dump is from July 23, 2008, so the results may be somewhat outdated. --Russ (talk) 14:25, 11 September 2008 (UTC)[reply]
Should be doable with dumps only. It seems the latest English Wiktionary dump is from mid-June, but, assuming pages get added more often than deleted, it could be used for a first pass and the results refined afterwards (either using the toolserver or just pasting it to a temp page on Wiktionary and looking for bluelinks). I could try it, but I've never tried working with the fulltext dumps before, so if someone else is already on it I'll gladly leave it to them. —Ilmari Karonen (talk) 16:01, 11 September 2008 (UTC)[reply]
I don't have time to do this myself, unfortunately, but my recommendation for parsing fulltext dumps is to use an xml library like expat. Here is an example framework in C from a project I did some time ago, which simply iterates through all pages in the dump and runs each page's text through a subroutine. — Carl (CBM · talk) 16:19, 11 September 2008 (UTC)[reply]
I'm working on it. Are you sure that your specifications are correct? The part about redirects worries me a little (since several misspelled versions of Foo may redirect to the same Bar). Let me give you a couple of examples:
  1. Beach ball has a space in the title, but wikt:beach ball exists, therefore Beach ball is not on your list.
  2. Class room is a redirect to Classroom; wikt:class room does not exist (your first criterion is based on the redirect, not the target); and Classroom does contain the words "class room" in the text, therefore Class room goes on your list.
  3. Classroom does not have a space in the title, so it does not go on your list.
  4. Articles with titles like John Jones will almost certainly not go on your list, because it is unlikely that the text "john Jones" will be found on the page.
  5. 1923 in aviation has a space in the title; wikt:1923 in aviation does not exist; and the text "1923 in aviation" exists on the page (in a category link), so it will go on your list (I strongly suspect this is not what you want, and there are going to be a lot of articles that are like this).
  6. Market correction has a space in the title; wikt:market correction does not exist; it is a redirect to Market trends which does have the text "market correction" on the page, so Market correction goes on your list.
Do you want to make any changes in your criteria before I proceed? --Russ (talk) 16:26, 11 September 2008 (UTC)[reply]
Hm, I guess also omit anything with [0-9][0-9][0-9] in the title, and anything with an open soft parenthesis (. Thanks much for working on this.—msh210 03:56, 14 September 2008 (UTC)[reply]

Template removal detection, user list maintenance (relisting)

(relisting of previous request, hoping to get discussion)

The following request is for a bot that can do a somewhat complicated task for the Guild of Copy Editors, in order to recognize those editors who have done the most to reduce the "copy edit needed" backlog (as per consensus). Basically, the purpose of the bot would be to maintain a list of users ordered by the number of times they have removed a template from an article. I have scanned through all the bots in Wikipedia:Bots/Status (that took a while) looking for a bot that could do this, and didn't see any, but please let me know if one already exists. In reading through those, it seems theoretically possible, though I don't know if anyone is willing to spend the time to make it. Anyway, this is how I've planned out how the bot could function, but feel free to go some other way if you prefer:

1. Detect when a {{copyedit}} or {{grammar}} template (and any possible variants, like with the "date" attribute) has been removed from an article (perhaps by using the same method that anti-vandalism bots use to detect vandalism).

2. If the editor who removed the template is on the blacklist (here, feel free to mess with that page however you want), stop. Otherwise, continue on...

3. Look at the list at Wikipedia:WikiProject_Guild_of_Copy_Editors/Left_panel#Most_prolific_copy_editors (which doesn't yet exist, see Wikipedia_talk:WikiProject_Guild_of_Copy_Editors#Most_prolific_copy_editors for an example of how this could be laid out). If the editor is in the list, do 4a-4c, otherwise skip to #5:

4a. Increment the count beside the editor's name by one.

4b. Add a link to the diff from #1 to the end of that editor's line.

4c. Move the editor's entire line above all the other editors with the same count or less (in other words, the list is sorted descendingly first by count, then descendingly by the most recent). Done.

5. Add a new line for that editor above all the other editors with the same count or less, in the following format (or something similar):

* '''1''' [[User:Example|Example]] [diff]

That's it. Would anyone be willing/able to do this? Any comments are welcome. -kotra (talk) 17:21, 11 September 2008 (UTC)[reply]

WikiProject Merseyside article tagging?

Hello team,

Would it be possible for someone to run a bot through the following categories and tag them with the Wikipedia:WikiProject Merseyside talk page banner (Template:WikiProject Merseyside)?

Thank you, hope you can help. :) --Jza84 |  Talk  21:21, 11 September 2008 (UTC)[reply]

Possible Possible : TinucherianBot can do this for you. But running recursively to all sub categories is very dangaerous. I will provide you with the entire list of subcategories , which you and other project members can have a serious look and remove all possible false positive categories. Once this is done, the bot will work on it soon. -- Tinu Cherian - 02:47, 12 September 2008 (UTC)[reply]
Thank you. Will you let me know here, at my talk or at the project page? If there's a choice, the project page would be preferable. Thanks again, --Jza84 |  Talk  12:23, 12 September 2008 (UTC)[reply]
Apologies for the delay in the response. I was away over the weekend. btw you can see the entire subcats under Category:Merseyside at Wikipedia:WikiProject Merseyside/Cats. Please double check each category and remove all the possibly wrong categories from the page. I will also leave a note on the project page. Once it is throughly checked and all the false positive categories are hopefully removed, kindly let us know. -- Tinu Cherian - 05:47, 15 September 2008 (UTC)[reply]
In case you didn't see Jza84's reply on the project talk page, we think we're ready to go. Nev1 (talk) 21:26, 15 September 2008 (UTC)[reply]
Doing... : Bot working on the tagging. Thanks -- Tinu Cherian - 05:52, 16 September 2008 (UTC)[reply]

Inactive users

Would it be plausible for a bot to subst some sort of template onto user/talk pages of inactive users? Or would that be a waste of time? Seegoon (talk) 09:01, 12 September 2008 (UTC)[reply]

Unnecessary. There are so many inactive accounts that would be a waste of resources, and wouldn't have very much purpose. Another factor is how can a bot decide who is inactive? Best left up to human judgement, looking at contributions etc. ~ Ameliorate! U T C @ 11:27, 12 September 2008 (UTC)[reply]

Replace this image

Use of the "Replace this image" images has now been deprecated. You can see a before and after example here. Can this be done by a bot? Dismas|(talk) 16:30, 13 September 2008 (UTC)[reply]

Sure can. I could do this with TINA but I will do a BRFA for it so that it's done on a bot flagged account, so not to clog recent changes. ~ Ameliorate! U T C @ 16:41, 13 September 2008 (UTC)[reply]
Thanks! And thanks for using a bot flagged account. I hate it when my watchlist is full of bot type edits made by non-bots. Dismas|(talk) 17:07, 13 September 2008 (UTC)[reply]
Actually you may want to hold off as the discussion called for the stoppage of either adding or removing them generally. There apparently is ongoing debate what to do presently with this summation coming close:
Considering the weight of arguments on both sides, the two-thirds majority in favor of removing image placeholders from article space cannot be considered a consensus in my opinion. Much of the opposition seems to be answerable by less drastic measures, namely, removing image placeholders from individual articles of private individuals where a free photo is unlikely to be found, and by improving the unappealing appearance of these images. The next steps need to include a list of proposed alternative designs, and a guideline to be added at Wikipedia:Image placeholders to define which biographies should include image placeholders.
I would get clarity that mass removing them won't cause a new round of conflict. I, for one, prefer them and have seen them work. -- Banjeboi 18:02, 13 September 2008 (UTC)[reply]
BRFA filed here. Although I am very confused as to if there is a consensus for this or not. ~ User:Ameliorate! (with the !) (talk) 16:16, 14 September 2008 (UTC)[reply]
My impression is that an effort to do a variety of things is underway although that is also unclear what and how and wheer it's being discussed. Wholesale removal, however, should likely not take place until there is agreement to do so. -- Banjeboi 17:50, 14 September 2008 (UTC)[reply]

Template:Female adult bio

The "orientation" field of the {{Female adult bio}} template has been removed from the template. Could this be removed from all the articles where the template is being used? See here for an example of a before and after. Dismas|(talk) 17:29, 13 September 2008 (UTC)[reply]

Coding... Shouldn't take long to write. Anomie 19:09, 13 September 2008 (UTC)[reply]
BRFA filed Wikipedia:Bots/Requests for approval/AnomieBOT 5. Took slightly longer than expected, mainly because of a bug in my template parsing code. Anomie 21:09, 13 September 2008 (UTC)[reply]

Index of unpatrolled, expired "New Pages"

Special:NewPages has a system where "unpatrolled pages" (see Wikipedia:New pages patrol/patrolled pages) are marked in yellow. If an editor clicks a yellow link at NewPages, there is some small text in the bottom right that says "[Mark this page as patrolled]"; if this is clicked, that article will no longer show up in yellow at NewPages.

After 30 days, all pages at NewPages disappear, regardless of whether they have been "patrolled" or not. Even with only 30 days' worth of new pages, there is always a backlog at NewPages. You can check this for yourself (Special:NewPages → "Hide patrolled edits" → "Earliest") and you will see that the earliest are almost exactly 30 days before the present (taking into account UTC). This means that articles are being created faster than they are being monitored at NewPages. In other words, many unpatrolled pages are slipping through because of the 30 day expiration at NewPages.

I would propose a bot that somehow identifies all of the unpatrolled pages that slipped through the NewPages system. I think the "new page patrollers" would appreciate such a facility. — Twas Now ( talkcontribse-mail ) 00:41, 14 September 2008 (UTC)[reply]

I would think a tool on the Toolserver would be better. LegoKontribsTalkM 17:01, 14 September 2008 (UTC)[reply]
The recentchanges table, which is replicated on the toolserver, is used to mark edits as patrolled and that table only contains edits made in the last month. So you'd have to do it some other way. This means having your own table with unpatrolled edits, but you can't mark those edits any more, because Wikipedia doesn't know about them in its recentchanges table. You'd then have to mark the pages in your tool and thus be able to identify users. What I'm trying to say is that this request is quite complicated even with toolserver access. Therefore, I'd think it's best to do this after discussion with new page patrollers about their wishes and not just code something. Even though it could indeed be useful. --Erwin(85) 20:27, 14 September 2008 (UTC)[reply]
It might be an idea to just use the Dutch method, see :nl:Wikipedia:Controlelijst vandalismebestrijding. They simply list each day and you mark an entire part of the day on that page, so you check each new page created in a certain time frame. The advantage is that you can still check new pages created more than a month ago. tools:~erwin85/newpages.php might also be of help to get a list of new pages in a given time frame. --Erwin(85) 20:32, 14 September 2008 (UTC)[reply]

Convert data from old cricketer infoboxes to new one

Need an automated script/bot that can convert the data present in the cricketer infoboxes (Category:Deprecated cricket templates) (for example, the one present on Allan Donald) to the newer Template:Infobox cricketer biography template. If this is possible, please drop in a line at Wikipedia talk:WikiProject Cricket. Thanks! =Nichalp «Talk»= 16:25, 14 September 2008 (UTC)[reply]

Why not just redirect them? LegoKontribsTalkM 18:25, 14 September 2008 (UTC)[reply]

Article page/talk page redirect mismatches

This has probably been asked before, and if so, my apologies. I occasionally will come across an article talk page which is a redirect but the associated talk page isn't. At some time in the past, the article has been moved but the talk page left unchanged. There must be many of these and I presume they'd be fairly easy to identify. I'd nearly have thought they'd be a list within Special:SpecialPages. But if not, could a Bot generate such a list? Moondyne 07:19, 15 September 2008 (UTC)[reply]

This seems possible for a special page (but I don't know what it'd be called). Xclamation point 04:16, 16 September 2008 (UTC)[reply]
I have generated a (very large) list at User:X!/splitredirects. Xclamation point 04:26, 16 September 2008 (UTC)[reply]