Wikipedia:Bots/Requests for approval/VoxelBot 2: Difference between revisions
→Discussion: cmt |
→Discussion: trial would be nice |
||
Line 55: | Line 55: | ||
***{{tlx|BAGAssistanceNeeded}} Any comments? [[User:Vacation9|<span style="color:green">Vaca</span>]][[User talk:Vacation9|<span style="color:teal">tion</span>]][[Special:Contributions/Vacation9|<span style="color:orange">9</span>]] 03:10, 7 February 2013 (UTC) |
***{{tlx|BAGAssistanceNeeded}} Any comments? [[User:Vacation9|<span style="color:green">Vaca</span>]][[User talk:Vacation9|<span style="color:teal">tion</span>]][[Special:Contributions/Vacation9|<span style="color:orange">9</span>]] 03:10, 7 February 2013 (UTC) |
||
****Sorry, I've had a cold and been otherwise busy in real life. Are you ready for a trial? '''[[User:MBisanz|<span style='color: #FFFF00;background-color: #0000FF;'>MBisanz</span>]]''' <sup>[[User talk:MBisanz|<span style='color: #FFA500;'>talk</span>]]</sup> 03:09, 9 February 2013 (UTC) |
****Sorry, I've had a cold and been otherwise busy in real life. Are you ready for a trial? '''[[User:MBisanz|<span style='color: #FFFF00;background-color: #0000FF;'>MBisanz</span>]]''' <sup>[[User talk:MBisanz|<span style='color: #FFA500;'>talk</span>]]</sup> 03:09, 9 February 2013 (UTC) |
||
*****It might be a couple days before the trial starts, but if we're approved for trial that would be great. I feel for you; managing both BAG and CHU? :O [[User:Vacation9 Public|<span style="color:#008B8B">Vacation</span>]]<sup>[[User talk:Vacation9 Public|<span style="color:#FF8C00">nine</span>]]</sup> <span style="color:#008B8B">Public</span> 13:40, 9 February 2013 (UTC) |
Revision as of 13:40, 9 February 2013
VoxelBot 2
Operator: Vacation9 (talk · contribs · SUL · edit count · logs · page moves · block log · rights log · ANI search)
Time filed: 02:56, Thursday January 31, 2013 (UTC)
Automatic, Supervised, or Manual: Supervised
Programming language(s): AutoWikiBrowser, Python
Source code available: AWB, Standard pywikipedia
Function overview: Replace substitutes for Romanian letters used before Unicode 3 was released (Ş, ş, Ţ, and ţ) with their proper letters in the Romanian alphabet: (Ș, ș, Ț, and ț). It will replace everywhere except for image links and interwiki/external links.
Links to relevant discussions (where appropriate): Wikipedia:Bot requests/Archive 52#Romanian_orthography
Edit period(s): One time run
Estimated number of pages affected: Hundreds of thousands. Working off the Geography of Romania Category
Exclusion compliant (Yes/No): Yes
Already has a bot flag (Yes/No): Yes
Function details: From a database scan of pages with the characters in their titles list of pages in categories related to Romanian Geography, move them to their correct name if they contain incorrect characters. Then, replace the characters defined above with their correct letters in the Romanian alphabet and fix the double redirects created.. The input pages will only be strictly Romanian, since the current letters are correct in non-Romanian languages. The input pages won't just be taken from the base category (Geography of Romania) but will be sub-categories recursed by AWB that are manually checked.
Discussion
- Will it do any other kinds of gen-fixes when making these edits? Will most users be able to see the difference in the letters or will it be purely an underlying language code change? MBisanz talk 03:23, 31 January 2013 (UTC)
- In some manual test edits ([1] for example) you can see that it does make a visible change. It is set up to also do genfixes (of course only if it matches the regex as well), but this can be disabled as you know. It's already editing so genfixes (which many of these pages need) are quite good in this case. Vacation9 03:42, 31 January 2013 (UTC)
I believe this must be supervised, not automatic. --MZMcBride (talk) 03:52, 31 January 2013 (UTC)
- What exactly do you mean by supervised? If you mean checking the edits as they are made, this is completely possible. Vacation9 04:08, 31 January 2013 (UTC)
- Yes, I mean checking the edits. "Supervised" instead of "automatic" (which is the equivalent to "unsupervised"). I think human review will be needed for each of these edits, as find and replace is notoriously tricky on any large body of work. The number of edge cases is simply overwhelming. --MZMcBride (talk) 18:54, 31 January 2013 (UTC)
- That's completely possible, and I've noted the bot as Supervised instead of Automatic. Vacation9 22:28, 31 January 2013 (UTC)
- Yes, I mean checking the edits. "Supervised" instead of "automatic" (which is the equivalent to "unsupervised"). I think human review will be needed for each of these edits, as find and replace is notoriously tricky on any large body of work. The number of edge cases is simply overwhelming. --MZMcBride (talk) 18:54, 31 January 2013 (UTC)
- Another problem has arisen: we have to move pages with incorrect characters in them as well. Thus, I came up with a three-step plan. First, a database scan for articles with incorrect characters in them, which outputs to a file. Then, (after review of the articles) using Pywikipediabot or a similar framework I mass move the pages. Then, when doing the AWB scan, we can both correct the redlinks and correct everything else in the page. This will need some interesting code but I think it would be worth it. Thoughts? Vacation9 13:34, 31 January 2013 (UTC)
- We can actually just use the API to find the list of the pages in the Category, then use our code to see if the title needs to be moved, and if it does then move it. Vacation9 13:45, 31 January 2013 (UTC)
{{BAGAssistanceNeeded}}
Any comments? Vacation9 03:10, 7 February 2013 (UTC)- Sorry, I've had a cold and been otherwise busy in real life. Are you ready for a trial? MBisanz talk 03:09, 9 February 2013 (UTC)
- It might be a couple days before the trial starts, but if we're approved for trial that would be great. I feel for you; managing both BAG and CHU? :O Vacationnine Public 13:40, 9 February 2013 (UTC)
- Sorry, I've had a cold and been otherwise busy in real life. Are you ready for a trial? MBisanz talk 03:09, 9 February 2013 (UTC)
- We can actually just use the API to find the list of the pages in the Category, then use our code to see if the title needs to be moved, and if it does then move it. Vacation9 13:45, 31 January 2013 (UTC)