Wikipedia:Wikipedia Signpost/2013-06-19/Arbitration report
The Farmbrough amendment request—automation and arbitration enforcement
Richard Farmbrough
- Editor's note: the "Arbitration report" invited Richard Farmbrough to comment on his recent request to the arbitration committee. In an effort to represent all sides of the issue, we also asked arbitrators T. Canens and Carcharoth if they would take the time to answer some questions about the case, since they both commented on the initial request. Carcharoth declined, but T. Canens agreed to talk to us from his own perspective.
Richard Farmbrough was set to have his day in court, but as events transpired, this was not to be so. On 25 March 2013, an accusation was made against Farmbrough at Arbitration Enforcement (AE), claiming that he violated the terms of an automated edit restriction. Within hours, Farmbrough had filed his own request with the arbitration committee, citing the newly filed AE request and claiming that the motion was being used "in an absurd way" in the filing of enforcement requests: "I have not made any edits that a sane person would consider automation."
The AE arm of the arbitration committee blocked Farmbrough for one year, after receiving a go-ahead from arbitrator T Canens and without waiting for input from either Farmbrough or the community. The committee, noting that Farmbrough was blocked, then declined to consider Farmbrough's request.
Meet Richard Farmbrough
Richard Farmbrough is something of an icon in the Wikipedia saga. In 2007, Smith Magazine interviewed him as one of the most prolific editors on Wikipedia. In 2011, he was cited by R. Stuart Geiger in "The Lives of Bots" as the creator of the {{nobots}} opt-out template and an advocate of the "bots are better behaved than people" philosophy of bot development. Farmbrough is also credited with coining the word "botophobia", to make the point that bot policy needs to be as responsive to public perceptions as to technical considerations. Farmbrough described himself to the Signpost as "a reader and sometime editor and administrator of the English Wikipedia ... [I've] contributed to and started many articles, worked on policy, edited templates, created and organised categories, participated in discussions, helped new users, run database extraction, created file lists and reports for Wikipedians, done anti-vandal work, and was a host at Tea-house. I also wrote and ran bots."
Genesis
All of the bots' tasks were approved by BAG, the Bot Approvals Group, "although in the less restrictive environment of 2007 a more liberal approach was taken to 'obviously' good extensions of existing tasks than was later the case." Before being submitted to BAG's testing regime, bot tasks underwent a significant amount of manual testing. In one typical case, Farmbrough manually checked and saved more than 3000 edits over the course of six or seven weeks.
None of Farmbrough's bots are currently running. Some of the code and data from his bots is used in other bots, such as AnomieBot and AWB-based bots. AnomieBot has taken over some of Helpful Pixie Bot's dating tasks, but the other general fixes are not being performed.
Dwarves vs gnomes?
So what went wrong? "In September 2010 I made some changes to the general clean-up, there was some opposition and I agreed to revert the changes ... However, an avalanche had been unleashed, and the matter was escalated to ANI. Subsequently I removed all custom general fixes, and rewrote the entire bot in perl, since AWB at that time could not meet the exacting standards that were being demanded. ... One would think that having agreed to do everything asked, and even gone beyond it, the matter would have rested there; but a series of ANI and ARB filings ensued, some rejected out of hand, others gaining traction until by mid-2012 it had become impossible to edit."
As one observer put it, "What we are seeing here is 'The War of the Dwarves and the Gnomes'. Dwarves are editors who work mainly on content, and typically put a lot of thought into each edit; gnomes are editors who work mainly on form, and tend to make large numbers of edits doing things like changing a - to a –. Richard is a Supergnome, and the comparatively small fraction of errors generated by his huge volume of automated edits ended up costing the dwarves who maintain articles an enormous amount of time. Eventually, after repeated failed attempts to rein him in, the outraged dwarves banded together to ban him."
An automation restriction
The outcome of the 2012 Rich Farmbrough arbitration case, along with its subsequent motions, was not at all in his favor. It contained the wording of the automation restriction that has become so controversial: "Rich Farmbrough is indefinitely prohibited from using any automation whatsoever on Wikipedia. For the purposes of this remedy, any edits that reasonably appear to be automated shall be assumed to be so." A later "amendment by motion" stated "Rich Farmbrough is directed ... to make only completely manual edits (i.e. by selecting the [EDIT] button and typing changes into the editing window)".
Is typing four tildes "automation"?
The Arbitration Enforcement administrator, however, stated that "it appears very improbable that this sort of repetitive change was made without some sort of automation, if only the copy/paste or search/replace functions (which are forbidden under the terms of the decision, which prohibits 'any automation whatsoever')", and defined "find and replace" as automation because "it produces the effect of many keystrokes with one or few keystrokes". If "search and replace" is automation, replied the commenters, then so is "copy and paste" or signing posts with four tildes. Farmbrough pointed out that caps-lock also fits the definition of producing the effect of many keystrokes with one keystroke.
Defining automation
What interpretation of "automated edits" is reasonable? We asked Farmbrough if some automated edits are potentially damaging and others not:
“ | In order to establish a useful definition of "automated" we should establish why we want it. The putative reason is that automation "gone wrong" allows creation of many many errors that us poor humans cannot deal with. This is false, though, as I have inspected about 176,000 edits of HPB looking for a particular error; it took maybe a few hours—try inspecting that many human edits. Moreover, automated edits can be rolled back and reapplied in an emergency. Nonetheless, granting that people have reservations, [the definition of automation] should clearly be looking edits that are repetitive, high speed and affect many pages. | ” |
Chilling effect on bot operators?
It has been suggested that this will have a chilling effect on other bot operators, that they will be afraid of making mistakes and getting banned. Says one talk page commenter, "A lot of bot ops and potential botops think twice before starting a bot. I have talked with several editors who want too but are afraid if they make mistakes that the zero defect mentality will get them banned."
Arbitrator T. Canens responded:
“ | Obviously bots cannot run if the botop is blocked/banned. However, nothing prevents other botops from taking over Rich's bots, provided that they comply with all relevant policies and guidelines, including promptly addressing any concerns about the bot raised by other members of the community. Bot defects are unavoidable (though I'm not sure if there's any statistics documenting exactly how frequent it is). The point is that botops need to be responsive to community concerns and promptly fix any reported defects. We have many bot operators, but to the best of my knowledge only a very small number were ever blocked/banned over bot-related issues (Rich and Betacommand are the only two that come to mind). | ” |
Does it matter if edits are beneficial?
We did not think to ask whether sub-optimal edits are beneficial, as long as they move the project forward, but both Farmbrough and T. Canens identified this as an issue.
Said T. Canens, "It is very clear to me that the committee in both the initial sanction and the subsequent motion intended to ban all forms of automated editing whatsoever from Rich, regardless of whether any particular automated edit is beneficial. In general, this happens when the Committee determines that 1) the disruption caused by the totality of the automated editing outweighs the benefits of said editing and 2) there is no less restrictive sanction that is both workable and capable of preventing further disruption. In this case, for instance, given the high volume of Rich's automated edits, a remedy that only prohibits him from making problematic edits would be impractical."
Farmbrough stated, "What we should be concerned about is the encyclopedic project, is something someone is doing damaging or benefiting the project? If it is damging we should look at steps to address that, if it is benefiting we should look at ways to improve it further."
Procedural issues about arbitration and enforcement
The Arbitration Enforcement request against Farmbrough was initiated at 10:29, 25 March 2013, and closed less than 13 hours later, at 23:04, with only the accuser and the AE administrator participating. After a request to leave the case open a little bit longer for discussion was declined, discussion continued on Sandstein's and Rich Farmbrough's talk pages.
Farmbrough's block at AE
T. Canens' statement at Farmbrough's Arbcom request that "I think the AE request can proceed as usual", and Richard's subsequent block, received comments at various talk pages ranging from "[it is] somewhat strange that T. Canens should encourage blocking of an editor who has made an appeal to ArbCom" to "the comments from arbitrators seem to say 'block him, we're not going to change the sanction' (T. Canens) and 'we're not going to change the sanction because he's blocked' (Carcharoth and Risker)."
"I was amazed that one arb suggesting Sandstein go ahead was considered authority to do so," Farmbrough told the Signpost. "Even more at the circular argument 'Rich is blocked so the request to remove the provision he was blocked under is moot'".
We asked arbitrator T. Canens why he had Farmbrough blocked while his Arbcom request was still open.
“ | The filing of an amendment request to lift a sanction is by itself insufficient to delay an AE request seeking the enforcement of said sanction; were it otherwise, people could file tons of meritless amendment requests in the hopes that they'll delay the AE request long enough to get it closed as stale. There needs to be at least a reasonable probability that a majority of arbitrators would in fact grant the appeal to justify delaying action on the AE thread, but Rich's appeal is very unlikely to be granted, as the committee views with disfavor 1) multiple appeals in a short time period, and 2) appeals to lift a sanction that has been recently violated, as Rich's appeal fits both. Note that any AE block would not prevent Rich from emailing the committee with any additional comments on his request. | ” |
Autonomy of Arbitration Enforcement administrators
There was also some disagreement over the intentions of the arbitration committee with regard to automation and role of AE.
According to one interpretation of the Farmbrough arbitration case, "it isn't the automated editing itself that is harmful/disruptive, and if there is no harm being done here then the 1 year block does not prevent any problems. So in that sense it is neither punitive nor preventative!" and "the Enforcement By block section says 'may be blocked...' which I can't read any other way than to imply that some discretion is given to administrators to not block or to block for a shorter period when, for example, the infraction was so exceedingly minor or when there is no or very little disruption."
According to another view, "the underlying decision of the Arbitration Committee to consider all automated editing of whatever nature by Rich Farmbrough to be harmful, and to ban all such editing. ... Because Arbitration Committee decisions are binding, AE admins in particular have no authority to question the Committee's decisions; they must limit themselves to executing the decisions."
We asked T. Canens if, under these circumstances, "the arbitration committee needs to clarify their intentions about automation and mass editing". Canen replied:
“ | Admins are volunteers and are free to refrain from taking action on any AE request if they do not want to, though they are not free to overturn another admin's AE action except under certain limited circumstances. For example, AE has, on occasion, declined to block for isolated 1RR violations when the edit being reverted was unquestionably problematic, even if it doesn't fall in one of the xRR exemptions. If Rich's edit were indeed completely correct, this fact might justify letting this particular violation go. On the other hand, the wilful nature of Rich's violation and his history of violations of the restriction would counsel against overlooking the violation. It is up to the AE admins to balance these competing considerations; arbcom generally does not interfere with AE in the absence of clear error or new developments. | ” |
Is there a way forward?
"I just want to get back to editing" says Farmbrough. "Wikipedians do not edit for thanks and barnstars, though they are both nice to receive. It is however a big disincentive to edit, and part of the hostile environment, when there's a constant (and I do mean constant) threat hanging over every editor's head that they're going to have to spend days and weeks fighting off ANI threads and Arbcom cases every time they do something that someone doesn't like."
Given the absence of any other formal mechanism for dealing with automation disputes, that may be exactly what will happen once the block is over.
Discuss this story
Code first. It seems clear to me that even the first version of AnomieBOT's TagDater.pm is in a distinctly AnomieBOT style, but direct code comparison is of course impossible since Rich hadn't and hasn't to my knowledge released his code publicly.
As for data, AnomieBOT's tag dating has two major sources: WP:AWB/DT for the list of templates, and the category hierarchy under Category:Wikipedia maintenance categories sorted by month to find pages with templates needing dating. Regarding the former, I note that, despite running his tag-dating bot, Rich appears to have made only two trivial edits[1][2] to WP:AWB/DT. Nor did he create Category:Wikipedia maintenance categories sorted by month, although he does seem to have played a part in some of the complex templates currently used in that hierarchy. I don't see that contributing to the templates used in an existing category hierarchy rises to the level implied here.
As for the many lesser inaccuracies and omissions in this piece, I think I'll refrain from specific comment. There are enough people who idolize Rich for various reasons (the saddest being, IMO, those who hold him as some sort of mascot for people "oppressed" by "Wikipedia's [ArbCom/admin] oligarchy") that I don't want to get into a drawn-out argument with. Anomie⚔ 02:29, 21 June 2013 (UTC)[reply]
Since there are some remarks here about neutrality, let me make it clear that none of the opinions expressed in the piece are my own. I will try to find time to say more about this later, but for now let me just say that if someone has a question about who or what is being cited, I would be happy to provide more detailed diffs. Neotarf (talk) 09:26, 21 June 2013 (UTC)[reply]
This is probably the worst-ever article I've read in the Signpost. If it was an essay by Rich about why he reckons he was unfairly treated that would be one thing, but it's basically an opinion article which cherry picks Rich's interpretation of events while leaving everything else out. Where's the coverage of his multiple attempts to push against his restrictions? (which is what contributed to the block). Some of the prose would be deleted from a stub article - for instance "It has been suggested that this will have a chilling effect on other bot operators, that they will be afraid of making mistakes and getting banned. Says one talk page commenter, "A lot of bot ops and potential botops think twice before starting a bot. I have talked with several editors who want too but are afraid if they make mistakes that the zero defect mentality will get them banned." - who has suggested this, and who is the mysterious "one talk page commenter"? Given the number of chances Rich was given there doesn't seem to be any imminent danger of them being "banned"(!) - are bot owners really so silly that this represents their consensus opinion? Nick-D (talk) 10:00, 21 June 2013 (UTC)[reply]
- I will keep my comment brief but as I have stated in other venues in the past, the blocking of Rich and his bots over minor edits was a decision of monumentally poor judgement. Even more so given his eventual banning from the project was due to the extrmely poor wording of his sanction. Sepcifically the decidedly poor use and interpretation of the terms "broadly construed" which allow an administrator unlimited discretion. An editor being banned over little more than a difference of opinion about the types of edits that should be done is nothing short of just plain dumb. Every month Rich and his bots aren't editing is directly equivelant to tens of thousands of useful edits not being done, setting the project back in an unmeasurable amount. Kumioko (talk) 13:55, 24 June 2013 (UTC)[reply]
- "Broadly construed" points both ways--it's easy to see why people who like to Wikilawyer hate it, because it's essentially IAR applied to sanctions: in any sanction that includes "broadly construed" it is very explicitly stating that the spirit counts, not just the letter. ArbCom tends to use a lot of "broadly construed", because people who are able to find consensus at lower levels of dispute resolution don't make it to ArbCom cases. Jclemens (talk) 02:44, 28 June 2013 (UTC)[reply]
- But your assumptions are that the wording assumes that the editors is guilty and that every one of the 1400+ admins who could block that user, would do so fairly or has the same understanding or interpretation of the rules. Unfortunately that is often not the case. It doesn't have to do with wikilawyering and fankly I only see that as an excuse by those who support a system of making it easy for admins to eliminate editors without due process. Broadly construed measn that any interpretation of the judgement is justified. It has nothing to do with wikilawyering and everything to do with fairness and common sense. Kumioko (talk) 17:05, 2 July 2013 (UTC)[reply]
Postscript. Several comments above have raised questions about neutrality in terms of the format of this arbitration report. As I said there, and will say again, comment was invited from both Mr. Farmbrough and the arbitrators. I also promised, somewhere upthread, to try to find time to comment about this at greater length, so I will say something more now about the writing of this report, and the departure from the usual arbitration report format.When we first started discussing the possibility of doing an interview for the arbitration report, I knew immediately that I wanted to interview Farmbrough. When a disruptive user is indeffed, there is often a sigh of relief that goes up from the community. But with Farmbrough, that wasn't what happened. Instead, there was a whole dialogue that started up about process--the process surrounding arbitration enforcement and bot development, and people stated bringing up points about the structure of WP. In fact, the working title of this report was "The Farmbrough amendment request - a closer look at automation and arbitration enforcement".
While I was putting the report together, I was also trying to decide whether Farmbrough was a hero or a villain. But you would probably have to edit in the same area with someone to really answer that question, and as far as I know, I have never had any interaction with Farmbrough on WP at all. And I suspect the answer to that question is not that simple. I also didn't look at any of the diffs for any of his cases until after I had submitted the report, so whatever I wrote came directly from discussion about the current case, and not from trying to judge for myself what had happened.
My primary interest in writing this was to find out something about how WP breathes and grows, and sometimes hiccups, in the context of real users, not in the abstract. But if the comments here are any indication, most readers are not interested in the meta-type issues I tried to bring out in the report. They are interested in Farmbrough himself; and they are not finished with talking about what happened.
Neotarf (talk) 11:59, 2 July 2013 (UTC)[reply]