Wikipedia:Arbitration Committee/Requests for comment/Article creation at scale: Difference between revisions
Line 1,115: | Line 1,115: | ||
=====Comments from ActivelyDisinterested ===== |
=====Comments from ActivelyDisinterested ===== |
||
I'd agree the BOT board isn't the right place for discussion, all mass creations should be discussed at the new board (as I see it that would be the primary concern of the new board). -- LCU '''[[User:ActivelyDisinterested|ActivelyDisinterested]]''' <small>''∆[[User talk:ActivelyDisinterested|transmissions]]∆'' °[[Special:Contributions/ActivelyDisinterested|co-ords]]°</small> 15:29, 12 October 2022 (UTC) |
I'd agree the BOT board isn't the right place for discussion, all mass creations should be discussed at the new board (as I see it that would be the primary concern of the new board). -- LCU '''[[User:ActivelyDisinterested|ActivelyDisinterested]]''' <small>''∆[[User talk:ActivelyDisinterested|transmissions]]∆'' °[[Special:Contributions/ActivelyDisinterested|co-ords]]°</small> 15:29, 12 October 2022 (UTC) |
||
=====Comments from Lurking Shadow===== |
|||
The bot noticeboard is the right place for these disputes if the other noticeboard doesn't form. |
|||
If you are creating articles on scale then there is some routine in the process. Bot-like editing can be addressed in a similar fashion to bot editing, only easier because you can ask the "bot" what it does directly. |
|||
=====Comments from Editor X ===== |
=====Comments from Editor X ===== |
Revision as of 16:17, 12 October 2022
Status as of 14:50 (UTC), Monday, 18 November 2024 (
)
- In August 2022 ArbCom requested a Request for comment regarding article deletions at scale.
- Initial community input indicated an RfC regarding article creations at scale would be a helpful precursor, and ArbCom confirmed this was within scope.
- Discussion of structure and scope of the workshopping phase to develop this first RfC happened here.
- The workshopping began August 31 and ended September 10 at Wikipedia talk:Arbitration Committee/Requests for comment/Article creation at scale.
- RfC on article creation at scale began October 3, 2022 and will run for at least 30 days, with closing at closers' discretion. It was announced at the articles for deletion talk page, the Arbitration Noticeboard, the administrators' noticeboard, the Bot policy talk page, Village pump (policy), Wikipedia talk:Notability and Centralized discussion.
- The workshopping for the RfC on article deletions at scale will be held here; timeline TBA.
Introduction
This is the first of two RfCs about article creation and deletion at scale. Per the rules below, please feel free to add to questions/proposed changes for the first seven days; other suggestions, comments, questions or replies should be made within your own section.
This RfC has been announced at the articles for deletion talk page, the Arbitration Noticeboard, the administrators' noticeboard, the Bot policy talk page, Village pump (policy), Wikipedia talk:Notability and Centralized discussion.
Background
Page-related actions done at scale can overwhelm the community's ability to adequately monitor and participate effectively. The issue is exacerbated in the case of article creation at scale because it escapes the normal notification system.
In the past, Wikipedia did not discourage article creation at scale under the assumption this was the best way to achieve broad coverage of vast subjects such as sports, plant and animal life, geography. There exists a policy that automated or semi-automated creation requires a bot request for approval. More recently, concerns have been raised in multiple venues that the continuing creation of such articles (or article creation at scale performed manually) has overwhelmed editors’ ability to track and assess these articles, and that the churn has become a waste of time and a cause of disruption. In a 2022 August decision, the Arbitration Committee (ArbCom) requested an RfC addressing "how to handle mass nominations at Articles for Deletion" (termed "AfD at scale").
A strong argument was made that the article creation at scale (sometimes known as mass, rapid, or large-scale creation) is one of the causes of dysfunction at AfD with regard to article deletions at scale, and that addressing this issue is a necessary precursor to the ArbCom-ordered RfC addressing AfD at scale.
For a list of proposed solutions other than those initially presented here, please see Archive 2 of WT:ACAS.
Statistics for mass creation
- Editors who have created more than seven articles in the past week, including lists and disambiguation pages
- Editors who have created more than seven articles in the past week, excluding lists and disambiguation pages
- Editors who have created more than ten articles in June
- Editors who have created more than ten articles in July
- Editors who have created more than ten articles in August
- Editors who have created more than 100 articles in the past year
- Editors who have created more than 100 articles in the past year, by month
- Editors who created more than than 10 articles in 2021, by month
- Editors who created more than than 10 articles in 2020, by month
- Editors who created more than than 10 articles in 2019, by month
- Editors by number of articles created in the past five years
Notes:
- None of these contain redirects that were converted into articles by the listed editor, but they do contain redirects that were converted into articles by other editors. I'm looking into fixing the latter; the former can be fixed for smaller datasets, but is too intensive for larger ones.
- External links counts can be suggestive about the quality of the article, it can also be meaningless - a low number may be because a large number of offline sources were used, while a high number may be because a template that provides links to a large number of database sources was added.
- Articles by editor by day over one year (1138 editor-days exceeded 10 articles; 163 exceeded 25)
- Articles by editor by week over one year (922 editor-weeks exceeded 20 articles, 150 exceeded 50)
- Articles by editor by month over one year (640 editor-months exceeded 40 articles, 123 exceeded 100)
- Articles by editor by year since 2020 (1156 editor-years exceeded 80 articles; 407 exceeded 200)
Note that these do attempt to exclude false positives from editors converting redirects created by the original editor, but some still exist, and this attempt does result in some false negatives. This is also the reason why a hard technical limit will be difficult; we will need some way to identify editors converting redirects into articles, and count those articles towards their count rather than towards the count of the original article creator. (Compiled by BilledMammal)
Purpose of this discussion
This RfC is to find and develop solutions to issues surrounding article creation at scale, partially in preparation for the RfC on article deletions at scale.
Rules
- All editors are required to maintain a proper level of decorum. Rudeness, hostility, casting aspersions, and battleground mentality will not be tolerated. Inappropriate conduct will result in a partial block (p-block) from this discussion.
- The sole purpose of this RfC is to determine consensus about policy going forward surrounding creation of articles at scale and to form consensus on those solutions. It is not a venue for personal opinion on past creation or creators of such articles or about previous tolerance of such creations, nor about past mass deletions, ditto. Editors posting off-topic may be p-blocked from this discussion.
- All comments must be about issues and proposed policy changes surrounding article creation at scale. Comments about any contributor are prohibited and will result in a p-block from this discussion. Any violations will be reverted, removed, or redacted.
- Please do not make changes in RfC questions that have already been posted. Anyone is permitted to post additional questions/proposals, below the existing ones. Moderators may at their discretion merge, edit, or condense questions at any point in the process. Any user may suggest such changes.
- Please make all additional proposals within seven days of the start of this discussion. Subsequent proposals may be brought up in an editor's own section for consideration and inclusion at the discretion of the moderators.
- Discussion is unthreaded. Please create your own comments section within the discussion section for each question, placing your username in the section header. Within your own section you may present your !votes, post questions to other editors, or respond to other editors; unthreaded discussions with other editors can be created on the talk page. Threaded discussion on the RfC will be moved to the talk page by moderators/clerk.
- Within a comment section each editor is limited to 300 words, including questions to and replies to other editors. (word count tool) Short quotes from other editors to provide clarity are excluded from the word count, but quoted material may be trimmed by moderators at their discretion. Moderators may at their discretion grant extensions following a request on the talk page that includes a brief explanation of why it is needed; please ping for such requests. Overlength statements will be collapsed until shortened.
- If you believe someone has violated these rules, please speak to a moderator on their user talk page. If you believe the moderators are behaving inappropriately, please speak to an arbcom member on their user talk page or by email.
- This discussion will be open for 30 days and will be closed by a panel of three editors with experience closing discussions and who will be appointed by the Arbitration Committee prior to the start of the RfC. The closing panel will summarize and evaluate what consensus, if any, exists within the community.
- Per their order and this amendment, any appeals of a moderator decision may only be made to the Arbitration Committee at Wikipedia:Arbitration/Requests/Clarification and Amendment. The community retains the ability to amend the outcomes of the RfC through a subsequent community-wide request for comment
Moderators of this discussion
The Arbitration Committee has appointed two moderators for this RfC:
Additional clerking help: MJL (talk · contribs)
Closers
The Arbitration Committee has appointed a panel of three closers for this RfC:
- KrakatoaKatie (talk · contribs)
- RoySmith (talk · contribs)
- TheSandDoctor (talk · contribs)
Proposals
Question 1: Should we develop a noticeboard where mass creations and sources used for them can be discussed?
Proposed: A noticeboard will be created to allow for obtaining consensus for, making reports of, and having other discussions of mass creations and the sources used for such creations. (Details to be developed there.)
Support (Create noticeboard)
- Thryduulf (talk) 19:04, 3 October 2022 (UTC)
- Per what I wrote in the pre-RfC stage, especially the process described here. — Rhododendrites talk \\ 19:38, 3 October 2022 (UTC)
- --Enos733 (talk) 20:13, 3 October 2022 (UTC)
- This is very needed, article creation at scale has highly disruptive potential.Lurking shadow (talk) 20:56, 3 October 2022 (UTC)
- I suspect it's the only proposal that will achieve consensus here, effectively punting all this nonsense to a new location. Nonetheless, there are situations where it will be necessary, and it ought to be a net positive. Vanamonde (Talk) 21:01, 3 October 2022 (UTC)
- Re-affirming support. Yeah, we have an overabundance of noticeboards, but there isn't a place where someone can ask "hey is mass-creating from this list a good idea" and have a meaningful discussion about it. The closest we have is perhaps WT:N, which isn't very active. Vanamonde (Talk) Vanamonde (Talk) 15:11, 6 October 2022 (UTC)
- Worth a try, though I'm uncertain of how much good it will in practice do. Seraphimblade Talk to me 21:11, 3 October 2022 (UTC)
- Open to giving this a try. HouseBlastertalk 21:18, 3 October 2022 (UTC)
- A good start NW1223<Howl at me•My hunts> 22:52, 3 October 2022 (UTC)
- Per Seraphimblade. Not sure how much it will help, but it can't hurt. ♠PMC♠ (talk) 01:58, 4 October 2022 (UTC)
- More community eyes will lead to better results. Pinguinn 🐧 04:55, 4 October 2022 (UTC)
Decent idea. Not sure why we need a mega-RfC to create a new, optional noticeboard, though? – Joe (talk) 10:20, 4 October 2022 (UTC)Moved to oppose. – Joe (talk) 10:11, 5 October 2022 (UTC)
- Echoing Seraphim. Don't have high hopes but its worth a shot. --WhoIs 127.0.0.1 ping/loopback 12:27, 4 October 2022 (UTC)
- A centralized place where consensus can be established and referred to for each mass creation sounds like the proper way to ensure that article quality is met. Each case will inevitably be unique no matter how well concerns about scale, rate or notability are codified. — HELLKNOWZ ∣ TALK 15:51, 4 October 2022 (UTC)
- * Pppery * it has begun... 18:00, 4 October 2022 (UTC)
- Sounds like a good idea. If mass creations are then questioned the noticeboard discussion can be referenced. -- LCU ActivelyDisinterested ∆transmissions∆ °co-ords° 18:25, 4 October 2022 (UTC)
- Anything to help tamp down the eternal firehose of article creation. Even if mass article creations are a small percentage of it, that would still help ease pressure on AfC/NPP. —Jéské Couriano v^_^v a little blue Bori 20:52, 4 October 2022 (UTC)
- Beyond My Ken (talk) 03:51, 5 October 2022 (UTC)
- JoelleJay (talk) 05:41, 5 October 2022 (UTC)
- It's worth a shot. –dlthewave ☎ 21:40, 5 October 2022 (UTC)
- It would be low-traffic of course, but I do think that it would be really helpful for, in a centralised location, mass article creation to be beforehand discussed. To have multiple people look over the source(s) used and the notability of the to-be-created articles would resolve most of the problems around mass creation. It could turn into some consensus-less hell what with the dichotomy on the merits of mass creation, but I reckon it's worth a try. J947 † edits 22:24, 5 October 2022 (UTC)
- Not sure how effective it would be but worth a try. Rlendog (talk) 14:59, 6 October 2022 (UTC)
- Support --Jax 0677 (talk) 23:22, 9 October 2022 (UTC)
- While I agree that current noticeboards may be able to handle this, there should at the least be a standardized method for notifying the community about mass creation requests. Reywas92Talk 21:41, 10 October 2022 (UTC)
- While the noticeboard itself may be another venue for "taking the wiki", those editors who actively avoid the place when creating articles at scale can more easily be considered not having the projects best interests at heart. Also, I disdain the suggestion that the creation of noticeboards at scale has any bearing on whether any one noticeboard is of practical use. LessHeard vanU (talk) 15:56, 11 October 2022 (UTC)
- Could be useful but may not always be needed though. Crouch, Swale (talk) 20:22, 11 October 2022 (UTC)
Oppose (Create noticeboard)
- Wikipedia already has 39 noticeboards and we should avoid adding more. A noticeboard related to something niche probably won't attract a broad audience and will be of limited usefulness. Hut 8.5 12:03, 4 October 2022 (UTC)
- Let's consider how the last time we tried adding a new noticeboard went. Article creation at scale is a problem, but do we really have distinct events often enough to justify a separate noticeboard? Why can't this be handled at AN or ANI? Trainsandotherthings (talk) 14:02, 4 October 2022 (UTC)
- We already have more noticeboards than we can keep track of. Any discussion about whether or not to embark upon a particular spree of article creation can be conducted at an existing page (e.g., Women in Red) and advertised centrally. Moreover, "article creation at scale" is sufficiently ill-defined and fuzzy around the edges that we'd only be inviting meta-arguments and wiki-lawyering about what belongs on the new noticeboard. XOR'easter (talk) 02:11, 5 October 2022 (UTC)
- The sort of topics which tend to be done at scale have quite different themes -- minor planets; species; athletes; settlements; &c. These all have existing projects which cover these specialist areas and so the particulars of the topic are best discussed there. If there are technical issues then these are addressed by existing discussion forums such as WP:BOTN. Per WP:NOTFORUM and WP:BIKESHED, we really don't need yet another talking shop. Andrew🐉(talk) 09:57, 5 October 2022 (UTC)
- Moved from support per XOR-easter and Andrew D. I do think it's a good idea to discuss mass creation projects with others before embarking on them, but they're absolutely right – it's better to have that discussion with editors who have subject-matter expertise. On reflection I can see how a mass creation noticeboard could easily be dominated by an unrepresentative subset of the community that is opposed to mass article creation in principle, much as Wikipedia:Village pump (WMF) ended up being dominated by people who have an ingrained dislike of the WMF. – Joe (talk) 10:11, 5 October 2022 (UTC)
- It's a good idea to discuss but we already have too many noticeboards. Projects or already-existing noticeboards should be preferred. RSN is a particularly good venue for discussing sources (CF the discussions we have already had there about GNIS, GeoNET Names Server, which are some of the worst sources for mass-creation). FOARP (talk) 12:24, 5 October 2022 (UTC)
- As to my knowledge there are only a few editors who masscreate not so much informative stubs. The issue was more that there has not been presented a viable solution to deal with it. To temporarily prohibit those few to release articles into mainspace and instead allow them to create drafts and submit them for review would be a better solution.Paradise Chronicle (talk) 12:41, 5 October 2022 (UTC)
- I'm not convinced the number of mass-creators is high enough to warrant a separate noticeboard and its incumbent bureaucracy. —David Eppstein (talk) 16:25, 5 October 2022 (UTC)
- I am leaning in opposition to this proposal because, while this question is in scope of this RfC, I think we're putting the cart in front of the horse. A noticeboard might make sense in the case that we have a special deletion procedure that relates to mass-created articles, or if we expect mass-creations to happen so frequently that we need a user conduct noticeboard, but I don't see evidence that ANI is unable to handle user conduct issues that pertain to mass creation. Absent a change in deletion policy or guidelines, the current proposal would essentially create a fork of ANI to deal with conduct issues that pertain to mass creation, and I'm not sure that a board that's going to only be visited by those who have an active interest in mass creation (or opposing mass creation) would be a good idea in terms of its ability to deal with conduct issues, especially since WP:CBAN generally restricts the discussion venues for community sanctions to AN and ANI, which I think is wise given that we want the general community to view proposed sanctions and discuss them before imposing sanctions in the name of the whole community. However, if there are changes in deletion policy, then a discussion to create a noticeboard might be apt at that time. — Red-tailed hawk (nest) 17:25, 5 October 2022 (UTC)
- We should be reducing the number of noticeboards, not increasing them. Article creation at scale can be either good or bad, depending on how it is done, but current procedures andnoticeboards should be able to handle this. Phil Bridger (talk) 18:43, 5 October 2022 (UTC)
- I'm worried that this would create a little-frequented corner where it would be easy for a small number of regulars to dominate, as occurs at some of the more-specialist deletion arenas. Given the breadth of opinion on this topic that seems unwise. I also agree that topic or source-specific expertise is more useful than an all-purpose board. Espresso Addict (talk) 19:30, 5 October 2022 (UTC)
- Please don't; the creation of WP:XRV comes to mind. Which was an idea I enthusiastically supported, and now not even I'm watching it regularly. There's one noticeboard I regularly visit (WP:AN) and one I occasionally stumble upon (WP:ANI), and that's about it. I do post to WP:BLPN when I feel it's needed, but I don't visit BLPN to help with other cases there. That's unfortunate and suboptimal of course, but probably not unusual. The existence of a separate noticeboard for an issue is more likely to be noticed by those who currently need it, rather than those who are currently needed by that noticeboard. ~ ToBeFree (talk) 21:31, 5 October 2022 (UTC)
- Per Hut 8.5, XOR'easter, Joe Roe and David Eppstein. We already have too many notice boards. We do not have the manpower for an additional noticeboard with such a narrow focus. James500 (talk) 00:25, 6 October 2022 (UTC)
- As Hut 8.5 noted, we already have 39 noticeboards with varying levels of traffic, and I think there's a high risk that the only regular contributors to a new noticeboard will be the same editors who have already been arguing back and forth about the mass-creation/SNG debates for the past several months. TheCatalyst31 Reaction•Creation 02:51, 6 October 2022 (UTC)
- As others have noted, we have too many noticeboards already; the last thing we need is another one. —pythoncoder (talk | contribs) 04:03, 6 October 2022 (UTC)
- Others have put this better than I could. We already have many noticeboards and the numbers of "mass creators" isn't that high to warrant a separate noticeboard. A noticeboard with less reports would be on fewer people's watchlist, and would be prone to gaming by a small subsection of the community. —CX Zoom[he/him] (let's talk • {C•X}) 08:27, 6 October 2022 (UTC)
- Per others. "Wenn Du nicht mehr weiter weißt, bilde einen Arbeitskreis" ("If you're ever stuck, just form a committee") is among the more infamous problem resolution strategies the German-speaking world has produced, and this very distinctly feels like the WP version of it. Dr. Duh 🩺 (talk) 08:50, 6 October 2022 (UTC)
- Per Espresso Addict and others above. In addition, I'm concerned about the lack of specifics here: AARV floundered partly because there was so much bickering about how to flesh out the details, and this proposal has even fewer details than that one did. Noticeboards that aren't well thought out ahead of time turn into drama sinks, and I'm afraid that's what would happen here. Extraordinary Writ (talk) 06:26, 7 October 2022 (UTC)
- Existing noticeboards should suffice. wjematherplease leave a message... 12:32, 7 October 2022 (UTC)
- There are existing places these issues can be raised - ANI for example. Those are preferable to having another obscure location that no one can find. I can see the point of this, but don't think it's, right now, necessary. Blue Square Thing (talk) 08:18, 8 October 2022 (UTC)
- Weak oppose - as above, but in particular because of the likelihood that it would be too easily dominated by a small group of editors with a particular agenda. Ingratis (talk) 09:34, 8 October 2022 (UTC)
- Unnecessary as it would rarely be of use and there are other forums available. BoJó | talk UTC 09:40, 8 October 2022 (UTC)
- I would oppose this proposal, unless such a noticeboard also encompasses mass actions in general. NotReallySoroka (talk) 05:22, 11 October 2022 (UTC)
- Noticeboards are magnets for drama and unnecessary fights. We already have bot noticeboards for the kinds of software-assisted creations that there is clear consensus to ask permission first. Otherwise, it's not necessary and will just deter good editors who end up having their every move stalked by people who have a problem with articles about a particular subject. Steven Walling • talk 21:00, 11 October 2022 (UTC)
Comments (Create noticeboard)
Comments from Thryduulf (Q1)
- I think this will provide clarity to everyone, help keep discussions focused and constructive. This VP:PRO discussion is something we should aim to improve on. Thryduulf (talk) 19:04, 3 October 2022 (UTC)
- @Hellknowz: A small number of examples work well for bots and would likely work well here also. Thryduulf (talk) 18:49, 4 October 2022 (UTC)
Comments from Rhododendrites (Q1)
If you meet the definition of "article creation at scale" (see my comments at Q3), then you must post a notice to this noticeboard with the following information:
- The approximate number of articles you will create.
- The approximate time frame for creation.
- A description of the overall topic/theme.
- Which notability criteria you will be using.
- What kind of sourcing you will use to demonstrate that each article meets the criteria (subject to the results of Q2).
Upon creation of the noticeboard, a subsequent RfC (or other discussion) will determine how long these discussions stay open, who approves them, if there's an appeals process, etc. — Rhododendrites talk \\ 19:48, 3 October 2022 (UTC)
Comments from Valereee
Trainsandotherthings, FWIW a commenter at the ArbCom case did this analysis and reported that "There does not appear to be any page of ANI archives that don't have at least one thread about AFD." 15:45, 4 October 2022 (UTC)
Comments from Hellknowz
@Rhododendrites: I feel like it would be much more useful to others if editors were to show/make an example draft/article. If they are serious about the work of creating a whole lot of articles, then surely they can just make one. Presumably, mass creation results in articles that are all basically "the same". So seeing an example would be so much more useful (and much easier to participate in a discussion about it) than a checklist of criteria. You could just look at it from the AfC / AfD perspective. — HELLKNOWZ ∣ TALK 16:22, 4 October 2022 (UTC)
Comments from Espresso Addict
I'm worried that this would create a little-frequented corner where it would be easy for a small number of regulars to dominate, as occurs at some of the more-specialist deletion arenas. Given the breadth of opinion on this topic that seems unwise. Espresso Addict (talk) 01:45, 5 October 2022 (UTC)
Question from Scolaire
How will a user who decides to mass-create articles know that such a noticeboard exists? Will a notice be sent to the talk page of every user? Will every Create page have a banner in red saying "If you intend to mass-create you must go to the noticeboard first"? Scolaire (talk) 15:00, 5 October 2022 (UTC)
Comments from Hut 8.5
Successful noticeboards usually fall into one of two groups:
- The noticeboard has a broad scope, or at least a scope which lots of people are interested in, and it attracts lots of people. These noticeboards can be used to establish community consensus for something and allow the community to scrutinise things. For example WP:ANI, WP:VPR, WP:RS/N.
- The noticeboard has a narrow scope and is of interest to a small audience of specialists. These noticeboards can be used to make sure these specialists are aware of developments in that area, or to get those specialists to fix a problem. For example WP:BOTN, WP:CP, WT:WPSPAM.
This noticeboard is neither. If the idea is for it to scrutinise mass article creation then it needs to attract lots of people, but it has a narrow scope and will probably only attract a handful of people who are very interested in the topic. It also probably won't get much use. It was mentioned above that WP:ANI has lots of threads about AfD, but most AfDs have nothing to do with mass article creation. Hut 8.5 11:58, 6 October 2022 (UTC)
Comments from Lurking Shadow
If we don't have a noticeboard for mass-creation, where should these talks take place?
WP:AFD is deletion. I have seen bulk deletion requests, they don't work.
WP:ANI is for conduct issues, not content issues. Mass-creation can be both. ANI would only address the conduct issues. It is ill-suited to solve content issues.
WP:BOTN is specialized on technical problems with bots, and not conduct issues. Not a broad audience.
WP:BRFA would be the right place for approvals, but not if there were unapproved mass-creations. Not focused on conduct either, and not a broad audience.Lurking shadow (talk) 16:22, 6 October 2022 (UTC)
Comments from ONUnicorn
- My thoughts largely echo those of Scolaire. How will users (especially new users) learn of the noticeboard? Regarding Lurking Shadow's question of "If we don't have a noticeboard for mass-creation, where should these talks take place?" Perhaps at WP:VPPRO. Users could say something along the lines of, "Hey, over the next few months I intend to create short articles on every corporation established in Delaware between the years of 1790-1850. I will use this database, that book, and those periodicals as sources." And a discussion could be had there. ~ ONUnicorn(Talk|Contribs)problem solving 19:58, 11 October 2022 (UTC)
Comments from Editor X
Please open your own section with username in the heading
Question 2: Should we require (a) source(s) that plausibly contribute(s) to WP:GNG?
Proposed: Modify the General notability guideline (GNG)/Subject-specific notability guidelines (SNG) at WP:Notability (as appropriate) to add: (Please rank your choices by listing, in order of preference from most preferred to least preferred; ranking all options you don't consider completely unsuitable will assist closers in determining consensus.)
A: All articles created under SNGs (other than those which confer notability) must be cited to at least one source which would plausibly contribute to GNG: that is, which constitutes significant coverage in an independent reliable secondary source.
A-2: At least two sources.
B: All articles (except those not required to meet GNG) must be cited to at least one source which would plausibly contribute to GNG: that is, which constitutes significant coverage in an independent reliable secondary source.
B-2: At least two sources.
C: All WP:MASSCREATEd articles (except those not required to meet GNG) must be cited to at least one source which would plausibly contribute to GNG: that is, which constitutes significant coverage in an independent reliable secondary source.
C-2: At least 2 sources.
D. No change.
Statements (Require GNG-quality source(s))
Please rank your choices by listing, in order of preference from most preferred to least preferred; ranking all options you don't consider completely unsuitable will assist closers in determining consensus. Sign as usual with 4 tildes.
- D only (I don't think change will improve things, especially given the very significant variety in SNGs) --Enos733 (talk) 20:16, 3 October 2022 (UTC)
- C2 > C > A2 > A > D > B2 > B. Much of the conflict that led to this RfC is driven fundamentally by a mismatch between criteria used for creation and for deletion. This is primarily the result of SNGs that do not independently confer notability being used to justify mass-creation using databases and lists. Neither such SNGs nor such sources are, at present, admissible as evidence for keeping at AfD, where such articles inevitably end up. Requiring the articles to include sources supporting GNG addresses this mismatch. I would prefer two sources to one, but requiring it of every single article is a bit of an over-reach. Vanamonde (Talk) 20:53, 3 October 2022 (UTC)
- B2 > B > A2 > A > C2 > C > D. Vanamonde's point about the disconnect between creation and deletion is a good one, and requiring editors to create articles that meet some (very low) quality standards will improve the quality of the encyclopedia while also naturally preventing problematic mass-creations of articles without affecting highly productive editors that also produce high quality articles. BilledMammal (talk) 00:08, 4 October 2022 (UTC)
- D (only). This proposal conflates two fundamentally different concepts: notability and verifiability. We cite sources in articles so that our readers can verify the information in them, not so that editors, and the best sources for verification are not always the sources that show notability. If there is doubt about the notability of a topic, it should be addressed with cleanup tags, talk page discussions, and/or an AfD nomination, not by shoehorning sources into articles where they're not needed and have no value to our readers. Notability is a property of topics, which does not depend on the state of sourcing of the article. I also have significant misgivings that options A and B are valid outcomes of this RfC (see below). – Joe (talk) 10:40, 4 October 2022 (UTC)
- D only. A and B are an attempt to make large changes to notability guidelines by the back door and are frankly not valid outcomes of this RfC. An RfC about large-scale article creation should restrict itself to proposals about large-scale article creation as people who have no interest in large-scale article creation probably won't participate, so support for these proposals here does not indicate there is community consensus for them. A and B will make big changes to numerous SNGs, including some which have always been understood to be independent of the GNG (such as WP:PROF and WP:GEOLAND). The GNG also does not require multiple sources. I could get behind Vanamonde's suggestion below that mass-created articles only should have evidence that the subject meets the GNG or a criterion which is independent of the GNG. Hut 8.5 12:12, 4 October 2022 (UTC) Amend per Vanamonde93's comment below: not opposed to C if it means something similar to the 4A proposal. Hut 8.5 17:57, 4 October 2022 (UTC)
- B2>C2>B>C>A(2). Two sources should be a minimum to show sufficient coverage. Although at the end of the day I'd support any of these changes. --WhoIs 127.0.0.1 ping/loopback 12:31, 4 October 2022 (UTC)
- C-# > D. For C specifically, the number of sources should be sufficient and varied to write an article. This is how all GNG works and I don't see why mass creation should lower this bar. From my experience, AfD sees 3 in-depth sources as barely enough. A and B options sound like they are outside the scope of the RfC and change how all articles are approached, so I cannot see how those are valid options here. — HELLKNOWZ ∣ TALK 16:01, 4 October 2022 (UTC)
- D > C > C2. I don't think change will improve matters, but C is acceptable. A and B are not. Thryduulf (talk) 16:08, 4 October 2022 (UTC)
- C,B,A,D (don't really care whether the requirement is 1 or 2 sources). This seems to have worked for WP:SPORTSCRIT, so it would be beneficial to expand it to here. I concur that A and B are kind of outside scope, so prefer to C to them. * Pppery * it has begun... 17:58, 4 October 2022 (UTC)
- C,B,A,D I don't see that having one reference that supports GNG is onerous, and should take the some of stress of AfD. -- LCU ActivelyDisinterested ∆transmissions∆ °co-ords° 18:29, 4 October 2022 (UTC)
- D only - the other options are tempting, but we the community should have no appetite for a repeat of the Pending Changes "trial". That that page even exists should be telling. —Jéské Couriano v^_^v a little blue Bori 20:56, 4 October 2022 (UTC)
- B2 > B > A2 > A > C2 > C > D. If an editor is creating an article that needs to meet GNG, then presumably they've already found sources (otherwise, how would they know that it meets GNG?) so simply including these sources in the article is not a big ask. This would take a lot of pressure off of the creation and deletion processes, both of which currently require other editors to search for sources to prove that it's not notable before challenging. –dlthewave ☎ 23:01, 4 October 2022 (UTC)
- D only. I'm extremely uncomfortable with this question as noted below and particularly per Hut 8.5 above. Espresso Addict (talk) 00:45, 5 October 2022 (UTC)
- D only, with a trout to whoever decided to insist upon ranking "all seven options". (Bikeshedding should not be obligatory.) This entire discussion is predicated upon the idea that the GNG is objective, unambiguous, and easily applicable, when really all it does is transfer the ambiguity to questions about what counts as "significant" coverage. Let's not make things worse than they already are. XOR'easter (talk) 02:17, 5 October 2022 (UTC)
- D only. WP:N notes quite explicitly that
It meets either the general notability guideline (GNG)... or the criteria outlined in a subject-specific notability guideline (SNG
provided the article is not barred by WP:NOT, so SNGs generally do confer notability except when they explicitly state that they don't (or, like WP:NCORP, explicitly state stricter-than-GNG source requirements per community consensus); WP:DEL-REASON#8 explicitly refers to consensus to deletearticles whose subjects fail to meet the relevant notability guideline
, rather than simply articles whose subjects fail some notability guideline. We have notability guidelines that do not so much as require significant coverage of the article subject to exist, such as WP:NGEO's guidance legally recognized populated places and WP:COMPOSER#1. Requiring GNG-quality sources is not warranted based on our current notability guidelines; if there exist guidelines that don't require SIGCOV, then requiring evidence of SIGCOV is not a good measure of whether mass creation is appropriate or not. Rather than looking for "GNG-quality sources", it would probably be better to frame this as something along the lines of "contains sourcing in the article that shows that the article subject satisfies at least one notability criterion", but this bumps into the wisdom of WP:NEXIST. An alternative might be to create a speedy deletion tag for all mass-created articles that do not contain a sourced claim of significance, which seems like a natural extension of WP:A7 to handle this sort of situation. But I don't think that GNG is the right framing here when the vast majority of controversial mass creations are under SNGs and this proposal's options basically excludes anything that could possibly claim SNG notability. — Red-tailed hawk (nest) 03:56, 5 October 2022 (UTC) - B2>B>A2>A>C2>C>D. This would only apply to subjects that already must have two+ GNG sources in existence; requiring those sources to actually be cited in the article from the start will, by definition, not affect whether the subject merits an article. So this would have zero change to notability guidelines aside from any special informal temporal leeway certain topics may currently receive when having to demonstrate notability. JoelleJay (talk) 05:53, 5 October 2022 (UTC)
- C 1 or 2. Vanamonde makes a good point between the mismatch in article creation expectations and article deletion expectations. If not a single reliable source can be found for mass created pages, then there are probably better ways to present the content. A and B are nice enough, but I find the arguments they are somewhat out of the scope of this RfC reasonable. CMD (talk) 08:19, 5 October 2022 (UTC)
- D only This has nothing to do with the issue of scale. WP:GNG is not a policy, explicitly provides for exceptions and so is not mandatory. Andrew🐉(talk) 10:04, 5 October 2022 (UTC)
- All articles should meet GNG>All articles should have at least one instance of SIGCOV>B2>B1>C2>C1>A2>A1. Strongest possible oppose of D. I do not see mass-creation problems in any area where GNG applies, it is entirely a creation of SNGs, particularly GEOLAND and NSPORTS (at least whe NSPORTs was being interpreted as if it were a stand-alone SNG). It is caused by just going through a database which does not provide enough coverage to actually write a meaningful article on and creating articles. I do not see what possible basis there is for GEOLAND being somehow kept out of this issue when it is far and away one of the worst areas for mass-creation problems. Requiring at least one instance of SIGCOV means that in every instance at least some kind of meaningful article can be written. FOARP (talk) 12:31, 5 October 2022 (UTC)
- B2 > B > A2 > A > C2 > C. At least two independent sources are necessary to write a neutral article; an article which doesn't have them is going to fail multiple policies in any case. And assuming they're not just making stuff up whole-cloth, the person who originally creates an article ought to have those sources on-hand in any case; it's logical to ask them to provide them at the start. While we allow unsourced text to exist until it is challenged, an entirely unsourced article is another matter entirely. And trying to apply this only to some formally-defined "article creation at scale" is doomed to fail - the simple fact is that if someone is not mass-creating articles, providing the two sources they used to write the article should be trivial for them. The argument that this would somehow change our existing notability guidelines (which already set this requirement) is nonsensical. --Aquillion (talk) 14:59, 5 October 2022 (UTC)
- C2, C. We should stick to making rules for mass-created articles. Two sources is preferable because the bar should be higher for an experienced user mass-creating new articles than for the ordinary Joe. Scolaire (talk) 15:05, 5 October 2022 (UTC)
- D > C > B > A. Per Scolaire and WP:CREEP, we should stick to making rules for mass-created articles. I don't think the wording of any of A, B, or C is clear enough to both have the desired effect of preventing mass creation from lines in databases, and to avoid damaging the creation of properly-sourced stubs. And in practice, something like this is already the case, with unsourced stubs generally quickly getting draftified by the new page patrollers, so we don't need extra rules saying that something must happen without providing any mechanism for making that thing happen. —David Eppstein (talk) 16:30, 5 October 2022 (UTC)
- C The best choice on this entire page. "A" and "B" are basically the elimination of SNG.....probably a good thing in the long run but not something that you do in a mass creat RFC. North8000 (talk) 19:06, 5 October 2022 (UTC)
- B>B2>A>A2>C>C2>D, although A and B may be out of scope, as noted multiple times above and below. The burden of deleting a page (WP:AFDBEFORE) is considerably higher than the burden of creating one (WP:BLPPROD?). I can create 1000 pages about non-notable subjects citing nothing but interviews, and deleting each of them requires others to check various search engines for material I should have provided in the first place. And if one of the 1000s is actually notable, I have even achieved a goal "positive to the encyclopedia" on behalf of others' free time. AFC/draftification and WP:A7 exist, but they don't entirely solve the problem. Requiring at least one independent reliable source as proposed, for whichever cases this RfC can actually decide about, perhaps even two, sounds fine to me. ~ ToBeFree (talk) 21:58, 5 October 2022 (UTC)
- D only per Joe Roe, Hut 8.5, XOR'easter, Red-tailed hawk and David Eppstein. Proposals A and B are completely outside of the scope of this RfC. C assumes that GNG is objective and unambiguous. C is not compatible with WP:NEXIST. James500 (talk) 00:26, 6 October 2022 (UTC)
- B2 > A2 > C2 > B > A > C > D. All articles, like all mainspace content, should be supported by multiple sources. I'll take as close to that ideal as I can get. Levivich (talk) 01:18, 6 October 2022 (UTC)
- D only A and B aren't related to mass creation in the first place, what counts as "would plausibly contribute to GNG" is too open to interpretation to be useful for a deletion or draftification criterion, and with the general trend away from SNGs that language seems a little odd. (I especially oppose any requirement for two sources; there are DYK-quality articles with only one non-database source.) TheCatalyst31 Reaction•Creation 03:03, 6 October 2022 (UTC)
- D ≫ C > C2: I don't think that change will help here.
Notable topics may suffer from scarcity of sources, while non-notable ones may have enough in the form of generic listings on web.(stricken as there is a carve-out for SNGs) Not every article has to be a GA. If the proposals covered one or two subject areas, I would have considered to support. But a blanket rule is too far reaching that will hurt content creation more than improve it. Even more far reaching are the A/B proposals, which I think shouldn't be enacted at any cost. —CX Zoom[he/him] (let's talk • {C•X}) 08:53, 6 October 2022 (UTC) - D > C (subject to a contingency) >> (A or B) - C only works if we have an adequate definition of "mass creation". Some of the definitions that have been proposed, like 50 over a month or 500 over a year amount to less than 2 a day, which is really not mass creation. Maybe something like 20 a day or 100 a week would qualify, and thus be appropriate to require some additional indication of notability upon creation. A and B might make sense at some point but are really outside the scope of this RfC. C2, B2 and A2 all go far beyond what should be required at the time of creation - there is time to add additional sources if there is a reliable source in the article at creation, and in addition B2 and A2 are way beyond the scope of an RfC on mass creation. Rlendog (talk) 15:06, 6 October 2022 (UTC)
- C, the vast gulf in effort required to mass-create versus mass delete articles means that some form of quality control has to be introduced, and one GNG-approaching source is an acceptable compromise here, the massive amount of wasted editor time over mass-created sports biographies showcases this excellently. Devonian Wombat (talk) 02:16, 7 October 2022 (UTC)
- C# with option C-2 > C. THis is because our problem addressed here is for masscreated articles, and this requirement should mean that the articles should pass GNG. And thus not be a problem for being useless or deletable. Graeme Bartlett (talk) 03:41, 7 October 2022 (UTC)
- B > C > A > B2 > C2 > A2, not D. Requiring at least one instance of significant coverage should be a bare minimum for any article, but is an absolute must in order to prevent disruptive mass-creation. We are well beyond the point of doing nothing, so no change is not an option. wjematherplease leave a message... 12:30, 7 October 2022 (UTC)
- Other: Depends if the article is a BLP. I feel that we need to raise standards incrementally, and the first standard we raise should be the one for biographies of living people. I feel that from now on, nobody should start a new biography of a living person in the mainspace unless it cites at least two GNG-quality sources, but if that's difficult we could make it one GNG-quality source and one other non-database source for the time being.—S Marshall T/C 19:02, 7 October 2022 (UTC)
- D only. This really won't help. WhatamIdoing (talk) 04:20, 8 October 2022 (UTC)
- C>C2>D. The others are out of scope of this RFC. We are tolerating these quality errors from newbies because we don't want to bite them away, but mass-created articles with severe problems like that coming from likely more experienced editors are just not tolerable.Lurking shadow (talk) 06:53, 8 October 2022 (UTC)
- D mainly; C is just about OK only. Of course articles should have reasonable sources. But trying to put in place "rules" which are really quite precise will cause more harm than good and lead to gaming Blue Square Thing (talk) 08:22, 8 October 2022 (UTC)
- C2 > B > A; NOT D. The problem is mass creation so options A and B are effectively academic, though it is worth making the point that any new article must have SIGCOV from at least one reliable non-database source. BoJó | talk UTC 09:37, 8 October 2022 (UTC)
- D Only The other options are inadequate, will leave a great opening for continued creation of poorly sourced stubs in areas covered by SNGs or not subject to GNG. - Donald Albury 14:55, 8 October 2022 (UTC)
- D only. Per several others. BeanieFan11 (talk) 17:52, 10 October 2022 (UTC)
- A2 > A> B2 > B > C2 > C, not D All articles, mass-created or not, should be expected to have a modicum of significant coverage that is not merely databases. The failure to include significant coverage in new articles is what leads to issues at AFD. Failure to include significant coverage in mass-created articles increases the probability of incorrect, low-quality, and non-notable content and moreover results in intractability of these issues as the scale of the articles make it daunting to attempt to address the problem at AFD as people call for procedural keeps. Reywas92Talk 21:47, 10 October 2022 (UTC)
- B > A > D > C - Requiring all articles to cite at least one source would ease many problems around here. Honestly, my ideal wording would be B, if it would leave out the parenthetical. ~ ONUnicorn(Talk|Contribs)problem solving 20:11, 11 October 2022 (UTC)
- D only. As said several times A and B are out of scope anyway, and C presupposes that GNG is clear and unambiguous, whereas much of AfD is actually occupied by squabbles between different subjective interpretations of what is or isn't GNG-worthy. Ingratis (talk) 06:55, 12 October 2022 (UTC)
Comments (Require GNG-quality source(s))
Comments from Thryduulf (Q2)
- Combine the massively variety in style, format and purpose of SNGs with the very subjective nature of what constitutes a source that "passes" the GNG and changes would not improve matters. Better imo to discuss things individually at the board proposed in Q1. 19:26, 3 October 2022 (UTC)
Comments from Rhododendrites (Q2)
Whoa. 2B (and to a much lesser extent 2A) extends far beyond the scope of this RfC IMO, applying to all articles. This would be a radical change and should be separated if anyone wants to really propose it.
Weak support for C, but really I think the guidance should go something like this: "Mass created articles must include sufficient sourcing to show notability, and cannot be based only on simple statistical databases. While there are no firm requirements about the level of quality an article must reach when created, many in the community have a strong preference for mass created articles to be more than one- or two-sentence stubs." (This obvious extends to quality, but doesn't mandate anything about length). — Rhododendrites talk \\ 19:52, 3 October 2022 (UTC)
Comments from Vanamonde
- I believe a cleaner way to do this would be to simply prohibit mass-creation that is based on criteria that do not independently grant notability. However, this idea has not made it into the RfC. Some of the proposals above do so indirectly, at least per my view of what mass-creation is, and so have my support. Vanamonde (Talk) 20:55, 3 October 2022 (UTC)
- @Hut 8.5: If I'm understanding you correctly you're not opposed to the C options, but you don't directly refer to them in your !vote; have I misunderstood? Vanamonde (Talk) 16:47, 4 October 2022 (UTC)
- Valereee, didn't receive your ping, but that's what 4A below is trying to achieve. I don't see it reaching consensus though. I have to admit the community's opinions about notability and deletion are even more off-the-wall than I expected. Vanamonde (Talk) 15:53, 5 October 2022 (UTC)
- @FOARP: I'm not sure why you struck your reply to me, because as it happens I agree with you; there are problematic creations under GEOLAND, including mass creations. However, that's an issue with GEOLAND that needs addressing; it doesn't mean we should permit mass-creation under criteria that are even looser. Vanamonde (Talk) 17:21, 7 October 2022 (UTC)
Comments from Enos733 (Q2)
I agree with the comments of Thryduff and Rhododendrites above (and can support Rhododendrites's proposal). We do want high-quality articles, but we must balance that with the idea that this project is "freely editable" and we should be hesitant to enact procedures that enact barriers towards the sharing of knowledge (we have procedures for dealing with vandalism and deleting articles that do not fit with this project). --Enos733 (talk) 21:11, 3 October 2022 (UTC)
Comments from Joe Roe
Options A through C represent a fundamental change to our core content policies (throwing out WP:NEXIST and modifying every single SNG) and options A and B would apply this to all articles. This is an absurd overreach of the stated scope of this RfC, which was already stretching ArbCom's request for comment on article deletion, and I don't think a local consensus on them here would be enforceable. The moderators should remove or modify it so we don't waste our time. – Joe (talk) 10:49, 4 October 2022 (UTC)
Comments from Valereee
Vanamonde93, if you can figure out a way to propose something around based on criteria that do not independently grant notability that you think can work for a general !vote, please do. In distilling the workshop suggestions, I couldn't. Anything I didn't include was for reasons of not being able to figure out how to distill something that addressed major concerns, but if you have an idea definitely add it!
- Vanamonde93, well, duh, I forgot to sign w/ping lol...yeah, there's certainly a lot of pushback on any proposal to make any changes to try to solve the creation-at-scale problems that many feel are a root of the AfD-at-scale problems. From my point of view as just-a-moderator, a creations RfC that cannot find consensus for any proposed solutions is still a successful effort, as it removes concerns about lack of due diligence from the deletions RfC. But it probably won't be pretty at that RfC. Valereee (talk) 16:01, 5 October 2022 (UTC)
- Scolaire, you can add another question at will for several more days. Since it's related to Q2, I'd suggest adding it as a Q2A. Try to word it in a way that doesn't cause a problem with mutual exclusivity. Valereee (talk) 16:14, 5 October 2022 (UTC)
Comments from Espresso Addict
Echoing above comments; some of these options appear a considerable overreach of the "Article creation at scale" RfC remit. They should be removed, before this poisons the whole RfC. Espresso Addict (talk) 00:15, 5 October 2022 (UTC)
@Dlthewave: I assume because some sources do exist but (1) are not online (eg print books not on Google Books); (2) are in old, offline/paywalled newspapers/magazines/journals; and/or (3) are not in English or any language the editor reads. Eg. the article I'm currently working on, a listed building, is primarily sourced to the listing description, which references two books neither of which is previewed on Google Books, one oop, the other rather expensive; I have found another potential piece of significant coverage but it is in a 1932 journal/magazine which has no online content listing, is all paywalled and may not be online back to 1932. As it is entirely impossible for me to visit an academic library, my personal solution has been to buy the two books at significant personal expense, and I may plan to ask the resources exchange for the journal/magazine. However, I do not think it is unreasonable for someone to start the article based purely on the listing. Espresso Addict (talk) 23:06, 6 October 2022 (UTC)
@FOARP: WP:GEOLAND is a very diverse guideline that covers a wide range of articles, including heritage-listed structures and substantial geographic entities such as mountains. Personally I find its settlements advice completely unhelpful (afaik, Scotland has no legally recognised places lower than cities?) which has contributed to my stopping working in this area altogether. Perhaps it would be better to try to refine the guidance further rather than require everything falling under GEOLAND to address GNG, which would potentially bring into question a huge number of relatively harmless articles, say on heritage-listed structures, where there's a single easy source but sparse other sources that are online & non-paywalled. Espresso Addict (talk) 01:56, 8 October 2022 (UTC)
@S Marshall: Agree BLPs are more of a problem than other topics. Espresso Addict (talk) 11:05, 8 October 2022 (UTC)
@WhatamIdoing: I like the facts approach, but fear it's gameable; consider a list of publications for a scientist. (Is each paper a fact? Or worse, is each paper n facts (co-author 1, co-author 2....)?)
@S Marshall: "a database of species or diseases or asteroids maintained by scientists is a good source, but a database of Olympians maintained by amateurs isn't" the key might be experts vs amateurs. Espresso Addict (talk) 01:00, 9 October 2022 (UTC)
@JoelleJay: There are plenty of article types that don't need secondary analysis to be useful. Espresso Addict (talk) 01:40, 9 October 2022 (UTC)
@Dlthewave: Agree good practice would be to list sources that can't be verified somehow; I use often use Further reading for this. Espresso Addict (talk) 03:33, 12 October 2022 (UTC)
Comments from JoelleJay
Responses to Red-tailed hawk
|
---|
@Red-tailed hawk (and others), I'm not sure I understand your reasoning? The proposed changes would only affect GNG-based articles, which includes around half of the SNGs, e.g. NSPORT -- so there would be no conflict with SNGs that don't ultimately require GNG. Currently the burden of verifying a subject is notable is almost entirely relegated to AfD and NPP, rather than the article creator (where the responsibility should lie), and the proposals aim to address this. 06:17, 5 October 2022 (UTC)
|
I think that it would be more relevant to say that the database record should contain 10+ facts that would belong in the article, if that article reached FA.
This would NOT be acceptable for GNG because it still doesn't contain secondary analysis. It doesn't matter how many isolated facts a database contains on a subject, if no person is actually discussing them specifically they are not DUE in an article, let alone a sufficient basis for an article. 01:12, 9 October 2022 (UTC)
- Policy:
Do not base an entire article on primary sources, and be cautious about basing large passages on them.
No articles should be based on primary sources. And if a whole source type is unacceptable as the basis of an article, it is not possible for it to contribute to notability. 04:23, 9 October 2022 (UTC)
- Policy:
- @S Marshall Whether a database is scientific/professional or not is irrelevant, the only factor that should be considered is whether someone actually discussed the subject specifically in their own words. That rules out ALL the problematic databases, which only contain facts with no interpretation/commentary, while permitting the ones that actually do have secondary coverage. Being able to write something on a subject (which we can always do by stringing together isolated "facts") is only half of the point of N; the other half is to be able to maintain NPOV/no OR/comply with NOT, which can only occur when other people have personally analyzed those facts and discussed the ones that are important, and why. If we just went by the first purpose we could have articles on every single child with a MaxPreps profile. JoelleJay (talk) 22:10, 9 October 2022 (UTC)
- @S Marshall, the problem with that is there is no indication of which info is encyclopedic, let alone that the topic is encyclopedic merely through being mentioned in a database. That page could never be used to support notability because no one could write an article based solely on it: it would require massive OR for an editor to interpret all of that info, distill what's important, and then contextualize it into a prose WP page. That's what makes it primary. JoelleJay (talk) 23:10, 9 October 2022 (UTC)
- This is the problem with all of the databases that do not contain prose interpretation, which makes it very easy to just exclude off the bat any such database entry from contributing to notability. All one has to do when checking a database entry citation is see whether the subject is discussed specifically, by a human, in prose. JoelleJay (talk) 23:18, 9 October 2022 (UTC)
- @S Marshall, databases that don't contain original prose written specifically about the subject are comparable to specialized web browsers or news aggregators: they may be reliable, but they are not synthesizing anything specifically for any entry, they are merely tools to autocurate and present facts without further analysis. Some of those facts might be from a secondary publication, but they are more likely to be cited to a primary research article or even uploaded directly by a researcher. For the facts that are referenced to secondary sources or have secondary coverage elsewhere, those publications should be the basis of the article rather than the database. For brevity, I'm also going to just quote some parts of NOT and OR (all emphases mine) and expand on those on a talk page:
Information should not be included in this encyclopedia solely because it is true or useful... Verifiable and sourced statements should be treated with appropriate weight
. AndA primary source may be used on Wikipedia only to make straightforward, descriptive statements of facts that can be verified by any educated person with access to the primary source but without further, specialized knowledge.
JoelleJay (talk) 19:28, 10 October 2022 (UTC)
- @S Marshall, databases that don't contain original prose written specifically about the subject are comparable to specialized web browsers or news aggregators: they may be reliable, but they are not synthesizing anything specifically for any entry, they are merely tools to autocurate and present facts without further analysis. Some of those facts might be from a secondary publication, but they are more likely to be cited to a primary research article or even uploaded directly by a researcher. For the facts that are referenced to secondary sources or have secondary coverage elsewhere, those publications should be the basis of the article rather than the database. For brevity, I'm also going to just quote some parts of NOT and OR (all emphases mine) and expand on those on a talk page:
- This is the problem with all of the databases that do not contain prose interpretation, which makes it very easy to just exclude off the bat any such database entry from contributing to notability. All one has to do when checking a database entry citation is see whether the subject is discussed specifically, by a human, in prose. JoelleJay (talk) 23:18, 9 October 2022 (UTC)
- @S Marshall, the problem with that is there is no indication of which info is encyclopedic, let alone that the topic is encyclopedic merely through being mentioned in a database. That page could never be used to support notability because no one could write an article based solely on it: it would require massive OR for an editor to interpret all of that info, distill what's important, and then contextualize it into a prose WP page. That's what makes it primary. JoelleJay (talk) 23:10, 9 October 2022 (UTC)
Comments from FOARP
Why exempt GEOLAND from this when it is far and away one of the worst areas for mass-creation? Just look at the mess that C46 left us with in Iran and California based on bare database sources. FOARP (talk) 12:57, 5 October 2022 (UTC)
Comments from Scolaire (Q2)
Perhaps we need a C3 and a C4 that don't specifically invoke GNG, and don't link to WP:MASSCREATE, e.g. "All mass-created articles (except those specifically excluded) must be cited to at least one (or two) source(s) which demonstrate(s) significant coverage in an independent reliable secondary source." Poorly thought out. I don't know how we would actually edit WP:N. Scolaire (talk) 16:23, 5 October 2022 (UTC)
Comments from Red-tailed hawk
For what it's worth, options A and B are clearly outside of the scope of this RfC, as Rule#2 states that the sole purpose of this RfC is to determine consensus about policy going forward surrounding creation of articles at scale and to form consensus on those solutions
. Simply put, options A and B are not narrowly tailored towards the mass creation of articles; while they could be something the community puts together as a sweeping change to WP:NEXIST, they are clearly out-of-scope for this RfC as they affect policy in a way that is much, much broader than the narrow focus of dealing with creations of articles at scale. That being said, Option C, as it only focuses on (a subset of) mass-created articles, is within the scope of this RfC, as is the option arguing for no change in policy. — Red-tailed hawk (nest) 16:38, 5 October 2022 (UTC)
- @JoelleJay: Would you please provide a list of the SNG pages you say require GNG to be met? I'm looking through them and I'm not getting anywhere near 50-50. — Red-tailed hawk (nest) 22:30, 5 October 2022 (UTC)
- @JoelleJay: With due respect the part where SNGs
are just effectively restating GNG
is somewhat nebulous, far too much so for the reach of this proposal. This proposal squarely asks about articles that are required to meet GNG, not meet something else that we think is similar to GNG but might have a hair of difference. It actually wouldn't have even stopped Lugnuts, since Lugnuts's mass creations were all in line with an SNG that, at the time, did not require GNG levels of coverage nor any evidence of a source that would cover it significantly. If we'd like to change notability guidelines, which I take it the statement RE NSPORT is getting at, there's a place for that, but this is not that place. — Red-tailed hawk (nest) 00:04, 6 October 2022 (UTC)
Comments from Dlthewave
CX Zoom, I'm having trouble imagining a topic that's notable yet suffers from a lack of sourcing. Notability is based on significant coverage, so generally if sourcing doesn't exist then it's not notable. And if the sourcing does exist, why not simply include it in the article from the get-go? The exception would be SNGs that presume notability, but this proposal has a carve-out for those. –dlthewave ☎ 16:02, 6 October 2022 (UTC)
@Espresso Addict: and others, I wonder if a good middle ground would be to include sources in a bibliography if they can't be cited directly for whatever reason. This would at least demonstrate the notability of the topic. –dlthewave ☎ 03:21, 12 October 2022 (UTC)
Comments from WhatamIdoing
None of these proposals will solve any of the actual problems. The problem isn't that the editors aren't providing sources; the problem is that some editors don't believe that the provided sources "plausibly contribute to WP:GNG", and there is no rule we can make that has the power to stop people from feeling that way.
Part of the problem is our unwillingness to settle at WP:N exactly what WP:SIGCOV means. I once had an editor tell me that it was SIGCOV if the whole fact of "<Person> got married today in <city>" could be read in a single source, and didn't have to be cobbled together from one source saying "<Person> got married today" and another saying "<Person> is in <city> today". I've also had editors say that SIGCOV is 100 consecutive words of prose in a single source, 300 words or prose, etc. My own interpretation relies on WP:WHYN: We have "significant coverage" when we have enough coverage to write a decent article. But several editors who like this "plausibly contribute to WP:GNG" phrase don't think that's relevant at all; they appear to think that SIGCOV is a matter of how much serious-souding prose you find, because when I show them that it's possible to write a decent bigger-than-stub article from a database, they persist in believing that databases are incapable of providing SIGCOV. And if you are inclined to believe that because some entries in some databases are insufficient, then all entries in all databases should be assumed insufficient, then I point out that https://omim.org/entry/609423 – a single database record – contains more consecutive sentences of prose than at least 90% of our articles.
The bottom line is that this proposal will not solve any actual problems. I believe these problems can be solved, but this won't contribute to it. A clear definition of SIGCOV would go much further than any of these efforts. WhatamIdoing (talk) 04:19, 8 October 2022 (UTC)
- @S Marshall, I agree with you about the value of being cautious wrt BLPs. I thought you might want to consider the work of Wikipedia:WikiProject Medicine/Dermatology task force 10+ years ago. It basically was a single editor who set himself a target like "I will turn every red link on the List of skin conditions blue" – and he did. He created hundreds of single-sentence, single-source stubs. All of them were notable subjects. (In practice, all recognized-by-mainstream-medicine human diseases are notable under GNG, because even if the cited source only contains a passing mention, others will go into depth.) Most editors have enough sense to know that certain categories should be assumed notable (e.g., heads of state, prescription drugs, human diseases, the latest iPhone...), but for those subjects that are less obvious to some reviewers, how could we explain to someone that this list/this source is okay, but that list/those sources aren't?
- Also, do the problematic aspects of mass creation of articles depend on whether the subject's notability is obvious? WhatamIdoing (talk) 22:56, 8 October 2022 (UTC)
- @S Marshall, I think that it would be more relevant to say that the database record should contain 10+ facts that would belong in the article, if that article reached FA. That would rule out a simple "List of Olympians from Ruritania" (number of facts: name+year+sport+country+score/ranking = 5) but would permit database entries that are actually useful for writing a whole article. (We could swipe the wording for such a provision right out of WP:ELNO#EL1.) WhatamIdoing (talk) 00:10, 9 October 2022 (UTC)
- @Espresso Addict, so long as editors keep using "gameable" as a synonym for "editors trying to follow the rules", then everything is gameable. Perhaps we should embrace that, and write the rules so take advantage of it. "We can't just require articles to have five top-quality sources, because then (*gasp*) people might actually do that! We can't have people going around and creating articles with multiple top-quality sources! What will become of Wikipedia!"
- In terms of your example ["consider a list of publications for a scientist. (Is each paper a fact? Or worse, is each paper n facts (co-author 1, co-author 2....)"], would each one of those papers be included in a Featured Article? Any paper that wouldn't be mentioned is irrelevant.
- If the paper would be mentioned, then I think most editors would interpret 'the fact' as being "He wrote a paper". I don't think they would stand for someone writing "1. He wrote a paper. 2. The co-author was Francis Crick. 3. The title was "Molecular Structure of Nucleic Acids: A Structure for Deoxyribose Nucleic Acid". 4. It was published in 1953. 5. It was published in Nature. 6. It was in volume 171. 7. It was two pages long." WhatamIdoing (talk) 01:24, 9 October 2022 (UTC)
- @S Marshall, I think that it would be more relevant to say that the database record should contain 10+ facts that would belong in the article, if that article reached FA. That would rule out a simple "List of Olympians from Ruritania" (number of facts: name+year+sport+country+score/ranking = 5) but would permit database entries that are actually useful for writing a whole article. (We could swipe the wording for such a provision right out of WP:ELNO#EL1.) WhatamIdoing (talk) 00:10, 9 October 2022 (UTC)
- @JoelleJay, yes, that's in the policy, but it was added by SlimVirgin, who argued for years that when journalists stood in front of a burning building and submitted a story based on what they saw with their own eyes, this was a "secondary source" on the grounds that the journalists were neither arsonists nor firefighters. More than a decade later, editors now (mostly) agree that Wikipedia:Secondary does not mean independent and that WP:PRIMARYNEWS exists, but SV represented the other end of that spectrum.
- Consequently, I'm not sure that it fully represents either the actual or the desired practice as manifested by the work of tens of thousands of editors each month. Strict enforcement of that rule would, for example, result in deleting a large fraction of articles about music albums, which are sourced either entirely to primary sources (e.g., the track listing) or to sources that are partly primary and partly secondary (music reviews). We have, and seem to want, such articles, and when they appear at AFD, it is rare for anyone to complain about NOR violations because the article only names primary sources. It may be one of our rules that does not mean quite what it says. WhatamIdoing (talk) 21:12, 9 October 2022 (UTC)
S Marshall
Espresso Addict, I'm thinking about mass creation in terms of articles about sportspeople and athletes, because that's the actual cause of the drama here. Mass creation often arises when an editor sets themselves a target like "I will turn every red link on this list blue". I think it matters what the list is: asteroids or roads or species of beetle are one thing, and biographies of living people are another. I do not have a problem with Whatamidoing's database source but I do have a problem with the database sources used by Lugnuts among others, and I think it's partly about who compiles the database and partly about its checkability and rigour.
I feel that we can't agree on what mass creation is and therefore we need to generate solutions that don't require a definition of mass creation. Hence my wording above.—S Marshall T/C 10:15, 8 October 2022 (UTC)
- @WhatamIdoing, I'm starting to wonder if it might be useful to distinguish scientific databases from non-scientific ones? Imagine a rule where a database of species or diseases or asteroids maintained by scientists is a good source, but a database of Olympians maintained by amateurs isn't. This idea might need some refinements because for example it makes our articles about roads and railway stations look vulnerable, and there's scope to wonder whether we feel that's a bad thing.—S Marshall T/C 23:39, 8 October 2022 (UTC)
- @JoelleJay, you and I often agree but I join issue with you on this particular point. I'm very persuaded by WhatamIdoing's remark about scientific databases. Take for example this database entry, which is incredibly condensed and contains zero prose sentences. It's also verifiable and well-referenced. It describes the star HIP 56948, and would unpack to a complete article. WhatamIdoing gives similar examples of databases about diseases and I'm sure there are equivalents for species—and, I should expect, court cases as well, although these last can always be supported by prose sources containing legal analysis because judgments are published, at least in Western democracies. I feel it would be an error to rule these out.—S Marshall T/C 22:37, 9 October 2022 (UTC)
- @JoelleJay, again I disagree. WhatamIdoing has already shown here how an editor who understands the science can unpack a scientific database entry into a full article. The only issue is that you do have to understand the science to do it, but that's settled policy already. Verifiability means verifiable by someone. It doesn't mean verifiable by anyone.
- So for example if I cite a source in a foreign language, I've cited the source. Randy from Boise might not be able to follow my citation but that's his problem. The burden is not on me to drive to his house and give him language lessons. If he wants to check my translation, the burden is on him to find an independent Wikipedian who speaks the language and ask them. (And not use Google Translate because it's unreliable.)
- In the case of scientific databases, you've got to, er, speak science to unpack them and not everyone does. That's OK. Randy from Boise's ignorance is, and rightly should be, his problem and not an article writer's problem.—S Marshall T/C 23:36, 9 October 2022 (UTC)
Comments from Donald Albury (Q2)
After thinking long and hard, I have decided that I do not support any of the alternatives under this question. As worded, the question variants exempt all articles that are subject to an SNG, or are not subject to the GNG. That leaves categories that have been particularly subject to mass creation of articles, such as populated places, outside the scope of this "solution". - Donald Albury 14:41, 8 October 2022 (UTC) Edited 14:42, 8 October 2022 (UTC)
Comments from Redfiona99
From my memory, the major problem was with people mass-creating from databases. Would there by anyway of requiring that, if an article was created from a database, the creator would also have to add at least one other, non-database, source? Red Fiona (talk) 17:14, 8 October 2022 (UTC)
Comments from XOR'easter
Redfiona99, I think I see where you're coming from, but that just leads us back to the problem that not all databases are created equal. A poorly-phrased rule could lead to unwarranted bias against perfectly good sources just because of how their websites happen to be formatted. XOR'easter (talk) 17:19, 8 October 2022 (UTC)
Comments from Editor X
Please open your own section with username in the heading. Please limit comments within a section to 300 words.
Question 3: Should we create a definition of "article creation at scale"? By rate, source, similarity, other?
Proposed: "Article creation at scale" is the creation of over 25 similar/similarly-structured articles per day or 50 per week or 100 per month or 200 per year using the same source.
This definition, once finalized, would be usable for establishing limits for the need to request consensus to create at scale, for requesting permission to create at scale, or for other discussions surrounding article creation at scale. (This proposal is intended to be refined and may not be finalizable here in this RfC but can be used for input for later proposals.)
Support (Create definition)
Thryduulf (talk) 19:06, 3 October 2022 (UTC)Reconsidered in light of others' comments. Thryduulf (talk) 07:40, 4 October 2022 (UTC)Supporting on principle, but it's odd to ask if we should have a definition while simultaneously making decisions about the undefined thing. — Rhododendrites talk \\ 19:45, 3 October 2022 (UTC)
Oppose (Create definition)
- I supported, but was confused by the difference between the heading, which asks "By rate, source, similarity, other?" and the actual proposal, which answers that question: "by rate, similarity, and source". I moved to oppose mainly because "similar/similarly-structured" needs more clarity and because "using the same source" should be one way in which they can be similar/similarly-structured. See my comments below for an alternative. — Rhododendrites talk \\ 19:58, 3 October 2022 (UTC)
- Support creating a definition, oppose this definition. As I see it, mass-creation is simply creating articles without individually checking them for notability. Period. It's sometimes justifiable; it sometimes draws from lists or databases whose entries are inevitably notable; and sometimes it doesn't, but the products may still be good. However, putting numbers on it misses the crux of the matter, and also allows for endless dispute about timing and rates (see how bad the wikilawyering can get just with respect to 1RR restrictions). Vanamonde (Talk) 21:00, 3 October 2022 (UTC)
- I'm not sure I support a direct numeric value as anything but guidance; hard numbers can be gamed. The idea here is "If you intend to create a whole bunch of very similar articles, get community feedback rather than just plowing ahead with it." That's not unnecessary at 49 in a week but suddenly essential at 50. Seraphimblade Talk to me 21:22, 3 October 2022 (UTC)
- Per Jacobellis v. Ohio, no need to get boxed in to definition that can be gamed. nableezy - 04:23, 4 October 2022 (UTC)
- No need to get bogged down in particulars. Mass creation can be mass creation even when done slowly. Pinguinn 🐧 04:56, 4 October 2022 (UTC)
- A definition for guidance is a good idea, but not rigid numbers. Thryduulf (talk) 07:40, 4 October 2022 (UTC)
- It's really more about vibe then about specific numbers. We should avoid at all cost the wikilawyering that will come with putting a specific number on it. Create a definition but avoid putting a box around it too strictly. --WhoIs 127.0.0.1 ping/loopback 12:33, 4 October 2022 (UTC)
- I support creating a general description of what "mass creation" would be. But not a rigid definition as such, because that would only lead to lawyering over details. More like a list of indicators of mass creation, like same process, high number, high rate, same source, same structure, etc. — HELLKNOWZ ∣ TALK 16:10, 4 October 2022 (UTC)
- As with other editor's concerns this sounds like something that would end up being wikilawyered, whether claiming that creations do or don't meet the standard. -- LCU ActivelyDisinterested ∆transmissions∆ °co-ords° 18:33, 4 October 2022 (UTC)
- Pointless. "We'll know it when we see it". Might as well try to create a numerical definition of what WP:SIGCOV is, or what a stub is, or any of the other concepts which people have differing ideas around - the existence of those differing idea is not and should never be a reason not to do anything about the issues they address. FOARP (talk) 12:33, 5 October 2022 (UTC)
- I've seen enough "I have made only 3 reverts, so I didn't edit war". ~ ToBeFree (talk) 22:02, 5 October 2022 (UTC)
- Impractical. Not the sort of thing you can define as each case needs to be considered individually for impact, need, etc. BoJó | talk UTC 09:44, 8 October 2022 (UTC)
- Better to focus on quality rather than quantity of new articles. - Donald Albury 14:45, 8 October 2022 (UTC)
- The definition, if any is eventually adopted, will likely come from experience using the noticeboard. LessHeard vanU (talk) 16:08, 11 October 2022 (UTC)
Comments (Create definition)
Comments from Thryduulf (Q3)
The definition will probably have to be slightly fuzzy, but I can only see this being a good thing. 19:08, 3 October 2022 (UTC)Definitions need to be fuzzy because the real world is messy, but this appears to be being treated as rigid which is not. Thryduulf (talk) 07:40, 4 October 2022 (UTC)
Comments from Rhododendrites (Q3)
[Turned comment into 3A per suggestion on the talk page]. See my comments at Q1 for what I think a request for permission should look like. — Rhododendrites talk \\ 20:56, 3 October 2022 (UTC)
- @Vanamonde93:
mass-creation is simply creating articles without individually checking them for notability. Period
- I suspect if that were the working definition, there would be consensus to completely prohibit it, but I've not seen anybody put forth a definition like that before. On the flip side, it would also allow for creating thousands of articles per year as long as you know they're notable, regardless of rate/sourcing, and those are things people are clearly concerned about. — Rhododendrites talk \\ 21:14, 3 October 2022 (UTC)
- @Vanamonde93:
politicians that unambiguously meet NPOL, legally recognized towns, etc.
- I agree that we should allow some forms of mass create at scale, but you just defined mass creation as limited to creation without checking the articles for notability. Creating articles that are unambiguously notable means they've been checked for notability. :) I'm not making a point about what should be allowed -- just that that's not a helpful definition. — Rhododendrites talk \\ 21:21, 3 October 2022 (UTC)
- @Vanamonde93:
Comments from Vanamonde
- @Rhododendrites: I don't think there will. There's plenty of mass-creation that's quite justifiable; described scientific species, politicians that unambiguously meet NPOL, legally recognized towns, etc. These are areas in which we've had community support not only for mass-creation, but for bot creation. I think the community is upset about mass-creation of non-notable pages, which, I believe, is the consequence of the GNG-SNG mismatch I mentioned above. Vanamonde (Talk) 21:18, 3 October 2022 (UTC)
- @Rhododendrites: When I say "checked for notability", I mean topics have individually been evaluated against a criterion that requires manual evaluation. You can quibble with the word "checked" if you'd like, but there's a qualitative difference between creating pages off of a list of MPs (or towns, or cricket players) and looking to see if each meet GNG. That's the fundamental feature of mass creation. Vanamonde (Talk) 04:09, 4 October 2022 (UTC)
Comments from Editor xaosflux
- Above only mentions a lower bound, however at some upper bound such an endeavour should fall to bot-flagged accounts to prevent flooding. Perhaps guidance related to when a bot task should be used is wise here. Such a mass-creation would still need to pass all other requirements, in addition to being pre-approved as uncontroversial. — xaosflux Talk 14:43, 4 October 2022 (UTC)
Comments from FOARP
Have to be honest, I am not at all jazzed by the supporters here !voting oppose on other proposals just because this proposal (and the other attempted definitions) are not being supported. Wikipedia has many contested but nonetheless valid concepts. Take WP:SIGCOV for example - why should we develop a numerical measure of what this is? Would anyone ever agree as to what it is? Yet it is still a very valid and useful concept even without a specific numerical definition. Please address the other proposals on their own merits. FOARP (talk) 12:26, 11 October 2022 (UTC)
Comments from Editor X
Please open your own section with username in the heading. Please limit comments within a section to 300 words.
Question 3A: Alternative three-part definition
An editor is engaged in "article creation at scale" if these three criteria are all met:
- Rate - More than [X] new articles in the span of a month or [Y] in the span of a year (with X and Y to be determined subsequently, if this proposal finds support).
- Related articles - The articles are on a similar topic, similar theme, or based on the same set of sources.
- Manually created - Rather than the use of a bot/script/tool (which requires going through a different process, bot authorization).
Anyone who answers "yes" to all three of these is engaging in "article creation at scale" of the sort that would require abiding by the rules set forth elsewhere in this RfC (such as posting a request to a noticeboard, if Q1 gets support). Even if an editor does not think they meet the criteria, an uninvolved administrator may determine that someone's editing fits within the spirit of these requirements, and instruct them to seek permission.
Support 3A (alternative three-part definition)
- As proposer. — Rhododendrites talk \\ 20:56, 3 October 2022 (UTC)
- I think we can change the numbers later, but this is a good start at defining mass-creation. --Enos733 (talk) 21:16, 3 October 2022 (UTC)
- I don't think a generic "create definition" will suffice. We need to decide on a concrete definition at this stage. Scolaire (talk) 15:25, 5 October 2022 (UTC)
- Support this kind of criteria, but the third requirement is not necessary, as it is still article creation at scale if a script is used, just that extra controls are needed. Graeme Bartlett (talk) 03:58, 7 October 2022 (UTC)
- Generally support, since if we are going to have certain procedures related to article creation at scale we need a definition. I agree with Graeme Bartlett that criterion #3 is not needed and so should be eliminated. I also would remove "same sources" from #2, since it doesn't really matter and if there are multiple sources we are getting to the point where the mass creation is probably not a problem anyway. Of course, #1 needs to be defined appropriately. The proposal of 50 per month or 500 per year is really nowhere near appropriate, as we should be able to handle 2 creations a day from any editor. Something more like 25 per day, 100 per week would be more like a mass creation that would be potentially problematic (although I would be fine increasing those numbers). Rlendog (talk) 17:54, 7 October 2022 (UTC)
- I support having a definition of mass-creation/article creation at scale, and I support that definition being based on all three of the criteria (rather than just some). The exact numbers (which have been removed?) do not matter to me, though I would like someone to be able to create more than one article per day without being hassled over it. Editors have done 30-day challenges to create articles, and if they like it enough to keep going, why shouldn't they? WhatamIdoing (talk) 04:27, 8 October 2022 (UTC)
Oppose 3A (alternative three-part definition)
- Support some guidelines in principle, but 500 a year before some community oversight kicks in is way too many. Seraphimblade Talk to me 21:19, 3 October 2022 (UTC)
- The method of creation is not relevant, not convinced with these numbers. Thryduulf (talk) 23:24, 3 October 2022 (UTC)
- I'm fundamentally opposed to a numerical definition, see comments below. Vanamonde (Talk) 04:11, 4 October 2022 (UTC)
- Moral support as a better direction than a rigid definition, but any numbers will inevitably lead to gaming the system and arguing over semantics rather than following the intention. Kind of like everyone is always arguing about bot-like editing. Also I should not that mass creation and assisted creation are not mutually-exclusive. Both, either or neither processes could apply to article creation. Bot approval would still ask for mass creation approval should it become a more codified requirement. Finally, one point I do agree with is that an uninvolved admin (or "mass creation clerk" or some such) should have the "authority" to label any multiple article creation as mass creation to avoid arguments about what is or isn't mass creation on individual level. — HELLKNOWZ ∣ TALK 16:30, 4 October 2022 (UTC)
- I oppose criteria 1. A definition of mass creation should be based on the amount of time spent writing the article. It should not be based the time between edits or the total number of articles. For example, a single article written in one minute is far more likely to be a problem than 500 articles each of which was written in 7.3 hours by an editor working ten hours a day for 365 days [i.e. (365 days x10 hours per day)÷500 articles=7.3 hours per article]. James500 (talk) 00:28, 6 October 2022 (UTC)
- I can't support without having an idea of X and Y. I don't object to the idea, but if we leave this open then at some point X might get set to 3 and Y to 10 - and that's clearly nonsense (I've seen a suggestion that's not a million miles from those numbers). If a sensible lower bound were put on X and Y I might be able to support. Blue Square Thing (talk) 08:26, 8 October 2022 (UTC)
- As with Q3 above, impractical. BoJó | talk UTC 09:46, 8 October 2022 (UTC)
- Again, this is focused on quantity, I think quality is the key. - Donald Albury 14:47, 8 October 2022 (UTC)
- Same as above: "We'll know it when we see it". Just like with WP:SIGCOV and other similar concepts that lack a numerical definition but are nonetheless useful. FOARP (talk) 12:29, 11 October 2022 (UTC)
- The worse offenders will game any given quantity or timescale (or sock around them). As commented in the prior definition discussion, examples will likely be self incrimminating to the casual observer. LessHeard vanU (talk) 16:17, 11 October 2022 (UTC)
Comments (alternative three-part definition)
Comments from Rhododendrites (alternative three-part definition)
@Enos733 and Seraphimblade: To move this forward, I've replaced 50 and 500 with "X" and "Y". The idea is to agree to some basic rules, and then deal with specific numbers afterward, rather than altogether. Does that make sense? Hope I'm not being too bold... — Rhododendrites talk \\ 21:27, 3 October 2022 (UTC)
@Thryduulf: ^^ The numbers had already been removed by the time you commented. I'm curious what you mean by "method of creation is not relevant". We're making rules for a specific activity. Up to now, a lot of people have filed all "mass creation" under WP:MASSCREATE and WP:MEATBOT, which are specifically rules about when one should seek bot authorization. The method is important, because use of tools/scripts is often frowned upon while manual creation is typically treated differently. — Rhododendrites talk \\ 00:44, 4 October 2022 (UTC)
@Seraphimblade: The Even if an editor does not think they meet the criteria, an uninvolved administrator may determine that someone's editing fits within the spirit of these requirements, and instruct them to seek permission.
part tries to do that. Is there a better way? — Rhododendrites talk \\ 01:56, 4 October 2022 (UTC)
I find the arguments that setting clear numbers (and allowing admins to go by the spirit of the numbers to avoid gaming the system) will encourage wikilawyering/gaming more than some subjective "I know it when I see it" meaning... bizarre. Like why did we implement 3RR? Because "don't edit war" means people wikilawyer over the meaning, blocks/unblocks get litigated, etc. Do people still wikilawyer over 3RR? Of course, but having a bright line sets expectations clearly, without removing the ability of admins to act on other forms of edit warring. — Rhododendrites talk \\ 16:54, 4 October 2022 (UTC)
Comments from Seraphimblade
Even without a particular number currently in place, I still think this is open to gaming. I would probably see a definition more along the lines of edit warring, where we effectively say "If you violate 3RR, you are pretty much certainly edit warring, but that doesn't mean you're not edit warring if you don't make more than 3 reverts. Now, here's what an edit war tends to look like." I think that would be a much more effective way of defining "mass creation" and much less susceptible to gaming. Seraphimblade Talk to me 01:46, 4 October 2022 (UTC)
Comments from Thryduulf (Q3A)
@Rhododendrites: Method of creation is not relevant because the only thing that matters to readers is the output. Tools, scripts, bots, manual can all produce output of good or bad quality. Thryduulf (talk) 07:34, 4 October 2022 (UTC)
Comments from Espresso Addict
I'd like to draw attention to WP:Women in Red's long-running #1day1woman initiative, which encourages daily article creation on what's arguably a single topic. Generally the outcomes have been favourable and have certainly resulted in positive press coverage. Espresso Addict (talk) 23:44, 6 October 2022 (UTC)
Question 3B: Should we create a definition of "article creation at scale"?
Proposed: Create a definition of "Article creation at scale".
This definition, once finalized, would be usable for establishing limits for the need to request consensus to create at scale, for requesting permission to create at scale, or for other discussions surrounding article creation at scale. (Details may not be finalizable here in this RfC but can be used for input for later proposals.)
Support (Q3B)
- If this us just "should we have a definition", that seems obvious to me. Or, well, it did seem obvious until I saw a couple people on this page argue against it. What in the world is the noticeboard people are supporting going to be for if there's no clear definition of when someone needs to use it and/or what considerations requests will be judged by? Many people have proposed things like CSD or special rules for deleting mass created pages -- how can we support or propose anything like that without starting from a clear definition? — Rhododendrites talk \\ 13:52, 4 October 2022 (UTC)
- I believe this would be useful in principle. I'm fairly certain it's not going to get anywhere in practice, given how wide the disparities are in the understanding of the problems we face. Vanamonde (Talk) 16:14, 4 October 2022 (UTC)
- We need some definition, but the placement of hard limits or certain methods will lead to wikilawyering. I think something that gives a basic definition, but only as far as the general idea, would be a better idea. We don't need a strict definition to recognise harassment, and we don't need one to recognise articles being created en masse. -- LCU ActivelyDisinterested ∆transmissions∆ °co-ords° 18:39, 4 October 2022 (UTC)
- We absolutely must agree on some sort of a definition. Without a definition, we're all talking past each other when it comes to mass creation. Espresso Addict (talk) 00:47, 5 October 2022 (UTC)
- Per ActivelyDisinterested. The definition should be set of characteristics that mass actions generally have and non-mass actions generally don't that act as guidance not rigid rules. Thryduulf (talk) 09:37, 5 October 2022 (UTC)
- Sure? The devil's in the details on these things, though. — Red-tailed hawk (nest) 22:12, 5 October 2022 (UTC)
- Support. I guess this is what the community needs so the parties don't have to explain it over and over again in discussions on editors who mass create articles.Paradise Chronicle (talk) 23:06, 5 October 2022 (UTC)
- Support, no real reason not to considering how much the topic is referenced. Devonian Wombat (talk) 02:17, 7 October 2022 (UTC)
- Yes, but make it fairly generic, and don't confuse the definition with the problems that may be there. The scale is just the rate or number, not the quality. Graeme Bartlett (talk) 03:59, 7 October 2022 (UTC)
- Support. We need a definition if we are going to do something about it. I think it needs to involve numbers, since only numbers can define whether the creation is "mass"; once we know we have a mass creation under the definition we can address whether the quality is adequate in any particular case. The numbers don't have to be rigid - if we say. for example, 25 per day and someone creates 24 per day for several days we can address that as mass creation, or even make the definition a bit fuzzy, like about X per [period]. Rlendog (talk) 17:58, 7 October 2022 (UTC)
- Support. This is the bare minimum we have to do to be able to take any action regarding article creation at scale at all. The definition might be somewhat fuzzy, but we need to have something to reference. --Aquillion (talk) 18:17, 7 October 2022 (UTC)
- Support. If we use numbers, then we can have a discretionary range - a minimum(e.g. more than 10 articles per week) and a maximum(e.g everyone doing more than 30 articles per week is engaging in mass-creation).Lurking shadow (talk) 19:40, 7 October 2022 (UTC)
- Yes, we need a definition. That definition needs to be clear enough that people who cry "mass creation" because someone created a handful of articles on subjects they dislike can be told to knock it off. We do not need another process in which there is one set of rules for me and another set of rules for thee. WhatamIdoing (talk) 04:29, 8 October 2022 (UTC)
- Yes, per my opposition to 3A above Blue Square Thing (talk) 08:27, 8 October 2022 (UTC)
- Though I still oppose numeric values being used as anything but very loose guidance (if at all), I think a clearer definition of what "mass creation" generally looks like would be helpful. Seraphimblade Talk to me 12:02, 9 October 2022 (UTC)
- As a statement more that a hard definition; "careless creation of quantities of low quality articles from poor sources are considered detrimental to the community cause" - if anyone can think of words beginning with c to replace article, poor sources, and detrimental, can substitute as they wish. LessHeard vanU (talk) 16:24, 11 October 2022 (UTC)
- Support - you can't make rules about something if you don't define your terms. ~ ONUnicorn(Talk|Contribs)problem solving 20:37, 11 October 2022 (UTC)
Oppose (Q3B)
- Per my support of 3A, I don't think a generic "create definition" will suffice. We need to decide on a concrete definition at this stage. Scolaire (talk) 15:27, 5 October 2022 (UTC)
- Definitions are impractical and will be circumvented. A smart mass creation artist will always stop just short of the limit. Consider each case on its individual merits. BoJó | talk UTC 09:51, 8 October 2022 (UTC)
- "Article creation at scale" is a granfalloon. Investing time into codifying a definition would just reify a wide variety of practices into a single thing. We've got people worried about the creation of 1000 sports stubs based on single lines in statistics tables from Gray's Sports Almanac, and other people pointing out that biomedical databases are rich with more than enough material to write articles, despite the "database" moniker. This isn't one argument; it's half a dozen different arguments, stemming from deep illusions that "GNG-level" and "SIGCOV" have meanings handed down from Sinai. XOR'easter (talk) 16:51, 8 October 2022 (UTC)
- Defining a threshold for the scale of articles that's okay to create is meaningless. In a vacuum, any particular number of articles—whether it's 10 or 10,000—could be spam or a major contribution to the encyclopedia. If we then try to create a more vague, non-numerical definition to avoid having the policy gamed, then it's not really an effective guideline. What matters is the quality of that encyclopedic content, in terms of verifiability and notability, which we already have well-established policies about which we can use to discuss the merits of articles and whether they should be deleted or not. If you think a particular subject or type of article is inherently not notable, then propose a new notability policy or guideline to specifically address it. A threshold based on the number and size of articles doesn't help us avoid crappy articles, and that blunt of a guideline will most definitely deter helpful editors from writing good articles. Steven Walling • talk 21:17, 11 October 2022 (UTC)
- Pure WP:BURO. I'm sure that this is not intended as such, but this is ultimately acting as just an excuse to do nothing. Wikipedia has many contested concepts (e.g., WP:SIGCOV, WP:BLUDGEON, WP:AUD etc.) but the fact that these are contested does not make them useless or counter-productive. What is an is not mass-creation is something that should be beaten out on a case-by-case basis and need not be predetermined here. Indeed, there are projects where mass creation already has been dealt with (particularly WP:NASTRO) and curiously enough they did not bother to define it explicitly beyond referring to the already-existing WP:MASSCREATE. FOARP (talk) 09:39, 12 October 2022 (UTC)
Comments (Q3B)
Comments from Joe Roe
I agree with Rhododendrites that you self-evidently have to have a shared definition of something if you're going to start creating rules and noticeboards about it. But what on earth does these questions mean in the context of an RfC? If there is no consensus to create a definition, does the ordinary English phrase "mass article creation" becomes undefined? If there is a consensus against creating a definition, is it officially decreed undefinable? In the (admittedly increasingly unlikely) event that any new processes come out of this RfC, are we prohibited from describing their scope unless we get the go-ahead here? This whole thing gives me a headache. – Joe (talk) 06:56, 5 October 2022 (UTC)
Comments from Espresso Addict
I'm liking the idea of a "set of characteristics that mass actions generally have and non-mass actions generally don't that act as guidance not rigid rules" per Thryduulf. I'm not sure how in this framework we might move forward and agree such a thing, particularly as I think the rules of the discussion prohibit mentioning actual cases. Espresso Addict (talk) 19:26, 5 October 2022 (UTC)
@ActivelyDisinterested: Everyone is certainly capable of recognising mass creation, the intractable problem is that we all appear to be recognising different things. Espresso Addict (talk) 01:43, 8 October 2022 (UTC)
Comments from Valereee
Espresso Addict, the rules aren't intended to prevent referring to a past case or cases to allow you to explain why something is necessary or reasonable or a good idea. They're primarily intended to keep people from bickering over past grudges. Please feel free to discuss past cases that are necessary to understand a proposal. Valereee (talk) 20:18, 5 October 2022 (UTC)
Comments from Andrew Davidson
I agree that a definition is required for practical purposes. Some examples may help in clarifying what people mean by this.
- The archetypal case is Rambot – an early bot that created stubs based on the US census. I suppose most everyone would agree that that was mass creation.
- Jesswade88 has been lauded for her steady creation of articles about women. She aims to create one every day and still seems to be going strong. I would myself say that this is not "mass" creation because the creations seem separate and discrete rather than being batched and "cookie cutter". The word I'd use for this is "sustained" rather than "mass".
I could go on but you get the idea.
Andrew🐉(talk) 17:25, 6 October 2022 (UTC)
Comments from ActivelyDisinterested
I think anyone is capable of recognising mass creation. If someone creates 25 well referenced articles in a day, that's a bit odd but it's fine. When a set of articles are all the same apart from small details and have one reference that is a database or list, that's obviously mass creation. -- LCU ActivelyDisinterested ∆transmissions∆ °co-ords° 17:49, 7 October 2022 (UTC)
Comments from WhatamIdoing 2
ActivelyDisinterested, I think we need a definition that is absolutely objective, precisely because not everyone is capable of recognizing mass creation. Under your "obvious" definition, I can't tell whether 25 well-referenced articles that happened to all be the same apart from small details is mass creation. I can't tell whether just two articles, which happen to be the same apart from small details and have one database reference, is "mass" creation. (Where's the "mass" part of mass creation, when only two articles are involved?) We shouldn't be leaving the definition up to each editor's personal opinion. That doesn't lead to sensible conversations and quick resolution of disputes. That leads to unfair enforcement: I have lots of wiki-friends, so I can create 100 weak articles on my Very Important™ Subjects, but you don't, so I'll get you stopped if you create five on your unimportant subject. That's not healthy for the community, and no matter how much we might wish it otherwise, that is the alternative to having a clear definition. WhatamIdoing (talk) 04:40, 8 October 2022 (UTC)
Comments from FOARP
Should a definition exist? Well, it already does. Indeed more than one definition already exists. I do not oppose coming up with a more formal one at some point - ideally after a period of case-by-case examination - but for Pete's sake let's not let this just become a barrier to taking any action at all in this RFC! FOARP (talk) 09:47, 12 October 2022 (UTC)
Comments from Editor X
Please open your own section with username in the heading. Please limit comments within a section to 300 words.
Question 4: Should we prohibit the creation of articles at scale?
This proposal would prohibit the creation of articles at scale based upon a rate definition to be separately decided.
Support (prohibit)
- The creation of encyclopedia articles must be understood as a matter of quality, not quantity, and that the rapid creation of articles almost certainly threatens our extant processes for article triage and improvement. Chris Troutman (talk) 20:37, 3 October 2022 (UTC)
- The main problem affecting Wikipedia's status as an encyclopedia is the general lack of reliability. This includes lack of wikitext factual verification, lack of NPOV verification, and (where they exist) lack of citation validation. The exceptions are few, and glaring, as they inadvertently spotlight the vast majority of articles' shortcomings. As a tertiary resource of knowledge, Wikipedia falls short and cannot be trusted. Any action that does not immediately and directly deal with this problem contributes to it by extending and enhancing the status quo. This RFC, and its purpose, are prime examples. Not a single article should be allowed in Wikipedia mainspace unless it is deemed trustworthy by uninvolved editors. A first step would be that it should conform to the relevant Wikipedia mainspace policies, not an extraordinary concept. Any articles not created in this way do not contribute knowledge fit for anybody. Their provenance and accuracy are at best uncertain, and may as well be fiction. Considering the site's history and current state, it is prudent they be considered so unless proven otherwise. 65.88.88.68 (talk) 15:56, 6 October 2022 (UTC)
Oppose (prohibit)
- Much too blunt of an instrument. At the most extreme, we're saying we don't want someone to create 51 GAs in a month on various topics? Or we're assuming they're all stubs? If the latter, a more precise question might be to ask whether we want a minimum level of quality for articles created at scale. Update: I've added Q5 accordingly. — Rhododendrites talk \\ 20:52, 3 October 2022 (UTC)
- There are circumstances where mass-creation is quite justifiable. I'd support the more specific prohibition of mass-creation where notability is not automatic. Vanamonde (Talk) 21:20, 3 October 2022 (UTC)
- I think what is needed here is regulation, not prohibition. Seraphimblade Talk to me 21:24, 3 October 2022 (UTC)
- Per everyone above. Thryduulf (talk) 23:24, 3 October 2022 (UTC)
- Mass creation without preapproval should be - and already is - forbidden, but this goes beyond that. BilledMammal (talk) 00:10, 4 October 2022 (UTC)
- Per Rhododendrites. Pinguinn 🐧 04:58, 4 October 2022 (UTC)
- A non-starter. Mass creation—manual or semi-automated—has been one of the most useful tools we have for expanding our coverage of repetitive but encyclopaedic topics (e.g. geographic places, species, sports) and will continue to be for the foreseeable future. We have occasionally had problems with editors doing it badly, yes, but this would be throwing the baby out with the bathwater. It would also be a disaster for efforts to address systematic bias, because we'd be curtailing efforts to expand coverage in under-represented areas, while retaining all the mass-created articles we have now (favouring the Anglosphere/Global North). – Joe (talk) 11:06, 4 October 2022 (UTC)
- Mass creation of articles is OK if done properly. Hut 8.5 12:20, 4 October 2022 (UTC)
- Echoing everyone else. — HELLKNOWZ ∣ TALK 16:36, 4 October 2022 (UTC)
- * Pppery * it has begun... 18:00, 4 October 2022 (UTC)
- Articles are good, bad articles are bad. We don't want to stop good articles, but minimise bad articles that have to be taken to AfD and clog up NPP. -- LCU ActivelyDisinterested ∆transmissions∆ °co-ords° 18:41, 4 October 2022 (UTC)
- Plenty of instances where mass creation is perfectly justified (after elections, for example). I do think some parts of this (wider) debate have lost sight of the fact that new articles are the lifeblood of an encyclopedia. Espresso Addict (talk) 01:37, 5 October 2022 (UTC)
- If an editor happens to find a batch of 10 notable topics in a day, and has the time to create properly sourced new articles on all 10 in a day, then that should be encouraged, not prohibited. The problem is the bad sourcing and cookie-cutter nature of some past binges of mass creation, not the volume itself. —David Eppstein (talk) 16:32, 5 October 2022 (UTC)
- On the
throwing the baby out with the bathwater
and theproblem is the bad sourcing and cookie-cutter nature of some past binges
rationales articulated above. Are short descriptions of individual species or once-inhabited locales the sort of thing I expect to find when I open an encyclopedia? Yes, they are. Do we benefit people by freeing information that had been locked up in specialized, harder-to-navigate references? Yes, we do. XOR'easter (talk) 16:47, 5 October 2022 (UTC) - No. We are supposed to be creating an encyclopedia, and it is much closer to its beginning than the end. There is no reason to prohibit the creation of any articles. Phil Bridger (talk) 18:51, 5 October 2022 (UTC)
- I've seen editors who can write 10+ sourced, non-stub articles in a day. There are definitely topics where Wikipedia is missing a lot of notable articles but it's also easy to find sources (e.g. American historic sites), and we want to encourage the creation of quality articles on those topics, not prohibit it. TheCatalyst31 Reaction•Creation 03:10, 6 October 2022 (UTC)
- No, if the articles are good then they are welcome. Graeme Bartlett (talk) 04:00, 7 October 2022 (UTC)
- No reason to prohibit mass creation of good articles that meet GNG and are well sourced. Rlendog (talk) 17:59, 7 October 2022 (UTC)
- Decent, well sourced articles are welcome, however they come about. We don't need to restrict this in such a way Blue Square Thing (talk) 08:29, 8 October 2022 (UTC)
- All that is needed is the requirement to cite reliable, non-database sources which provide significant coverage. Someone might have a couple of books which talk at length about numerous topics that are needed as individual articles. Those articles would then be mass-created but the sources would be reliable and coverage in each would be broad. We would welcome mass creation in that scenario. BoJó | talk UTC 09:56, 8 October 2022 (UTC)
- Wrong approach, why limit the rate if articles subjects demonstrate notability and sourcing. - Donald Albury 14:50, 8 October 2022 (UTC)
- This will start a war regarding WP:STUB, and their creation, which is an accepted guideline. LessHeard vanU (talk) 16:28, 11 October 2022 (UTC)
- No as long as the articles are still acceptable, if they aren't that's the same as if they were created individually other than perhaps the way the author may be dealt with namely the fact they have created more unsuitable pages quicker may make sanctions more likely. Crouch, Swale (talk) 20:24, 11 October 2022 (UTC)
- ~ ONUnicorn(Talk|Contribs)problem solving 20:40, 11 October 2022 (UTC)
- Blanket opposition to the creation of articles is the most boneheaded idea imaginable. Our mission is to create more encyclopedic content, not prevent it. Steven Walling • talk 21:02, 11 October 2022 (UTC)
- Not in these hopelessly wide terms. Ingratis (talk) 06:56, 12 October 2022 (UTC)
Comments (prohibit)
Comments from 50.74.109.2
Wikipedia has failed to apply standing policies regarding verification, neutrality etc. years after an uncounted number of offending single articles has been created. In light of such failure, codifying batch creation of articles is a fool’s errand. Despite this, the comments opposing prohibition seem to presuppose/imply that article batches will be somehow validated a priori. Without any proof other than wishful thinking like, "if a bunch of articles are ok [how?] let's publish 'em". This is inherently inapplicable: there is no corresponding tool for validation of article batches. It isn’t likely that any such tool can ever exist. Entities/people who deal professionally with validation and true fact-checking would be already using it if there was one. In the end, all articles in a batch will have to be validated singly, and as pointed in the beginning, anyone can see how this has worked out.
This RfC surreptitiously serves to make Wikipedia’s unreliability more palatable, by obscuring the real problem behind useless, minutely detailed procedure. Well-argued opinions regarding utopian proposals that ignore simple realities will never make sense no matter how politely they are expressed, and will not magically transform these proposals into workable solutions. I’m afraid this is just so much more pompous nonsense. 50.74.109.2 (talk) 01:32, 9 October 2022 (UTC)
Question 4A: Should we restrict creation of articles at scale?
Proposed: Mass-creation is permitted for a group of topics when all members of that group meet a criterion directly granting notability. All other mass-creation is prohibited unless supported by consensus at the mass-creation noticeboard (see Q1).
Addendum for clarity, not part of the proposal: criteria that have traditionally been held to grant notability include the GNG (of course), but also SNGs such as NPOL, NPROF, CREATIVE, GEOLAND, and a few others. Topics that meet these standards are not typically also required to meet GNG. The same is not true of other SNGs (notably NSPORTS). The distinction being made in the proposal is between these two categories of notability criteria.
Support (limit)
- As proposer, and per comments above. Mass-creation is a problem only when the notability of each created item is not demonstrated. There are cases where groups of topics do meet this criterion (under NPOL, for instance, or GEOLAND) where it isn't a problem. Vanamonde (Talk) 21:28, 3 October 2022 (UTC)
- That seems fair enough. Several SNGs are understood to be independent of the GNG (for example WP:NPROF: This guideline is... explicitly listed as an alternative to the general notability guideline). If the subject of an article meets one of those then there is no need to show that the subject passes the GNG. If you don't think any SNGs should work that way then make a proposal to change the SNGs which don't. Hut 8.5 12:20, 4 October 2022 (UTC)
- I'd See it as fair that if one creates 5 articles a day that they come under review and if they fairly elaborated articles, they should be allowed to create them on, but if they create one-two-three line stubs (with maybe an infobox) they should temporarily be only allowed to create drafts and submit them for review to see if they pass the AfC requirements.Paradise Chronicle (talk) 14:15, 5 October 2022 (UTC)
- Limiting mass creation allows time for proper scrutiny, especially with regards to SNGs that don't necessarily mean GNG is also met. Der Wohltemperierte Fuchs talk 22:07, 5 October 2022 (UTC)
- If the do meet the inclusion criteria, then that removes a problem. I also think that the articles created en masse should also prove that they meet the criteria. Graeme Bartlett (talk) 04:03, 7 October 2022 (UTC)
Oppose (limit)
- Opposing mainly because I don't understand. Following on the proposer's support !vote, how does passing a SNG like GEOLAND or NPOL not make something notable? I mean, there's the "presumed" language of those guidelines, but that's even in the GNG. What's the distinction being made here? If this boils down to "articles created at scale need to be notable", that's pretty uncontroversial because all articles already have to be notable. If it's that articles have to meet the GNG, it would be clearer to propose that explicitly. I'd also disagree that notability is the only problem with mass creation. Sourcing and rate are the issues I see come up most often. — Rhododendrites talk \\ 22:58, 3 October 2022 (UTC)
- Per Rhododendrites. Thryduulf (talk) 23:27, 3 October 2022 (UTC)
- Mass creation is a problem even when the articles meet SNG's, as SNG's are subject to change, and the mass created articles on the basis of SNG's are almost always low quality. BilledMammal (talk) 00:12, 4 October 2022 (UTC)
- SNGs should not be "granting" notability to start with, only pointing out when it is likely to exist. I certainly don't want anything that would further encourage that. Seraphimblade Talk to me 01:57, 4 October 2022 (UTC)
- Either it's notable or it isn't. SNGs are there to help clarify when articles should be created, not mandate what articles are created. Pinguinn 🐧 05:02, 4 October 2022 (UTC)
- Notability would be better handled by ensuring articles have some minimum referencing in place. -- LCU ActivelyDisinterested ∆transmissions∆ °co-ords° 18:48, 4 October 2022 (UTC)
- Per WP:CREEP. This seems to be pointless rule-making. I don't see any situation where making this into any kind of policy or guideline would change anything we do about mass creation. —David Eppstein (talk) 16:33, 5 October 2022 (UTC)
- This is the same question as above, just with a carve-out for SNGs. I don't think this would solve our problem, since (a) a lot of problematic mass creation has been in areas that ostensibly were covered by an SNG, like Carlossuarez46 and GEOLAND, and (b) a lot of productive mass-creation isn't covered by a formal SNG because the articles literally never get deleted at AfD anyway (species, sites on the National Register of Historic Places, etc.) TheCatalyst31 Reaction•Creation 03:14, 6 October 2022 (UTC)
- Per David Eppstein. Also, there is no reason to prohibit or even limit mass creations that are well-sourced and meet notability criteria, regardless of the status of other members of the "group". Rlendog (talk) 18:06, 7 October 2022 (UTC)
- Decent articles that are well sourced are decent, well sourced articles. We shouldn't restrict this in such a way - it might be that only 10% of <insert name of large topic> articles are notable; we should have those articles - and if they're mass created then that's fine Blue Square Thing (talk) 08:31, 8 October 2022 (UTC)
- Same as for Q4 above. Sources are what matter. This idea is making a rule for the sake of making a rule. Keep things simple. BoJó | talk UTC 09:59, 8 October 2022 (UTC)
- Same as above. Why limit the creation of acceptable articles? - Donald Albury 14:52, 8 October 2022 (UTC)
- Weakly for my reason above. Crouch, Swale (talk) 20:25, 11 October 2022 (UTC)
- Creating a rule where you have to ask permission to create encyclopedia articles is a surefire way to deter people from volunteering their time to expand and improve the encyclopedia. Steven Walling • talk 21:04, 11 October 2022 (UTC)
Comments (limit)
Comments from Vanamonde
This looks like it's headed for failure already, because the variation in standards among the SNGs is simply being ignored. Some SNGs are written such that they confer notability independent of GNG. Others are not. If a person meets NPOL, they are not required to meet GNG for an article to exist. The same is not true of NSPORTS. Seraphimblade, I'm particularly confused by your position; because not only do some SNGs already grant notability (to the same extent as GNG), mass-creation is currently being justified on the basis of SNGs that do even less. This proposal is intended to restrict that, not enable it. Vanamonde (Talk) 04:16, 4 October 2022 (UTC)
- @TheCatalyst31: I appreciate that the proposal won't solve the whole problem, believe me, but you imply that it would address some pieces of the problem; so why is it a bad thing? Vanamonde (Talk) 14:57, 6 October 2022 (UTC)
- @Rhododendrites: I added a clarification above in response to your comment, but realized you may have missed it in all the back-and-forth; does that address your concerns at all? Vanamonde (Talk) 17:35, 7 October 2022 (UTC)
- Rlendog The proposal doesn't do what you say it does though; mass-creations that meet GNG, for instance, are explicitly exempt. Vanamonde (Talk) 21:08, 7 October 2022 (UTC)
Comments from Seraphimblade
For clarity, my oppose is on the grounds that we never should have had SNGs like the "professor" and "populated place" ones, which purport to confer notability even in the absence of substantial good quality source material about the subject. I'm already seeing a growing amount of discontent regarding that, and don't want something which legitimizes that practice further. I hope relatively soon there, we will see a change in those areas and a cleanup of them like we once had to do with fiction and more recently have been doing with sports biographies. (Of course, not having it on a separate page entirely doesn't mean we shouldn't have anything about it; most populated places, for example, would fit quite nicely on a page like "List of populated places in Example County, Somestate".) Seraphimblade Talk to me 07:31, 4 October 2022 (UTC)
Comments from Editor Paradise Chronicle
Having reflected on my vote, I'd like to add that the very basic stubs (if properly sourced) are of course also a valuable contribution of the mass creators to the Wiki project, but maybe (probably) their edits would be more appreciated at Wikidata than Wikipedia.Paradise Chronicle (talk) 09:51, 6 October 2022 (UTC)
Comments from TheCatalyst31
@Vanamonde93: To be clear, I meant the problems with the original Question 4 as much as I did the problem of mass-creation. Excluding topics with SNGs would still leave this policy as a blunt instrument that would prevent productive mass creation (if a bit less so), but it would also move it away from addressing the core issue at hand. TheCatalyst31 Reaction•Creation 01:01, 7 October 2022 (UTC)
Comments from FOARP
Vanamonde - Mass-creation is ABSOLUTELY a problem for GEOLAND, precisely because of the supposed automatic notability it creates. Please look at this ARBCOM discussion, this AN discussion, and these AFDs 1 2 and this entire project that has been operating for years now to clean up the mess of mass-created GEOLAND articles JUST IN CALIFORNIA. Sorry to go all-caps but really, it's an issue. FOARP (talk) 08:31, 7 October 2022 (UTC)
Joe - The global south is possibly even more likely to be the victim of failing mass-created articles than the north. It's just easier to say that an entire class of things on a database all deserve articles when the number of people who will understand that they don't is much more limited. Carlossuarrez46 was only able to get away with mass-creating articles about Iranian "villages" that were not actually villages because no-one who spoke Farsi was there to correct him. Ditto Lugnut's articles about India/Pakistani cricketers. We are not helping people in the Global south by loading up Wiki with content-less articles of this kind. FOARP (talk) 08:31, 7 October 2022 (UTC)
Comments from Andrew Davidson
We don't have clarity on the definition yet. But supposing that we're talking about bulk creation of the Rambot sort then this might be restricted along similar lines to the existing procedures used at WP:RFBOT, WP:AUTOPATROL and WP:AWB in which editors are given trusted status to do high volume edits. Article creation is just a special case of editing in general. Andrew🐉(talk) 10:41, 7 October 2022 (UTC)
Comments from Joe Roe
@FOARP: I don't think anyone here is going to say that Carlossuarrez46 is someone to emulate. But hard cases make bad law and he didn't really "get away with it" in the end, did he? What I'm saying is that at this point we have articles on, for example, pretty much every single populated place in North America and Western Europe; I simply cannot see how we will ever approach that level of coverage for other parts of the world without some form of mass creation from databases. To cut off that possibility now, after using it so successfully to seed our coverage of the affluent, English-speaking world, would be fundamentally unjust. – Joe (talk) 11:59, 7 October 2022 (UTC)
Comments from Rlendog
Vanamonde93 The proposal as written does not explicitly exempt mass creations that meet GNG. In any case, whether they meet GNG can often be subjection - hence many debates at AfD. Which is why I think (per my comment in #2) the criteria should be at least one source that plausibly meets GNG. Rlendog (talk) 15:43, 8 October 2022 (UTC)
Question 5: Minimum article quality when created at scale
Articles created at scale should be required to meet a certain level of quality in addition to minimum sourcing requirements (see Q2).
For example: minimum number of sentences, article size, assessment, ORES score, etc.).
If you support this, you may suggest qualitative or quantitative standards, but a separate question will be required to find consensus for specific requirements.
Support (minimum quality)
# Support. Articles that: 1)were disapproved by the community at the noticeboard for article creation at scale or the Bot Approvals Group or 2)contain no sources or 3)contain only: deprecated sources or sources that are easily editable by unqualified people should be speedily deletable. 1) is equivalent to a deletion discussion, 2 and 3 suggest that the author didn't have any standards whatsoever and their creations cannot be trusted.Lurking shadow (talk) 21:19, 3 October 2022 (UTC)
Moved down.Lurking shadow (talk) 07:16, 5 October 2022 (UTC)
- Support. Mass created articles (in vast majority stubs) mainly live from their creators (or are notable for their creators) and then after, are seldomly read or expanded for years. I suggest to enable the draftification of stubs (sourced or not) per wikipedia guideline and if there is interest by their creators (or the community) they'll expand to meet the AfC requirements.Paradise Chronicle (talk) 07:36, 5 October 2022 (UTC)
- Tentative support, looking at the ideal that an article should be more than a database entry. The support is tentative given the potential difficulty in finding an actionable formulation for what is quite a qualitative consideration. CMD (talk) 08:24, 5 October 2022 (UTC)
- Mass-created articles are always undesirable unless they are of a certain quality. Requiring one or two RSs alone does not guarantee that articles will be encyclopaedic. Scolaire (talk) 15:33, 5 October 2022 (UTC)
- THe articles should be useful to a reader. They should also have enough referencing to show that they meet inclusion criteria. One sentence could be enough though, if it is useful to a reader. Graeme Bartlett (talk) 04:04, 7 October 2022 (UTC)
- See my essays at Wikipedia:Substantive content, @FOARP: would that kind of guidance work? Crouch, Swale (talk) 20:27, 11 October 2022 (UTC)
Oppose (minimum quality)
- IMO the basic requirement of containing a source with SIGCOV (as well as any rules that apply to non-mass-created articles) is sufficient. * Pppery * it has begun... 18:00, 4 October 2022 (UTC)
- Per Pppery. —David Eppstein (talk) 16:34, 5 October 2022 (UTC)
- There's no way to quantify "quality" that can't and won't be gamed. Nor is there a meaningful notion of "quality" that could extend over all topics on which articles can be created, beyond vague platitudes. XOR'easter (talk) 16:36, 5 October 2022 (UTC)
- Quality's a bit of a hard thing to systematically measure. There have been sanctions to restrict users so that they are only allowed to create articles of minimum length (such as 500 words), and maybe some concrete proposal along these lines would be rational. But, at the same time, there are some topics that are notable under current guidelines (such as legally recognized populated places) that can be quite easily created and described at a small prose size while not being terrible. I'm not principally opposed here, but I'm not able to do anything but oppose without a good mechanism for judging the nebulous character of "quality". — Red-tailed hawk (nest) 22:19, 5 October 2022 (UTC)
- Per XOR'easter and Red-tailed hawk. The number of sentences, and article size, are not necessarily a measure of quality. The grading of an article does not necessarily reflect the quality of that article due to human error in applying the (less than completely objective and unambiguous) criteria at WP:ASSESS. The ORES score of an article does not necessarily reflect the quality of that article due to the limits of articlequality model. James500 (talk) 01:23, 6 October 2022 (UTC)
- Per above. Measuring quality and setting a minimum standard of it is difficult and can be gamed. Basic requirements that apply to normal articles should be sufficient for mass created ones too. The problem we need solving is not that mass created articles are incomprehensible. The problem is that such articles (normal or mass created) would've been deleted or not deleted based on the inherent notability of the "subject", and that should be maintained no matter the rate of creation. —CX Zoom[he/him] (let's talk • {C•X}) 12:56, 6 October 2022 (UTC)
- Even a sourced one line stub can be beneficial to the encyclopedia, if for nothing else than a starting point for expansion. Rlendog (talk) 18:05, 7 October 2022 (UTC)
- This sort of thing can work as a sanction at ANI etc... when a problem editor is identified. It's not suitable as a one size fits all approach Blue Square Thing (talk) 08:33, 8 October 2022 (UTC)
- Quality is a matter of opinion and all articles can be improved. The key factor is sourcing. BoJó | talk UTC 10:03, 8 October 2022 (UTC)
- Per WP:STUB and WP:NOTABILITY, it is the hard to define attempt to simply create a mass of articles to inflate an editors creation listing that it the issue that needs addressing. LessHeard vanU (talk) 16:34, 11 October 2022 (UTC)
- We already have clear guidelines for notability and verifiability that cover all possible articles, and we also already have processes for developing a consensus about when articles need to be deleted. The last thing we need is more rules. Steven Walling • talk 21:06, 11 October 2022 (UTC)
Comments (minimum quality)
Comments from Rhododendrites (minimum quality)
Adding this question because it comes up frequently and would be useful to resolve one way or the other. Not supporting or opposing at this time. — Rhododendrites talk \\ 21:07, 3 October 2022 (UTC)
@Lurking shadow: I agree with those SD criteria for articles created as scale. Moving forward with creation despite rejection at the TBD venue seems straightforward but isn't really related to quality. My thinking with this question was "the community will only approve requests to mass create articles that meet X, Y, Z criteria for minimum quality," and specifically "in addition to minimum sourcing requirements" set forth in Q2. I imagine 2 and 3 would be covered by Q2, no? — Rhododendrites talk \\ 21:25, 3 October 2022 (UTC)
Comments from Pinguinn (minimum quality)
I don't see how this would be workable. Of the four suggested criteria, the first and second lend themselves to poor writing, as any student tasked with meeting a length requirement on an assignment can attest. The third is based on purely subjective criteria meant to be used by individual reviewers. The fourth is based on an algorithm based on machine learning, not objective criteria. Though there are undoubtedly other ways to measure articles, I strongly suspect they'll all fall short as either not objective or not indicative of article quality. Pinguinn 🐧 05:22, 4 October 2022 (UTC)
Comments from Abductive (minimum quality)
The problem is not an undefinable "quality" or lack thereof, but whether the articles are "encyclopedic". In the case of species, there are often users who mass-create stubs which are basically copies of the database entries that they are sourced to. These creations will meet (or can be made to meet) any of the above definitions of notability or quality, but are still useless. Why? Because they don't provide any more information to the reader than the database item that they would find if they Googled a species with no Wikipedia article. Fundamentally, the problem is users who are unclear on the scientific rationale for us humans calling something a species, and on the purpose of an encyclopedia. But this cannot be codified for even a straightforward topic such as species, let alone for all possible mass-creation endeavors. Abductive (reasoning) 14:08, 4 October 2022 (UTC)
Comments from ActivelyDisinterested
I'm not going to vote on this yet, as proposed it looks bureaucratic. However I wonder if something is needed in regard to WP:BLPs, were mass creation of stubs could be problematic. -- LCU ActivelyDisinterested ∆transmissions∆ °co-ords° 18:54, 4 October 2022 (UTC)
Comments from Espresso Addict
I feel this is the direction we ought to be heading, but I can't see how a definition can be made that won't just lead to deliberate bloating. That's certainly been my experience with creators wanting to get over the 1500-character DYK threshold. Espresso Addict (talk) 19:43, 5 October 2022 (UTC)
Comments from TheCatalyst31
I'm interested in this proposal, if only because it's similar to how WikiProject National Register of Historic Places solved its own mass creation problem years ago. The project essentially took a hard line against two-sentence stubs with only a database source that didn't explain why a historic site was historic, and while it took some effort and an ArbCom case to informally put it into practice (and we still haven't cleaned up all the substubs created beforehand), it's at least helped us cut down on new articles that don't state their importance. That being said, I would like to see how this would be implemented before I support; it's easier to do this for a single topic area than the entire project, and short stubs are more valid for some topics than others. TheCatalyst31 Reaction•Creation 03:36, 6 October 2022 (UTC)
Comments from WhatamIdoing (minimum quality)
The devil is in the details. In principle, I would support a low minimum quality (e.g., two sentences + two sources) but I would never support a minimum quality that is higher than the quality of the median Wikipedia article. The median article is a stub, and although we don't actually have exact stats yet (I asked for one recently), my guess is that the median article is about 8 sentences and 3 inline citations (including non-independent sources). A new article should never be required to be better than 50% of existing articles. WhatamIdoing (talk) 04:49, 8 October 2022 (UTC)
Comments from Scolaire (minimum quality)
Twenty years ago many articles were created as stubs and grew to be sizeable articles; this happens very rarely now. Stubs tend to be just left; in the case of creation at scale, they are left by the article creator who doesn't care that users are not going to volunteer to expand them. This harms the project, I believe. In the FritzpollBot poll in 2008 I said: "A paper encyclopaedia is biased towards what the editors want us to know. Wikipedia is biased towards what we ourselves want to know. Correction of that bias by creating articles about what we don't want to know serves no educational purpose while creating the potential for harm (accidental or deliberate misinformation etc.)." The result of that poll was that the bot was approved, but it never accomplished its task, to the relief of many. Mass-created substubs are doing the same thing in 2022, with the same undesirable consequences. Scolaire (talk) 15:06, 10 October 2022 (UTC)
Comments from Editor X (minimum quality)
Please open your own section with username in the heading. Please limit comments within a section to 300 words.
Question 6: New mass creator permission
Withdrawn
|
---|
Support (New mass creator permission)
Oppose (New mass creator permission)
Comments (New mass creator permission)Comments from Joe RoeWhat would the permission allow you to do that ordinary editors can't? – Joe (talk) 06:45, 5 October 2022 (UTC)
Comments from Espresso AddictJoe Roe: The idea is that only those editors with the permission would be able to do whatever it is we eventually define as mass creation without sanctions. Clearly I need to make that clearer somehow, though not sure how I am meant to do that. Espresso Addict (talk) 06:52, 5 October 2022 (UTC)
Comments from Thryduulf (Q6)The definition of what is and isn't "mass creation" is of necessity going to be blurry and not reducible to numbers. Even if it were it wouldn't be useful as what matters is not who is mass creating but what is being mass created. 09:43, 5 October 2022 (UTC) Comments from Editor XPlease open your own section with username in the heading. Please limit comments within a section to 300 words. |
Question 7: Should we adopt a new speedy deletion criterion that relates to mass-created articles that lack any sourced claims of importance?
This proposal would create a new speedy deletion criterion, A12, as follows:
Support (proposal name)
Oppose (proposal name)
Comments (proposal name)Comments from Thryduulf (Q7)Determining whether an article is part of a mass creation set may mean analysing months of contributions which cannot be done by admins patrolling speedy deletion categories, also one reason A7 is topic-limited is that it is not possible for a single admin to reliably determine what is and isn't a claim of significance in all subject areas. This is also out of scope for this RfC. 09:49, 5 October 2022 (UTC) Comments from Editor XPlease open your own section with username in the heading. Please limit comments within a section to 300 words. |
Question 7a:
For AfD RfC; Has been added to a section at WT:ADAS.
|
---|
Should we instead introduce the following speedy deletion criterion:
Support (proposal name)
Oppose (proposal name)
Comments (proposal name)
Comments from Editor XPlease open your own section with username in the heading. Please limit comments within a section to 300 words. |
Question 8: Delete ranking of editors by created articles
Delete the list of editors by numbers of articles created, Wikipedia:List of Wikipedians by article count.
Support (delete ranking)
- Support as nominator. One motivation for bulk article creation appears to be gaining perceived prestige as an editor. Much of this is difficult to combat, but simply deleting the ranking is one simple step that might help to change the culture without obvious negative effects. I note (which I had not realised) that in April 2021 the order of the top 100 editors was randomised in an effort to discourage mass stub creation. Espresso Addict (talk) 05:58, 5 October 2022 (UTC)
- This is a bold idea, and I find myself drawn to it. If needed, we already have a marker of an experienced article creator, the autopatrolled right. The list stems from a nice bit of code so I wouldn't want the information to be inaccessible, but certainly it doesn't need to be so prominent. CMD (talk) 08:29, 5 October 2022 (UTC)
- Reading other the many argumentative threads around mass creation this is a thought I've previously had. It seems a ridiculous measure, but the fact that action has previously been taken about it, as noted by Espresso Addict, hints towards this being necessary. -- LCU ActivelyDisinterested ∆transmissions∆ °co-ords° 13:09, 5 October 2022 (UTC)
- Creating new articles should be for the sake of the project, not for gaining plaudits. Scolaire (talk) 15:38, 5 October 2022 (UTC)
- Support for many reasons. One of them is the one brought forward by Espresso addict.Paradise Chronicle (talk) 16:07, 5 October 2022 (UTC)
- We already scramble the top 100, and while there are plenty of excellent and valuable editors in that group, there are also a number of editors who are banned for various reasons, including a few whose contributions were part of why we're having this RfC in the first place. Deleting the page isn't going to solve the problem on its own, but it seems worth doing. TheCatalyst31 Reaction•Creation 03:26, 6 October 2022 (UTC)
- Support - The ranking only encourages editors who are here to get a "high score" rather than build a quality encyclopedia, as shown by the number of Top 10 creators who have been banned and left a huge pile of stubs for others to sort through. We're better off without it. –dlthewave ☎ 03:56, 6 October 2022 (UTC)
- The top 100 is already scrambled as it is, making the page not even effective at its current job. Thus its only purpose right now is to send the wrong message about what Wikipedia is about. —pythoncoder (talk | contribs) 04:05, 6 October 2022 (UTC)
- I don't think this has any direct relevance to mass-creation, but I would go further and remove any ranking of editors (whether by number of articles created, edits made, DYKs, or anything else) that could in any way be thought to be "official". Such rankings make editors concentrate on quantity rather than quality. We recently had an editor whose reply to any concern about his editing was simply to point to the number of articles he had taken to GA. That sort of thing should be stopped. Phil Bridger (talk) 08:13, 6 October 2022 (UTC)
- Support — In academia, emphasis on citation count has led to a publish or perish culture, where researchers are encouraged to publish as many ground-breaking articles as possible, without regard to quality. This process has resulted in many cases of scientific misconduct and other questionable research practices, such as plagiarism and data fabrication, in an attempt to cut corners where possible. One of the most extreme cases is that of Yoshitaka Fujii, a Japanese anesthesiologist who was found in 2012 to have fabricated data in at least 183 papers. Any quantity-based status is prone to similar abuses. Several users near the top of the closely related Wikipedia:List_of_Wikipedians_by_number_of_edits, such as Rich Farmbrough, Koavf, and Lugnuts have specific sanctions targeting mass edits, such as unnecessary cosmetic changes and low-quality stubs. That does not mean that this other list should be deleted, since there are large-scale editing patterns with little risk of error due to their systematic nature — for example, the top-ranking user, Ser Amantio di Nicolao, is mostly involved in very large-scale category changes and talk page creeations using AWB or Cat-a-lot. However, mass creation of articles requires more careful discretion, such as ensuring that the database used is reliable and can be used to support WP:GNG; and is much more damaging to Wikipedia than say, mass talk-page creation, when it goes wrong, because we often have to scrutinize a large fraction of those articles which don't have obvious problems. –LaundryPizza03 (dc̄) 09:40, 11 October 2022 (UTC)
- Support Stat-padding, as it is known in other places on the Interwebz, is the bane of online gaming communities. By removing ranking it may well demotivate those who mass produce low quality articles, while hopefully those with the encyclopedias best interests at heart will pay it no need. Anyone can put the their stats up on their homepage if they feel the need. LessHeard vanU (talk) 07:32, 12 October 2022 (UTC)
Oppose (delete ranking)
- I don't disagree that there's the potential for harm, but it's already randomized to the point of being pretty useless and anyone can get an unrandomized list through a quick Quarry query (and widely publicize that query) exactly like those at the top of this page. Maybe having it in projectspace makes it look a little too official/endorsed/valued, in which case a move might be in order. Meh. — Rhododendrites talk \\ 13:08, 5 October 2022 (UTC)
- I don't think that the real cause of mass-creation is the fact that the list exists. Such numbers are also available on XTools and WikiScan, so if the point is to censor the amount of articles one has created so as to not allow someone to be overly prideful, then this is not really an effective implementation anyway. — Red-tailed hawk (nest) 16:25, 5 October 2022 (UTC)
- Pointless proposal. I'm an admin on pi:, and found several bot created articles, with a single word or sentence and nothing else. There is no bot generated report there, why did the creator do it? Furthermore, even though I'm a complete novice in coding, I was easily able to create a list of editors by move count at Quarry in about an hour's time. I could do it just as easily for article creation count. Someone who really wants to know their article creation count can find it directly from databases. I don't think, this proposal is going to change a single editor's mind let alone solve the problem of mass creation. —CX Zoom[he/him] (let's talk • {C•X}) 13:11, 6 October 2022 (UTC)
- No point, as that list would not be a motivation for creating substubs rather than useful pages. Graeme Bartlett (talk) 04:14, 7 October 2022 (UTC)
- I think rankings are silly and can incentivise unhelpful behaviour, but I'm not sure deleting this one would be a net benefit. First, because the list can be useful: for the overall statistics or as a maintenance list (if someone is very high up there, then their creations will most likely need scrutiny). Second, I don't think this particular list has a great influence on editor behaviour: my impression has been that, to a larger extent than with other rankings, people have tended to ignore the report and just kept their own manual counts on their user page. – Uanfala (talk) 13:00, 7 October 2022 (UTC)
- We have a process for deleting articles and it does not include 'arbitrary decisions taken on subpages of subpages of pages completely unrelated to the page in question'. If there's a policy-based reason to delete Wikipedia:List of Wikipedians by article count, it should be nominated at MfD. At best, a consensus here should be taken as a reason to have that MfD. Nobody's even bothered to notify the page's creator or watchers that this discussion is happening. – Joe (talk) 13:24, 7 October 2022 (UTC)
- Seems bizarre to me to remove a list of created articles. If there are issues with people mass producing articles, then this list would be helpful to know who's making the most and fix the issue. Without a list - there are no ideas. Lee Vilenski (talk • contribs) 14:01, 7 October 2022 (UTC)
- Disclaimer: list creator. Thank you Joe for the notification. There is no evidence this page is contributing to mass stub creation. There was evidence at one time involving a single user, who has since been permanently banned. That incident led to the top 100 scramble, which appears to be effective. Many users track their contributions this way, who are not bad actors. I don't mind moving it out of project space if so decided. And if that doesn't work, I can move it off-site as I have a wiki that can host it. There is demand for this list by many who are not bad actors. Some of the delete votes appear to dislike the egotistic nature of these lists which is fine they are not for everyone but these lists can also motivate and reward legitimate and positive behavior. I also don't think it's a good idea in principle to give up control of the list to offsite actors. -- GreenC 14:39, 7 October 2022 (UTC)
- Unlikely to make a difference, particularly now that the top 100 are randomized. (I also agree with the sentiment that RfCs are not for things covered by the deletion process, although that's an auxiliary issue.) Extraordinary Writ (talk) 17:49, 7 October 2022 (UTC)
- Don't see a need for this. Rlendog (talk) 18:01, 7 October 2022 (UTC)
- The rankings tool is a helpful way to find the most prolific article creators, whose work we should watch.—S Marshall T/C 18:56, 7 October 2022 (UTC)
- It seems pointless to remove something that would clearly just move to some other server and still get used in the same sort of way Blue Square Thing (talk) 08:40, 8 October 2022 (UTC)
- A case of "it ain't broke, so don't fix it". BoJó | talk UTC 10:06, 8 October 2022 (UTC)
- Oppose, that page was created to reduce the load on Wikiscan and doesn't cause people to go cray-cray with the stub-creation-ay. Abductive (reasoning) 18:33, 9 October 2022 (UTC)
- Oppose - Not needed. --Jax 0677 (talk) 23:33, 9 October 2022 (UTC)
- I highly doubt anybody is creating loads of articles in order to rank more highly on that rather obscure list. Hut 8.5 11:45, 11 October 2022 (UTC)
- Oppose as not necessary given the scrambling. Hut 8.5 - I know that a number of mass-creators were regular visitors to the talk-page at that page and obviously checked it regularly, so even applying AGF, not a stretch to say they were at least interested in who was ranking highly on it. FOARP (talk) 14:27, 11 October 2022 (UTC)
- No, its useful and interesting for other editors who may need it for research purposes. Crouch, Swale (talk) 20:29, 11 October 2022 (UTC)
Comments (delete ranking)
Comments from XOR'easter
I kinda feel that anyone who really gets their jollies from racking up their stub creation count would get much the same feeling from, e.g., listing their created stubs on their User page. Meh. I'm having a hard time seeing that there would really be a benefit or a loss for eliminating this list. XOR'easter (talk) 16:32, 5 October 2022 (UTC)
Comments from Espresso Addict
XOR'easter: I'm sure most editors get satisfaction from listing their creations in their user space but that does not make them readily findable (especially by new editors) nor easily comparable to others, nor does it send a meta-message that one's value as an editor can be measured in how many articles one creates. It also has the benefit of reminding the editor that they started that article, and might want to come back to update or improve it, which the ranking does not. I don't think the effect will be large but I think it might be a small positive step. Espresso Addict (talk) 19:00, 5 October 2022 (UTC)
Rhododendrites: I think most editors don't know how to use Quarry or similar. Having an "official" ranking available in project space and linked from various help guides seems to give it too great a prominence. Espresso Addict (talk) 19:10, 5 October 2022 (UTC)
Comments from Editor Paradise Chronicle
Going with the logic that the creators list encourages the creation of stubs, an expanders list could be thought of. I know the GA and FA list exists, but one for raising stubs to start articles or adding several sections to an article would help as well.Paradise Chronicle (talk) 04:52, 7 October 2022 (UTC)
@Lee Vilenski it is no about removing the list of created articles, but about removing the ranking Wikipedia:List of Wikipedians by article count. Maybe you wanted to vote in another discussion where this is discussed? And I'd be interested where this I discussed, I would also oppose the removal of a list of created articles.Paradise Chronicle (talk) 10:54, 8 October 2022 (UTC)
Comments from FOARP
I hate to be go WP:BURO, but this is really a discussion to have at WP:MFD and not here. FWIW I have plenty of concerns about this page which I've discussed a number of times on the talk page there, but I don't get how the scrambling hasn't already addressed this.FOARP (talk) 08:47, 7 October 2022 (UTC)
Comments from (Your Name)
Please open your own section with username in the heading. Please limit comments within a section to 300 words.
Question 9:Should mass-creators be forced to respond to issues?
If there is a discussion on the mass-creation noticeboard, should the mass-creator be required to address the concerns on that noticeboard if the mass-creator is editing elsewhere?
Support (proposal name)
- Support. People are responsible for their edits. It is much easier to spot issues if the author informs you about what and why they did it.(Communication is required). And mass-creation has the potential for huge damage to the encyclopedia.Lurking shadow (talk) 10:10, 5 October 2022 (UTC)
- Support in principle, although some refinement of what it means to "respond" and what happens if you don't require elaboration - I don't think we can literally force people to respond, but we can establish that a failure to respond means that editors can make negative inferences about their intent and process, and that this is sufficient to justify sanctions preventing them from mass-creating articles in the future. This is necessary because of the danger of WP:FAIT - there is too much risk that someone could mass-create a bunch of articles and then try to stall out the process by refusing to engage, hoping that the difficulty of reversing their action leads to it remaining in place. --Aquillion (talk) 15:03, 5 October 2022 (UTC)
Oppose (proposal name)
- Oppose - Our current noticeboards work just fine without this requirement. If an editor doesn't show up to explain/defend themselves, we make decisions based on the information that we have, and there's no need to make a written rule that failure to comment may be viewed negatively. –dlthewave ☎ 21:37, 5 October 2022 (UTC)
- Per WP:NOTMANDATORY. We cannot compel ordinary editors to contribute to a particular noticeboard or thread; the only people that we expect this of are holders of advanced permissions, who are generally required to maintain the trust and confidence of the community. On the other hand, the decision to actively edit and not respond to an WP:ANI thread can at times indicate that a user has little or no interest in working collaboratively, and we should not prohibit the community from analyzing a user's conduct along those lines. — Red-tailed hawk (nest) 22:11, 5 October 2022 (UTC)
- The noticeboard whose existence is implied in this question ideally shouldn't exist. That said, if it is actually created against the concerns voiced at #Oppose_(Create_noticeboard), there's no need for a specific ADMINACCT-like requirement to be codified for that one noticeboard. Users are already expected to respond to community concerns about their editing (WP:DISRUPTSIGNS: "Does not engage in consensus building", "Rejects or ignores community input"), and WP:NOTCOMPULSORY limits this RfC's ability to go beyond this. ~ ToBeFree (talk) 22:13, 5 October 2022 (UTC)
- Per WP:NOTCOMPULSORY. We cannot force editors to make edits. James500 (talk) 01:35, 6 October 2022 (UTC)
- Oppose If they don't show up, they'll probably show up if the decision taken was perceived negatively by them. Also, mass creators are not new editors, they are rather experienced editors and know how noticeboards and talk-page notifications function.Paradise Chronicle (talk) 09:39, 6 October 2022 (UTC)
- If they don't respond, that is not necessarily a serious problem. But if they continue with causing a problem with article creation, they can then be blocked, perhaps after AN/I discussion. There could be use of a noticeboard, or the user's talk page. Graeme Bartlett (talk) 04:12, 7 October 2022 (UTC)
- Oppose - We cannot force behaviour and should not try. The one thing I would request is that mass-creators at least do not battle to prevent clean-up - BTW this is typically the reason that mass-creators get banned from Wikipedia: not because of their mass-creation per se, but because they then get very angry when others start questioning the value of their X-thousand articles on Y-subject created manually from Z-database and engage in uncivil and disruptive behaviour against the people doing the clean-up. This then results in months of unpleasantness for the people doing the clean-up ending at ARBCOM (because ANI simply isn't up to the task of banning these typically long-tenured and influential editors). It is striking that only one of the top-ten article-creators (Ser Amantio di Nicolao) is still an active editor today, with all the rest being indeffed (3) retired under a cloud (3) semi-retired/just not very active (3), and that that one remaining editor has admirably stayed out of attempts to clean-up their articles (particularly the mass-created ones about non-notable geographical features in Antartica based solely on the GNIS database created back in 2008 when standards were more relaxed).FOARP (talk) 09:20, 7 October 2022 (UTC)
- Better dealt with at ANI and through a voluntary approach via sanction negotiation etc... Blue Square Thing (talk) 08:41, 8 October 2022 (UTC)
- You can't FORCE people to do anything. If they won't respond, WP:ENGAGE applies. BoJó | talk UTC 10:09, 8 October 2022 (UTC)
- Oppose - Oppose per WP:NOTMANDATORY. --Jax 0677 (talk) 23:36, 9 October 2022 (UTC)
- Oppose Being involved in a scandal is just humiliating — the only involvement one should have in cleaning up their act is to make defending statements to the public or in the courtroom. And we don't want to shame people to the point of a catastrophic mental health breakdown, or else we risk getting sued in real life. Carlossuarez46 quit after the mass-creation scandal about Iranian abadis broke out, declining even to comment at the resulting Arbcom case. –LaundryPizza03 (dc̄) 09:48, 11 October 2022 (UTC)
Comments (respond to issues)
Comments from BeanieFan11
You can't force someone to comment on a page. BeanieFan11 (talk) 18:20, 5 October 2022 (UTC)
Comments from ActivelyDisinterested
You can't force editors to do something, but if they continue with something that is considered distruptive then there is WP:ANI. -- LCU ActivelyDisinterested ∆transmissions∆ °co-ords° 17:52, 7 October 2022 (UTC)
Comments from Editor X (respond to issues)
Please open your own section with username in the heading. Please limit comments within a section to 300 words.
Question 10: add mass-creation as a reason to WP:BUNDLE
Next RfC; + to WT:ADAS
|
---|
Should we add mass-creation of articles by the same editor using substantially the same sources and format as a reason for bundling multiple articles into a single AFD at WP:BUNDLE? Support adding mass-creation as a reason to WP:BUNDLE
Oppose adding mass-creation as a reason to WP:BUNDLE
Comments on adding mass-creation as a reason to WP:BUNDLEComments from ActivelyDisinterestedI support this, but as other have said this is probably best discussed in the AfD part of the RFCs. -- LCU ActivelyDisinterested ∆transmissions∆ °co-ords° 13:21, 5 October 2022 (UTC) Comments from WjematherThis is an appropriate proposal for the next RFC, i.e. "Article deletion at scale", but not this one. wjematherplease leave a message... 13:48, 5 October 2022 (UTC) Comments from Editor XPlease open your own section with username in the heading. Please limit comments within a section to 300 words. |
Question 11: Should we allow (or disallow) mass-created articles that lack any sourced claims of importance?
This proposal establishes a principle that all mass-created articles must contain a reliably sourced claim of importance. This would apply to any mass-created article that does not indicate why its subject is important or significant. This is a lower standard than notability. This proposal does not imply that all mass creations of articles that contain reliably sourced claims of importance are acceptable; it merely identifies that mass creations that do not adhere to this principle are generally not acceptable. This is non-exclusive with any option from Question 2; should there be a rough affirmative consensus in favor of both this proposal and a proposal in Question 2, both shall take effect.
Support (sourced claims of importance)
- This establishes a bare minimum for mass-created articles, which is that they contain a source that verifies their claim of significance. This is non-exclusive with any particular option in Question 2 because this does not make any comments regarding sources that would necessarily contribute to GNG. This principle is eminently reasonable and more or less any mass creation that fails this would probably require a good bit of legwork to clean up if not resulting in much of the mass creation eventually being deleted. Failure to follow this principle could either be enforced as a simple disruptive editing case or, should the community decide in another RfC or question, through some modification to WP:DEL-REASON and/or the creation of a CSD criterion. — Red-tailed hawk (nest) 22:00, 5 October 2022 (UTC)
- I note that the heading here is ambiguous. But a claim of importance should be required, and the information should be sourced. A claim of importance may include SNG evidence. Enough good references also show GNG and importance. If no claim of importance then speedy delete criterion A7 may apply, and if its out of scope then claims will avert being an obvious AFD candidate. — Preceding unsigned comment added by Graeme Bartlett (talk • contribs) 04:19, 7 October 2022 (UTC)
- Support. We need to have some sort of bare-minimum standard for mass-created articles or discussions over them are just going to spin wheels endlessly. And this is a reasonable extension of WP:A7, which can be tweaked to be more in-line with it if necessary. --Aquillion (talk) 18:21, 7 October 2022 (UTC)
Oppose (sourced claims of importance)
- Without any definition of what constitutes a "sourced claim of importance" that is objective and applicable to all subjects this is just a recipe for arguments. Thryduulf (talk) 15:13, 6 October 2022 (UTC)
- "Sourced claim of importance" is one of those things that seems unambiguous but actually at the coalface is rather hard to agree on. Is being a populated place a claim of importance? A listed building? An athlete who competed at the Olympics? Or was in their national top 10? An academic with two papers with >100 citations? Also database sourcing is often (though not always) reasonably reliable, it's the "significant coverage" that is lacking. Espresso Addict (talk) 23:54, 6 October 2022 (UTC)
- The opposite of what is required. Sets the bar way too low and potentially re-opens the door to mass-creation of (for example) sports biography stubs based on a single appearance somewhere, sourced only to a database, that was closed in WP:NSPORTS2022. wjematherplease leave a message... 12:40, 7 October 2022 (UTC)
- This is attempting to apply the standard of WP:CSD#A7 to mass article creations, with the added requirement of a source for the claim. A7 has been restricted so that it's usually obvious whether an article has a claim of importance or not, and attempts to expand it usually fail because it wouldn't necessarily be clear what qualifies. However any admin is likely to give the benefit of the doubt to the article creator, and certainly any article which indicates the subject might meet an SNG or which includes some non-terrible references is unlikely to be deleted under this criterion, so it probably wouldn't make much difference anyway. Hut 8.5 16:40, 7 October 2022 (UTC)
- Oppose. What is a "claim of importance"? If it meets GNG it's fine. As per #2, I think if it has one decent source at creation, it should be considered to meet minimum standards for creation. Rlendog (talk) 18:03, 7 October 2022 (UTC)
- Importance is like significance – a value judgement – and different people have different values. For example, I would suppose that being included in a national register of historic buildings is implicitly a claim of importance but some seem to expect something more. As another example, consider a forthcoming FA: Daglish railway station. There doesn't seem to be a clear claim of importance for this – it all depends whether you think ordinary railway stations are important or not. And railway stations are the sort of topic that tend to be created systematically, en masse. We have lots of articles about them, such as this FA, and so they seem to be generally acceptable. We should be going by such precedents rather than requiring them to be proven afresh, every time, in a subjective way. Andrew🐉(talk) 18:24, 7 October 2022 (UTC)
- I assume this is about CSD's concept of Wikipedia:Credible claim of significance, which explicitly does not apply to many subjects (e.g., animal species). It's nice when a Wikipedia article gives me a reason to care about the subject, but subjects can be notable, and articles about them can be useful, even if I don't care. If I'm looking up the subject, I already know why I want to know what it is; I don't need the article to tell me nearly as much as I need the article to have a decent first sentence. WhatamIdoing (talk) 04:55, 8 October 2022 (UTC)
- Who judges the claim of importance? I've seen this used to defend articles about obviously non-notable subjects before. Not, in my view, the way to go Blue Square Thing (talk) 08:44, 8 October 2022 (UTC)
- We already have notability. Something important to one person is unimportant to another. BoJó | talk UTC 10:13, 8 October 2022 (UTC)
- Oppose Conflicts with WP:STUB and WP:Notability, which are well established guidelines. LessHeard vanU (talk) 07:34, 12 October 2022 (UTC)
Comments (sourced claims of importance)
Comments from BeanieFan11
How do we determine what counts as a "claim of importance"? Would playing in the NFL count? What about participating in the Olympics or playing MLB? Or does it have to be something much higher to count as "important"? BeanieFan11 (talk) 14:13, 6 October 2022 (UTC)
Comments from Thryduulf (Q11)
A7 is subject-limited in part because it is often not possible for someone unfamiliar with a topic to reliably judge what is and isn't a claim of significance or importance and many of the SNGs are about giving guidance about this, but disagreement and arguments about interpretation of SNGs is one of the reasons we are here so this is likely to be both duplicative and unproductive. Thryduulf (talk) 15:20, 6 October 2022 (UTC)
Comments from Wjemather (sourced claims of importance)
@BeanieFan11: My understanding would be that (for example) participating in the Olympics is a credible claim of importance even if it is already well established that it is not a credible claim of notability. wjematherplease leave a message... 12:46, 7 October 2022 (UTC)
Comments from ActivelyDisinterested
Claims of importance or notability just make the issue to clouded. We need a proposal that mass created articles must have at least one, non-database, reference that is not from an unreliable source. This should be the requirement for any stub article, but that is a different discussion. -- LCU ActivelyDisinterested ∆transmissions∆ °co-ords° 17:57, 7 October 2022 (UTC)
Comments from Editor X
Please open your own section with username in the heading. Please limit comments within a section to 300 words.
Question 12: Editors who mass-create stubs have a duty to expand the articles later
It's okay, and sometimes beneficial, to create a lot of stubs, but it's not okay to leave them languishing in the stub state for years on end. Other editors are unlikely to be interested in expanding them past the stub state; therefore, editors who mass create stubs are expected to expand those articles within some reasonably generous timespan. If they don't, we can require them to stop creating stubs until they have expanded most of the stubs they have previously created. 05:22, 8 October 2022 (UTC)
Support (duty to expand)
- support statement
Oppose (duty to expand)
- There's no way to enforce this. Stubs as seeds for more developed articles are, to me, a reasonable way forward so long as there's actually some way of developing them. And they're more likely to be developed if they exist as stubs than if they don't. Blue Square Thing (talk) 08:46, 8 October 2022 (UTC)
- The idea that articles which remain stubs are unacceptable is just wrong and not consistent with the view of the community. Just over half of our articles are stubs, and most of them have remained that way for years. A proposal to delete half of the articles on Wikipedia would be dead on arrival. Hut 8.5 09:05, 8 October 2022 (UTC)
- Nothing at all wrong with any stub that has a reliable source. Why pick on stubs when there are volumes of "start-class" articles that have been "expanded" by conversion of a useful stub into an incoherent mess? BoJó | talk UTC 10:22, 8 October 2022 (UTC)
- Its a volunteers project.Paradise Chronicle (talk) 10:39, 8 October 2022 (UTC)
- This can't be enforced, and is a more complicated version of requiring a bit more substantiveness at the point of creation, which is much easier to assess. CMD (talk) 04:26, 9 October 2022 (UTC)
- Oppose generally. There can be exceptions such as proving notability and ensuring compliance with copyright laws and WP:BLP but not for everything. Emir of Wikipedia (talk) 20:32, 9 October 2022 (UTC)
- Oppose - Oppose per WP:TOOSHORT. --Jax 0677 (talk) 23:37, 9 October 2022 (UTC)
- No as long as at the time of creation they are acceptable. Crouch, Swale (talk) 20:30, 11 October 2022 (UTC)
- You can't require someone to edit. We're all volunteers here. Steven Walling • talk 21:07, 11 October 2022 (UTC)
- Reluctatnt oppose, because this focus' upon the article and not the creator - who is the problem. We have criteria for the speedy deletion of poor quality content, while struggle to deal with problematic editors. LessHeard vanU (talk) 07:42, 12 October 2022 (UTC)
Comments (duty to expand)
Comments from WhatamIdoing (duty to expand)
This proposal is based on some comments from multiple editors in previous discussions. Note that 3.7 million out of our 6.5 million articles are rated as stubs. I think that argues against this idea, but others may think it argues for it. WhatamIdoing (talk) 05:22, 8 October 2022 (UTC)
Comments from Espresso Addict (duty to expand)
I'm conflicted on this one. I wish people who had sources in their hand would expand stubs more than they do, but one can't really make an editor do anything (except go away). There's a fundamental problem that's been nagging at me that articles do, actually, have an owner, because the notification system sends notifications to the original creator. This means that if I were to create 100 "articles" of the form "XYZ is a listed building in ABC.<ref_Listing, ref_Pevsner>", I might just get away with it, but I would stellarly annoy anyone who had been planning to create one of them when they'd just amassed enough sources to write a "proper" article. On the other hand, someone might read it and think, "I pass XYZ on the way to work every morning, I'll take a quick photo" or even "I used to live near XYZ, surely the listing says more than that [click]." Espresso Addict (talk) 06:08, 8 October 2022 (UTC)
Comments from Editor X (duty to expand)
Please open your own section with username in the heading. Please limit comments within a section to 300 words.
Question 13:
Neutral description of proposal.
Support (proposal name)
- support statement
Oppose (proposal name)
- oppose statement
Comments (proposal name)
Comments from JoelleJay (proposal 13)
I'd like to workshop this proposal before putting it to !vote. How about a baseline question: "Should there be a per-editor limit to the number of articles created after [X date] that do not contain sufficient sourcing to meet a relevant notability guideline?"
Further details could be:
"A single editor cannot have more than [Y number] articles on GNG-governed subjects created after [X date] that did not contain [a source of SIGCOV|2 sources of SIGCOV] at the time of creation. Surpassing [Y number] will trigger escalating warnings leading to a p-block from article creation. [Y number] can be offset by adding GNG sources to articles they created, or by adding GNG sources to other articles that have never met GNG. The latter option is available to all editors regardless of the status of their own creations, and can be built up as their [Z number]."
There is no need to hammer out what is "SIGCOV" in this RfC; if we don't need a clear definition for AfC/AfD/NPP/etc. to work, we don't need one here either.
- Honestly, I think we should be strongly discouraging putting articles into mainspace that don't have sources demonstrating notability, and having a limit any one editor can have of such articles would help. So I think this should be evaluated at the creation time. And while adding any sourcing to any article is great, for the purposes of restricting undersourced article accumulation we should stick to "Z credits" being earned only by bringing non-GNG-demonstrating articles to demonstrating-GNG status. 00:03, 9 October 2022 (UTC)
Comments from Espresso Addict (proposal 13)
- "at the time of creation": suggest within 48 hours/5 days of creation; many editors don't put all their sources in in the first edit. Also need clarity re creation in mainspace vs userspace or draftspace; it's ok to create drafts there that don't comply with some mainspace policies.
- I suspect that commenters will want more information on what [Y number] is in advance of voting. 1? 5? 10? 100?
- "to other articles that have never met GNG": is it necessary to limit it to these? Any sourcing work is valuable.
- There's going to be a major problem with how technically feasible this all is to determine. No bot is going to be able to determine what a GNG-compliant source is, and editors will need access to the source, which might be offline. Eg, if I add a source to one of the Pevsner guides, who's to say whether there's enough coverage therein? It could create a lot of work for experienced editors. Espresso Addict (talk) 21:55, 8 October 2022 (UTC)
Comments from WhatamIdoing (proposal 13)
no sources at all
but the real world has:
hundreds of non-trivial newspaper reports, magazine articles, and books
then the subject is notable.JoelleJay, I think this needs more work. To start with, sourcing doesn't make a subject notable, and the absence of sourcing doesn't make a subject non-notable. I proposed the box on the side here for the relevant section of WT:N, but editors were alternately concerned that it would encourage editors to be lazy about citing sources (I guess these editors think that if citing sources isn't absolutely mandatory on penalty of article deletion, then nobody will do it), or that it might be possible to have literally hundreds of non-trivial newspaper reports, magazine articles, and books written about a subject, but still somehow the subject wouldn't be notable (I'm not sure what they're thinking).
So the first thing is that you'd have to change the wording to be about demonstrating notability, rather than being notable: "Should there be a per-editor limit to the number of articles created after [X date] that do not contain sufficient sourcing to meet demonstrate compliance with a relevant notability guideline?"
Second, I think you might want to consider a much more fundamental proposal: "As of <date>, all new articles must name at least one Wikipedia:Independent source." Right now, only BLPs are required to cite any sources at all. You could move User:WhatamIdoing/Christmas candy to the mainspace, and the worst that anyone could do to it is to slap an {{unref}} tag on it. Per User:Espresso Addict, there should be a sensible grace period (WP:BLPPROD gives you seven days after being tagged, so it should be the same or longer), but if you want all new articles to identify a specific source, we actually need a rule that requires this. WhatamIdoing (talk) 23:25, 8 October 2022 (UTC)
Comments from Editor X (proposal 13)
Please open your own section with username in the heading. Please limit comments within a section to 300 words.
Question 14:
Neutral description of proposal.
Support (proposal name)
- support statement
Oppose (proposal name)
- oppose statement
Comments (proposal name)
Comments from JoelleJay (proposal 14)
Another potential proposal: Definition of a "database" for the purposes of GNG. "Databases/statistical websites that do not contain significant prose text, written by a human, discussing the subject in detail, do not contribute to GNG". The issue obviously isn't with database pages that also contain substantial secondary prose coverage clearly written by humans, for example OMIM entries like https://omim.org/entry/609423, which has a list of the individual people responsible for writing that specific entry. The issue is with databases that do not contain any human prose written specifically about the subject: that is by definition a primary source and can never contribute to GNG. 20:32, 8 October 2022 (UTC)
- @Espresso Addict, what is wrong with the OMIM example? I included it specifically because it has been used as an example of a "database that contains SIGCOV"; defining "database" for our uses to exclude anything where there is significant secondary prose coverage on the subject would eliminate the issue. 23:48, 8 October 2022 (UTC)
- WhatamIdoing, if a database entry also has its own reliably published embedded video recordings or audio files or some visualization of ASL or whatever that would normally count towards GNG then that's obviously fine. 01:01, 9 October 2022 (UTC)
- Espresso Addict, this isn't some random distinction I came up with just for this RfC, it's directly coming from our P&Gs. Just because one can cobble together verifiable facts from a source doesn't make the source secondary, which is required for GNG and OR. And secondary, per our policy, means analysis of the subject by an author. The database entry you linked clearly would not be sufficient basis for an article because it contains NO commentary on the subject: it is strictly the output of a database query. Just as much material could be collected for any high school athlete in the US.
- WhatamIdoing, the key point is being discussed by other people in secondary RS. And I'm confident the vast majority of editors do NOT interpret GNG as if
significant coverage in reliable sources that are independent of the subject
means "significant coverage achieved by adding together multiple RS that are not necessarily all independent or secondary". That would be nonsensical since just about every topic has some secondary, independent coverage and some primary, non-independent SIGCOV. 04:09, 9 October 2022 (UTC)
- Boca Jóvenes, the distinction between primary and secondary that we use is in the sense of whether information has been analyzed and discussed by someone directly (see the wording in WP:SECONDARY). So raw data that has been "interpreted" by tools (e.g. software that automatically identifies protein domains from a sequence; a db query that returns specific facts in a UX-friendly table), or even the output of human manipulation of data (someone manually compiling an athlete's performance stats) are still primary because we do not have anything that has been said about that information, it's ultimately merely been reproduced. If just curating or annotating data was sufficient to make it secondary, all experimental publications would be considered secondary for the novel research they present. Without people actually commenting on the subject directly, even when there are bountiful verifiable facts attached to it, we cannot discern the importance or encyclopedic merit of it (or of any of its facts). An editor can't just write an article on the relationship between mothers against decapentaplegic and Smurf based solely on string-db.org spitting out their predicted binding interactions; someone has to report that relationship specifically in RS, and then those findings must be discussed by an independent researcher before any of that info can be used on WP. JoelleJay (talk) 23:02, 9 October 2022 (UTC)
Comments from Espresso Addict (proposal 14)
JoelleJay: I wouldn't include the OMIM example; that isn't an edge case!! WhatamIdoing might be able to help? I don't think it should be limited to online databases; print almanacs or gazetteers are also problematic, perhaps even more so as it is more difficult to see what coverage they have. Espresso Addict (talk) 22:06, 8 October 2022 (UTC)
- JoelleJay: I can't do word counts on this machine but the OMIM entry you linked must be >>1000 words with 62 refs. It's actually possible to make a viable article based on something as sparse as [1], which has no prose at all at the top level, though there's a number of clickthroughs. I'm beginning to think the key is reliability (expert/authorative authorship) not amount of prose. Espresso Addict (talk) 01:28, 9 October 2022 (UTC)
Boca Jóvenes: "The problem is invariably the database's own programming which they use to produce statistical summaries of various kinds and I would argue that these outputs are potentially, if not probably, unreliable." I don't understand this at all. I've worked with databases (as an amateur) and the problem is data input errors, not output problems? I asked my OH, who has expert professional experience in this area, who agreed with my assessment. Espresso Addict (talk) 00:35, 10 October 2022 (UTC)
Comments from WhatamIdoing (proposal 14)
User:JoelleJay, why does coverage need to be in prose? Are you hoping to ban videos as sources? WhatamIdoing (talk) 00:17, 9 October 2022 (UTC)
- It sounds like the key points for JoelleJay are "secondary" and "clearly written by humans"; prose per se is not the key point.
- We never have been able to agree on whether the WP:SIRS rule (a single source, considered in isolation, must have all the qualities that suggest notability [independent, secondary, significant coverage, etc.]) should be applied generally, or if you should look at the independent sources in combination to decide whether, taken as a group, there is enough information to write an article (e.g., a page here about his family, a short article there about his early career, a brief analysis of how he compares to two rivals over there...). If you lean towards the latter, then adding one bona fide secondary (but perhaps brief) source to one detailed (but non-secondary) database entry would indicate notability. WhatamIdoing (talk) 01:36, 9 October 2022 (UTC)
- Joelle, every time I've tried to get the folks at WT:N to agree that all sources must be secondary and individually contain SIGCOV for GNG's purposes, we've been unable to reach consensus. We can reach consensus that all sources must be independent to count (as long as we are fairly superficial with the definition of independence; find my talk page if you want to talk about the impossibility of finding scientific sources covering experimental drugs that are truly independent from the would-be manufacturer of a drug candidate), but we have historically been unable to reach consensus that only sources individually containing SIGCOV should count at all, or that only sources that are primarily/entirely secondary should count at all. If this is the sort of obvious truth that No true Wikipedian would ever disagree with, then we must have a lot of fake Wikipedians. ¯\_(ツ)_/¯
- @Boca Jóvenes, you might be interested in the concept of secondary data, which is not exactly the same thing as a secondary source. It is at least true that some databases are primary sources; it might be true that all databases are (or at least all of the "isolated data point" parts are) primary.
- More generally, the question of effort might be relevant. If someone published an article that says "I have taken the list of basketball players in this group, along with their heights and the points they each scored. I have manually calculated the statistical contribution of height to points scored. I conclude that taller players score more points, and for players with an above-average height, I list the percentage of points scored that can be attributed to height for each individual", we would be happy to include this the athletes' articles. But if you dump it in a database (or a spreadsheet), and the software does this simple calculation for you, then the same information is suddenly not okay? There was still a human deciding that this was something worth analyzing. There was still a human making the analysis happen. I think we should not encourage a technology-specific understanding of analysis. WhatamIdoing (talk) 02:56, 10 October 2022 (UTC)
Comments from Blue Square Thing (proposal 14)
I'm not sure that the problem is actually the use of databases, rather the inappropriate use of individual database entries with no prose at all as the sole source. This is probably more important for the deletion RfC. You'll struggle to include very many databases if you want to define them at a entire source level - Olympedia clearly has some human created significant prose entries. Blue Square Thing (talk) 07:42, 9 October 2022 (UTC)
Comments from Boca Jóvenes (proposal 14)
Interesting one but I don't agree that databases are primary sources. They are definitely secondary. In the first place, the typical database has human input but it probably lacks adequate oversight and so transcription errors occur. That in itself doesn't mean they are unreliable because all sources contain errors. The problem is invariably the database's own programming which they use to produce statistical summaries of various kinds and I would argue that these outputs are potentially, if not probably, unreliable. I think the human input content should be welcome even if there is an occasional human error, but the database's auto-generated outputs should be treated with caution and, perhaps, classified as unreliable. BoJó | talk UTC 11:46, 9 October 2022 (UTC)
Question 15: How about to award the expanders of articles (stubs) with the notifications that usually only the creator receives?
Support (awards)
- as nominator. If it would only count for stubs or articles in general could be thought about, but the expanders of the articles are likely the ones who are also interested in maintaining the articles.Paradise Chronicle (talk) 23:28, 9 October 2022 (UTC)
- Support - --Jax 0677 (talk) 23:38, 9 October 2022 (UTC)
- I'd been thinking of trying to codify a proposal to suggest this. I don't know whether the underlying software can handle it, but there seems no reason in principle why anyone can't ask to receive notifications on any article. Espresso Addict (talk) 00:39, 10 October 2022 (UTC)
- Support some kind of additional incentive for expansion - Expansion of stubs needs to be encouraged more. The plain truth is that simply creating stubs does not seem to encourage their expansion into real articles in any kind of reasonable time-frame even where the content exists for such an expansion (and especially if it does not). FOARP (talk) 09:22, 11 October 2022 (UTC)
Oppose (awards)
- This will only end up being gamed for the same reasons. Also I can't think of anything that would put me off developing stubs more than this. If people really want bonus points they can take articles to DYK Blue Square Thing (talk) 06:35, 10 October 2022 (UTC)
- It wouldn't put me off creating stubs, but I will note that stubs rarely get improved after their creation. Abductive (reasoning) 11:35, 10 October 2022 (UTC)
Please open your own section with username in the heading. Please limit comments within a section to 300 words.
Comments (awards)
Question from WhatamIdoing (awards)
Paradise Chronicle, what do you mean by "to award with notifications"? Is this like "winning extra spam"? (Here's a reminder to others that Special:Preferences#mw-prefsection-echo is worth looking into.) WhatamIdoing (talk) 02:59, 10 October 2022 (UTC)
- @Espresso Addict, you (currently) can't get notifications such as "Page link" ("please bother me every time someone adds a link to any article the software thinks I created") for pages you haven't created. However, I posted that link for the benefit of experienced editors who would like to get fewer notifications. WhatamIdoing (talk) 04:52, 10 October 2022 (UTC)
Comments from Espresso Addict (awards)
WhatamIdoing: I've just looked at that -- it doesn't look as if you can get notifications for articles that you haven't created? Espresso Addict (talk) 03:29, 10 October 2022 (UTC)
- WhatamIdoing: I quite like them -- it reminds me when I'm on wikibreak that I ought to be getting back. It's also interesting to see which of "my" creations continue to get links, and might be worth trying to put some more effort into. Espresso Addict (talk) 05:11, 10 October 2022 (UTC)
Comments from Paradise Chronicle (awards)
I am rather grateful to receive the notification if a wl is added to an article I created. It encourages to expand the articles accordingly. Also over the user views one can get notified if some event is related to an article one has created. An option would be adding an other symbol for user views and wls beside the blue star for the watchlist. Could be made over a tool.Paradise Chronicle (talk) 20:33, 10 October 2022 (UTC)
- WhatamIdoing, the notifications only concern on changes of MY creations (among other mentions and thanks etc.) not the ones of articles I am potentially interested in. For example if one expands an article from a stub, the one who created the stub gets the notifications, not the one who expanded the article.Paradise Chronicle (talk) 20:33, 10 October 2022 (UTC)
Question 16: Develop metric to measure editor's engagement in article improvement
Develop a metric that attempts to measure the degree to which an editor significantly improves articles created by other editors, either by adding sentences or paragraphs of sourced text or by adding reliable references to source existing text. It might be as simple as a count of articles that the editor had improved, or attempt to differentiate between adding one reference vs ~100-folding the article & getting it featured. This would complement existing metrics such as edit count, number of articles created, number of GAs reviewed, number of DYKs, and would facilitate the creation of user boxes or similar stating "I've improved n articles on Wikipedia" and/or "My [article improvement index] = y"
Support (article improvement metric)
- Support as nominator. Espresso Addict (talk) 01:02, 10 October 2022 (UTC)
- Support a constructive and cooperative approach to the mass creation of articles.Paradise Chronicle (talk) 08:20, 10 October 2022 (UTC)
- Support in principle, although I wonder about the practicalities. Ingratis (talk) 07:31, 12 October 2022 (UTC)
Oppose (article improvement metric)
- This will only end up being gamed. And, as above, I have little interest in creating stats such as this. Blue Square Thing (talk) 06:36, 10 October 2022 (UTC)
- Oppose - This is pure solutionism. FOARP (talk) 07:24, 10 October 2022 (UTC)
- Oppose - While article improvement is commendable, it doesn't solve the underlying problem of articles that can't be improved because significant coverage doesn't exist. Editors often skip over these articles when they can't find adequate sourcing, with the assumption that someone else will. We need solutions that specifically address the creation of unsourceable articles. –dlthewave ☎ 03:10, 12 October 2022 (UTC)
Comments (article improvement metric)
Comments from Espresso Addict
No idea how feasible this is technically. Might be useful for evaluating editors, too. Espresso Addict (talk) 01:02, 10 October 2022 (UTC)
Comments from Editor X
Please open your own section with username in the heading
Question 17: Amend WP:MASSCREATE
- 1. Amend WP:MASSCREATE to explicitly recite WP:MEATBOT, and explicitly clarify that mass-creation through repetitive editing by hand is not different for policy purposes to automated/semi-automated mass-creation.
- 2. Make getting consensus for creation prior to mass creation per WP:MASSCREATE mandatory.
Support amending WP:MASSCREATE
- Support both 1 and 2 as proposer - An argument repeatedly deployed by mass-creators was that they were not mass-creating, even when it was at the rate of dozens of articles per day through simply filling spaces in a template, as they were performing their mass-creation through repetitive editing by hand. They were able to argue this due to a perceived ambiguousness of WP:MASSCREATE/WP:MEATBOT as to whether they included mass-creation through repetitive editing by hand. Point 1 is intended to clarify that mass-creation by hand is also mass-creation.
Point 2 is designed to promote engagement with mass-creators and the wider community at an early stage. I feel strongly that editors spending many hours of their day, every day, robotically creating articles is not good for their mental health or co-operative engagement with other editors, and is path that we should steer people away from where possible. Instead editors should be encouraged to discuss their planned mass-creation at an early stage with other editors, get feedback from them, and possibly even explore options for using tools/bots to speed up the process where there is a consensus to do so. This discussion can be in any appropriate forum (e.g., a relevant project). I think an earlier intervention in the cases of the mass-creators for whom we have had ARBCOM cases in recent years might have avoided the sad outcome of these proceedings. I think this is a minimalist solution based on existing policies and is not mutually exclusive with the other proposed solutions above.
The enforcement mechanism for Point 2 would be a subject for the next RFC.FOARP (talk) 11:07, 11 October 2022 (UTC) - Support. Problematic mass-creation is already semi-automated through the use of spreadsheets, scrapers, or templates, but it is a good idea to remove any ambiguity. I also support Point 2; at the moment this is an implicit requirement (Wikipedia:Bots/Requests for approval only grant approval for mass creation tasks when there is a broader consensus) and making it an explicit requirement also helps to remove ambiguity. BilledMammal (talk) 13:58, 11 October 2022 (UTC)
- Support both. Good and bad editors both can over-indulge in editing Wikipedia, but the issue is the content by bad editors. Give them an early warning that bot like editing may have consequences, so as to limit damage to the project. The analogy is WP:MEATPUPPET and WP:DUCK. LessHeard vanU (talk) 13:03, 12 October 2022 (UTC)
- Support for both proposals. If the issues at AfD are going to be solved mass creation needs to not be the norm. By getting edutors to discuss mass creation first AfD won't be clogged up later having to deal with any issues. -- LCU ActivelyDisinterested ∆transmissions∆ °co-ords° 15:27, 12 October 2022 (UTC)
- Support- Ideally we would have one policy and approval process that covers all mass creation tasks, automated or not, as the effect is the same. This would also eliminate the loophole for editors who copy-paste the output of offline/local automated processes and insist that they're doing it all by hand. –dlthewave ☎ 15:55, 12 October 2022 (UTC)
Oppose amending WP:MASSCREATE
Comments (amend WP:MASSCREATE)
Comments from FOARP (amend WP:MASSCREATE)
As a simple policy point, mass-creation by bot is probably actually better than mass-creation by hand, since it at least reduces the chance of human error and takes up less of the editor's time. At the very least there is no reason why one should be considered "bad" and the other "good". One of the editors in the previous ARBCOM case (I cannot remember who) had it very right when they said that if we really wanted to create the Turkish Mahalle articles that were the subject of the first big ANI about Lugnuts we could have simply contacted the Turkish government for their data directly and just run a bot to create the articles - there really was no reason at all to just go through a low-information database by hand to do it. FOARP (talk) 11:07, 11 October 2022 (UTC)
Espresso Addict - I tend to agree that BOTPOL is not the best place for this. Just like many of our other policies, MASSCREATE started at a particular place because it was the first place to actually tackle with the problem of mass creation (if I'm not mistaken, it was RAMBOT and Dr. Blofeld's mass-creation-through-Bot attempts), but it need not stay there and can easily be moved somewhere else or spun out as its own article - would you support this proposal with the proviso that this happens? I think the main point here is we avoid making radical changes, and instead develop on what's already there. FOARP (talk) 07:41, 12 October 2022 (UTC)
- Espresso Addict - I'm afraid this whole thing about definitions - something that really should be allowed to develop organically and beaten out case-by-case - has become just a reason to oppose any proposal at all. The forum for such discussions can also be allowed to develop organically. Well-received serieses of articles are unlikely to have any problems clearing this bar, but repeated flouting of a required process in creating large amounts of low-quality stubs for which their is no consensus by hand would be a behavioural issue, just as much as if someone repeatedly tried to use a bot to do so. We can see that indeed WP:NASTRO already implemented something like this - based explicitly on WP:MASSCREATION - when they cleaned up minor planets and is appears to work just fine now. Oddly, they did not feel the need to create a specific definition to achieve this. FOARP (talk) 09:22, 12 October 2022 (UTC)
Comments from Rhododendrites (amend WP:MASSCREATE)
Adding "also this applies to manual editing" gets at the main reason efforts to point people to those policies don't work, but there are still unresolved issues: apart from the bot policy not seeming like the right place to talk about manual editing (it should have a pointer elsewhere IMO), it doesn't sufficiently define what sort of manual editing it applies to. There's the number "25 or 50" but the language around it is weak, and it never defines "25 or 50 what" (i.e. over what span of time, with what similar characteristics, etc.). Point 2 is, I think, taken for granted throughout this RfC -- the areas where we're having trouble agreeing is under what conditions it should apply. — Rhododendrites talk \\ 13:05, 11 October 2022 (UTC)
Comments from Espresso Addict
The bot noticeboard isn't the right place to point people at. Espresso Addict (talk) 23:35, 11 October 2022 (UTC)
- FOARP: If we can agree any kind of usable definition for mass creation, a suitable venue for pre-approving (I'd be more in favour of relevant wikiprojects than a noticeboard), and a suitable place for the advisory message to be placed (not relating to bots), then I'm not strongly opposed.
- What would you envisage doing with ongoing series of articles? Eg Peter I. Vardy has been creating lists of English listed buildings for over a decade now, which seem generally well received; would he have to ask for permission to continue? ETA: I'm worried that valuable contributors doing the spadework of building the encyclopedia are going to feel unappreciated, while editors who create poor-quality stubs and don't tend them are unlikely to read the guidelines and seek advance permission. Espresso Addict (talk) 08:46, 12 October 2022 (UTC)
Comments from ActivelyDisinterested
I'd agree the BOT board isn't the right place for discussion, all mass creations should be discussed at the new board (as I see it that would be the primary concern of the new board). -- LCU ActivelyDisinterested ∆transmissions∆ °co-ords° 15:29, 12 October 2022 (UTC)
Comments from Lurking Shadow
The bot noticeboard is the right place for these disputes if the other noticeboard doesn't form. If you are creating articles on scale then there is some routine in the process. Bot-like editing can be addressed in a similar fashion to bot editing, only easier because you can ask the "bot" what it does directly.
Comments from Editor X
Please open your own section with username in the heading. Please limit comments within a section to 300 words.
Question 18:
Neutral description of proposal.
Support (proposal name)
- support statement
Oppose (proposal name)
- oppose statement
Comments (proposal name)
Comments from Editor X
Please open your own section with username in the heading. Please limit comments within a section to 300 words.
Discussion
Comments from ActivelyDisinterested
As per my comment on question 5, it could be useful to have some limitation with WP:BLPs. Personally I wonder whether allowing it at all is a good idea. But I'm unsure how to word any proposal correctly. -- LCU ActivelyDisinterested ∆transmissions∆ °co-ords° 19:00, 4 October 2022 (UTC)
Comments from JoelleJay
I brought this up in the earlier discussions, but given the difficulty in defining "mass creation", why don't we just have a limit on how many undersourced new articles a single person can have in their contributions (from X date)? If they go over that limit, they get escalating warnings up to a p-block from page creation, which can only be removed if they adequately source N other non-compliant articles. So it would be fine to create 100 microstubs if they included GNG sourcing from the start. And it wouldn't matter if other people expanded their articles after creation: the whole point is to discourage generating standalones for which other editors end up needing to find notability-confirming sources. 22:19, 6 October 2022 (UTC)
- I don't know anything about technical feasibility in tracking the state of article sourcing at creation, but if it's doable I could add it to the proposals. "Undersourced" could be defined by AfC norms or hashed out more fully later. I intended for the counter to start only after extensive global notification across some period of time, so it wouldn't apply to old articles. Adding sufficient sourcing to other editors' articles would also be an option for offsetting one's own unexpandable-microstub footprint, it just would be harder to keep track of it. 21:01, 7 October 2022 (UTC)
- Also agreed with FOARP that GEOLAND is huge mass-creation fodder and should be covered. 21:22, 7 October 2022 (UTC)
Comments from Espresso Addict
@JoelleJay: This is an interesting suggestion; I'd like further details on how it might work in practice. Is this technically feasible? What about articles which the contributor creates with sources but someone else removes the sources (this is reasonably common, someone "improves" the article by writing over it, losing all sourcing). Would you get points for sourcing other people's work? Anything that persuaded editors to work on the un(der)sourced articles heap would be positive. How is undersourced defined? My understanding of the way this RfC works is that if this is to be a proposal, it needs to be formally started in the next few days. ETA: Might want to include a date threshold; I think there might need to be a "statute of limitations" for very old contributions before referencing standards were anything like today's standards and when online sources were much more sparse. Espresso Addict (talk) 00:04, 7 October 2022 (UTC)
@Aquillion: Well, I tried to put forward proposals but both of them have/are failing. I don't personally feel this discussion format is at all helpful in bringing people together to find a solution to the undoubted problem. I wish I'd been able to participate in the brainstorming discussions but they happened when I was sufficiently injured to be unable to type. Espresso Addict (talk) 02:07, 8 October 2022 (UTC)
@FOARP: I, and perhaps others, would be a great deal happier with proposals that relate to populated settlements under GEOLAND, rather the entire guideline, which also covers heritage-listed buildings, mountains, and similar. Espresso Addict (talk) 01:12, 10 October 2022 (UTC)
- @FOARP: I had a look at Zykov Glacier and there is a little online about it beyond the existing text. After the expedition that photographed it there seem to have been at least two published independent studies of the rock there, which are reviewed in a handful of papers. A reader encountering Zykov Glacier in this literature might want to know where it was exactly. Someone who actually understood what the papers are talking about might be able to improve the article further. Espresso Addict (talk) 23:39, 10 October 2022 (UTC)
Comments from Dlthewave
@JoelleJay: This seems like a great idea - Besides removing the open-ended "someone will improve it in the future" assumption, it will limit the problem of articles that can't be improved because coverage doesn't exist, which are currently a massive time sink. Even if we have to tweak the wording, this seems like something that the community might support as it doesn't encroach on article creation quite as much as the GNG-sourcing-from-the-start requirement. Would love to see this added to the RfC. –dlthewave ☎ 02:09, 7 October 2022 (UTC)
Comments from FOARP
I am very concerned that people seem to not appreciate the extent to which mass-creation in the GEOLAND area is a really big problem, at least as big as that which was in NSPORTS. Exempting and excluding GEOLAND from a lot of the proposals above (why?) means there will essentially be no fix for the problems we are addressing. FOARP (talk) 08:40, 7 October 2022 (UTC)
@Valereee: - My GEOLAND-specific fix is that any fix for mass-creation also applies to GEOLAND articles. There is nothing special about GEOLAND articles that requires a carve-out for them. Going through a half-understood database and translating every line of it into its own article is as much a problem there as it is everywhere else. This whole thing started with Lugnuts mass-creating Turkish “village” articles that were likely neighbourhoods or individual farms. FOARP (talk) 14:33, 9 October 2022 (UTC)
- @Valereee: - Having looked at this, I think this is better stated as a simple oppose !vote for proposals that (inexplicably) contain carve-outs for subject-areas known to be heavily affected by mass-creation. Beyond saying that Agro-Industry Complex and Harry Oppenheim were both undesirable mass-creations, what more is to be said? Instead I will just propose something based on an already existing policy that already applies to all subject-areas anyway. FOARP (talk) 10:32, 11 October 2022 (UTC)
Espresso Addict - I cannot see how the rash of articles about geographical features in Antarctica mass-created in 2006 literally by word-for-word copying of single-sentence GNIS database entries (e.g., this one) were much better than the Iranian villages articles mass-created at the same time. By any definition these articles lack significant coverage and are never likely to be expanded because they are about features where no-one lives and which are almost never visited except fleetingly. The are straight NGEO-fails given that the only information about them is name and location. The one thing I will grant them is at least they are not simply the same single-sentence template repeated over and over, but this is because the people who created the GNIS data did not do that. FOARP (talk) 07:37, 10 October 2022 (UTC)
- Espresso Addict - Passing mentions in a long list of places that samples were collected in is not anything that will add content to an article. The same thing was true for minor planets (i.e., their observations are regularly recorded and many are regularly mentioned fleetingly in long lists) but we sensibly weeded out all the ones without SIGCOV years ago. FOARP (talk) 08:00, 11 October 2022 (UTC)
@Valereee, 127(point)0(point)0(point)1, and 127(point)0(point)0(point)1: - Engagement is lower than you might expect partly because the title is very anodyne and doesn't really say what it's about. This subject does cause major drama so I expect if we had a title that was closer to what it was actually about (what to do about mass creation of articles?) more people would be clicking through from CENT. FOARP (talk) 14:34, 11 October 2022 (UTC)
@AKAF - You are right that WP:MASSCREATE already exists - sharpening it up a bit is the subject of question 17. FOARP (talk) 10:47, 12 October 2022 (UTC)
Comments from Aquillion
I'm noticing at least a few people seem to be opposing virtually all proposals. Is this because they do not believe there is a problem, or because they believe that none of these are the correct solution to the problem? If it's the latter, do they have proposals of their own? --Aquillion (talk) 18:23, 7 October 2022 (UTC)
Comments from XOR'easter
This whole discussion seems to have come unmoored from any specifics, in a way that creates an illusory unity of problems. Are we worried about copyright violations? Inaccurate information? The inclusion of accurate information in chunks that might be too small? I don't think "article creation at scale" is one thing, and so I don't think there can be a single rulebook for it, or a single remedy for it going wrong. XOR'easter (talk) 19:08, 7 October 2022 (UTC)
Hut 8.5, yes, I think that many people are just used to Wikipedia having lots of tiny articles about obscure things. I'm a bit surprised that there hasn't been a proposal along the lines of, "Before you create a lot of stubs, get community input on whether the source you are using is reliable." XOR'easter (talk) 14:03, 9 October 2022 (UTC)
Comments from Blue Square Thing
@Aquillion: I seem to have found myself opposing almost all of the proposals. I didn't expect this to be the case - I expected to support some. But the arguments put forward by other people have led me to oppose much more frequently than I expected to. I'm not completely convinced that there is that much of a problem here than can't be dealt with in other ways; if there are other solutions I'm afraid that I'm not clever enough to see them just now. Blue Square Thing (talk) 08:51, 8 October 2022 (UTC)
Comments from Valereee
Espresso Addict, we'll be workshopping the AfD RfC soon at WT:ADAS (and are discussing the timeline for that now; we are considering starting workshopping as soon as things significantly slow here). If none of the proposed solutions here gains consensus, that RfC will by necessity still need to cover any mass creation issues that affect AfD at scale. We ran this first to see if the community could find solutions to this one major problem, because if it could be solved at least partially here, that RfC could build on those solutions.
XOR'easter, we decided to do this RfC first not because creation at scale was the only thing affecting dysfunction at AfD but because there was a strong feeling that it was a major contributor, and if we could find solutions here before the RfC about deletions at scale, it might actually make that RfC less of a train wreck. Valereee (talk) 14:43, 8 October 2022 (UTC)
FOARP, can you propose a GEOLAND-specific fix? Valereee (talk) 14:48, 8 October 2022 (UTC)
- @FOARP, can you create a proposal for that? Valereee (talk) 14:40, 9 October 2022 (UTC)
Ingratis, I'm not going to prevent something being proposed that is related to mass creation, if someone can come up with a proposal. The RfC has been announced in multiple places, including WT:N. Valereee (talk) 12:04, 10 October 2022 (UTC)
@127(point)0(point)0(point)1, the RfC has been announced at the articles for deletion talk page, the Arbitration Noticeboard, the administrators' noticeboard, the Bot policy talk page, Village pump (policy), Wikipedia talk:Notability and Centralized discussion. Some of those noticeboards have thousands of watchers. Valereee (talk) 14:17, 11 October 2022 (UTC)
- @127(point)0(point)0(point)1, we can't actually force people to be interested or understand. @FOARP, you really think it's the name? Don't answer here, let's take it to talk. Valereee (talk) 14:51, 11 October 2022 (UTC)
Comments from 127
I'm rather surprised by the participation numbers here. Problems of this scale and discussions of reworking fundamental processes and the highest number of editors in any one counting stack response is... 13 at time of writing. There are more well populated ANI threads. I know there's more time to run, but at the rate there are new voices coming in, I don't think we're at a place we can realistically proclaim a consensus for or against. This RfC needs more visibility. --WhoIs 127.0.0.1 ping/loopback 20:56, 8 October 2022 (UTC)
@User:Valereee I appreciate that it's been announced in all those places, and don't wan't this to be framed as some sort of lack of effort. But that still doesn't change the fact that we don't seem to have the participation we need for the kind of changes we're proposing here. Either those watching eyes aren't coming and aren't interested or they don't understand the scale of the RfC. -WhoIs 127.0.0.1 ping/loopback 14:36, 11 October 2022 (UTC)
Comments from Hut 8.5
I get the impression that this is an issue a small number of editors care about a lot but where the wider community doesn't care. That explains the low particpation and the fact that none of the proposals is doing very well. The largest problem I've seen in this area is with people creating lots of articles on populated places that don't exist by citing sources that aren't reliable or don't really indicate that something is a populated place. The proposals here either wouldn't address this or are far too heavyweight. Hut 8.5 10:50, 9 October 2022 (UTC)
Comments from Editor AKAF
This RFC seems a lot of faff to codify the current community position:
- Creation of "high quality" articles (whatever that means) is a net positive of which pretty much everyone approves.
- Creation of articles in a "bot-like manner", including manually populating templates from a database, is something to which editors appear indifferent or negative.
- A small percentage of articles created in a "bot-like manner" have been expanded to be "high quality" articles, and everybody likes these articles.
For bot-like edits, there is an existing route to approval (WP:BRFA), which could be enforced. AKAF (talk) 07:25, 10 October 2022 (UTC)
Comments from Ingratis
@Valereee I'm very concerned at the apparent danger that this RfC may be used as backdoor leverage on the issues surrounding GEOLAND. As you will be aware, there is a long and rather vicious history of many attempts to change it which have met with much strong objection. Because of the complexity of the issues they need to continue to be discussed fully at the SNG itself, not tacked onto this RfC, the context of which will certainly slant the discussion, so it seems to me inappropriate for any GEOLAND reform proposal to be put forward here. Ingratis (talk) 11:28, 10 October 2022 (UTC)
- @Valereee I understand that of course the focus here is on mass creation but there are many more issues involved in GEOLAND than that. Any proposal to draw its teeth as an SNG by removing its present criteria and making it subject to GNG (as advocated by for example FOARP) must surely require a separate RfC, which would doubtless attract a lot of participation, since there are many editors who have very firm views on whether GEOLAND should go the way of NSPORTS. Ingratis (talk) 08:19, 12 October 2022 (UTC)
Comments from Rhododendrites
I wish everyone responding to various proposals here with "that will just be wikilawyered" or "this will just be gamed" will consider whether the potential for wikilawyering/gaming would be greater if we have some of those basic rules/guidelines in place than the current situation. It seems like so many things are failing for two big reasons: this, and because there's a divide as to whether this is all about quality or all about quantity. On the latter, I don't know how you can say which creation projects the quality requirements would apply to without getting into quantity ("all new articles must have X sourcing/claims/whatever" is outside the scope of this RfC after all, so to what will it apply?). — Rhododendrites talk \\ 12:16, 10 October 2022 (UTC)
Comments from Editor X
Open additional comments sections below. Please limit comments within a section to 300 words.