Jump to content

Wikipedia:Weeding Wild Wikipedia: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Larry_Sanger (talk)
Still at work...please don't edit yet...
 
Larry_Sanger (talk)
No edit summary
Line 1: Line 1:
The most meaningful and important thing anyone can do to contribute to Wikipedia is to write a long, accurate, well-referenced, meaty article.
The most meaningful and important thing anyone can do to contribute to Wikipedia is to write [[the perfect article|a long, accurate, well-referenced, meaty article]].






But it's also very important to do "Wikipedia weeding"--particularly when we have disproportionate influxes of new people who are eagerly, willy-nilly, contributing scads of new entries. We love these people, but they need teaching. Wikipedia weeding consists essentially of checking over work and making small but important edits and, occasionally, adding comments and questions. Here, for your edification, is some advice on how to perform Wikipedia weeding yourself. (Feel free to edit this!)
But it's also very important to do "Wikipedia weeding"--particularly when we have disproportionate influxes of new people who are eagerly, willy-nilly, contributing scads of new entries. We love these people, but some of them need teaching. (Not all of them. Some of them are, instantly, enormously useful and start doing some weeding themselves!) Wikipedia weeding consists essentially of checking over work and making small but important edits and, occasionally, adding comments and questions. Here, for your edification, is some advice on how to perform Wikipedia weeding yourself. (Feel free to edit this!)




Line 17: Line 17:
* '''The start of articles.''' Generally, Wikipedia articles begin with the subject of the article in <b>bold,</b> and part of a full sentence. There's a somewhat annoying tendency on the part of some to repeat the subject of an article by itself on its own line (it's already at the top of the page). Others make the first words of the article a partial sentence. So some of us (compulsively) convert such entries to begin with full sentences, with the subject of the article in <b>bold.</b> See [[G. E. Moore]] for an example of how this is done.
* '''The start of articles.''' Generally, Wikipedia articles begin with the subject of the article in <b>bold,</b> and part of a full sentence. There's a somewhat annoying tendency on the part of some to repeat the subject of an article by itself on its own line (it's already at the top of the page). Others make the first words of the article a partial sentence. So some of us (compulsively) convert such entries to begin with full sentences, with the subject of the article in <b>bold.</b> See [[G. E. Moore]] for an example of how this is done.


* '''Articles made to look like dictionary definitions.''' Another pet peeve of mine are entries that have different senses of the title word numbered--as if we were writing a dictionary (which [[Wikipedia is not a dictionary|we are not]]!!) and we needed to number the senses. Goddammit, I wish we had parentheses (Magnus, we need to get your software fixed!). Anyway, I remove the numbers, properly format the separate articles on the page, and put a line between them. I also, mercilessly, delete any brief appendages to the effect, "In English, this word is also used to mean X," where the word-in-the-sense-of-X ain't ''ever'' going to be an encyclopedia article. See [[/Why a list of the senses of a word is not an encyclopedia article]].
* '''Articles made to look like dictionary definitions.''' Some people confuse Wikipedia, which is an encyclopedia, with a dictionary. They add entries that have different senses of the title word numbered and briefly defined--as if we were writing a dictionary (which [[Wikipedia is not a dictionary|we are not]]) and we needed to number the senses. Soon (by February 2002) we should have parentheses working, which will help the [[naming conventions|disambiguation problem]]. Anyway, an appropriate approach to such entries is to remove the numbers, in most cases simply delete non-articles (i.e., mere definitions that will never become encyclopedia articles), properly format the separate articles on the page, and put lines between them.


* '''Copyediting.''' It might be unfair and silly, but Wikipedia ''is'' going to be judged based on how well we spell, punctuate, etc. So I clean articles up that way. Blatant copyediting mistakes are, even if trivial, nevertheless indefensible; mistakes in content can often be defended on grounds of ambiguity. So if somebody who cares about how English is used spots a bunch of copyediting mistakes, he'll easily be able to conclude the product is shoddy; on the other hand, if the same person sees few such mistakes, and content that is largely correct, with a few overgeneralizations and half-truths, his suspicions of shoddiness will be less certain. (Just some idle hypothesizing there.)
* '''Copyediting.''' It might be unfair and silly, but Wikipedia ''is'' going to be judged based on how well we spell, punctuate, etc. So we should clean articles up that way. Blatant copyediting mistakes are, even if trivial, nevertheless indefensible.


* '''Fix bad links.''' Some newbies, caught up in the excitement (and who can blame them), wikify everything in sight, including plurals, capitalized words that shouldn't be capitalized, ambiguous words and surnames only, etc. Hence I often make it my mission to either fix or remove bad links.
* '''Fix bad links.''' Some new contributors, caught up in the excitement (and who can blame them), wikify everything in sight, including plurals, capitalized words that shouldn't be capitalized, ambiguous words and surnames only, etc. So it's a good idea to either fix or remove those bad links.


* '''Remove [[patent nonsense]], etc.''' I sometimes find myself simply deleting entire sentences and even paragraphs. This has to be done carefully, though, of course. Sometimes it's just vandalism, and no excuses need to be made to remove ''that.'' Sometimes it's something that seems to have been written by a 14-year-old whose main concern is to express excitement about a hobby, but conveys literally zero information. The possibilities of useless text, indeed, are endless. Another possibility is completely, blatantly biased stuff. If I think I don't have time to correct it, and if the bias is extreme, but the content is useful, I'll move it to a [[talk page]] and say "this needs to be de-biased" or something like that.
* '''Remove [[patent nonsense]], etc.''' Sometimes we have the duty of simply deleting entire sentences and even paragraphs. This has to be done carefully, though, of course. Sometimes it's just vandalism, and no excuses need to be made to remove ''that.'' Sometimes it's something that seems to have been written by a 14-year-old whose main concern is to express excitement about a hobby, but conveys literally zero information. The possibilities of useless text, indeed, are endless. Another possibility is completely, blatantly biased stuff. Wikipedia weeders should be very familiar with the [[neutral point of view]] policy. If we lack the time to correct the bias, and if the bias is extreme, but the content is useful, we move it to a [[talk page]] and say "this needs to be de-biased" or something like that. Ax-grinders have no legitimate demands on the time of other contributors; other contributors shouldn't be asked to spend inordinate amounts of time debiasing other people's text.


* '''Check for and if necessary remove copyrighted stuff.''' Basically, if some new person (or a person who hasn't [http://www.wikipedia.com/wiki.cgi?action=editprefs signed in]) writes some fantastic prose, I instantly copy a string of four or five words from it and, in quotes, see if [http://www.google.com Google] recognizes it. If so (which happens more often than I'm comfortable with), I check on the source page for a copyright notice. If there is no indication that the text is public domain or released under the GNU FDL, I either remove the text on the spot, giving the URL where I found the text on a talk page, or (if there's some question) I append a note asking where it came from, or whether it's copyrighted, etc. If I receive no reply, I delete the text.
* '''Check for and if necessary remove copyrighted stuff.''' Dealing with copyright issues is more an art than a science. Basically, if some new person (or a person who hasn't [http://www.wikipedia.com/wiki.cgi?action=editprefs signed in]) writes some fantastic prose, we instantly copy a string of four or five words from it and, in quotes, see if [http://www.google.com Google] recognizes it. If so, we check on the source page for a copyright notice. If there is no indication that the text is public domain or released under the GNU FDL, we either remove the text on the spot, giving the URL where we found the text on a talk page, or perhaps (maybe if there's some question) we append a note asking where it came from, or whether it's copyrighted, etc. If we receive no reply, we delete the text. Wikipedia can't afford the liability risk.


* '''Light content editing.''' If, while doing the above, I come across some statement I can make factually correct, or I can add some essential piece of information or remove some clear error, etc., I'll do that.
* '''Light content editing.''' If, while doing the above, you come across some statement that you can make factually correct, or you can add some essential piece of information or remove some clear error, etc., do it.






I probably do this sort of editing more than any other simply because it seems to me that others aren't doing it enough (yet). It is, again, probably not the ''most'' important way one can use one's time on Wikipedia (there are many ways to help, of course)--but it is definitely essential work. If we don't do it, Wikipedia is going to start looking more and more like [http://www.wikipedia.com/wiki/Everything2 Everything2], and I swear I'll kill myself if that happens. :-)
When there are large influxes of traffic (see [[The Wikipedia Militia]]), it is very important that old hands be engaged in this sort of weeding. It is, again, probably not the ''most'' important way one can use one's time on Wikipedia (there are many ways to help, of course)--but it is definitely essential work. If we don't do it, Wikipedia is going to start looking more and more like [http://www.wikipedia.com/wiki/Everything2 Everything2], which would surely be a great tragedy.



--[[user:Larry_Sanger|Larry_Sanger]]

----

I ''am'' one of those newbies, but I found myself weeding (carefully) almost at once. I'd like to make a suggestion: If the /Talk page hasn't had any traffic for over a month AND the issue people were talking about has clearly been resolved and fixed, can we delete it? Or does eveyone feel that /Talk pages have significant historical value and should be held onto at all costs? -- [[clasqm]]



: It depends on the circumstances in my opinion. If the /Talk discussion resolved some trivial issue, say a typo, then it can be deleted. But if the /Talk discussion resolved some fundamental questions, then don't delete. In the latter case the 'historical' /Talk may help to avoid that the same discussion arises again. --[[css]]



I remove things from /Talk pages when they aren't relevant anymore. If a change or addition is discussed for a while and then done on the main page, then I think the discussion should be removed as well. --[[Pinkunicorn]]



----

I think I'm counted as a newbie (a week or so, now), and one of the things I've been doing, from the beginning, is this sort of tidying. Not on the "blatant nonsense" level, so far, but fixing spelling, grammar, and such. Then again, how many wikipedia newbies are copyeditors? This is something I find both easy and rewarding, so I do it.

----

Sorry, I didn't mean to imply that "newbies" couldn't do a lot of very useful weeding. Of course they can! --[[LMS]]

----

Back to [[Larry Sanger|Larry's columns]]



Revision as of 19:11, 15 January 2002

The most meaningful and important thing anyone can do to contribute to Wikipedia is to write a long, accurate, well-referenced, meaty article.


But it's also very important to do "Wikipedia weeding"--particularly when we have disproportionate influxes of new people who are eagerly, willy-nilly, contributing scads of new entries. We love these people, but some of them need teaching. (Not all of them. Some of them are, instantly, enormously useful and start doing some weeding themselves!) Wikipedia weeding consists essentially of checking over work and making small but important edits and, occasionally, adding comments and questions. Here, for your edification, is some advice on how to perform Wikipedia weeding yourself. (Feel free to edit this!)


  • The Recent Changes page. It's most effective to weed based on what comes up on Recent Changes. People are looking to see what other people have done to their articles; this is a prime opportunity to teach by example (and teaching by example is that than which nothing is more wiki). One can also weed by repeatedly following Random Page link and working on whatever comes up. This latter can be fun!
  • Look for new names and ISP numbers. It sounds like anti-newbieism to say so, and apologies to the new folks, but very often the people who add the most dross to the project are the newest people, who are not signed in. One of the finest services you can perform is follow these people around and clean up after them. But, in explaining your changes (if necessary), we do not want to make them feel unwelcome; please be as gentle as you can with them.
  • Bad titles. Titles can be improperly capitalized (should be lower case unless the word in the title is always capitalized), and they can be ambiguous. It can help to tell people to study naming conventions. If you don't know where a page should be located, you could simply append a small italicized note at the end of the page, or on a talk page. If you do know where the article should be located, make a redirection page.
  • The start of articles. Generally, Wikipedia articles begin with the subject of the article in bold, and part of a full sentence. There's a somewhat annoying tendency on the part of some to repeat the subject of an article by itself on its own line (it's already at the top of the page). Others make the first words of the article a partial sentence. So some of us (compulsively) convert such entries to begin with full sentences, with the subject of the article in bold. See G. E. Moore for an example of how this is done.
  • Articles made to look like dictionary definitions. Some people confuse Wikipedia, which is an encyclopedia, with a dictionary. They add entries that have different senses of the title word numbered and briefly defined--as if we were writing a dictionary (which we are not) and we needed to number the senses. Soon (by February 2002) we should have parentheses working, which will help the disambiguation problem. Anyway, an appropriate approach to such entries is to remove the numbers, in most cases simply delete non-articles (i.e., mere definitions that will never become encyclopedia articles), properly format the separate articles on the page, and put lines between them.
  • Copyediting. It might be unfair and silly, but Wikipedia is going to be judged based on how well we spell, punctuate, etc. So we should clean articles up that way. Blatant copyediting mistakes are, even if trivial, nevertheless indefensible.
  • Fix bad links. Some new contributors, caught up in the excitement (and who can blame them), wikify everything in sight, including plurals, capitalized words that shouldn't be capitalized, ambiguous words and surnames only, etc. So it's a good idea to either fix or remove those bad links.
  • Remove patent nonsense, etc. Sometimes we have the duty of simply deleting entire sentences and even paragraphs. This has to be done carefully, though, of course. Sometimes it's just vandalism, and no excuses need to be made to remove that. Sometimes it's something that seems to have been written by a 14-year-old whose main concern is to express excitement about a hobby, but conveys literally zero information. The possibilities of useless text, indeed, are endless. Another possibility is completely, blatantly biased stuff. Wikipedia weeders should be very familiar with the neutral point of view policy. If we lack the time to correct the bias, and if the bias is extreme, but the content is useful, we move it to a talk page and say "this needs to be de-biased" or something like that. Ax-grinders have no legitimate demands on the time of other contributors; other contributors shouldn't be asked to spend inordinate amounts of time debiasing other people's text.
  • Check for and if necessary remove copyrighted stuff. Dealing with copyright issues is more an art than a science. Basically, if some new person (or a person who hasn't signed in) writes some fantastic prose, we instantly copy a string of four or five words from it and, in quotes, see if Google recognizes it. If so, we check on the source page for a copyright notice. If there is no indication that the text is public domain or released under the GNU FDL, we either remove the text on the spot, giving the URL where we found the text on a talk page, or perhaps (maybe if there's some question) we append a note asking where it came from, or whether it's copyrighted, etc. If we receive no reply, we delete the text. Wikipedia can't afford the liability risk.
  • Light content editing. If, while doing the above, you come across some statement that you can make factually correct, or you can add some essential piece of information or remove some clear error, etc., do it.


When there are large influxes of traffic (see The Wikipedia Militia), it is very important that old hands be engaged in this sort of weeding. It is, again, probably not the most important way one can use one's time on Wikipedia (there are many ways to help, of course)--but it is definitely essential work. If we don't do it, Wikipedia is going to start looking more and more like Everything2, which would surely be a great tragedy.