Jump to content

User:MER-C/Incubator: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
m Reverted edits by 91.19.201.41 to last version by MER-C (HG)
Blanked the page
Line 1: Line 1:
{{User:MER-C/Directory}}

=Secondary crater=
[[Image:Mare Imbrium-Apollo17.jpg|thumb|200px|right|[[Mare Imbrium]] (foreground) is peppered with secondary craters from the impact that formed [[Copernicus (lunar crater)|Copernicus crater]] (upper center)]]
'''Secondary craters''' are [[impact crater]]s formed by the [[ejecta]] that was thrown out of a larger crater. They sometimes form radial [[crater chain]]s.

Some things to talk about:

*Formation rate (Zunil)
*Applicability to crater counting
*Depth/diameter is low (typically 0.11)
*Bonneville crater

==External links==
*[http://www.hq.nasa.gov/office/pao/History/SP-362/ch5.3.htm Some secondary craters and crater chains as seen by the Apollo missions]

<!--
{{crater-stub}}
[[Category:Craters]]
[[Category:Planetary geology]]
-->
<br clear=all>



=Paper dump=
* (athabasca.pdf) {{cite journal|title=Athabasca Valles, Mars: A Lava-Draped Channel System|journal=[[Science (journal)|Science]]|author=Jaeger, et al|volume=317|pages=1709 - 1711|doi=10.1126/science.1143315}}
* (bonnevillecrater.pdf) {{cite journal|title=Surficial Deposits at Gusev Crater Along Spirit Rover Traverses|journal=[[Science (journal)|Science]]|author=Squyres, et al|volume=305|issue=5685|pages=807 - 810|doi=10.1126/science.1099849}}
* (cratercounter.pdf) {{cite journal|title=Who can Read the Martian Clock?|journal=[[Science (journal)|Science]]|author=Kerr, R|volume=312|date=2006|pages=1132 - 1133|doi=10.1126/science.312.5777.1132}}
* (cloud1.pdf) {{cite journal|journal=[[Icarus (journal)|Icarus]]|title=Interannual variability of water ice clouds over major martian volcanoes observed by MOC|date=2006|author=Benson et al|volume=184|pages=365-371|doi=10.1016/j.icarus.2006.03.014}}
* (cloud2.pdf) {{cite journal|journal=[[Icarus (journal)|Icarus]]|title=The seasonal behavior of water ice clouds in the Tharsis and Valles Marineris regions of Mars: Mars Orbiter Camera Observations|date=2003|author=Benson et al|volume=165|pages=34-52|doi=10.1016/S0019-1035(03)00175-1}}
* (cloud3.pdf) [http://www.ohiolink.edu/etd/view.cgi?acc_num=toledo1163799718 Properties of Water Ice Clouds over Major Martian Volcanoes Observed by MOC] (PhD thesis)
*(eagle1.pdf) {{cite journal|author=Squyres, et al|title=In Situ Evidence for an Ancient Aqueous Environment at Meridiani Planum, Mars|journal=[[Science (journal)|Science]]|date=2004|volume=306|pages=1709-1714|doi=10.1126/science.1104559}}
*(eagle2.pdf) {{cite journal|author=Squyres, et al|title=Soils of Eagle Crater and Meridian Planum at the Opportunity Rover Landing Site|journal=Science|date=2004|volume=306|pages=1723-1726|doi=10.1126/science.1105127}}
*(eagle3.pdf) {{cite journal|author=Squyres, et al|title=The Opportunity Rover's Athena Science Investigation at Meridiani Planum, Mars|journal=Science|date=2004|volume=306|pages=1698-1703|doi=10.1126/science.1106171}}
* (homeplate.pdf) {{cite journal|journal=Science|volume=320|pages=1063–1067|author=Squyres, et al|title=Detection of Silica-Rich Deposits on Mars|doi=10.1126/science.1155429}}
* (impactgardening1.pdf) {{cite journal|title=Martian Cratering 7: The role of Impact Gardening|journal=[[Icarus (journal)|Icarus]]|doi=10.1006/icar.2000.6532|author=Hartmann et al|year=2001|volume=149|pages=37-51}}
* (impactgardening2.pdf) {{cite journal|title=Asteriodal Regoliths|journal=[[Icarus (journal)|Icarus]]||volume=39|pages=317-351|author=Housen et al|year=1979|doi=10.1016/0019-1035(79)90145-3}}
* (jezerocrater.pdf) {{cite journal|url=http://www.nature.com/ngeo/journal/v1/n6/full/ngeo207.html|title=Clay minerals in delta deposits and organic preservation potential on Mars|journal=Nature Geoscience|year=2008|author=Ehlmann et al|doi=10.1038/ngeo207|volume=1|pages=355 - 358}}
* (minites.pdf) {{cite journal|title=Miniature thermal emission spectrometer for the Mars Exploration Rover|journal=[[Acta Astronautica]]|author=Silverman et al|volume=59|issue=8-11|pages=990-999|doi=10.1016/j.actaastro.2005.07.055|year=2005}}
* (northpolarbasin1.pdf) {{cite journal|title=The Borealis basin and the origin of the Martian crustal dichotomy|author=Andrews-Hanna, et al|journal=[[Nature (journal)|Nature]]|volume=453|pages=1212 - 1215|doi=10.1038/nature07011}}
* (northpolarbasin2.pdf) {{cite journal|title=Mega-impact formation of the Mars hemispheric dichotomy|author=Marinova, et al|journal=[[Nature (journal)|Nature]]|volume=453|pages=1216 - 1219|doi=10.1038/nature07070}}
* (northpolarbasin3.pdf) {{cite journal|title=Implications of an impact origin for the Martian hemispheric dichotomy|author=Nimmo, et al|journal=[[Nature (journal)|Nature]]|volume=453|pages=1220 - 1223|doi=10.1038/nature07025}}
* (planetx.pdf) {{cite journal|title=An Outer Planet beyond Pluto and the origin of the Trans-Neptunian Belt Architecture|url=http://harbor.scitec.kobe-u.ac.jp/~patryk/Patryk-Planetoid.pdf|author=Lykawka, P.|coauthors=Mukai, T|journal=[[Astronomical Journal]]|date=2008|doi=10.1088/0004-6256/135/4/1161|volume=135|pages=1161-1200}}
* (secondarycraterdating.pdf) {{cite journal|title=The importance of secondary cratering to age constraints on planetary surfaces|author=McEwen, A|coauthors=Bierhaus, E|year=2006|journal=Annual Review of Earth and Planetary Sciences|doi=10.1146/annurev.earth.34.031405.125018|volume=34|pages=535 - 567}}
* (tychosecondaries.pdf) {{cite journal|title=Rays and secondary craters of Tycho|author=Dundas, M.|coauthors=McEwen, A.|journal=[[Icarus (journal)|Icarus]]|doi=10.1016/j.icarus.2006.08.011|year=2007|volume=186|pages=31 - 40}}
* (zunilcrater.pdf) {{cite journal|author=McEwan, A.S. et al|url=http://www.mars.asu.edu/christensen/classdocs/mcewen_zunil_Icarus_2005.pdf|title=The rayed crater Zunil and interpretations of small impact craters on Mars|journal=[[Icarus (journal)|Icarus]]|volume=176|pages=351 - 381|date=2005}}
* (zunilcrater2.pdf) {{cite conference|author=McEwan et al|title=Discovery of a large rayed crater on Mars: Implications for recent volcanic and fluvial activity and the origin of Martian meteorites|conference=Lunar and Planetary Science Conference|date=2003|url=http://www.lpi.usra.edu/meetings/lpsc2003/pdf/2040.pdf}}

LPSC abstract dump: ftp://ftp.lpi.usra.edu/pub/outgoing Should probably read the descriptions at http://www.lpi.usra.edu/meetings/lpsc2007/lpsc2007.download.shtml (substitute relevant year) first.

=Spam tutorial=
==temp==
*What is spam? Spammers are <u>not</u> welcome on Wikipedia.

==Spam pages==
[[Image:Spam tutorial - spam page.png|thumb|300px|right|A spam page]]

A spam page, naturally, is one created with the sole intention of advertising some business or site. Spam pages tend to fall in three loose categories:

*Blatantly promotional spam pages look as though they're a verbatim copy of an "about us" section on a company website. These are relatively easy to spot, as they contain patently unencyclopedic phrases like "our products" or "for more information, visit http://example.com". You can use Google to find them, for example the query <tt>[http://www.google.com/search?q=site:en.wikipedia.org+%22our+products%22 site:en.wikipedia.org "our products"]</tt> throws up a number of pages that have been edited by spammers.

'''Exercise 1.1:''' Try finding a few spam pages for yourself. It shouldn't be that hard.

*The second type of spam page is what's best described as corporate vanity, see [http://www.nabble.com/Corporate-vanity-policy-enforcement-p6585535.html this mailing list post] for details. Essentially, if an account that has a business name as a username creates an article on said business, then everyone knows why they did it. There's a good chance the creator could be a disallowed [[m:Role account|role account]]. You can easily find the two above types of spam pages through [[WP:NPP|new pages patrol]].

*The third type of spam page is typically associated with linkspamming campaigns (see below). Here, the page is created for the purpose of housing external links to the spammed site(s). These often have salvageable content and aren't obviously advertising, so they aren't deletable as spam. However, these pages may still be copyvios, have non-notable subjects or suffer from other problems. For some examples, have a look at these article creations, more precisely the contents of the external link section in each: [http://en.wikipedia.org/w/index.php?title=Julia_Emilia_Vald%C3%A9s_Borrero&oldid=173818867] [http://en.wikipedia.org/w/index.php?title=Carlos_Trillo_Name&oldid=172853619] [http://en.wikipedia.org/w/index.php?title=Carlos_Rafael_Uribazo_Garrido&oldid=172848676] [http://en.wikipedia.org/w/index.php?title=Esterio_Segura_Mora&oldid=170374263] [http://en.wikipedia.org/w/index.php?title=Raul_Santoserpa&oldid=169538920].

When you find a deletable spam page (i.e. not of the third type), tag it for speedy deletion using {{tl|db-spam}} or (admins only) delete it yourself. Report the creator to the admins or block indefinitely because they're obviously not here to improve the encyclopedia. For images used exclusively on spam pages, check to see whether they are copyvios or have missing source/license information. If this is so, tag the image with {{tl|db-imagevio}}, {{tls|nsd}} and {{tls|nld}} for the respective problems.

==Link spam==

There are three things we need to consider in order to determine whether an external link addition is spam:

*The "what", or the content of the site. Does the site fall under any of the categories at [[WP:ELNO]] (i.e. links to avoid)?
*The "how" - how is the link added? Are the links added haphazardly or are the additions part of a systematic campaign to insert links to various sites?
*The "who" - who is adding the links? Are the links added by established editors or by [[WP:SPA|single purpose accounts]]? Are the usernames related to the site or its maintainers? Where do the IPs used resolve to?

I will focus on the "how" - identifying systematic campaigns and dealing with them, but the two other aspects will come into play. Firstly, some background reading.

'''Exercise 2.1:''' Familiarize yourself with the following policies, guidelines and articles.

*[[Wikipedia:External links]]
*[[Wikipedia:Spam]]
*[[Wikipedia:Conflict of interest]]
*[[Wikipedia:What Wikipedia is not]]
*[[Spamdexing]]
*[[Wikipedia:FAQ/Business]]

It is highly recommended that you have rollback abilities because spammers will add their links to multiple articles. If you don't, you may seek them at [[Wikipedia:Requests for permissions]].

===Trawling the wiki===
[[Image:Spam tutorial - RC feed.png|thumb|300px|right|RC feed]]

OK, you want to revert some spam. But how do we find it? The most effective way of trawling the wiki is through recent changes, or a filtered version of it. There are two [[Internet Relay Chat]] (IRC) channels you can connect to, choose one of them:

*irc://irc.freenode.net/wikipedia-en-spam (link additions only)
*irc://browne.wikimedia.org/en.wikipedia (everything, useful if you want to fight vandals at the same time)

It's time to install and set up some software. These channels are somewhat incompatible, which means software built for #en.wikipedia probably won't work on #wikipedia-en-spam and vice versa. For #wikipedia-en-spam, any decent IRC client will do. For #en-wikipedia, I recommend [[WP:VF|Vandal Fighter]] (requires Java). Find something that works for you. However, I know of no automated reversion tool that has the capability required for effective spam patrolling. In particular, do not use Huggle! (Read the tutorial once through and you'll see why.)

We're good to go, assuming you set up your software correctly. You may want to configure your software to highlight/show only changes to external link sections.

[snip]

Open the diff in your browser. If it's not a link addition, ignore it. If it is, does the link look like it could be spam? This can usually be determined with a quick glance, by looking at the TLD (see exercise below) or recognizing the reputation of the provider of the service, e.g. bbc.co.uk == [[British Broadcasting Corporation]], nature.com == ''[[Nature (journal)|Nature]]''. (These aren't likely to be spam). If you're not sure, do a quick assessment of the content of the link against [[WP:ELNO]].

'''Exercise 2.2:''' Which of the following TLDs are more likely to be spammy - [[.com]], [[.org]], [[.gov]], [[.edu]], [[.net]], [[.ac.uk]], [[.tv]], [[.info]], [[.biz]]?

'''Solution:''' (view source to reveal answer) <span style="display: none">.com, .org (non-profits are not exempt from our spam rules!), .net, .tv (although this TLD belongs to Tuvalu, it is marketed for television related sites), .info, .biz</span>

[[Image:Spam tutorial - contributions.png|thumb|300px|right|Contributions of a spammer]]

Bring up the user's contributions. Assess these against the following table on the suitability of link additions, which is an excerpt from [[MediaWiki:Spam-blacklisting]].

{| style="text-align:center; background: transparent"
|-
| width=14% | Criteria
| width=14% | '''Addition Frequency / Volume'''
| width=14% | '''Account / Intention'''
|-
| style="background:#efe;" | '''Links to potentially include'''
| style="background:#efe;" | Isolated event
| style="background:#efe;" | High-volume established editor
|-
| style="background:#ffe;" |
| style="background:#ffe; font-size:15pt; padding-bottom:6px" | ↕
| style="background:#ffe; font-size:15pt; padding-bottom:6px" | ↕
|-
| style="background:#fee;" | '''Links to exclude'''
| style="background:#fee;" | Campaign
| style="background:#fee;" | Spam only account
|-
| '''''Relevant authorities'''''
|
*[[WP:SPAM|Spam policy]]
*[[Wikipedia:Spam#Source_soliciting|Source soliciting]]
*[[Wikipedia:Spam#External_link_spamming|External link spamming]]
*[[WP:NOT|What Wikipedia is not]]
*[[WP:NOT#REPOSITORY|Not repository]]
*[[WP:NOT#DIR|Not a directory]]
*[[Wikipedia:What_Wikipedia_is_not#Wikipedia_is_not_a_soapbox|Not advertising]]
|
*[[WP:SOCK|Sock puppetry]]
*[[WP:SPA|Single-purpose account]]
*[[Wikipedia:Sock_puppetry#.27Role.27_accounts|'Role' accounts]]
*[[Wikipedia:Sock_puppetry#Inappropriate_uses_of_alternative_accounts|Inappropriate accounts]]
*[[Wikipedia:What_Wikipedia_is_not#Wikipedia_is_not_a_soapbox|Not advertising]]
*[[Wikipedia:Conflict of interest|Conflict of interest]]
*[[WP:EL#Advertising_and_conflicts_of_interest|Advertising and COI]]
*[[Wikipedia:COI#Blocks|Accounts used for promotion]]
|}

Once you've satisfied yourself that these are links to exclude and they were added in a systematic manner, revert the additions and warn the user.

===Warnings===
[[Image:Spam tutorial - warning spammer.png|thumb|right|300px|Warning a spammer]]
Since spammers are not welcome here, you should initially avoid using any template with the text "Welcome to Wikipedia". {{tl|uw-spam2}} is usually a good start. You can add {{tl|uw-coi}} if you suspect the user has a conflict of interest. If a suspected spammer turns out to be a good faith editor, you can always add a welcome template later.

Include in your warning a tracking URL, typically of the form http://spam.example.com . This allows other spam patrollers to determine when a domain was previously spammed and by who (it shows up in the linksearch). Personally, I put the URL in the header, but you don't have to. If there are multiple sites spammed, you can include multiple URLs or the URL of the company owning the sites.

If the user ignores your warning and continues to add links, follow up with a {{tl|uw-spam4}} and block/reporting if necessary. Ignoring warnings makes it almost certainly spam, so we move on to the next step.

===Investigating===
Now we have to determine whether the spamming campaign is just limited to the user you just caught. There are two angles of attack: one through existing links and the other through the sites themselves. Both need to be examined. You might find it easier if you investigate and prepare the spam report (see Reporting, below) concurrently.

====Looking through existing links====
[[Image:Spam tutorial - linksearch.png|thumb|right|300px|Looking for other spammers with [[Special:Linksearch]]]]

Before we start, I'd like to introduce some "tools of the trade".

'''Exercise 2.3:''' Familiarize yourself with {{tl|spamlink}}, {{tl|IPSummary}} and {{tl|UserSummary}}, their meta equivalents ([[m:Template:Spamlink]], [[m:Template:IPSummary]] and [[m:Template:UserSummary]]) and the function of each link in these templates. I may talk about some of them later. Here are three example uses for your convenience.

*{{spamlink|bom.gov.au}}
*{{IPSummary|127.0.0.2}}
*{{UserSummary|Jimbo_Wales}}

Start by performing linksearches on the spammed domain(s). It's the first item in {{tl|spamlink}} or, directly, [[Special:Linksearch/*.example.com]]. You should see the tracking URL you placed in your warning - that's why you did it.

Now for each article result, look for the link in the "references" (if no inline citations) and/or "external links" sections. If it's there, then you can use [http://wikipedia.ramselehof.de/wikiblame.php WikiBlame] to find who added the link. Whack in the description of the link, enter an appropriate amount of revisions (500 will do) and choose interpolated search for the fastest results. Bear in mind it doesn't work all of the time - there's always the page history.

'''Exercise 2.4:''' Who added a link to thejhelum.com to the article [[Jhelum River]]? Use WikiBlame.

'''Solution:''' (view source to reveal answer) <span style="display: none">{{User|202.61.63.10}} did, see [http://en.wikipedia.org/w/index.php?title=Jhelum_River&diff=prev&oldid=195496563]. I hope you didn't cheat and look at the page history because I've already removed the link. And besides, directly using the page history is usually much slower.</span>

[[Image:Spam tutorial - move links up.png|thumb|300px|right|A spammer moving links up.]]

For each editor who added the links, check their contributions and revert any other spamming. You should warn all IPs and users whose contributions are almost exclusively spam, no matter when they spammed. Check the edits for any other domains spammed and any bad-faith behaviour. This is not limited to:

*Vandalism when adding spam links, e.g. [http://en.wikipedia.org/w/index.php?title=Yiddish_language&diff=224099021&oldid=224053523]
*Vandalism of link records, e.g. [http://en.wikipedia.org/w/index.php?title=Wikipedia_talk%3AWikiProject_Spam&diff=224515796&oldid=224438306]
*Moving links up
*Replacing existing links
*Ignoring the meaning of the term "references", e.g. [http://en.wikipedia.org/w/index.php?title=Asher_D_%28British_rapper%29&diff=prev&oldid=221179412]
*Citation spamming, e.g. [[Special:Contributions/Webgeek]] (What do all the added citations have in common?)
*Creation of first person spam pages
*Using [[open proxy|open proxies]] to spam
*Cross-wiki spamming (see below)

Note all spammers in your report under an appropriate heading using {{tl|IPSummary}} or {{tl|UserSummary}}, whichever is appropriate along with any bad-faith behaviour. Also include the sites spammed in a separate section with {{tl|spamlink}} and tracking URLs if not considering blacklisting.

The linksearch also helps us look for previous incidents through tracking URLs placed by other spam patrollers. By "previous incident" I mean prior [[WT:WPSPAM]] reports, blacklistings, deletion of spam pages through AFD or MFD and any other project-space discussion. [[Special:Linksearch/*.squidoo.com|Here is an egregious example]] (now blacklisted globally). List these in their own heading in the report.

You might also find the occasional spam page, especially on the user pages of registered spammers. You know how to deal with these. If you're an admin, have a look at the deleted contributions of any registered spammers to find previously deleted spam pages. List spam pages in the report with {{tl|la}} (articles), {{tl|li}} (uploaded images) or {{tl|lu}} (user pages).

====The sites themselves====
It's time to get your hands dirty and visit the sites in question. Every site is different, hence I can only give you a few pointers on what to look for. I suggest you keep your wits about you and your adblockers enabled because you never know what they're going to serve up before it's too late...

*Are the spammed sites owned by the same company or person? This is usually obvious from the content of the site, but you can look at [[WHOIS]] records to confirm this.

'''Exercise:''' Are dailygujrat.com and geokashmir.com owned by the same company? Hint: [http://web.archive.org The Internet Archive] may be handy here.

'''Solution:''' (view source to reveal answer) <span style="display: none">Yes, they are. Both sites are "project[s] of JhelumSoft", as said on the bottom of the page. jhelumsoft.net is offline, so this is where the Internet Archive comes in handy.

I also note they sell [[search engine marketing]] ([http://web.archive.org/web/20070505102036/www.jhelumsoft.net/internetmarketing.html]). What a coincidence.</span>

*Are the sites hosted on the same server? The tools [http://onsamehost.com OnSameHost.com] and [http://whosonmyserver.com Who's on my server?] (they're in {{tl|spamlink}}) fit the purpose.

*Look for Adsense ads. You don't have to disable Adblock in order to do this - just open up the page source and look for a string starting with "pub-", followed by many digits. This is the Adsense ID of the site and is owner-specific.

'''Exercise:''' What is the Adsense ID of dailypunjab.com?

'''Solution:''' (view source to reveal answer) <span style="display: none">pub-0371265814726923</span>

*Do the sites look the same?

*Poke around the sites for any related domains. Publishers sometimes list their projects on each pages for SEO purposes or have a page detailing their other projects. These should be listed with tracking URLs in the spam report under their own heading. The [http://url-info.appspot.com URL info tool] is handy for finding related domains in a link farm. You may want to invest some time in getting a text editor that has regex find and replace - it'll save you a lot of time in formatting long lists of related domains.

'''Exercise:''' List all domains related to (www.)webooks.co.uk.

'''Solution:''' (view source to reveal answer) <span style="display: none">See http://en.wikipedia.org/w/index.php?title=Wikipedia_talk:WikiProject_Spam&oldid=224336142#Webooks_Network_http://spam.webooks.co.uk . I hope you got all 154 domains. Usually they aren't that bad.</span>

*Another thing to consider is whether the site(s) employ [[spamdexing]], black-hat or overenthusiastic SEO techniques. Here are some examples:

:*tuanlinhtravel.com: meta tag stuffing, link farming, hidden links (blacklisted locally)
:*adorons.com: scraper site (banned by the devs)
:*ctkohl.googlepages.com: spam in blogs (e.g. [http://www.beppegrillo.it/eng/2007/09/the_maroni_law_1.html triple post] [http://www.cbs.columbia.edu/weblog/2007/10/the-ancient-ind.html]), comment spam (e.g. [http://mycommitment.org/poverty/day1/post1] [http://community.nytimes.com/article/comments/2008/03/18/world/asia/18china.html?s=1&pg=7]), wiki spam, splogs (e.g. nagarjunafreiburg.wordpress.com, buddhismandquantumphysics.blogspot.com), mirror websites (blacklisted globally). To find the spam here, we google search [[google:"christian+thomas+kohl"+blog|"christian thomas kohl" blog]].
:*howtoretireabroad.info: I can't think of a legitimate reason why a site would have links to 75 social networking sites. Five, yes, but 75? Umm, no.

===Reporting===
[[Image:Spam tutorial - complete report.png|thumb|right|300px|Complete report]]

You didn't gather all that information above for nothing - now it's time to tell people about it. Post a new section on [[Wikipedia talk:WikiProject Spam]], containing the following things (in handy checklist format):

*[ ] Previous incidents
*[ ] Spam pages
*[ ] Sites spammed
:*[ ] Adsense ID, if appropriate
:*[ ] Tracking URLs, if not considered for blacklisting
*[ ] Related domains
:*[ ] Tracking URLs, if not considered for blacklisting
*[ ] Spammers
:*[ ] Evidence of bad faith behavior, if appropriate
*[ ] Course of action, see "blacklisting" below

Domains should be listed with {{tl|spamlink}}, registered users with {{tl|UserSummary}} and IPs with {{tl|IPSummary}}. At the very least, you should list sites spammed and who spammed them. For examples of completed reports, see the archives of WT:WPSPAM.

Now if there are previous incidents or the scale of the spamming justifies it, proceed to the blacklisting phase.

===Blacklisting===
[[Image:Spam-fltr-NAS.PNG|thumb|right|300px|Attempts to insert blacklisted URLs are blocked]]
Wikipedia maintains two spam blacklists - [[m:Spam blacklist|one on meta]] and [[MediaWiki:Spam-blacklist|one locally]]. MediaWiki prevents the addition of new URLs which match either blacklist and are not on [[MediaWiki:Spam-whitelist|spam whitelist]]. The meta blacklist affects all Wikimedia projects, all of Wikia and some other sites while the local blacklist affects the English Wikipedia only. The blacklists are editable by only meta and local administrators respectively. There is also [[User:XLinkBot]], which is the ClueBot equivalent for spam.

There are five things to examine when considering a site for blacklisting:

<ol>
<li> Does the content of the site have any use to Wikimedia projects? Sites with no useful content, e.g. gambling sites, can be globally blacklisted despite the spamming being restricted to one project. Compare the website against [[WP:ELNO]]. If a site has useful content and other good-faith editors agree (say, by using it for references) consider using XLinkBot.

<li> What is the scale of the spamming? Widespread spamming can lead to immediate blacklisting.

[[Image:Spam tutorial - cross wiki searching.png|thumb|right|300px|Doing a cross-wiki linksearch.]]

<li> Whether the spamming is spread across multiple Wikimedia projects. You can use the [http://toolserver.org/~eagle/linksearch cross-wiki linksearch] to search up to 57 Wikipedias and/or the [http://toolserver.org/~luxo/contributions/contributions.php cross-wiki contributions tool] as a preliminary. If there are hits, check the articles to see who added the link and whether they are spammers. If there are any additional spammers, the domain(s) are eligible (and recommended) for global blacklisting. List any additional spammers at WT:WPSPAM with an annotation that they have spammed non-English wikis. If the spam is extremely widespread, you can ask me to run a search of all Wikimedia projects.

<li> Whether the site or related sites have been spammed before. The list of previous incidents you compiled earlier should come in handy.

<li> Whether the site attempts to install malware (these should be blacklisted globally).
</ol>

[[Image:Spam tutorial - local blacklisting.png|thumb|right|300px|Since x-wiki was empty, local blacklist it is]]

To file a blacklisting request, list the domains that were spammed at [[MediaWiki talk:Spam-blacklist]] or [[m:Talk:Spam blacklist]] under the appropriate section and a link to the WikiProject Spam report. If you are seeking global blacklisting, add the cross-wiki spammers to your request. A template, {{tl|WPSPAM}} (exists locally only), allows you to add permanent links to such reports. An administrator will come along and process your request.

If you're an admin then you can blacklist (locally) directly, see [[MediaWiki:Spam-blacklisting]] for details.

===Other courses of action===
Unfortunately, blacklisting doesn't stop all spammers. I won't go into this in detail for a [[WP:BEANS|fairly good reason]], but it's good to know what to do when these cases crop up.

*Firstly, you need to be able to find spam that slips in between the cracks. Identify a unique phrase or string (e.g. a company name) that is used almost exclusively by the spammer, then search for it. [[Special:Search|Wikipedia's own search engine]] is barely adequate for the purpose, especially in the case of cross-wiki spam. If you must use it, set it to search the article and user namespaces. Otherwise, just use Google. <tt>site:wikipedia.org X</tt> is a good starting point, where X is the unique string you identified. If multiple projects have been spammed you can use OR to add projects searched, e.g. <tt>site:wikipedia.org OR site:wiktionary.org X</tt> These Google searches are automatically cross-wiki.

*Block any new sockpuppets and blacklist any new sites promptly.

*If there are large amounts of registered spammers, consider [[WP:RFCU|requesting checkuser]].

*Don't be afraid to ask for or make large and long rangeblocks. In the case of a company spamming, are the IPs registered to the company? Use a WHOIS, [[reverse DNS]] or a traceroute to find out. If they are, you can block for longer durations because they are likely static IPs.

*Is the spam focussed a couple of pages? Try getting them protected.

*Call in the wider community. This approach works best when you have a very widespread problem (hundreds of spam link additions).

*...
*PROFIT!

==Being prepared for the usual complaints==
{{see also|Wikipedia:Grief}}

How you deal with complaints from suspected spammers about their links being removed or page being deleted is dependent on your personality and your current mood. The only way to find a "strategy" that works for you is through experience, though I can give you a few tips. Successfully dealing with suspected spammers involves using many of these techniques.

*State specifically and succintly what's wrong with the user's edits. (This takes a bit of practice). Refer to your notes (if applicable) about bad-faith activities. Tell them that systematic additions of external links looks like spamming and if they are a good-faith editor, point them to our tips on [[Wikipedia:Spam#How not to be a spammer|not setting off the spam radar]]. You can also comment on the content of the sites, in which case state specifically which points of [[WP:ELNO]] the spammed site(s) fail.

*Some complaints are abusive. It's best if you ignore these, unless they contain [[WP:NLT|legal threats]] or severe [[WP:NPA|personal attacks]] in which case you should get the user blocked.

*Be prepared to be blunt. Some users are utterly clueless. It can take you several goes to get the point across that they can contribute to the encyclopedia without adding links to a particular website. I've personally contemplated using big colorful blinking text many times, but they get it before I roll this technique out.

*Ask the hard questions, e.g. why this particular (group of) site(s)? [http://en.wikipedia.org/w/index.php?title=Wikipedia%3AAdministrators%27_noticeboard%2FWebgeek&diff=176358688&oldid=176345258 Example application] (note the spammer hasn't been seen since).

*You might want to inform the user about the consequences of spamming Wikipedia and that it can backfire spectacularly:

:*Our global blacklist affects [[m:List of Wikimedia projects|all Wikimedia projects]], all of [[Wikia]] and hundreds of third-party websites that use our blacklists for spam filtering. We generally do not remove sites from the blacklist at the requests of their owners, but only when high-volume editors can demonstrate a valuable encyclopedic use for the site. Therefore it is extremely difficult to get sites delisted.

:*Some search engines take Wikimedia's blacklists as [http://www.searchenginejournal.com/wikipedia-spam-resulting-in-google-yahoo-penalties/5854/ user submitted spam reports] and may result in spammed sites being penalized or delisted from search results.

:*Our records of the spamming may feature prominently in search results for the same reason spammers add links here - high PageRank. This is especially true for persistent spammers. In particular, [[Wikipedia talk:WikiProject Spam]] has a PageRank of 6.

:*Spamming may result in [[Wikipedia:Wikipedia Signpost/2007-08-20/WikiScanner|negative PR]] for your client or employer.

*If you deleted someone's spam page and they complain about it, don't waffle on about notability. You should explain simply why their page is not permissible in an encyclopedia and appeal to their common sense (and not their knowledge of Wikipedia policy) e.g. "Do [[Encyclopedia Britannica]] or [[World Book]] contain promotional blurbs about companies written by that company?"

*Spammers often go through a [[WP:Grief|grieving process]]. It's your job to get them through this as fast as possible.

*Tell them it's not worth the effort - it took you several hours to add those links, it took me a minute to remove them.

And that's it. Wikipedia is not a restaurant full of [[Spam (Monty Python)|bloody Vikings]], so kick them out.

==Useful links==
*[[Wikipedia:WikiProject Spam]]
*[[Wikipedia talk:WikiProject Spam]] (spam reports go here)
*[[MediaWiki talk:Spam-blacklist]] (local spam blacklist)
*[[MediaWiki talk:Spam-whitelist]] (spam whitelist)
*[[m:Talk:Spam blacklist]] (global spam blacklist)
*[[User talk:XLinkBot/RevertList]] (XLinkBot revert list)
*[http://url-info.appspot.com/ URL-Info page analysis tool]
*[http://wikipedia.ramselehof.de/wikiblame.php WikiBlame]

Revision as of 08:06, 27 October 2008