Jump to content

MediaWiki talk:Spam-blacklist: Difference between revisions

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia
Content deleted Content added
Line 452: Line 452:


After reviewing the spam report very carefully, I think it's '''possible''' that ASAPS may even be an innocent party. The spammers were using these two sites to blend in with their own spam links in an attempt to look legitimate. However, lacking more information, I will let someone else put in a request for ASAPS, or put the request in at another time, but ASPS has no reason at all to be included on the blacklist. They were an innocent bystander.[[User:Ssc-capricorn|ssc-capricorn]] ([[User talk:Ssc-capricorn|talk]]) 20:04, 17 December 2013 (UTC)
After reviewing the spam report very carefully, I think it's '''possible''' that ASAPS may even be an innocent party. The spammers were using these two sites to blend in with their own spam links in an attempt to look legitimate. However, lacking more information, I will let someone else put in a request for ASAPS, or put the request in at another time, but ASPS has no reason at all to be included on the blacklist. They were an innocent bystander.[[User:Ssc-capricorn|ssc-capricorn]] ([[User talk:Ssc-capricorn|talk]]) 20:04, 17 December 2013 (UTC)

== tanners-wines.co.uk ==
{{Link summary|tanners-wines.co.uk}}
This site was blacklisted in February 2008 when it was hacked. However it is now safe and well maintained. There is a wikipedia page here: https://en.wikipedia.org/wiki/Tanners_(company) which from a usability perspective would benefit from a link to the actual Tanners site. I think in light of the fact that the company has genuine history and a good online reputation combined with the fact that it was blacklisted due to hacking issues originally; that there is compelling enough reason to remove this site from the blacklist. [[User:CCarson789|CCarson789]] ([[User talk:CCarson789|talk]]) 16:24, 19 December 2013 (UTC)


=Completed Proposed removals=
=Completed Proposed removals=

Revision as of 16:24, 19 December 2013

    Mediawiki:Spam-blacklist is meant to be used by the spam blacklist extension. Unlike the meta spam blacklist, this blacklist affects pages on the English Wikipedia only. Any administrator may edit the spam blacklist. See Wikipedia:Spam blacklist for more information about the spam blacklist.


    Instructions for editors

    There are 4 sections for posting comments below. Please make comments in the appropriate section. These links take you to the appropriate section:

    1. Proposed additions
    2. Proposed removals
    3. Troubleshooting and problems
    4. Discussion

    Each section has a message box with instructions. In addition, please sign your posts with ~~~~ after your comment.

    Completed requests are archived. Additions and removals are logged, reasons for blacklisting can be found there.

    Addition of the templates {{Link summary}} (for domains), {{IP summary}} (for IP editors) and {{User summary}} (for users with account) results in the COIBot reports to be refreshed. See User:COIBot for more information on the reports.


    Instructions for admins
    Any admin unfamiliar with this page should probably read this first, thanks.
    If in doubt, please leave a request and a spam-knowledgeable admin will follow-up.

    Please consider using Special:BlockedExternalDomains instead, powered by the AbuseFilter extension. This is faster and more easily searchable, though only supports whole domains and not whitelisting.

    1. Does the site have any validity to the project?
    2. Have links been placed after warnings/blocks? Have other methods of control been exhausted? Would referring this to our anti-spam bot, XLinkBot be a more appropriate step? Is there a WikiProject Spam report? If so, a permanent link would be helpful.
    3. Please ensure all links have been removed from articles and discussion pages before blacklisting. (They do not have to be removed from user or user talk pages.)
    4. Make the entry at the bottom of the list (before the last line). Please do not do this unless you are familiar with regular expressions — the disruption that can be caused is substantial.
    5. Close the request entry on here using either {{done}} or {{not done}} as appropriate. The request should be left open for a week maybe as there will often be further related sites or an appeal in that time.
    6. Log the entry. Warning: if you do not log any entry you make on the blacklist, it may well be removed if someone appeals and no valid reasons can be found. To log the entry, you will need this number – 586805692 after you have closed the request. See here for more info on logging.


    Proposed additions


    marketsandmarkets.com

    marketsandmarkets.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com --Dennis Bratland (talk) 17:37, 21 November 2013 (UTC)[reply]

    Is a blacklist really needed when at least one of these additions was not spam, one was through the AFC process, few notices have been placed, and there is no shown evidence of ongoing spamming? Liamdavies (talk) 17:26, 12 December 2013 (UTC)[reply]
    This has been ongoing for quite some time, but seems to have stopped now since a couple of weeks - warnings have been issued/blocks handed out - lets see if they were heeded. As I said above, lets see if it continues. no Declined for now (maybe use XLinkBot to hand out more warnings?). --Dirk Beetstra T C 08:38, 18 December 2013 (UTC)[reply]

    suzukicycles.org

    suzukicycles.org: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com Frequently cited, frequently plagiarized. Trove of copyrighted photos, books and text violates WP:COPYLINK and WP:SPS --Dennis Bratland (talk) 16:09, 17 September 2013 (UTC)[reply]

    Support blacklisting. Werieth (talk) 19:37, 27 September 2013 (UTC)[reply]


    Morning277 subjects

    These sites are being promoted by a publicity agency, banned from Wikipedia, which has been posting articles about them. After an article is deleted and the poster blocked, a new article with similar contents is posted from a different account, almost always under a different title. Since they keep using new accounts and new article titles, account blocking and page protection haven't been entirely effective.

    newyorkstay.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    youtube.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    justiceforall.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    kulaw.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    4cabling.com.au: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    aasted.eu: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    alsbridge.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    awaionline.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    bizible.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    rybec

    www.princeton.edu/~achaney/tmve/wiki100k

    princeton.edu: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    • The site is not reliable and should not be used in Article space per WP:CIRCULAR. This is clone of Wikipedia (every real page there says "The article content of this page came from Wikipedia and is governed by CC-BY-SA."). Some wikipedia editors may think that site is good as RS (it is in google's top and the domain is .edu), but it isn't and there should be some way to say that the link is not correct to be added to the Wiki.
    • Recent example: diff
    • Currently there are 76 links to the site, some are from Article space: [1]:
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/1924_Summer_Olympics.html is linked from Albert Séguin
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/2D_computer_graphics.html is linked from 2D computer graphics
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Abba_Eban.html is linked from Abba Eban
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Adair_County,_Missouri.html is linked from Grand River (Missouri)
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Area_rule.html is linked from Sears–Haack body
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Banba.html is linked from LÉ Banba (CM11)
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Ben_Bova.html is linked from Ben Bova
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Bhavani.html is linked from Bhavani Peth
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Biface.html is linked from Hand axe
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Brightness_temperature.html is linked from Brightness temperature
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Bushyhead,_Oklahoma.html is linked from Dennis Bushyhead
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Byng,_Oklahoma.html is linked from Julian Byng, 1st Viscount Byng of Vimy
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/CDC_6600.html is linked from CDC 6600
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Camel_(band).html is linked from Camel (band)
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Camel_(band).html is linked from The Snow Goose (album)
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Critical_theory.html is linked from Critical theory
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Ctesiphon.html is linked from Iwan
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Du_hast.html is linked from Burkenburg
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Francesco_Redi.html is linked from Francesco Redi
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Jean_le_Rond_d_Alembert.html is linked from Louis-Camus Destouches
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Lagari_Hasan_%C3%87elebi.html is linked from Lagâri Hasan Çelebi
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Language_game.html is linked from Language game
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Local_Government_Areas_of_Australia.html is linked from Local Government Area
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Lord_Peter_Wimsey.html is linked from Lord Peter Wimsey
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Mohammed_Deif.html is linked from Mohammed Deif
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Mystic_Records.html is linked from Mystic Records
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Noise_weighting.html is linked from Psophometric weighting
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Phonograph_cylinder.html is linked from Early classical guitar recordings
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Pimsleur_language_learning_system.html is linked from Pimsleur method
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Pope_John_XXI.html is linked from History of Roman Catholicism in Portugal
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/QuarkXPress.html is linked from QuarkXPress
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Reconquista.html is linked from History of Roman Catholicism in Portugal
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Record_producer.html is linked from Executive producer
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Sacraments_of_the_Catholic_Church.html is linked from Catholic Church
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Simpson_s_paradox.html is linked from Edward H. Simpson
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Smokey_Robinson.html is linked from North End, Detroit
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Sunk_costs.html is linked from Sunk costs
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/The_Chemical_Brothers.html is linked from Alleyn's School
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Transport_in_Barbados.htm is linked from Transport in Barbados
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Tsui_Hark.html/ is linked from List of University of Texas at Austin alumni
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Wall.html is linked from Wall
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Warren,_Arkansas.html is linked from Warren, Arkansas
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Wis%C5%82awa_Szymborska.html is linked from Ironic precision
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Yom_Kippur_War.html is linked from United Nations Security Council Resolution 338
    PS Actually, there are another runs of "tmve/wiki100k" on different sites (google for "tmve/wiki100k" site:wikipedia.org), e.g. http://www.sccs.swarthmore.edu/users/08/ajb/tmve/wiki100k/docs/Bavarii.html http://www.sccs.swarthmore.edu/users/08/ajb/tmve/wiki100k/docs/Potentiometer.html and they are not only in en-wiki (move to meta spam list or create some filters for wiki100k?) `a5b (talk) 00:52, 25 August 2013 (UTC)[reply]
    I don't know where the links came from but that site is benign -- it's an experiment being done by a grad student at Princeton. For more information, see these web pages:
    http://www.cs.princeton.edu/~achaney/papers/ChaneyBlei2012.pdf
    I suggest emailing her at http://www.cs.princeton.edu/~achaney/email.html before any blacklisting to give her a heads up.
    Her work could be very useful to Wikipedia and the Wikimedia Foundation in the long-term.
    That said, we don't need any of these links since they circle back to our own content.
    --A. B. (talkcontribsglobal count) 16:23, 29 August 2013 (UTC)[reply]
    I suggest she get in touch with WikiProject Research
    --A. B. (talkcontribsglobal count) 16:25, 29 August 2013 (UTC)[reply]
    That Wikiproject looks moribund when I look at it closer. It looks like there's very active support and discussion of various research projects on Meta-Wiki at meta:Research:Index. I'd hate to see a diligent researcher run afoul of what might look BITE-y to an outsider.
    --A. B. (talkcontribsglobal count) 16:34, 29 August 2013 (UTC)[reply]
    A. B., the main problem with this site is that: it takes texts from Wikipedia and republish them. This is allowed to copy text from wiki, but what is not allowed (per WP:CIRCULAR) - is to used wikipedia texts as references (wikipedia is not Reliable source, so does any copy of wikipedia). E.g. If there is link to http://www.princeton.edu/~achaney/tmve/wiki100k/docs/Pope_John_XXI.html in some article, we should replace it with Pope_John_XXI; and if such link is in <ref> we should replace this link with {{fact}}. I propose to add this site to spam list only to limit the efforts of replacing links to the princeton with {{fact}}. With site included to spam list, there will be no new links to the site added by good faith users who may think that something from `.edu` is always reliable..... Ok, there is actually no need to include her site into spam-list, but we should delete all links to her site and periodically recheck the Linksearch. `a5b (talk) 21:39, 9 September 2013 (UTC)[reply]
    As the creator of these problematic pages, I'm sorry--I just became aware that this is an issue. Please let me know what I can do to help fix it or prevent bad citations in the future. I won't be following the discussion here, but please email me if you'd like me involved. — Preceding unsigned comment added by Absonant (talkcontribs) 13:23, 19 September 2013 (UTC)[reply]
    To fix it, stop using these links for citations, period. As A5b noted, "wikipedia is not Reliable source, so does any copy of wikipedia." Better yet, clean up the mess by replacing all of those links with [citation needed], or even better, find a source that meets WP:RS to do so. OhNoitsJamie Talk 21:46, 16 November 2013 (UTC)[reply]

    sourcesecurity.com

    Spammers

    Long term, persistent spamming on many IPs and users - above is a partial list of IPs and accounts. Main spam URL is sourcesecurity.com, but thebigredguide and yogawizard show some overlap in accounts. - MrOllie (talk) 18:37, 30 August 2013 (UTC)[reply]


    Here's a specific suggestion:

    ebscohost\.com(\.|.*(pdfviewer|EbscoContent))     #Block 3 kinds of unusable EBSCOHOST links but allow permalinks: Match proxies: there's a literal "." after "com", and temporary session links, which contain pdfviewer or EbscoContent
    

    ( This is a consolidation of these two simpler regexes:

    ebscohost\.com.*pdfviewer          #Block unusable [[wp:EBSCOHOST]] links but allow permalinks
    ebscohost\.com\.                   #Match proxies, which is where it's not the end of the hostname - there's a literal "." after "com".
    

    )

    Wikipedia has many apparently dead-on-arrival links (like this intended to be to PDFs of the form ebscohost.com...pdfviewer...: All 7 of the 323 pages containing ebscohost and pdfviewer] I looked at had dead EBSCO links. These are NOT links that hit a paywall (like this. Rather, they bring up 404-like server error messages, and did from the day they were added; they're non-persistent URLs.

    A second problematic type of EBSCO link are proxied URLs, like the three added by a user's (sole ever) edit that are of the form hxxp://0-web.ebscohost.com.sculib.scu.edu/ehost/pdfviewer/pdfviewer?sid=[hex string]@sessionmgr13&vid=4&hid=13. (Note the bold portion!) These links work ONLY for subscribers that are ALSO at SCU. We shouldn't allow such links, and the blacklist (or a similarly functioning parallel system) would be a good solution.

    I've noticed that EBSCO staff has been heavily editing their own article. I solicited assistance, hoping they'd be available, willing, and able to help fix these links or suggest ways to deal with them systematically. note posted; no response. What EBSCOhost calls permalinks, like http://search.ebscohost.com/login.aspx?direct=true&db=ulh&AN=37698669&site=ehost-live&scope=site are acceptable, and so I've designed a regex that allows the permalinks but forbids the non-persistent URLs.

    Research suggests it's not possible to convert the non-persistent URLs to persistent URLs using the data in the former. --Elvey (talk) 21:26, 9 September 2013 (UTC)[reply]

    The second problem is the use of a proxied URL, ie, the link points to a institution's proxy server such as sculib.scu.edu. This is not specific to ebscohost - it happens with links to other subscription databases too. A search for "ezproxy", for example, will bring up hundreds of such links. They are a bad thing. Nurg (talk) 08:39, 12 June 2013 (UTC) (reposted)[reply]
    I am tempted to see these sites as redirects, which will be location-dependent whether they work. I would consider that these should typically be converted to direct links to the object (within educational institutions, one can generally use a web-proxy to get to literature - a direct link would either be the link on the server where the literature resides, or the DOI. <snip> Links through proxy servers have no place whatsoever. I am somewhat tempted to say that these need blanket blacklisting on meta, as they could possibly be abused to circumvent other blacklistings (for a relatively open proxy), and serve no function whatsoever to most readers except for the (few) ones that have access through the proxy - I doubt even if the url can be understood well enough to be able to figure out a real link from it. It is however going to be very obnoxious for the users that in good faith insert the proxy url they copy from their web-browser and then they can't save, and one could think of cases where it is appropriate (if information is only available to people who can pass the proxy and no-where else in the world, it could still a good reference for certain information - think of it of a book of which the single copy is in an nearly inaccessible library (the library in the Vatican), it is still verifiable by proxying through people who do have access to the library (ask the pope)).
    Note, that with creative regex rule-writing, we could blacklist the two 'bad' examples of Nurg (the non-persistent link and the institution proxies), still enabling good ones (the permalinks). --Dirk Beetstra T C 09:30, 12 June 2013 (UTC) (reposted, indented, and 1 sentence snipped)[reply]
    We use the blacklist to limit examiner.com links, because they generally fail RS, so I think it's appropriate that we add regexes for the impermanent URLs. (Arguably it would be better to have a similarly functioning parallel system with its own error messages handle sites like examiner.com and this ebsco problem, but in the meantime, I say let's put in regexes to handle them.) I also match the ebscohost proxy URLs, but not by matching on 'ezproxy', because some of the ebscohost proxy URLs don't contain 'ezproxy'. (It could be considered as part of a future proposed blacklist addition.) Beetstra (talk · contribs) suggested blanket blacklisting on meta be considered, but at meta, though I see these links on other sites - e.g 'fr.', I was told firmly, "Deal with it at the local wiki level." (Discussion at https://meta.wikimedia.org/w/index.php?title=Talk:Spam_blacklist&oldid=5798048#Unusable_EBSCOHOST_links.) --Elvey (talk) 21:26, 9 September 2013 (UTC)[reply]
    Well? Do we need to run a bot to remove all the extant links first, or is there more that is holding this addition back? --Elvey (talk) 01:58, 19 September 2013 (UTC)[reply]
    3000 links to improve in one go - seems like a good idea to me, yes!--Elvey (talk) 02:17, 26 September 2013 (UTC)[reply]
    Given the fiasco of the many editors pissed off by the actions of Cyberpower678 (talk · contribs)'s bot Cyberbot II (talk · contribs), a non-spam blacklist (see the big text above) is urgently needed. If one of Cyberpower678's bots is set up to handle entries on this list appropriately, it would be appropriate to ad the EBSCOHOST regex to it, and move the examiner.com and petition regexes to it.--Elvey (talk) 09:13, 26 September 2013 (UTC)[reply]
    Many? Don't see that yet. Elvey, most of the links we block are blocked because they are/were spammed (examiner.com was spammed and is a spam-problem, for most of the petition sites, that is also true - it is a spam problem .. your remark regarding that is wrong), we do not block because we don't like links, or because they are unusable or because they are unreliable sources .. Nonetheless, your suggestion to have a second similar list might have merit, but that is a mediawiki developer problem that should be solved at the bugzilla level .. and I do not have much hope since we are waiting for several blacklist-related 'bugs' (improvements) for years already. --Dirk Beetstra T C 09:26, 26 September 2013 (UTC)[reply]
    Thanks, I see what you're saying. Here is a good example of the problem (of the list being used for reasons other than to block spam and bot Cyberbot II (talk · contribs) pissing off many users) : Luke (talk · contribs) is ADAMANT: "If something gets tagged as being on the spam blacklist, I will remove it, pure and simple." He's saying he's NOT going to examine the link, or attempt to repair or replace it. He's going to ASSUME the blacklist maintainers are making sure that the blacklist pretty much only blocks spam (like the spamhaus SBL and XBL maintainers do, if you're familiar with those lists).


    It's a problem that this blacklist is not a Spamhaus caliber blacklist; it's more like some of the more aggressive blacklists that are willing to regularly include entries that can be expected to cause considerable collateral damage. And that wouldn't be so bad if this blacklist was not marketed/described as pretty much only blocking spam. I'm willing to bet that the typical editor who tries to add a legitimate reliable source that is blacklisted collateral damage ends up not adding it, because we don't have multiple lists. The bot and the blacklist description pages are wrong to say the link Luke removed was spam; he was misled. The solution isn't to threaten to block everyone who does what he intends to do. It's to fix the list by splitting it. ASAP.


    We should still be blocking no-ip.com and examiner.com by default. Just not with this list. Roughly how many have I seen/do I think have expressed/do I think are upset because of the bot? Me? Around 6/?/60+, based on the 6. What about you? Is effecting a new list a developer problem at all? I would expect replicating the existing system and changing the names of a few things would be relatively trivial task for the right person (an admin, not a developer), compared to a real development project, such as a significant enhancement. Many good examiner.com-type links were not blocked because the links were a spam problem, but rather because they might have been one. But yes, I see your point - the examiner.com domain was blocked because it is a spam problem, I stand corrected. (BTW, is there a working 'spam' definition for use here? I usually refer to the definitions like the ones spamhaus proffers, tweaked to apply to this medium. I guess I'll go look for that …) I remember the first time I tried to save an edit and couldn't, and for the longest time had no idea why Wikipedia wouldn't let me save the edits I'd made to an article, which included adding an examiner.com source to it. The error messages had me thoroughly confused - and I'm knowledgeable about spam blacklists - but I still got the most pathetic and inscrutable error messages, and had no idea why Wikipedia wouldn't let me save the edits i'd made to an article. I have just gone through the same motions, which confirmed what I have seen others' comments suggest: the error messages shown to legit editors are still a pretty serious FAIL, though they are better than I remember. I recall they were worse than nothing, worse than useless. For no-ip.com links, the error message is still awfully misleading, as it describes my only options as:
    • If you feel the link is needed, you can:
      • Request that the entire website be allowed, that is, removed from the local or global spam blacklists (check both lists to see which one is affecting you).
      • Request that just the specific page be allowed, without unblocking the whole website, by asking on the spam whitelist talk page.
    This error message is unhelpful. The appropriate action is to request that the subdomain, rwservices.no-ip.info be whitelisted. It needs to be whitelisted. The error message is **misleading**.
    --Elvey (talk) 21:57, 26 September 2013 (UTC)[reply]
    PS: From current discussions: Surely you admit this listing wasn't because of a spam problem:

    \bjustjared\.buzznet\.com\b # Kanonkas # Gossip site/copyvio issues/speculation/not a reliable source used wrongly

    Perhaps Versageek (talk · contribs), creator of bot XLinkBot (talk · contribs) would be up to the task (of a non-spam blacklist system.) --Elvey (talk) 00:07, 23 October 2013 (UTC)[reply]

    sentuamsg.com

    These URLs are being spammed by 86.20.42.223 (talk • contribs • deleted contribs • blacklist hits • AbuseLog • what links to user page • COIBot • Spamcheck • count • block log • x-wiki • Edit filter search • WHOIS • RDNS • tracert • robtex.com • StopForumSpam • Google • AboutUs • Project HoneyPot) all over the place. Diffs: [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16]. --benlisquareTCE 05:56, 15 September 2013 (UTC)[reply]

    pinkangelsmokes.com

    Repeated additions of this - apparently, as I can't check it from a work machine - porn site link by at least 3 4 5 IP users I have been able to ID (only other reference I could find in a search was at Talk:Smoking fetishism requesting it be removed from the article in May of this year). User(s) using formatting in a - poorly executed - attempt at deception to masquerade it as a link to a government survey. besiegedtalk 00:51, 21 September 2013 (UTC)[reply]

    raveguide.co.uk

    raveguide.co.uk: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com Being added by user of the same name. The user has been reported as a promotional user, not (yet) as a vandal. Fiddle Faddle 22:45, 26 September 2013 (UTC)[reply]

    • Oppose, no active links, no user by the name indicated by nom have been active in years. Without further evidence of spamming or indications of alternative actions this shouldn't be blacklisted. Liamdavies (talk) 13:51, 12 December 2013 (UTC)[reply]

    Digitaldreamdoor.com

    digitaldreamdoor.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com This website accepts user contributions without editorial oversight which means they fail the WP:Reliable sources guideline. Anybody can make a list of their favorites and publish it—a lot of what we see on Wikipedia is in the form of "100 Greatest Fusion Artists" or similar. The website has been added as an external link or a reference to Wikipedia many, many times, often by multiple good-faith users rather than by a single spamming account. If we blacklist the website the unreliable references and links will stop. Examples of this website being used as a reference or an external link:[17][18][19][20][21][22][23]
    Back in 2007, Special:Contributions/65.2.112.232 added Digital Dream Door to seven articles in every one of his seven edits—an example of spamming.
    A discussion about this general issue can be see at Wikipedia:Reliable_sources/Noticeboard#digitaldreamdoor.com. Thank you. Binksternet (talk) 16:03, 27 September 2013 (UTC)[reply]

    Is there an active ongoing spam threat here? Liamdavies (talk) 13:54, 12 December 2013 (UTC)[reply]

    programarexcel.com

    programarexcel.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com Several inclusions on Oct 9, 2013 on Microsoft Excel to this site, which is of dubious value and pops open full screen ads.

    mobiles.sulekha.com

    mobiles.sulekha.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com A welter of spam links to this site today alone. It may be worth investigating the entire domain too Fiddle Faddle 10:39, 12 October 2013 (UTC)[reply]

    bcl10.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    gentaur-worldwide.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    apoptosises.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    Saw that IP 94.26.80.83 added apoptosises.com to Apoptosis, clicked it, and saw a long list of "Buy Now" buttons for pharmaceuticals on a shoddy website. Sites host barebones (almost certainly copy/paste jobs) medical articles, with tons of links to buy its products. Also reporting the IP as those are all it has ever added in the period since February 2013. Undid all of his/her past additions, but these should probably be blacklisted. --Rhododendrites (talk) 10:50, 17 October 2013 (UTC)[reply]
    • 94.26.80.83 hasn't been active since October[25]; is this threat still active? Have other measure been undertaken to prevent spamming? If not, oppose. Liamdavies (talk) 14:05, 12 December 2013 (UTC)[reply]
    • The spammer has been active from February till October, and has been warned on the last day - this is a case for a firm warning (spam4im) on the next occurance (maybe with the help of XLinkBot) and a following block if they chose to ignore for a prolonged time (active from Feb-Oct - 8 months - so at least a 6 month block as they are static enough). no Declined for now, though we may see this one returning (and linking to 'Buy now'-pages is useless enough to just blacklist to avoid future abuse - we can't keep cleaning up). --Dirk Beetstra T C 06:32, 18 December 2013 (UTC)[reply]

    soccerdatabase.eu

    soccerdatabase.eu: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    Back in May 2013 this link was mass removed from Wikipedia because it was deemed to be a copyright violating mirror website, of the defunct 'playerhistory.com' website. As I understand it, the owner of 'playerhistory.com' is Polarman (talk · contribs) and he has been taking legal action against the owners of 'soccerdatabase.eu' for violating copyright. This website has no place on Wikipedia and should therefore be blocked. Note that a previous attempt to blacklist 'soccerdatabase.eu' fizzled out with no real decision either way. GiantSnowman 12:32, 29 October 2013 (UTC)[reply]

    qtrax.com

    qtrax.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com Spammed by multiple accounts Special:Contributions/Lilamey2013, Special:Contributions/79.179.194.98, Special:Contributions/Putting_an_end_to_tyrant_editors, Special:Contributions/Sivan_qtrax, Special:Contributions/109.64.177.36.

    Apparently using bots now too according to [26]. Яehevkor 10:28, 10 November 2013 (UTC)[reply]

    No, this is incorrect. If you look at DVdm (talk), you will see that there isn't any current usage of bots, but only declaration about the desire of using a bot. Currently, there is a bot approved by wikipedia, that add links to lyrics of notable songs, and redirect the readers to a website owned by CBS Interactive, called MetroLyrics. From some reason, this website has been approved, and spreading links all over wikipedia. There isn't much difference between Qtrax and Metrolyrics, beside the fact that Qtrax also offer free streaming and downloads of the song. Yes, free. The whole model of Qtrax is based on providing LEGAL music for FREE to end users. Yet, in return the artists are getting paid by Qtrax. Hence it is the only service in the world which is both free & legal.

    Our quest is eventually to fight piracy over the web. We offer everything for free, because this is what pirate sites offer. So if we want to fight piracy, we must offer music for free. If we would like to keep being legal, we must then have licenses with the music labels, and pay the artists for every song our users play or download on Qtrax. Which we happily do. Please help us fight piracy, don't stop us. — Preceding unsigned comment added by 79.183.0.181 (talk) 14:10, 13 November 2013 (UTC)[reply]

    Note - According to About us:

    "In addition, the partnership between advertising and Qtrax delivers great potential for monetization by brokering branded deals with consumer advertisers around the world. These natural partnerships are sure to yield generous revenues for artists and labels alike."

    - DVdm (talk) 10:56, 10 November 2013 (UTC)[reply]
    I fully support this being blacklisted. The IP has clearly stated its intent is to promote the website and artists, not to better the Wikipedia project. Wikipedia is not here to use as your source of free advertisement. Sergecross73 msg me 14:16, 13 November 2013 (UTC)[reply]
    Please look at Requests_for_approval/LyricsBot. There's obviously a common interest in adding lyrics to song pages, which has been validated by the Wikipedia community and administrators which approved the MetroLyric bot. Just like Qtrax, MetroLyrics (owned by CBS) is a commercial entity. Since Wikipedia is unbiased, an equal approach should be taken towards both parties; meaning, since MetroLyrics were allowed to add external links, so should Qtrax. — Preceding unsigned comment added by Gil.qtrax (talkcontribs) 15:30, 13 November 2013 (UTC)[reply]
    Qtrax should go through whatever Metrolyrics did for approval then. Until then, you shouldn't be adding Qtrax links to articles, as you've already shown your interest is in your own website and promoting artists, not bettering the Encyclopedia. If you keep spamming the website, you'll be blocked. Sergecross73 msg me 15:38, 13 November 2013 (UTC)[reply]
    Wikipedia is not a free advertising platform for your new company. In the absence of documented community consensus for mass-adding of the links, if I see any other single purpose accounts adding the links, the account will be blocked and your site will be blacklisted. OhNoitsJamie Talk 15:44, 13 November 2013 (UTC)[reply]
    According to WP:Village_pump_(proposals)/Archive_97#Linking_lyrics_from_legal_providers there is a documented community consensus for mass-adding of exactly such links. There is no difference between a Qtrax link and a MetroLyrics link. Since this has already been discussed and supported in the community, and approved for automation (LyricsBot) - less than a year ago - there should be no problem with posting manual links until a Bot is approved. — Preceding unsigned comment added by Gil.qtrax (talkcontribs) 15:55, 13 November 2013 (UTC)[reply]
    Qtrax is a separate organization, so it needs separate consensus. Its as simple as that. You don't get to make the terms here. Follow protocol and the rules or get blocked. Sergecross73 msg me 16:04, 13 November 2013 (UTC)[reply]

    tefl-online-courses.com

    Being spammed to multiple articles by multiple IPs. [27] [28] Jackmcbarn (talk) 22:22, 16 November 2013 (UTC)[reply]

    en.softonic.com

    Part of a plague of snowshoe spam on WP and many, many areas elsewhere, to drive traffic to this software download site. the en. version redirects to the second version. I suspect other prefixes as well as .en. Fiddle Faddle 12:04, 28 November 2013 (UTC)[reply]

    nationalforum.com

    nationalforum.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com Regular spam of this site, and also the author (and blacklist candidate string) "William Allan Kritsonis" across a range of often bizarrely unrelated articles, from a number of IPs. See [29] [30] and 72.48.212.34 (talk · contribs) for examples. Blocking is likely to be ineffective, as some are just commonplace AT&T IPs. Andy Dingley (talk) 00:37, 7 December 2013 (UTC)[reply]

    • Is this spam threat still active? Or has the ban worked? I see no active (fresh) links. Liamdavies (talk) 14:19, 12 December 2013 (UTC)[reply]
      • Liam, this editor was active in October, was warned several times in December for spamming and continued until a block was implied. 5 1/2 days after the last spam run (and the block) you ask if the threat is still active while you know that there are other IPs doing the same. 5 1/2 days, even 3 weeks of not seeing the link being added does not show whether the threat is still active (it may even have happened but reverted without further notice). Part of the task of editors evaluating requests is to do that extra research as well.
      • Lets ask a return question: do we want editors to spend time on reverting, maybe getting frustrated by having to fight spammers, and do we want readers to have to read spammy Wikipedia pages?
      • Upon further research - there is an earlier block of an account regarding these spam issues, and this is now the second. Good example of persistence, the spammer has been active since 2006 (and did not get the message then). plus Added. --Dirk Beetstra T C 11:26, 18 December 2013 (UTC)[reply]
      • Addendum: editor was active in 2006, blocked, editor returned in 2007, blocked and unblocked (after apparently promising to adapt - though no further edits). --Dirk Beetstra T C 11:34, 18 December 2013 (UTC)[reply]

    Completed Proposed additions

    directory.tradeford.com

    directory.tradeford.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.comrybec 18:52, 19 September 2013 (UTC)[reply]

    riocodes.com

    riocodes.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.comrybec 18:41, 19 September 2013 (UTC)[reply]

    Proposed removals

    Rentarasta.com

    rentarasta.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com Not understanding why this one in particular is on the list or even where to find out on my own. There is a screenshot archive of an article here that I want to reference. Docucopter (talk) 18:04, 25 September 2013 (UTC)[reply]

    Question How would this link be useful for Wikipedia? OhNoitsJamie Talk 21:42, 16 November 2013 (UTC)[reply]

    cbronline.com

    Contains the archives of the Computer Business Review, an indispensable source of information about business computing of the 1980s and 90s. QVVERTYVS (hm?) 15:50, 29 September 2013 (UTC)[reply]

    Seconded. Computer Business Review is cited at Azul Systems and would seem to be a valuable source concerning a 2006 legal dispute between Azul and Sun Microsystems. As far as I can tell at [31] the domain has changed hands since the blacklist listing. Kingdon (talk) 14:43, 8 October 2013 (UTC)[reply]
    Agree, there are few online sources from this era, except a few in Google books or a few other narrowly-focused sites. Being able to cite them might help recentism. The cbronline pages do show up in Google searches near the top, perhaps because of spam-like behavior in the past, but also perhaps because they are useful. The cbronline story pages do not seem overly commercial themselves, unless I am missing another reason to keep them blacklisted. A bunch of pages on computer topics from that era still have the citations, but many others are uncited because of the black list. W Nowicki (talk) 22:25, 8 October 2013 (UTC)[reply]
    Seconded. I have used cbronline.com as a source for several articles about computer systems from the 1990s, especially Digital Equipment Corporation products. For a lot of facts and figures this was the only reasonable online source I could find at the time. Letdorf (talk) 22:02, 14 October 2013 (UTC).[reply]
    Please be aware that the usefulness of a source has no relevance to the blacklist. Past behavior and potential for future abuse are all that matters here. ~Amatulić (talk) 23:31, 20 October 2013 (UTC)[reply]
    OK, I'm a greenhorn when it comes to spam-blacklisting. I'm here because apparently a bot has recently begun tagging articles with {{Blacklisted-links}}. For example, Super video graphics array was tagged on September 24, 2013—that's just the first one I noticed. An external links search finds some 357 articles linking to this so-called "spam" site. The information box at the top of this section tells me to familiarize myself with the reasons why this site was blacklisted, so I look at MediaWiki talk:Spam-blacklist/log/2010#April 2010 to see who blacklisted the link and when, and the reason given for blacklisting. There I find that user:Tedder blacklisted this site 21:28, 16 April 2010. So, apparently many readers and editors have been blissfully unaware that hundreds of articles have been linking reliable-source references to a blacklisted site for the past 3 12 years. As for the reason, we are permalinked to Wikipedia talk:WikiProject Spam (the discussion can also be found at Wikipedia talk:WikiProject Spam/2010 Archive Apr 1#cleantechnology-business-review.com). There, I find many links to Template:Link summary and Template:User summary that aren't expanded for some technical reason. Notice that those templates work fine up to a certain point on the page, and then they don't. Maybe the expansion limit was exceeded? So, I've taken the liberty to copy the relevant section of the archive to Wikipedia talk:WikiProject Spam/2010 Archive Apr 1/cleantechnology-business-review.com. There you will see that the blacklist decision was a local consensus between the two editors Tedder and user:Beetstra. Beetstra helpfully mentions that "this is used as a reference as well, and I see many 'regulars' using these links" – well I'd guess that most of the 350+ links are legitimate links created by us 'regulars'. I mean, Wikipedia has a "massive" number of links to The New York Times I'm sure, but their massiveness doesn't make them spam. Sorry, but after all the time and effort I've put into this, I still don't understand why this site was blacklisted. Can Tedder or Beetstra please explain, for the benefit of us spam-blacklist newbies? Thanks, Wbm1058 (talk) 18:47, 23 November 2013 (UTC)[reply]
    I just recalled that my first visit to the MediaWiki talk: namespace, two years ago, was over this same issue. As I've only half a dozen edits in this namespace, to me this particular blacklist item really stands out. I can't recall any encounters with any other site on the blacklist. Relevant past discussions in the archive:
    You just hit it, SimonThird is one of them, 78 edits, most of the cases adding a reference to cbronline. We call that reference spamming. Sometimes adding his reference to referenced material, or just adding a sentence with this reference. How many of the current available 350+ articles that contain the links are still there because the spam was not appropriately cleaned out. Yes, it is a massive work to get those 350 through the whitelist, but I have seen quite a number of them already having been declined because they were not necessary, replaceable (4 of the 5 I just went through were in fact replaceable, and only one was granted). As this was a massive campaign to spam Wikipedia, and we are NOT a vehicle for that, I am very reluctant to removal, and I will ask editors to go the extra mile and go through whitelisting for the individual links, including showing that there are not other sources for their requests.  Defer to Whitelist. --Dirk Beetstra T C 07:45, 24 November 2013 (UTC)[reply]
    To be able to see the full record, I split the archive, please see: Wikipedia_talk:WikiProject_Spam/2010_Archive_Apr_1B#cleantechnology-business-review.com. --Dirk Beetstra T C 08:49, 24 November 2013 (UTC)[reply]

    So let me be sure I understand this last post… a small number of users including (only?) SimonThird started spamming the Wiki. Right? So instead of blocking them we blocked a reference used in hundreds of other articles? And the reason for this is that it would be too much effort to fix the actual problem? Maury Markowitz (talk)

    I checked another one of SimonThird's edits. Here he added a new section Releases to IBM's article, referenced with a link to the CBR site. Clearly this gives undue weight to a single product release, one of perhaps thousands of products released over IBM's 100+ year history. This one lasted a couple months before it was reverted. So some unknown percentage of this editor's contributions may have dubious motivations. I maintain that the first example I cited, if looked at by itself, with no knowledge of SimonThird's other edits, should be considered as both good-faith and helpful, were the site not blacklisted. I don't have the power to check this user's IP address to see if it could be associated with this website. I'm not really familiar with the publication, but my perception is that Computer Business Review was a British printed trade journal back in the day, perhaps similar to InfoWorld. A lot, perhaps most of these publications have dropped their paper editions, and if they're still operating, are now online-only. Unfortunately, unlike InfoWorld which Google has helpfully scanned so that we may directly link images of the paper-printed magazines, our only option with Computer Business Review is to link to this site. I doubt this site makes much, if any, money selling subscriptions, so obviously they need to draw traffic that views on-site advertising to survive. Our legitimate reference links to this site may help in some small way in that regard, helping them stay online so that the site is available for us to research and find more references. Now The New York Times does still make real money from selling subscriptions, but even they are becoming more dependent on online ads. So, what if, theoretically, some anonymous editors decided to help the Times out by focusing all their editing energies on clearing the Category:Articles with unsourced statements backlog by inserting mostly helpful citations to Times articles, but got somewhat over-enthusiastic about the project and let some dubious links like SimonThird's "Releases" link slip in as well. Would we then be forced to blacklist the Times?

    Dirk Beetstra, I see that you maintain a bot that generated a report: Wikipedia:WikiProject Spam/LinkReports/cbronline.com – can we get an updated report? Thanks, Wbm1058 (talk) 00:13, 25 November 2013 (UTC)[reply]

    I'll also point out that user:SimonThird has a clean block log, the only admonition on user talk:SimonThird was extremely vague and didn't indicate any specific edits or the nature of the alleged "promotional material", and by the time this site was blacklisted on 16 April 2010, SimonThird was long gone (last edit 11 December 2009). Wbm1058 (talk) 01:34, 25 November 2013 (UTC)[reply]

    Another relevant past discussion: Wikipedia:Reliable sources/Noticeboard/Archive 49#xxx-business-review.com as source – yes this site seems to have a lot of "articles" that are just regurgitated press releases, but the idea that these are unreliable sources is a bit ridiculous. You just need to be careful about what they reliably say. Press releases are primary sources, not the secondary sources preferred on Wikipedia. Primary sources are used to fact-check secondary sources. If you have a company Z press release dated March 1996 announcing the release of the product whiz-bang version 3.0 then that is indeed a reliable source for "Company Z announced the release of whiz-bang version 3.0 in March 1996". You might want to look for a reliable secondary source that confirmed that the product actually was released when they said it was, but the press release is a reliable source for "the company claimed the product was released." - Wbm1058 (talk) 03:23, 25 November 2013 (UTC)[reply]

    Nope, it was not only SimonThird, there were more. If it was only SimonThird, likely, as you state it, it would likely have been a block for the user (with some exceptions for some sites). Wbm1058, I said it was a campaign, spammers do not stay with just one account, they use multiple accounts to spam multiple domains. There were 5 or 6 listed, but there are many, many more (some with just one or two edits, but of the same pattern). We may indeed not have bothered to block editors here, but warn the different accounts and move on to straight blacklisting. Why bother blocking accounts if other socks will pop up (sometimes, even warning them as they will not read the warning on the old-sock account when they are already on the new one).
    Also, you say a reference to hundreds of other articles, if I see it correctly, there are at least 5 accounts (and if I go through a handful of other IPs I am worried about those edits as well), who added and (between each other) re-added links that were removed. Since it took 2 years before it was uncovered, the reports are congested with regular editors who, in some cases, may have added the links back reverting unrelated vandalism. There may also have been regulars adding the link in the past - but that means that there should also be regulars who tried to add the link since. If it was significantly used, then there must among those have several regulars who do know that when they run into a blacklist warning that there are ways to discuss that problem (whitelist requests). Still, there have not been many discussions regarding it, suggesting that not many regulars have used the link. Moreover, most of the whitelisting requests I did see were declined as 'replaceable'. I don't think this site has been used by regulars a lot.
    The site was not blacklisted because it was an unreliable source, even a porn site is a reliable source if you use it correctly, it was blacklisted because it was spammed (and otherwise spammy abused) by multiple accounts (likely a SEO-company seen IP edits) in a campaign (or multiple campaigns), ánd it is not massively reliable anyway.
    So if it shows anything, I think it shows that most of these links were not cleaned out after the blacklisting .. manpower is a continued problem here.
    Regarding the Times - that is the interesting thing - first, a journal like the Times does not need spam to get their links out (so that says something about companies that do spam), moreover, if a site like that would engage in a massive spamming campaign, we would indeed have a nice problem, which likely would be handled through the legal department of Wikimedia (we have had congressman or their representatives spam Wikipedia - besides blocking, they have to be reported to the Foundation). I would however not exclude that if such a site would engage in such massive spamming, that blacklisting (though more likely an edit filter) may be needed to mitigate the problem - and it has happened for sites like that.
    And yes, cbronline or the ..-review sites may be a reliable source for some information - and that is why we have a whitelist for those cases where the information is unique, reliable ánd notable enough.
    Sorry, COIBot is down at the moment, but the old reports should already give you an idea - I just went through some IPs, and there are more engaging in spamming than the ones that precipitated the blacklisting. --Dirk Beetstra T C 08:11, 25 November 2013 (UTC)[reply]
    OK, fair enough. This has been a good discussion, although I feel that perhaps the burden of proof has been unfairly placed on the defense. There does seem to be a problem here, but the extent of the problem and the manageability of it is just a matter of opinion. I feel that no matter how much effort I put into showing it's manageable, you will still reply that what I've found is just the tip of the iceberg and we just haven't identified the rest of it. So there's no point in further analyzing what happened over three years ago before the blacklisting. This recent addition of bot-generated {{Blacklisted-links}} has introduced an eyesore that currently transcludes onto 3,738 articles, apparently 357 (nearly 10%) of which are caused by this reference. If the goal of this exercise was to twist the arms of busy gnomes into diverting from other backlogs they've been trying to clear for months, then it's succeeded. I'll familiarize myself with the white-listing process, which is something I haven't needed to do until now, and get to work on "cleaning out" the links, though I can just get started for now before I need to take a break. Wbm1058 (talk) 14:20, 25 November 2013 (UTC)[reply]
    Hmm, I'm surprised at how short that white-list is, now that I'm looking at it for the first time. Just nine entries for cbronline. But the "helpful hint" section does not feel helpful at all. In fact, it strikes me as rather hostile. The attitude that you are guilty until proven innocent comes through loud-and-clear. That probably explains why the list is so short. If you make something enough of a bureaucratic bother, then volunteers just won't bother. I suppose that is the goal. Excuse me for grumbling. Wbm1058 (talk) 15:03, 25 November 2013 (UTC)[reply]
    @Wbm1058: Regarding manageability - I was just pointed to a similar case of spamming, where many domains from a company were blacklisted back in 2008. 2 independent whitelisting requests led me to have a look, and it looks like the same spamming is still ongoing, with many single-purpose accounts creating and editing articles in the realm of the company - a clear case of paid advocacy (maybe SEO-spam). There, blocking accounts and blacklisting their domains certainly did not stop the spam, and I have no believe that here blocking the editors would have stopped it either. Spamming pays their bills. --Dirk Beetstra T C 07:55, 28 November 2013 (UTC)[reply]
    Regarding the helpful hint - unfortunately that is behaviour that we have to put up with on a regular basis. Please understand that blacklisting is not done to annoy good faith fellow editors, it is to stop spam - and even the blacklist does not do a good job at that. Paid advocacy is a serious continuous problem, Wikipedia is a massive spam target. I am sorry, but I get very nervous and non-cooperative when a regular is coming with an attitude of 'you asshole, you blocked the domain that I need and now I can not save my page' .. that approach should be reserved for editors who spam Wikipedia, but that aspect is often ignored.
    Unfortunately, the bureaucratic bother is needed, spam is not your run-of-the-mill vandalism. And the reverse is generally also the case - we are deemed guilty of blocking 'useful' sites (your post of 23 November, 18:47 suggests such assumptions as well - 'this so-called "spam" site'/'Wikipedia has a "massive" number of links to The New York Times I'm sure, but their massiveness doesn't make them spam.'), and we need to continuously defend that if a site is blacklisted, it was actually spammed. --Dirk Beetstra T C 08:10, 28 November 2013 (UTC)[reply]

    energy-business-review.com

    Why is this website blocked? It is used in articles like Project Hayes and Waitahora Wind Farm, and does not look like a spam website to me. --Pakaraki (talk) 17:34, 4 October 2013 (UTC)[reply]

    Looks like it was being spammed by this user and perhaps others. OhNoitsJamie Talk 18:14, 4 October 2013 (UTC)[reply]

    ccel.us

    This was added by User:Ckatz in the summer of 2011 [32], apparently in response to this spam taunt, but it's quite unlikely that this threat was honest since CCEL (now titled Evangelical Christian Library) is simply a repository site for well-known theological and religion-related texts, most of them PD. I would not be surprised to see links to its materials throughout religious topics on Wikipedia; the case which caught my eye involves a reference in J. Z. Knight to an on-line edition of a book by Russell Chandler, once a religion writer for the LA Times. This looks to be a perfectly reasonable reference, and an online copy is surely preferable for an online encyclopedia. Therefore I would like to ask that this entry be removed from the spam blacklist as unnecessary and inappropriate. Mangoe (talk) 02:37, 30 November 2013 (UTC)[reply]

    One correction: ccel.us and ccel.org are separate sites. However, the rest of my request remains the same: ccel.us has a number of references now, and as far as I can determine it was never actually spammed. Mangoe (talk) 22:19, 3 December 2013 (UTC)[reply]

    eHow.com

    The blacklist shows behow.com, so it appears to me that it is not really intended to block eHow.com. Spalding (talk) 16:05, 8 December 2013 (UTC)[reply]

    The entry is \behow\.com\b, which means (word boundary) ehow.com (word boundary). It is intentionally catching eHow.com. Jackmcbarn (talk) 16:41, 8 December 2013 (UTC)[reply]
    Indeed. The backslash before a character makes that following character treated specially - '\b' is word boundary, '\.' is a true '.' ('.' itself has a function - it would match any character - similar is true for '\?', '\/', '\$' ...). Please see Regular expression, which contains, or will link through, to more information. --Dirk Beetstra T C 05:48, 9 December 2013 (UTC)[reply]

    Cosmoetica.com

    cosmoetica.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com This website was placed on the blacklist about four years ago due to various SPAs (supposedly controlled by the site's proprietor, though of course he has denied this) posting links to the site's movie reviews and essays on Wikipedia pages relating to the movies and subjects discussed. As of the present moment, however, the site contains in-depth interviews with 40 notable subjects (many of whom have Wikipedia pages), with more surely on the way, but these interviews have not been able to be cited due to the blacklist, despite the fact that they contain information that could potentially be used to correct errors re: the subjects' biographical information, views, etc. As well, the site owner Dan Schneider's own Wikipedia page has had links to the website removed from his own page - even when posted by SouthernNights, a well-known WP editor who is certainly not a SPA - despite the fact that it is by far the best source for information. Given it has been four years, with relatively little activity on even the site owner's own WP page in that time, I think it's reasonable to at least ask that the site be removed from the blacklist and be allowed to be the useful source of information that it can be. — Preceding unsigned comment added by 50.172.38.195 (talk) 21:37, 12 December 2013‎ (UTC)[reply]

    MoneyWeek.com

    moneyweek.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com This site was added in October 2008 with no clear explanation. MoneyWeek is a paper magazine and seems to be a reliable source. Blacklisting it prevents us from linking to it as a source. Pburka (talk) 02:35, 13 December 2013 (UTC)[reply]

    airtet.in

    airtet.in: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    • my website blocked in early stage linking to from Wikipedia to website that' time i don't know about blacklist i am teenaged and excited and i wanted to see what happens if keep linking to my website also i linked to related content which regional government offices universities list many also i requested but i am unable see my request when i visited this page. blocking from wiipedia makes me feeling about guilty of spammer but i don't now spam that time when i linking to my website i thought that i am linking to relevant content what's wrong with me finally realized linking many times in less time made me as a spammer— Preceding unsigned comment added by 124.123.0.103 (talkcontribs)

    plasticsurgery.org

    This website is the official website of the American Society of Plastic Surgeons, it's the official organization for plastic surgeons which are board certified in plastic surgery by the American Board of Medical Specialties. They are heavily involved in plastic surgery education, for example, AMBS certification candidates must have gone though an accredited plastic surgery residency program. They are also involved in public outreach such as educational material for public. So alot of articles involving plastic and reconstructive surgery may like to use this organization's website as a source, to verify information like procedure information, statistics, verify board certification status, etc. I can't even think think of any other site that could provide this type of information in a neutral way, maybe the PRS Journal but the articles are not public. I really have no idea why this entire site would be blanket blacklisted but I think it may suffer from a topic bias. ssc-capricorn (talk) 19:29, 17 December 2013 (UTC)[reply]

    I see that in 2008 or so, there was some spamming from what appears to be the American Society of Aesthetic Plastic Surgeons (that is ASAPS, notably different than American Society of Plastic Surgeons, ASPS). ASAPS is specific to cosmetic surgery, while ASPS is general plastic surgery. ASAPS domain is surgery.org and ASPS is plasticsurgery.org but for some reason the generalist ASPS was caught up in the blacklisting of spam from ASAPS members. ssc-capricorn (talk) 19:45, 17 December 2013 (UTC)[reply]

    After reviewing the spam report very carefully, I think it's possible that ASAPS may even be an innocent party. The spammers were using these two sites to blend in with their own spam links in an attempt to look legitimate. However, lacking more information, I will let someone else put in a request for ASAPS, or put the request in at another time, but ASPS has no reason at all to be included on the blacklist. They were an innocent bystander.ssc-capricorn (talk) 20:04, 17 December 2013 (UTC)[reply]

    tanners-wines.co.uk

    tanners-wines.co.uk: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com This site was blacklisted in February 2008 when it was hacked. However it is now safe and well maintained. There is a wikipedia page here: https://en.wikipedia.org/wiki/Tanners_(company) which from a usability perspective would benefit from a link to the actual Tanners site. I think in light of the fact that the company has genuine history and a good online reputation combined with the fact that it was blacklisted due to hacking issues originally; that there is compelling enough reason to remove this site from the blacklist. CCarson789 (talk) 16:24, 19 December 2013 (UTC)[reply]

    Completed Proposed removals

    Troubleshooting and problems

    Incomplete message for petition url

    An attempt to save http://petition.com/example only gives me the message:

    • The following link has triggered a protection filter: petition

    Either that exact link, or a portion of it (typically the root domain name) is currently blocked.

    It appears MediaWiki:Spamprotectionmatch doesn't get the full url in $1. Maybe it has something to do with the petition entry not having a domain:
    \bpetition(?:online|s)?\b

    {{int:Spamprotectionmatch|petition}} produces the message I got:
    The following link has triggered a protection filter: petition
    Either that exact link, or a portion of it (typically the root domain name) is currently blocked.

    Solutions:

    • If the URL used is a URL shortener/redirect, please use the full URL in its place, for example, use youtube.com rather than youtu.be,
    • If the URL is a Google URL, please look to use the (full) original source, not the Google shortcut or its alternative.
    • Look to find an alternative URL that is considered authoritative.

    {{int:Spamprotectionmatch|http://petition.com/example}} produces what I expected to get: A message with "The following link has triggered a protection filter: http://petition.com/example". I can see it in preview but not save it without nowiki, because the produced interface message contains the blacklisted link.

    My tests were based on a report at Wikipedia:Teahouse/Questions#I can't figure out what link is blacklisted? PrimeHunter (talk) 20:51, 17 December 2013 (UTC)[reply]

    Logging / COIBot Instr

    Blacklist logging

    Full instructions for admins


    Quick reference

    For Spam reports or requests originating from this page, use template {{/request|0#section_name}}

    • {{/request|213416274#Section_name}}
    • Insert the oldid 213416274 a hash "#" and the Section_name (Underscoring_spaces_where_applicable):
    • Use within the entry log here.

    For Spam reports or requests originating from Wikipedia_talk:WikiProject_Spam use template {{WPSPAM|0#section_name}}

    • {{WPSPAM|182725895#Section_name}}
    • Insert the oldid 182725895 a hash "#" and the Section_name (Underscoring_spaces_where_applicable):
    • Use within the entry log here.
    Note: If you do not log your entries, it may be removed if someone appeals the entry and no valid reasons can be found.

    Addition to the COIBot reports

    The lower list in the COIBot reports now have after each link four numbers between brackets (e.g. "www.example.com (0, 0, 0, 0)"):

    1. first number, how many links did this user add (is the same after each link)
    2. second number, how many times did this link get added to wikipedia (for as far as the linkwatcher database goes back)
    3. third number, how many times did this user add this link
    4. fourth number, to how many different wikipedia did this user add this link.

    If the third number or the fourth number are high with respect to the first or the second, then that means that the user has at least a preference for using that link. Be careful with other statistics from these numbers (e.g. good user who adds a lot of links). If there are more statistics that would be useful, please notify me, and I will have a look if I can get the info out of the database and report it. This data is available in real-time on IRC.

    Poking COIBot

    When adding {{LinkSummary}}, {{UserSummary}} and/or {{IPSummary}} templates to WT:WPSPAM, WT:SBL, WT:SWL and User:COIBot/Poke (the latter for privileged editors) COIBot will generate linkreports for the domains, and userreports for users and IPs.


    Discussion