Jump to content

MediaWiki talk:Spam-blacklist: Difference between revisions

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia
Content deleted Content added
Elvey (talk | contribs)
ebscohost\.com(\.|.*(pdfviewer|EbscoContent)) - block unusable EBSCOHOST links
Line 535: Line 535:
==500music.com==
==500music.com==
* {{LinkSummary|500music.com}} - At various articles by various IPs, like [[Free music]]. ([https://en.wikipedia.org/w/index.php?title=Free_music&diff=prev&oldid=552733706]) [[User:Diego Moya|Diego]] ([[User talk:Diego Moya|talk]]) 10:56, 4 September 2013 (UTC)
* {{LinkSummary|500music.com}} - At various articles by various IPs, like [[Free music]]. ([https://en.wikipedia.org/w/index.php?title=Free_music&diff=prev&oldid=552733706]) [[User:Diego Moya|Diego]] ([[User talk:Diego Moya|talk]]) 10:56, 4 September 2013 (UTC)

== ebscohost\.com(\.|.*(pdfviewer|EbscoContent)) - block unusable EBSCOHOST links ==
* {{LinkSummary|ebscohost.com}}

Here's a specific suggestion:
<pre>
ebscohost\.com(\.|.*(pdfviewer|EbscoContent)) #Block 3 kinds of unusable EBSCOHOST links but allow permalinks: Match proxies: there's a literal "." after "com", and temporary session links, which contain pdfviewer or EbscoContent
</pre>

<small>
(This is a consolidation of these two simpler regexes:
<pre>
ebscohost\.com.*pdfviewer #Block unusable [[wp:EBSCOHOST]] links but allow permalinks
ebscohost\.com\. #Match proxies, which is where it's not the end of the hostname - there's a literal "." after "com".
</pre>)


Wikipedia has many apparently dead-on-arrival links (like [http://web.ebscohost.com/ehost/pdfviewer/pdfviewer?vid=4&hid=111&sid=b3387fad-2fe1-4ede-a963-0b5087cc9e5b%40sessionmgr115 this] intended to be to PDFs of the form ebscohost.com...pdfviewer...:
All 7 of the [https://en.wikipedia.org/w/index.php?title=Special%3ASearch&search=ebscohost+pdfviewer 323] pages containing ebscohost and pdfviewer] I looked at had dead EBSCO links. These are NOT links that hit a paywall (like [http://search.ebscohost.com/login.aspx?direct=true&db=neh&tg=UI&an=326730&site=novp-live this]. Rather, they bring up 404-like server error messages, ''and did from the day they were added''; they're non-persistent URLs.

A ''second problematic type'' of EBSCO link are proxied URLs, like the three added by [http://en.wikipedia.org/w/index.php?title=Open-source_movement&diff=prev&oldid=475497813 a user's (sole ever) edit] that are of the form hxxp://0-web.'''ebscohost.com.sculib.scu.edu'''/ehost/pdfviewer/pdfviewer?sid=[hex string]@sessionmgr13&vid=4&hid=13. (Note the '''bold''' portion!) These links work ONLY for subscribers that are ALSO at SCU. We shouldn't allow such links, and the blacklist (or a similarly functioning parallel system) would be a good solution.

I've noticed that EBSCO staff has been heavily editing their own article. I solicited assistance, hoping they'd be available, willing, and able to help fix these links or suggest ways to deal with them systematically. [https://en.wikipedia.org/wiki/Talk:EBSCO_Publishing#Link_problems note posted]; no response. What EBSCOhost calls permalinks, like http://search.ebscohost.com/login.aspx?direct=true&db=ulh&AN=37698669&site=ehost-live&scope=site are acceptable, and so I've designed a regex that allows the permalinks but forbids the non-persistent URLs.

Research suggests it's not possible to convert the non-persistent URLs to persistent URLs using the data in the former. --[[User:Elvey|Elvey]] ([[User talk:Elvey|talk]]) 21:26, 9 September 2013 (UTC)

:The second problem is the use of a proxied URL, ie, the link points to a institution's proxy server such as sculib.scu.edu. This is not specific to ebscohost - it happens with links to other subscription databases too. A search for "ezproxy", for example, will bring up hundreds of such links. They are a bad thing. [[User:Nurg|Nurg]] ([[User talk:Nurg|talk]]) 08:39, 12 June 2013 (UTC) (reposted)

::I am tempted to see these sites as redirects, which will be location-dependent whether they work. I would consider that these should typically be converted to direct links to the object (within educational institutions, one can generally use a web-proxy to get to literature - a direct link would either be the link on the server where the literature resides, or the DOI. <snip> Links through proxy servers have no place whatsoever. I am somewhat tempted to say that these need blanket blacklisting on meta, as they could possibly be abused to circumvent other blacklistings (for a relatively open proxy), and serve no function whatsoever to most readers except for the (few) ones that have access through the proxy - I doubt even if the url can be understood well enough to be able to figure out a real link from it. It is however going to be very obnoxious for the users that in good faith insert the proxy url they copy from their web-browser and then they can't save, and one could think of cases where it is appropriate (if information is only available to people who can pass the proxy and no-where else in the world, it could still a good reference for certain information - think of it of a book of which the single copy is in an nearly inaccessible library (the library in the Vatican), it is still verifiable by proxying through people who do have access to the library (ask the pope)).

::Note, that with creative regex rule-writing, we could blacklist the two 'bad' examples of Nurg (the non-persistent link and the institution proxies), still enabling good ones (the permalinks). --[[User:Beetstra|Dirk Beetstra]] <sup>[[User_Talk:Beetstra|<span style="color:#0000FF;">T</span>]] [[Special:Contributions/Beetstra|<span style="color:#0000FF;">C</span>]]</sup> 09:30, 12 June 2013 (UTC) (reposted, indented, and 1 sentence snipped)

:::We use the blacklist to limit examiner.com links, because they generally fail RS, so I think it's appropriate that we add regexes for the impermanent URLs. (Arguably it would be better to have a similarly functioning parallel system with its own error messages handle sites like examiner.com and this ebsco problem, but in the meantime, I say let's put in regexes to handle them.) I also match the ebscohost proxy URLs, but not by matching on 'ezproxy', because some of the ebscohost proxy URLs don't contain 'ezproxy'. (It could be considered as part of a future proposed blacklist addition.) {{User|Beetstra}} suggested blanket blacklisting on meta be considered, but at meta, though I see these links on other sites - e.g 'fr.', I was told firmly, "Deal with it at the local wiki level." (Discussion at https://meta.wikimedia.org/w/index.php?title=Talk:Spam_blacklist&oldid=5798048#Unusable_EBSCOHOST_links.) --[[User:Elvey|Elvey]] ([[User talk:Elvey|talk]]) 21:26, 9 September 2013 (UTC)


=Completed Proposed additions=
=Completed Proposed additions=

Revision as of 21:26, 9 September 2013

    Mediawiki:Spam-blacklist is meant to be used by the spam blacklist extension. Unlike the meta spam blacklist, this blacklist affects pages on the English Wikipedia only. Any administrator may edit the spam blacklist. See Wikipedia:Spam blacklist for more information about the spam blacklist.


    Instructions for editors

    There are 4 sections for posting comments below. Please make comments in the appropriate section. These links take you to the appropriate section:

    1. Proposed additions
    2. Proposed removals
    3. Troubleshooting and problems
    4. Discussion

    Each section has a message box with instructions. In addition, please sign your posts with ~~~~ after your comment.

    Completed requests are archived. Additions and removals are logged, reasons for blacklisting can be found there.

    Addition of the templates {{Link summary}} (for domains), {{IP summary}} (for IP editors) and {{User summary}} (for users with account) results in the COIBot reports to be refreshed. See User:COIBot for more information on the reports.


    Instructions for admins
    Any admin unfamiliar with this page should probably read this first, thanks.
    If in doubt, please leave a request and a spam-knowledgeable admin will follow-up.

    Please consider using Special:BlockedExternalDomains instead, powered by the AbuseFilter extension. This is faster and more easily searchable, though only supports whole domains and not whitelisting.

    1. Does the site have any validity to the project?
    2. Have links been placed after warnings/blocks? Have other methods of control been exhausted? Would referring this to our anti-spam bot, XLinkBot be a more appropriate step? Is there a WikiProject Spam report? If so, a permanent link would be helpful.
    3. Please ensure all links have been removed from articles and discussion pages before blacklisting. (They do not have to be removed from user or user talk pages.)
    4. Make the entry at the bottom of the list (before the last line). Please do not do this unless you are familiar with regular expressions — the disruption that can be caused is substantial.
    5. Close the request entry on here using either {{done}} or {{not done}} as appropriate. The request should be left open for a week maybe as there will often be further related sites or an appeal in that time.
    6. Log the entry. Warning: if you do not log any entry you make on the blacklist, it may well be removed if someone appeals and no valid reasons can be found. To log the entry, you will need this number – 572249788 after you have closed the request. See here for more info on logging.


    Proposed additions

    icax.co.uk

    icax.co.uk: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com Appears to be keyword spammed link farm designed for SEO positioning. It's a low value link and scattered around numerous articles related to alternative energy Cantaloupe2 (talk) 12:48, 19 August 2013 (UTC)[reply]

    sgs.com

    sgs.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com Sockpuppet SPIs have been adding links to SGS press releases to numerous articles. See Wikipedia:Sockpuppet investigations/Schmetterling5/Archive. --Dennis Bratland (talk) 15:34, 17 July 2013 (UTC)[reply]

    angellis.net

    angellis.net: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com Various editors, mostly anonymous IPs have been using this website as a reference on several articles about prehistoric animals, but, the site is not a reputable source by any stretch, being a fanmade site filled with original research of very little value.--Mr Fink (talk) 16:54, 11 July 2013 (UTC)[reply]

    mosler-safe.de / .com

    IP range adding a dubious link to the German website of a "Mosler Safe Company" to various safe-related articles, including Mosler Safe Company, a historically famous American safe manufacturer that went bankrupt in 2001. --McGeddon (talk) 09:25, 1 July 2013 (UTC)[reply]

    This spammer is still going a month and a half later. --McGeddon (talk) 17:42, 19 August 2013 (UTC)[reply]

    sports-rings.com

    VQuakr (talk) 04:18, 9 July 2013 (UTC)[reply]

    obitree.com

    obitree.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    A minor 'obituary' website which is being spammed into external links sections of articles on recently-deceased as a link to a 'memorial' or similar. There is no indication whatsoever that the site is operating in any way on behalf of relatives of the deceased. Furthermore, material on the website is frequently copied from our own article - without attribution (see for example Princess Fawzia Fuad of Egypt[1], James Gandolfini,[2],Jim Kelly (martial artist)[3] etc.). Thus not only is this clear spam, but linking it might encourage circular referencing (it is also incidentally a breach of copyright..). Note that after I warned an initial contributor regarding spamming (see User talk:Ikristin), new accounts have been created for each instance. AndyTheGrump (talk) 13:10, 14 July 2013 (UTC)[reply]

    Is anything going to be done about this website?: This has been listed for almost two months now, and are still spamming articles on recently-dead persons. The latest trick, which I'd not seen before is to replace a link with one of their own [4] - in this case to findagrave.com, which is arguably spam too, but I think we should be the judge of that, not boosters of a competing (?) website. Note that they are using disposable one-edit accounts - they clearly know what they are doing isn't approved of. AndyTheGrump (talk) 23:46, 7 September 2013 (UTC)[reply]
     Done. I've also blocked roughly thirty sockpuppets.... Reaper Eternal (talk) 11:59, 8 September 2013 (UTC)[reply]

    soulinterviews.com

    IP has been adding the same link since 2012 --Glaisher [talk] 16:22, 15 July 2013 (UTC)[reply]

    lawrenceindia.com

    lawrenceindia.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    First it was through the user name Royaltrains (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam) which was bloked from editing. After that another username

    adelaide-classifieds.info

    adelaide-classifieds.info: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

     Done Reaper Eternal (talk) 12:13, 8 September 2013 (UTC)[reply]

    maharajasexpresstrains.com

    maharajasexpresstrains.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    Spammers

    thepalaceonwheels.com

    thepalaceonwheels.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    Spammers

    spywareloop.com

    Links being spammed

    spywareloop.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    Spammers

    Sourov0000 (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)

    Diffs

    [5] [6] ... etc. User has been spamming this on dozens of articles today. -- Mesoderm (talk) 22:25, 27 July 2013 (UTC)[reply]

    • Well, Mesoderm, thank you for your concern about Spamming. But the actual fact is that I am not related to any kind of Spamming. I am currently working with the virus and Spyware sections and that's why added the references. The reason why I was mentioned is that I have used the link of spywareloop.com on several articles. I am also adding references and information from other websites to Wikipedia's (Spyware related articles) within course of time. So, there is no need to see it as Spam. I am well aware of Wikipedia's policies and am always against spamming. If what I did makes me spammer, then I think I will have to be even more careful from future. Again thank you for notifying the matter. I really had no intention of spamming and will never have that intention. Thank you all who have used their valuable time to read it. Sourov0000 (talk) 13:50, 28 July 2013 (UTC)[reply]

    thealternativepress.com

    thealternativepress.com

    This is nothing more than sites like the Examiner and Associated.As I am checking the WP refs, the links are copied press releases and things plagiarized from other sources.Maybe not all but it looks like it so far. I noticed it popping-up today in a seriously overlinked article Teresa Giudice, and The Real Housewives of New Jersey. I read the linked reference and it was nothing more than reconstituted info. that had already been published by news organizations.(edited to add: it was actually word-forword copied from The Star-Ledger!)Housewifehader (talk) 22:39, 31 July 2013 (UTC) There is no need for this site to be used as a reference here unless it contains original or exclusive content.I hope that I have formatted this correctly;). Here is the example of where the ref. goes as used in these articles : http://thealternativepress.com/towns/west-essex/sections/law-and-justice/articles/nj-housewives-star-theresa-guidice-and-husband-iHousewifehader (talk) 21:51, 31 July 2013 (UTC)Housewifehader (talk) 21:45, 31 July 2013 (UTC)[reply]

    just a note to add that since i posted this earlier, I have checked this site out a little closer, and there is plagiarism, copied press releases, broken links, but also it appears that there is some site-original content available. It looks like when they started they focused on an area in North Jersey with local writers-so I am hoping to see another opinion about this, TYHousewifehader (talk) 03:02, 1 August 2013 (UTC)[reply]

    gtax.co.uk

    This has been ongoing for some time from different IP adresses, the latest bout of spammers are: 195.144.63.18 (talk • contribs • deleted contribs • blacklist hits • AbuseLog • what links to user page • COIBot • Spamcheck • count • block log • x-wiki • Edit filter search • WHOIS • RDNS • tracert • robtex.com • StopForumSpam • Google • AboutUs • Project HoneyPot) 212.203.111.210 (talk • contribs • deleted contribs • blacklist hits • AbuseLog • what links to user page • COIBot • Spamcheck • count • block log • x-wiki • Edit filter search • WHOIS • RDNS • tracert • robtex.com • StopForumSpam • Google • AboutUs • Project HoneyPot) 85.3.200.64 (talk • contribs • deleted contribs • blacklist hits • AbuseLog • what links to user page • COIBot • Spamcheck • count • block log • x-wiki • Edit filter search • WHOIS • RDNS • tracert • robtex.com • StopForumSpam • Google • AboutUs • Project HoneyPot) (see the diffs from July 30 to August 3).

    Really disruptive spamming since it replaces valid links and information with links to this company. Sjö (talk) 07:38, 3 August 2013 (UTC)[reply]

    And just to show that it's been going on for quite some time, Wikipedia:WikiProject Spam/LinkReports/gtax.co.uk is from 2009-2010. Sjö (talk) 20:23, 12 August 2013 (UTC)[reply]

    shaligram.com

    Google Analytics ID: UA-38591310 - (Track - Report - reverseinternet.com • Meta: Track - Report)

    Spammers

    MER-C 11:53, 8 August 2013 (UTC)[reply]

    elitetraveler.com

    Spammers

    Three spamruns from this website that I have noticed. The Banner talk 21:25, 13 August 2013 (UTC)[reply]

    Four spamruns (a small one the last time The Banner talk 18:36, 2 September 2013 (UTC)[reply]
    • I have no opinion on the editors above, but I do want to report that the magazine itself qualifies as a reliable source according to Wikipedia standards. WP:RS. Like all glossy magazines IMHO, they do pander to their advertisers to some degree, but it is a real print magazine with reporters, editors and fact checkers.--Nixie9 23:26, 3 September 2013 (UTC)[reply]

    Morning277's sources

    investmentunderground.com

    investmentunderground.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com This "source" is only used by the Morning277 a.k.a. Wiki-pr.com spammers. Examples found in a big box of junk:

    [8] [9] [10] [11] [12] [13] [14] [15] [16] [17] [18] [19] [20] [21] [22] [23] [24]

    As I write, there are 20 of these links in the article space of the English Wikipedia, and 30 elsewhere. This outfit has multiple people working on insertion of new articles, restoration of deleted articles, and maintenance of their preferred content in articles. I understand that the links are normally removed before blacklisting. If this is approved, I'm willing to remove the links from the English Wikipedia afterward.

    rybec 00:36, 14 August 2013 (UTC)[reply]

    There's no requirement for links to be removed before blacklisting. Before or after, it doesn't matter. Isn't this one of those sites that these socks are removing anyway to avoid detection? ~Amatulić (talk) 23:42, 14 August 2013 (UTC)[reply]
    I don't have a diff that shows them removing this particular one, but it's one of the sites I listed at Wikipedia:Long-term_abuse/Morning277#Habitual_behavior. There are now 13 12 articles that contain it [25] and 24 other pages [26]. I have a suspicion that the PR company arranges for stories to be written about its clients on the Investment Underground site; then the Investment Underground story lends an appearance of credibility to the Wikipedia article. New page patrollers see an article written in fluent English with inline references, and leave it alone. After examiner dot com was blacklisted, this outfit started just writing "Examiner" without providing a URL, so I'm not sure blacklisting will be entirely effective, but it may help. —rybec 23:56, 15 August 2013 (UTC)[reply]
    There are so many publications called "Examiner" that if the URL is missing, it's probably a decent sock detection signal.
    I'm uneasy about blacklisting a source that isn't unambiguously being used for abuse. This seems more indirect, citing possibly-paid-for articles on investmentunderground as a means to preserve articles on Wikipedia posted by the PR firm. Needs some discussion. It might be an easier decision if this site had little editorial oversight like examiner.com, but I'm not sure that's the case. ~Amatulić (talk) 00:21, 16 August 2013 (UTC)[reply]
    If there are good faith uses for this source, but most links by new accounts are spam would this be a good Xlinkbot candidate? VQuakr (talk) 01:09, 16 August 2013 (UTC)[reply]

    (edit conflict)In the past, the Morning277 editors always became autoconfirmed before posting their articles. Recently, they've stopped doing that, but I'm sure they wouldn't mind going back to their previous behavior. If I understand correctly, XLinkBot does not affect autoconfirmed editors. Below, I contend that good-faith use of this URL occurs in at most one article on the English Wikipedia.

    The Wikipedia search says there are 13 results, but only shows 12 articles on the results page. I assume it's because one article contains two Investment Underground citations.

    User:Sublimeharmony/sandbox11 contains 75 distinct articles; of those, 17 linked to this site (22.6%). Of the 4,306,170 articles on the English Wikipedia, only 12 link to Investment Underground (0.00027%). Arithmetically, there's not much ambiguity, even if all 12 articles were unrelated to the spamming PR firm. However, I believe that all or nearly all were placed on Wikipedia by that PR firm. If the relationship between Investment Underground and the company placing these Wikipedia is what I suspect it is, it should be obvious why they would choose not to disclose that. For further evidence, look at the timing of when stories appear on investmentunderground.com and vatalyst.com, and the first appearance of the Wikipedia article.

    Here are the 12 articles which cite Investment Underground, and some of the reasons why I think they're all products of the same company:

    1. David Stewart (alternative medicine) cites Vatalyst (one of only 8 articles which do), CNN Ireport, an Andrew Moran article in Digital Journal
    2. Banc De Binary An article about its CEO is in User:Sublimeharmony/sandbox11 [27].
    3. Citadel EFT Its creator did minor edits to existing articles until autoconfirmed, pasted this article into a sandbox [28], and abandoned his account the same day he moved the article to the main space on the English Wikipedia. These habits were all characteristic of Morning277 prior to mid-July 2013.
    4. EFans (social network) This one is a little ambiguous, I admit. The editing habits and writing style differ from Morning277's usual. I do notice that it cites www.business2community.com, another of Morning277's preferred sources, and the refnames are two-letter, capitalized abbreviations. Also, it's about an obscure Web site (its Alexa rank is 415,792 globally and 149,308 for India). The efans.com domain is registered to someone in La Jolla, California. Morning277 activity has been seen from the San Diego area; La Jolla is a suburb of San Diego.
    5. Enigma NMS Cites California Business Journal and Business 2 Community, two more of Morning277's habitual sources. Had a minor edit by Tabithamuiru (talk · contribs), a Morning277 account which re-spammed Cleeng as Cleeng (Company)—see [29].
    6. Key Realty School LLC Previously spammed at Key Realty School, now recreated under a different name—a hallmark of Morning277.
    7. Linode Also cites California Business Journal, Technorati and Yahoo Voices, which are among Morning277's preferred sources. Heavily edited by Bluesman3145 (talk · contribs) and Afolson (talk · contribs), both blocked as MooshiePorkFace (talk · contribs) socks. MooshiePorkFace is believed to work with Morning277.
    8. Search Engine People another of the 8 that cites Vatalyst, just like David Stewart (alternative medicine) and Zorpia
    9. SmartFile (company) Also cites an Andrew Moran article in Digital Journal, just like Tim Grayem and David Stewart (alternative medicine).
    10. Tim Grayem Also cites an Andrew Moran article in Digital Journal, just like David Stewart (alternative medicine) and SmartFile (company).
    11. W Athletic (talent agency) Previously spammed at W Athletic, now recreated under a different name with unnecessary disambiguation—a hallmark of Morning277.
    12. Zorpia Is another of the 8 that cites Vatalyst, just like David Stewart (alternative medicine) and Search Engine People.

    To belabor a point, there are 4.3 million articles on the wiki, of which 12 cite Investment Underground and 8 cite Vatalyst, with an overlap of 3 articles that cite both! For that to happen by chance is like having three people in your city who each won a lottery grand prize, got struck by lightning and' were bitten by sharks. Also, for two of them, there were stories by Andrew Moran in Digital Journal (he's not their only writer). It's as though the same shark came back and gave two of those people a second bite. It would make sense if the three subjects had a great deal in common, or if Investment Underground, Vatalyst, and Andrew Moran all produced a huge volume of stories—but they don't. The assumption that they all have the same publicist, who has an "in" with the ostensible journalists, is the only reasonable explanation.

    Another thing I notice is that the domains wiki-pr.com, vatalyst.com, investmentunderground.com and cabusinessjournal.org are all registered with Godaddy and the whois information is private. While Godaddy is a very popular registrar, this is suspicious by itself.

    The only article where I see this site used that isn't very obviously a Morning277 write-up is the EFans article, and I think I've shown it's reasonable to have suspicion about that one. I just don't see this site used by non-spammers. Morning277 is hardly likely to send in a signed affidavit disclosing its relationship with Investment Underground, Vatalyst, and the California Business Journal. —rybec 02:50, 16 August 2013 (UTC)[reply]

    vatalyst.com

    vatalyst.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com The rationale is the same as for investmentunderground.com above.

    This site was referenced in 22 Sublimeharmony articles: [30] [31] [32] [33] [34] [35] [36] [37] [38] [39] [40] [41] [42] [43] [44] [45] [46] [47] [48] [49] [50] [51]

    This is currently used as a reference in 8 articles on the English Wikipedia:

    1. IDrive Inc. has two Vatalyst citations. Was created by 178Tint (talk · contribs), an editor in good standing.
    2. Search Engine People discussed in investmentunderground.com section
    3. Zorpia discussed in investmentunderground.com section
    4. Rivalus also cites Digital Journal, CNN Ireport, Yahoo Voices
    5. National Academy of Future Physicians and Medical Scientists also cites Business 2 Community twice, Yahoo Voices, CNN Ireport, and California Business Journal
    6. David Stewart (alternative medicine) discussed in investmentunderground.com section
    7. Krinos Foods Canada Ltd also cites California Business Journal, Business 2 Community, Yahoo Voices, Examiner
    8. Geoffrey Edelsten Philanthropy section with citations to one Vatalyst story was added [52] by Phrenology (talk · contribs), an editor in good standing.

    rybec 07:48, 16 August 2013 (UTC)[reply]

    dividendkings.com

    dividendkings.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com The rationale is the same as for investmentunderground.com and vatalyst.com above.

    This site was referenced in Sublimeharmony articles:

    1. [53] Sollensys
    2. [54] Echopass
    1. [55] Security Innovation
    2. [56] Loyaltyworks
    1. [57] Oren Laurent
    2. [58] NewYorkStay.com
    1. [59] Rev. Fr. Emmanuel Lemelson
    2. [60] American Writers and Artists Inc.

    It is not currently used in articles on the English Wikipedia, although there are three sandboxes which refer to it:

    1. User:Alcedine/sandbox about Casino.org, sourced from Dividend Kings, Yahoo Voices, Vatalyst, pameladeter's blog and a Digital Journal piece by Andrew Moran as its only sources.
    2. User:Jvn mht/sandbox about QuoteWizard, a previous Morning277 subject;

    Jvn mht (talk · contribs) has been blocked. The draft cites Vatalyst, Dividend Kings and Yahoo Voices as its only sources.

    1. User:Gem 1981/sandbox about TestCountry, a previous Morning277 subject. It cites Vatalyst, Dividend Kings and a Digital Journal piece by Andrew Moran as its only sources.

    Besides the pameladeter blog, all the sources for these three sandboxes come from the short list I posted at Wikipedia:Long-term_abuse/Morning277. —rybec 15:51, 16 August 2013 (UTC)[reply]

    cabusinessjournal.org

    cabusinessjournal.org: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com The rationale is the same as for investmentunderground.com, vatalyst.com and dividendkings.com above.

    This site was referenced in Sublimeharmony articles:

    1. [61] ModelManagement.com
    2. [62] Brosix
    3. [63] NewYorkStay.com
    4. [64] Sweet Couch
    5. [65] Emmanuel Lemelson
    6. [66] American Writers and Artists Inc.

    The uses on the English Wikipedia [67] include many of the same articles I mentioned in regard to investmentunderground.com, vatalyst.com and dividendkings.com. You'll see those sites cited along with this one. All the uses look like Morning277's work, or discussion of it. Right now it's in only six main-space articles:

    rybec 19:00, 16 August 2013 (UTC)[reply]

    celebritiesheight.com

    celebritiesheight.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    This website is currently being spammed by numerous hit 'n run accounts. So far I have caught:

    1. Sushmaatamang (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    2. Rupalithapalia (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    3. Pramodkumarshah (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    4. Dilkaparamda (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    5. Bidurchhetri (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    6. Durgalamsal (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    7. Rashmiupreti (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    8. Pimaggiea (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    9. Putanakamatyadav (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    10. Amranyadampo (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    11. Millitochulati (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    12. Bakanatamrakar (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    13. Laymetamang (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    14. Deepikasonnet (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    15. Angeelarai (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    16. Sumeetbohara (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    17. Putanilamba (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    18. Pitleungama (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    19. Mayakobarima (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    20. Phirwohiraat (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)

    Looking at the search results when searching for the domain, there are probably at least 30 more accounts spamming the website. Nymf (talk) 17:06, 16 August 2013 (UTC)[reply]

    1. Melepatisujata (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    2. Dilipamemorous (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    3. Pimariaya (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    4. Prichardshahi (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    5. Stangeprasad (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    6. Licianasharma (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    7. Kritikabhanddari (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    8. Pimaryenma (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    9. Gregorykarki (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    10. Lawatipranami (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    11. Amrtinahipun (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    12. Durchanarawat (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    13. Sundarshyamgale (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    14. Saritamulepatii (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    15. Thapadeepika (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    16. Prabhuadhikary (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    17. Pkhannagarama (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    18. Colemansuzata (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    19. Baralanita (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    20. Douglassumeet (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    21. Umakantana (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    22. Rosiemai (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    23. Pitambarlama (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    24. Pukartarahaume (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    25. Anishwaglee (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    26. Miswahkumaratankga (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    27. Trishakapoor (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    28. Surdarshanarlama (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    29. Chumanshinggurung (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam)
    30. Unnatik (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam) -- stale for CU
    31. Amrittas (talk · contribs · deleted contribs · blacklist hits · AbuseLog · what links to user page · count · COIBot · Spamcheck · user page logs · x-wiki · status · Edit filter search · Google · StopForumSpam) -- stale for CU

    SPI filed. MER-C 04:06, 17 August 2013 (UTC)[reply]

     Done Reaper Eternal (talk) 20:36, 17 August 2013 (UTC)[reply]

    fncy.it

    fncy.it: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    is a 302 redirect to bitly.com

    rybec 07:01, 17 August 2013 (UTC)[reply]

     Defer to Global blacklist URL shorteners go on the global blacklist. I've posted a request there. MER-C 07:53, 17 August 2013 (UTC)[reply]

    Morning277 subjects

    These sites are being promoted by a publicity agency, banned from Wikipedia, which has been posting articles about them. After an article is deleted and the poster blocked, a new article with similar contents is posted from a different account, almost always under a different title. Since they keep using new accounts and new article titles, account blocking and page protection haven't been entirely effective.

    newyorkstay.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    www.youtube.com/user/newyorkstaycom: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    www.justiceforall.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    www.kulaw.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    4cabling.com.au: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    www.aasted.eu: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    www.alsbridge.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    www.awaionline.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    www.bizible.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    rybec

    epnrstatus.co.in

    epnrstatus.co.in: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    • Regularly added to articles relating to Indian Railways, often by replacing legitimate links in the articles with the spamlink (both external links and reflinks, with official Indian Railways links being the prime target; sample diffs: [68],[69], [70], [71], [72]). The spamlinks are added by IP-users who all geolocate to India, using a new IP each time. Thomas.W talk to me 10:57, 22 August 2013 (UTC)[reply]

    hipromtech.com

    hipromtech.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    www.princeton.edu/~achaney/tmve/wiki100k

    (warning, link summary template may not work)

    www.princeton.edu/~achaney/tmve/wiki100k: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

    • The site is not reliable and should not be used in Article space per WP:CIRCULAR. This is clone of Wikipedia (every real page there says "The article content of this page came from Wikipedia and is governed by CC-BY-SA."). Some wikipedia editors may think that site is good as RS (it is in google's top and the domain is .edu), but it isn't and there should be some way to say that the link is not correct to be added to the Wiki.
    • Recent example: diff
    • Currently there are 76 links to the site, some are from Article space: [73]:
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/1924_Summer_Olympics.html is linked from Albert Séguin
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/2D_computer_graphics.html is linked from 2D computer graphics
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Abba_Eban.html is linked from Abba Eban
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Adair_County,_Missouri.html is linked from Grand River (Missouri)
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Area_rule.html is linked from Sears–Haack body
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Banba.html is linked from LÉ Banba (CM11)
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Ben_Bova.html is linked from Ben Bova
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Bhavani.html is linked from Bhavani Peth
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Biface.html is linked from Hand axe
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Brightness_temperature.html is linked from Brightness temperature
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Bushyhead,_Oklahoma.html is linked from Dennis Bushyhead
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Byng,_Oklahoma.html is linked from Julian Byng, 1st Viscount Byng of Vimy
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/CDC_6600.html is linked from CDC 6600
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Camel_(band).html is linked from Camel (band)
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Camel_(band).html is linked from The Snow Goose (album)
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Critical_theory.html is linked from Critical theory
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Ctesiphon.html is linked from Iwan
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Du_hast.html is linked from Burkenburg
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Francesco_Redi.html is linked from Francesco Redi
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Jean_le_Rond_d_Alembert.html is linked from Louis-Camus Destouches
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Lagari_Hasan_%C3%87elebi.html is linked from Lagâri Hasan Çelebi
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Language_game.html is linked from Language game
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Local_Government_Areas_of_Australia.html is linked from Local Government Area
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Lord_Peter_Wimsey.html is linked from Lord Peter Wimsey
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Mohammed_Deif.html is linked from Mohammed Deif
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Mystic_Records.html is linked from Mystic Records
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Noise_weighting.html is linked from Psophometric weighting
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Phonograph_cylinder.html is linked from Early classical guitar recordings
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Pimsleur_language_learning_system.html is linked from Pimsleur method
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Pope_John_XXI.html is linked from History of Roman Catholicism in Portugal
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/QuarkXPress.html is linked from QuarkXPress
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Reconquista.html is linked from History of Roman Catholicism in Portugal
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Record_producer.html is linked from Executive producer
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Sacraments_of_the_Catholic_Church.html is linked from Catholic Church
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Simpson_s_paradox.html is linked from Edward H. Simpson
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Smokey_Robinson.html is linked from North End, Detroit
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Sunk_costs.html is linked from Sunk costs
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/The_Chemical_Brothers.html is linked from Alleyn's School
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Transport_in_Barbados.htm is linked from Transport in Barbados
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Tsui_Hark.html/ is linked from List of University of Texas at Austin alumni
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Wall.html is linked from Wall
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Warren,_Arkansas.html is linked from Warren, Arkansas
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Wis%C5%82awa_Szymborska.html is linked from Ironic precision
      • www.princeton.edu/~achaney/tmve/wiki100k/docs/Yom_Kippur_War.html is linked from United Nations Security Council Resolution 338
    PS Actually, there are another runs of "tmve/wiki100k" on different sites (google for "tmve/wiki100k" site:wikipedia.org), e.g. http://www.sccs.swarthmore.edu/users/08/ajb/tmve/wiki100k/docs/Bavarii.html http://www.sccs.swarthmore.edu/users/08/ajb/tmve/wiki100k/docs/Potentiometer.html and they are not only in en-wiki (move to meta spam list or create some filters for wiki100k?) `a5b (talk) 00:52, 25 August 2013 (UTC)[reply]
    I don't know where the links came from but that site is benign -- it's an experiment being done by a grad student at Princeton. For more information, see these web pages:
    http://www.cs.princeton.edu/~achaney/papers/ChaneyBlei2012.pdf
    I suggest emailing her at http://www.cs.princeton.edu/~achaney/email.html before any blacklisting to give her a heads up.
    Her work could be very useful to Wikipedia and the Wikimedia Foundation in the long-term.
    That said, we don't need any of these links since they circle back to our own content.
    --A. B. (talkcontribsglobal count) 16:23, 29 August 2013 (UTC)[reply]
    I suggest she get in touch with WikiProject Research
    --A. B. (talkcontribsglobal count) 16:25, 29 August 2013 (UTC)[reply]
    That Wikiproject looks moribund when I look at it closer. It looks like there's very active support and discussion of various research projects on Meta-Wiki at meta:Research:Index. I'd hate to see a diligent researcher run afoul of what might look BITE-y to an outsider.
    --A. B. (talkcontribsglobal count) 16:34, 29 August 2013 (UTC)[reply]

    www.historyofnations.net

    historyofnations.net: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com – This site uses Wikipedia content without attribution and is placed into the External links section of about 100 'History of ...' articles. Abductive (reasoning) 20:40, 27 August 2013 (UTC)[reply]

    sourcesecurity.com

    Spammers

    Long term, persistent spamming on many IPs and users - above is a partial list of IPs and accounts. Main spam URL is sourcesecurity.com, but thebigredguide and yogawizard show some overlap in accounts. - MrOllie (talk) 18:37, 30 August 2013 (UTC)[reply]

    thewebminer.com

    scrape4me.com

    500music.com

    Here's a specific suggestion:

    ebscohost\.com(\.|.*(pdfviewer|EbscoContent))     #Block 3 kinds of unusable EBSCOHOST links but allow permalinks: Match proxies: there's a literal "." after "com", and temporary session links, which contain pdfviewer or EbscoContent
    

    (This is a consolidation of these two simpler regexes:

    ebscohost\.com.*pdfviewer          #Block unusable [[wp:EBSCOHOST]] links but allow permalinks
    ebscohost\.com\.                   #Match proxies, which is where it's not the end of the hostname - there's a literal "." after "com".
    

    )


    Wikipedia has many apparently dead-on-arrival links (like this intended to be to PDFs of the form ebscohost.com...pdfviewer...: All 7 of the 323 pages containing ebscohost and pdfviewer] I looked at had dead EBSCO links. These are NOT links that hit a paywall (like this. Rather, they bring up 404-like server error messages, and did from the day they were added; they're non-persistent URLs.

    A second problematic type of EBSCO link are proxied URLs, like the three added by a user's (sole ever) edit that are of the form hxxp://0-web.ebscohost.com.sculib.scu.edu/ehost/pdfviewer/pdfviewer?sid=[hex string]@sessionmgr13&vid=4&hid=13. (Note the bold portion!) These links work ONLY for subscribers that are ALSO at SCU. We shouldn't allow such links, and the blacklist (or a similarly functioning parallel system) would be a good solution.

    I've noticed that EBSCO staff has been heavily editing their own article. I solicited assistance, hoping they'd be available, willing, and able to help fix these links or suggest ways to deal with them systematically. note posted; no response. What EBSCOhost calls permalinks, like http://search.ebscohost.com/login.aspx?direct=true&db=ulh&AN=37698669&site=ehost-live&scope=site are acceptable, and so I've designed a regex that allows the permalinks but forbids the non-persistent URLs.

    Research suggests it's not possible to convert the non-persistent URLs to persistent URLs using the data in the former. --Elvey (talk) 21:26, 9 September 2013 (UTC)[reply]

    The second problem is the use of a proxied URL, ie, the link points to a institution's proxy server such as sculib.scu.edu. This is not specific to ebscohost - it happens with links to other subscription databases too. A search for "ezproxy", for example, will bring up hundreds of such links. They are a bad thing. Nurg (talk) 08:39, 12 June 2013 (UTC) (reposted)[reply]
    I am tempted to see these sites as redirects, which will be location-dependent whether they work. I would consider that these should typically be converted to direct links to the object (within educational institutions, one can generally use a web-proxy to get to literature - a direct link would either be the link on the server where the literature resides, or the DOI. <snip> Links through proxy servers have no place whatsoever. I am somewhat tempted to say that these need blanket blacklisting on meta, as they could possibly be abused to circumvent other blacklistings (for a relatively open proxy), and serve no function whatsoever to most readers except for the (few) ones that have access through the proxy - I doubt even if the url can be understood well enough to be able to figure out a real link from it. It is however going to be very obnoxious for the users that in good faith insert the proxy url they copy from their web-browser and then they can't save, and one could think of cases where it is appropriate (if information is only available to people who can pass the proxy and no-where else in the world, it could still a good reference for certain information - think of it of a book of which the single copy is in an nearly inaccessible library (the library in the Vatican), it is still verifiable by proxying through people who do have access to the library (ask the pope)).
    Note, that with creative regex rule-writing, we could blacklist the two 'bad' examples of Nurg (the non-persistent link and the institution proxies), still enabling good ones (the permalinks). --Dirk Beetstra T C 09:30, 12 June 2013 (UTC) (reposted, indented, and 1 sentence snipped)[reply]
    We use the blacklist to limit examiner.com links, because they generally fail RS, so I think it's appropriate that we add regexes for the impermanent URLs. (Arguably it would be better to have a similarly functioning parallel system with its own error messages handle sites like examiner.com and this ebsco problem, but in the meantime, I say let's put in regexes to handle them.) I also match the ebscohost proxy URLs, but not by matching on 'ezproxy', because some of the ebscohost proxy URLs don't contain 'ezproxy'. (It could be considered as part of a future proposed blacklist addition.) Beetstra (talk · contribs) suggested blanket blacklisting on meta be considered, but at meta, though I see these links on other sites - e.g 'fr.', I was told firmly, "Deal with it at the local wiki level." (Discussion at https://meta.wikimedia.org/w/index.php?title=Talk:Spam_blacklist&oldid=5798048#Unusable_EBSCOHOST_links.) --Elvey (talk) 21:26, 9 September 2013 (UTC)[reply]

    Completed Proposed additions

    blogintomystery.com

    See WikiProject Spam report. Of the 30 or so links I have removed, all of them were added by that /17. MER-C 08:09, 8 June 2013 (UTC)[reply]

     Done Reaper Eternal (talk) 19:14, 29 July 2013 (UTC)[reply]

    easybusinessposters.com

    Google Analytics ID: UA-23731192 - (Track - Report - reverseinternet.com • Meta: Track - Report)

    Spammers

    See WikiProject Spam report MER-C 06:52, 10 July 2013 (UTC)[reply]

     Done Reaper Eternal (talk) 19:21, 29 July 2013 (UTC)[reply]

    42.com

    42.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com Site redirects to malware, per Norton, added to Phrases from The Hitchhiker's Guide to the Galaxy (diff). - Pointillist (talk) 20:37, 4 August 2013 (UTC)[reply]

     Defer to Global blacklist .. if it damages the readers, it is of interest to all of mediawiki to block this. --Dirk Beetstra T C 05:49, 5 August 2013 (UTC)[reply]

    Proposed removals

    Free Web Town

    freewebtown.com/ Just need it because I can only access it if I make a link to it in the sandbox (most sites are blocked on my computer). Why was it blocked? Pubserv (talk) 19:20, 22 July 2013 (UTC)[reply]

    Could you please tell us which domain/link, just leave off the 'http://' from the beginning and we will see what you are talking about. Thanks! --Dirk Beetstra T C 07:18, 28 July 2013 (UTC)[reply]

    freewebtown.com/ Pubserv (talk) 19:34, 29 July 2013 (UTC)[reply]

    Not blacklisted here on Wikipedia. It's blacklisted globally. See meta:Talk:Spam blacklist/Archives/2008-12#www.freewebtown.com.
     Defer to Global blacklist if you want to contest this listing, but it's unlikely you will succeed because free web sites are highly unlikely ever to be appropriate for linking or referencing on Wikipedia. ~Amatulić (talk) 23:48, 31 July 2013 (UTC)[reply]
    Never mind. Defer means delay so that doesn't make sense. Why is it blacklisted globally? And how? Even at Google? Pubserv (talk) 19:15, 11 August 2013 (UTC)[reply]
    In this context it refers not to delay, but to submit to, IE Ill defer to your judgement in this case. meta:Meta-Wiki is the central project that oversees global issues (Stewards, global blocks, global rights, crosswiki blacklisting ect) freewebtown was an issue for multiple projects (which may or may not have included enwiki) and it was deemed that blacklisting there was needed. Since it wasnt blacklisted locally we cannot remove it from the blacklist. You would need to file a request there. In regards to the term global it refers to all meta:Wikimedia projects hosted by the Wikimedia Foundation. Werieth (talk) 19:23, 11 August 2013 (UTC)[reply]

    PowerSupplies.net

    Could you please remove this valuable resource site from the Wikipedia blacklist? The site started more than 10 years ago, it is a valuable source of information, for several years was #1 in Google search for “power supplies” words, never used link exchange to boost page ranking, does not send email to sell unsolicited products. — Preceding unsigned comment added by 143.166.255.115 (talkcontribs) 17:02, 30 July 2013 (UTC)[reply]

    Some information:
    Blaclisted in July 2012.
    Wikipedia talk:WikiProject_Spam/2012 Archive Jul 2#Long term SMPS Power Supplies related Spamming indicates this was an attempt to maximize Google AdSense revenue.
    COIbot report says the link was added 8 times.
    That said, the intent didn't seem to be to advertise. The web site itself is simply a browsable database of power supplies, and published technical spec sheets. It seems to be more of a portal, with no ads, no "buy it here" links, or anything spammy.
    I see it more as a tool to find specifications for individual power supplies than a possible reference site for citing on Wikipedia. If any of those spec sheets need to be referenced, powersupplies.net links to the external sites that host those spec sheets, so they can be referenced instead.
    Question: What possible use would this site have on Wikipedia? ~Amatulić (talk) 23:37, 31 July 2013 (UTC)[reply]
    There is a description here (www.smps.com/) of what appears to be their three websites. I can see using them as a reference for a specific power supply company, although I was trying to see if we even have any articles about any power supply companies. Power supplies are like circuit boards. They are inside your computer and no one knows they are there or what they do, but without them you would not be able to use your computer. The other two websites are more interesting though (both also blacklisted). Apteva (talk) 07:14, 4 August 2013 (UTC)[reply]
    If you go back to the original report, many of the IPs were either from Texas ISPs or they were Dell IPs; Dell is based in Austin, Texas as is the owner of some of these domains. The IP above is a Dell Computer IP.
    no Declined. If an established, trusted editor wants to link to a particular page; they can request it at MediaWiki talk:Spam-whitelist
    --A. B. (talkcontribsglobal count) 17:04, 29 August 2013 (UTC)[reply]

    prouty.org

    I am following up to VRTS ticket # 2013081210002123 sent by a representative of prouty.org, which was blacklisted in August 2011.

    The site wasn't blacklisted on the basis of any discussion on this page, but rather based on an ANI discussion (archived here). The blacklisting was apparently done in response to disruption caused by linking to an attack page at www.prouty.org/mcadams/

    I know that we don't de-list sites based on requests from people with a conflict of interest, and anyone who monitors this page will know I have declined such requests on many occasions.

    However, based on that ANI conversation, it seems that the blacklisting may have been done hastily. The final comment from bureaucrat Infrogmation (talk · contribs) suggests that this listing should be revisited.

    I suggest not de-listing, but modifying the entry to blacklist only that attack page. I would do it myself, but I prefer the transparency of discussion this public page first, rather than back-room OTRS communications. ~Amatulić (talk) 22:17, 13 August 2013 (UTC)[reply]

    dyingscene.com

    Why is this site blacklisted? I don't see any reason in the log. I wanted to add some information to an article about Greg Hetson, but I couldn't add references, as dyingscene.com is blacklisted. Nazgul02 (talk)

    It appears this site was blacklisted back in 2010 for sourcing its own articles on Wikipedia. I have personally contacted the owner of the site and was assured they no longer contribute to wikipedia from their own site. Since it's been 3 years, my recommendation would be to unblock them in order to allow our contributors to reference them for articles related to punk music. - Dr.Music —Preceding undated comment added 18:39, 28 August 2013 (UTC)[reply]

    "They no longer contribute to Wikipedia from their own site." What does that have to do with anything? They're not blocked from editing (from their site or elsewhere). They're blacklisted. Two totally different things. They can edit, but they just can't add their site to Wikipedia. Naturally, if they're blacklisted, they have no motivation to contribute. That's hardly surprising.
     Defer to Whitelist to request white-listing of individual pages. ~Amatulić (talk) 06:09, 30 August 2013 (UTC)[reply]
    The request of a longtime contributor (Nazgul) is non-trivial, and three years is a long time to exclude a site for what may have been a one-time indiscretion. An assertion by the site's owner is also significant and it's worthwhile to assume good faith on the owner's part. I support removing this from the blacklist. -Pete (talk) 17:27, 30 August 2013 (UTC)[reply]

    petition

    I understand the desire to avoid links to online petition-gathering sites, as a likely spam source. However, the blanket banning of any URL with the word "petition" in it (as best as I can understand the Regex, it ain't my thing) has been causing false spam flags in multiple articles that I deal with, because they link to legitimate news articles dealing with someone petitioning the court (example) or to copies of such court petitions (example). -Nat Gertler (talk) 02:56, 26 August 2013 (UTC)[reply]

    Support. Way too many FPs to be useful (regex in question is \bpetition(?:online|s)?\b. @NatGertler: You're correct in your understanding. Also, JzG (talk · contribs) added what evolved into this filter here, but I can't find a log entry for it. Jackmcbarn (talk) 03:09, 26 August 2013 (UTC)[reply]
    Support due to false-positives for URLs of news-stories about petitions. For example, this flagging is what first caught my attention. How about tightening the filter to match only the hostname (before first "/", or also other specific known sites by hostname) rather than "anywhere in URL". DMacks (talk) 07:56, 26 August 2013 (UTC)[reply]
    Support - Another false positive here. The global filter already has a less restructive entry that deals with petition sites (\bpetition(?:online|s24|site|spot|-?them)\.com\b) so I'm not sure the local one is even necessary. If it is, I agree with DMacks that something like \bpetition(?:online|s)?[A-Za-z0-9]*\.(com|org|net)\b would prevent a lot of the false positives. TDL (talk) 22:31, 26 August 2013 (UTC)[reply]
    hmm, this rule needs to be adapted. Petition sites should all be blacklisted per WP:SOAPBOX (they are at best a primary source, but that petition will only be notable enough to be mentioned in any article when there are secondary sources, making the primary source superfluous), not any link that contains the word petition in it (note that there are many domains without the word petition that are plain petition sites ...). --Dirk Beetstra T C 15:24, 28 August 2013 (UTC)[reply]
    Actually .... Maybe it is e bot misinterpreting and tagging wrongly .. There is no catch on 'petition' itself ... http://www.wired.com/wiredscience/2012/05/a-petition-for-free-online-access-of-taxpayer-funded-research/ ... <--- see! --Dirk Beetstra T C 18:31, 28 August 2013 (UTC)[reply]
    this link is reported as blacklisted on Access2Research, as mentioned on the meta blacklist talkpage. --Dirk Beetstra T C 18:34, 28 August 2013 (UTC)[reply]
    Support. Also in the meantime, I removed the banner from Access2Research and also (I hope correctly) added these URLs to the whitelist here: User:Cyberpower678/spam-exception.js -Pete (talk) 19:58, 28 August 2013 (UTC)[reply]
    Support I experienced problems as well. Blue Rasberry (talk) 20:09, 28 August 2013 (UTC)[reply]
    note these links are not blacklisted! --Dirk Beetstra T C 21:16, 28 August 2013 (UTC)[reply]
    Dirk, can you help me understand what caused the bot to make this edit? I must confess that I am not terribly well versed in Wikipedia's various anti-spam tools. Whatever caught that bot's attention, I think, is the thing that should be changed. -Pete (talk) 22:02, 28 August 2013 (UTC)r to Dirk -- ?[reply]
    The bot gathers and compiles the regexes exactly as the blacklist extension for Wikipedia does. It then validates the links against the regex. If it finds a positive match, it validates the regex against the whitelist, if it doesn't find a positive match, it then checks the exceptions list. If it finds a positive match, it ignores it and if it doesn't, it stores the link in the blacklist database and proceeds to flag it.—cyberpower ChatOnline 22:33, 28 August 2013 (UTC)[reply]
    So it's working off the same Wikipedia-specific blacklist, and also the global blacklist? If that's the case, why does Dirk say these links are not blacklisted? Surely the bot caught them somehow -- I guess that's the part I'm not getting. I thought they were caught because of a regex line based on the word "petition" -- am I wrong? -Pete (talk) 22:39, 28 August 2013 (UTC)[reply]
    Yes. It is working off of both blacklists, regexes are compiled exactly as the wikipedia software compiles it and uses them in the scan. Dirk is pointing out that the blacklist doesn't seem to be blocking the addition of links with petition in it for some reason which would indicate a bug in the filter itself. I myself added links to pages, and the filter seems to only intermittently stop the edit. Pete, your understanding of what flagged this bot is correct. — Preceding unsigned comment added by Cyberpower678 (talkcontribs)
    I've figured out what the issue is. The MediaWiki spam-blacklist only matches within the domain name, while User:Cyberbot II matches anywhere in the url. I get the filter notice if I try to put in a url with petition before the .com. TDL (talk) 23:08, 28 August 2013 (UTC)[reply]
    It helps everyone, if you post in one spot and not post everywhere for me to have to follow you. You are tripping the global regex rule, not the local one.—cyberpower ChatOnline 00:22, 29 August 2013 (UTC)[reply]
    Thanks to a few editors, the problem has been traced to the bot using an old outdated regex generator from the blacklist extension.
    Do you have any way of going through all the articles that the bot flagged this run, rechecking them, and removing any false warnings that it generated? --Nat Gertler (talk) 15:09, 29 August 2013 (UTC)[reply]
    The bot untags any misplaced tags. The bot determines tags that are misplaced when the links on it don't register in the active buffer of the bot. The active buffer is a collection of links stored in array elements identified by their page. Inclusion to this buffer is when there is a positive match to the blacklist, and a negative match to the whitelist and exceptions list. Currently, the bot's buffer is still loaded with the old regex generator so it won't untag the false ones this round.—cyberpower ChatOnline 15:36, 29 August 2013 (UTC)[reply]
    Cyberpower, I think it would help a good deal if the tags the bot leaves explicitly invite editors to remove the tag (instead of or in addition to leaving a comment here) if they believe it has been left in error. Even as an experienced user, I was reluctant to do so at Access2Research because the tag explicitly directs editors to the blacklist/whitelist process. However, many editors are not technically inclined and have no idea what a blacklist is. Would you consider tweaking the text at the top of the tag to include this suggestion? -Pete (talk) 17:30, 30 August 2013 (UTC)[reply]

    prodirectsoccer.com

    This is a website that I have used extensively as a source for the Nike Total 90 and Nike Mercurial Vapor articles. I have no idea whether anyone has used this site to spam Wikipedia in the past, but as you can tell, my intentions with it are purely encyclopaedic. I would appreciate this site being unblocked, since it is one of the leading resources on soccer equipment. – PeeJay 09:25, 5 September 2013 (UTC)[reply]

    Completed Proposed removals

    Pv-magazine

    'They have stopped ..', I am afraid that is because it is blacklisted. I would suggest to  Defer to Whitelist. --Dirk Beetstra T C 13:32, 7 July 2013 (UTC)[reply]
    There was a request in this section (by the same person) that sat here quite a while, unanswered, and it fell off this page during the June archiving. See MediaWiki talk:Spam-blacklist/archives/June 2013#pv-magazine.com.
    We make a point to say we consider de-listing requests from trusted, high-volume editors. Apteva isn't exactly "high volume" but certainly trusted (especially since Apteva is a legitimate alternative account), and I see no evidence of a COI as with a previous de-listing request from 2011. Apteva has posted a request twice now. It deserves some consideration and discussion.
    It seems that Pv-magazine is the authoritative reliable source for photovoltaic topics, and it is often difficult (I have tried) to find alternatives. Deferring to the whitelist may create an undue burden there if it needs to be sourced frequently. I suggest perhaps 'promoting' the domain from the blacklist to XLinkBot and monitoring it for a while. ~Amatulić (talk) 02:33, 8 July 2013 (UTC)[reply]
    I should have been more specific. They promised to stop. It was also requested removed by a second editor earlier, here. Apteva (talk) 07:17, 13 July 2013 (UTC)[reply]
    It seems pointless to put this domain on both the white and black lists, with the same affect that it is on neither. Apteva (talk) 01:47, 21 July 2013 (UTC)[reply]
    It's not the same as being on neither. The blacklist restricts the whole domain, the whitelist is where you would go to request use of a specific page that is hosted on the domain. This allows you to use it as a source without allowing the publishers to have their marketing interns link every article they write on every page they can think of. - MrOllie (talk) 02:39, 21 July 2013 (UTC)[reply]
    I have no interest in having to ask permission for each link. I think it is reasonable to believe that the publisher understands that they can not spam WP. Apteva (talk) 14:51, 21 July 2013 (UTC)[reply]
    Redemption is always possible. Do any other admins here object to my suggestion to move this site to XLinkBot and see what happens? ~Amatulić (talk) 23:51, 31 July 2013 (UTC)[reply]

    PresentViewer.com

    This site was added to the blacklist for some reason. I can't see why. The site provides information of products so I reckon it could used as a valuable resource. Thanks LumCel1 (talk) 17:10, 26 July 2013 (UTC)[reply]

    Seems dead to me. My browser reports "no data received." ~Amatulić (talk) 23:40, 31 July 2013 (UTC)[reply]
    See w:Wikipedia:Sockpuppet investigations/Lauriejackpot1. Short answer: no Declined. Ask for whitelisting of specific links, but this is better blacklisted. I would not even consider delisting if Jimbo himself would request de-listing. So many socks and so many spammed domains .. this is better blacklisted for some time so it does not get spammed again. --Dirk Beetstra T C 15:11, 2 August 2013 (UTC)[reply]

    Koolmuzone

    This site was blacklisted after spamming by multiple IP editors. I have found that it is a notable Music blog and I have started an article about it (Koolmuzone). But I was prompted by the spam filter when I entered this link in the infobox. Besides I find that the most of the ips who did spamming were on a single ip range and range block could have helped here. Also this spamming seems to have stopped now. So I request it to be removed from the blacklist. Thanks --SMS Talk 17:22, 5 August 2013 (UTC)[reply]

    "This spamming seems to have stopped" because the site is blacklisted, obviously. It was blacklisted just a couple months ago. It is highly likely that the spamming will resume if it is de-listed.
    I don't see a reason to de-list the whole site, since it's highly unlikely that a blog would ever be used as a reliable source for articles unrelated to the blog. Instead, you may request white-listing of a specific page (such as www.koolmuzone.pk/about/).  Defer to Whitelist. ~Amatulić (talk) 20:31, 6 August 2013 (UTC)[reply]
    My bad, I thought Xlinkbot keep eye on these blacklisted link addition and every addition is logged here. Probably you can also pour some light on this site's reliability as a source. I find that many news agencies of Pakistan (Daily Times, The Express Tribune, The News International) cite this blog for news related to entertainment industry of the country. So can we also cite it here? --SMS Talk 17:02, 7 August 2013 (UTC)[reply]
    No, that log just records the successful additions. Once it's blacklisted, there's nothing to log. I could be mistaken, but I think the blacklist hits don't get logged at all. I'd be surprised if they weren't but I haven't seen such a log.
    As to your question about reliability, well, generally we don't cite blogs; see WP:ELNO. There are exceptions, such as if the blog author is writing on a topic for which he's a notable expert, or it's a news blog authored by a bona-fide journalist. Blogs are often WP:TERTIARY sources, meaning the information you find in them can usually be found elsewhere in a secondary source. If you just need one or two pages, the whitelist is the place to request that. ~Amatulić (talk) 04:24, 10 August 2013 (UTC)[reply]
    Thanks that helps. I will proceed with the request at Whitelist. --SMS Talk 08:14, 10 August 2013 (UTC)[reply]

    MicrostockGroup

    I am trying to add some additional references to the Yuri Arcurs page and this domain seems to be blacklisted. Yuri has been posting in various threads on this site and it would be useful to use some of his posts as the source of the information. Specifically I was trying to use this link microstockgroup.com/14213/14213/msg218934/#msg218934 where me mentions a decline in earnings.

    Generally, forum sites are not to be used as sources per WP:ELNO although in this case your purpose would be for quoting the subject of the article. I don't see a reason to completely remove it from the blacklist, but if you want to link a single page,  Defer to Whitelist to request white-listing of that page. ~Amatulić (talk) 04:13, 10 August 2013 (UTC)[reply]
    OK, will do — Preceding unsigned comment added by 23.29.200.210 (talk) 04:31, 10 August 2013 (UTC)[reply]

    Troubleshooting and problems

    CSS overflow

    style="overflow:auto... is recognized as a SPAM site/link; style="width:...;overflow:auto... isn't. –pjoef (talkcontribs) 08:19, 16 April 2013 (UTC)[reply]

    False positive?

    I'm trying to troubleshoot an issue reported via OTRS where a person tried to add a link to one of the Requested Articles sections (their personal website). The domain has a pattern like so: www.<name>-actor.com. I tried several combinations (e.g., myteethhurt-actor.com and foobar-actor.com and it seems the issue is the -actor bit. The domain in question is not on either the local or global blacklists, and I can't find a pattern in either that would match "-actor" exactly. Should I request an exception or just ask the person to omit the link or is this something that we should fix? §FreeRangeFrogcroak 21:36, 19 June 2013 (UTC)[reply]

    Sorry for not getting back to you faster, the item in question is \bactor(?:suriya|arya)?\.com\b from meta added on 23:39, 28 November 2009 the user was optimizing several regex and goofed. The correct regex should be \bactor(suriya|arya)\.com\b Werieth (talk) 22:52, 28 June 2013 (UTC)[reply]
    No problem - sorry I also missed your reply. Glad you found it! §FreeRangeFrogcroak 18:55, 21 July 2013 (UTC)[reply]

    Malformed entries

    I believe a few of these entries are missing the "b" part of the leading "\b":

    • \freegovernmentcellphones4u\.com\b
    • \securityguardtraining-hq\.com\b
    • \thekoreanroyal\.(?:com|org)\b

    RobinHood70 talk 06:07, 26 August 2013 (UTC)[reply]

    You're correct. As of right now, they don't work at all. Jackmcbarn (talk) 21:39, 4 September 2013 (UTC)[reply]

    Logging / COIBot Instr

    Blacklist logging

    Full instructions for admins


    Quick reference

    For Spam reports or requests originating from this page, use template {{/request|0#section_name}}

    • {{/request|213416274#Section_name}}
    • Insert the oldid 213416274 a hash "#" and the Section_name (Underscoring_spaces_where_applicable):
    • Use within the entry log here.

    For Spam reports or requests originating from Wikipedia_talk:WikiProject_Spam use template {{WPSPAM|0#section_name}}

    • {{WPSPAM|182725895#Section_name}}
    • Insert the oldid 182725895 a hash "#" and the Section_name (Underscoring_spaces_where_applicable):
    • Use within the entry log here.
    Note: If you do not log your entries, it may be removed if someone appeals the entry and no valid reasons can be found.

    Addition to the COIBot reports

    The lower list in the COIBot reports now have after each link four numbers between brackets (e.g. "www.example.com (0, 0, 0, 0)"):

    1. first number, how many links did this user add (is the same after each link)
    2. second number, how many times did this link get added to wikipedia (for as far as the linkwatcher database goes back)
    3. third number, how many times did this user add this link
    4. fourth number, to how many different wikipedia did this user add this link.

    If the third number or the fourth number are high with respect to the first or the second, then that means that the user has at least a preference for using that link. Be careful with other statistics from these numbers (e.g. good user who adds a lot of links). If there are more statistics that would be useful, please notify me, and I will have a look if I can get the info out of the database and report it. This data is available in real-time on IRC.

    Poking COIBot

    When adding {{LinkSummary}}, {{UserSummary}} and/or {{IPSummary}} templates to WT:WPSPAM, WT:SBL, WT:SWL and User:COIBot/Poke (the latter for privileged editors) COIBot will generate linkreports for the domains, and userreports for users and IPs.


    Discussion

    Erwin's tool on meta

    On meta, we use the gadget User:Erwin/SBHandler (m:MediaWiki:Gadget-SBHandler.js) to add items to the blacklist on meta. It works from the Spam-blacklist talkpage (m:Talk:Spam blacklist), and from the cross-wiki reports generated by COIBot ('m:User:COIBot/XWiki/example.org'). I think that this tool could also be handy here on en.wikipedia, knowing that we have here also the talkpage of the blacklist ('here'), and the local reports ('Wikipedia:WikiProject Spam/Local/example.org') where this could be enabled.

    Would there be interest to have this tool here, and people who are capable/interested to move the gadget here (I tried to hack and activate it through my local .js, but I could not get it to work)?

    (Not willingly wanting to complicate things .. but one could consider to expand the tool to also work on XLinkBot's revertlist - being capable to blacklist from there, or to revertlist from here). --Dirk Beetstra T C 12:32, 4 July 2013 (UTC)[reply]

    Proposed addition backlog

    Returning to check something that I'd reported at the start of July (because the spammer is still going), it looks like none of the proposed additions have been processed this month - the only additions to the blacklist have been from admins adding URLs directly. Is there a reason why these aren't being processed? --McGeddon (talk) 09:48, 29 July 2013 (UTC)[reply]

    The problem is generally a significant lack of manpower. There are only a very few admins active here, all the others (including editors 'selecting' new admins) find XfD's more important. See also Category:Open Local COIBot Reports (and that are just bot flagged cases of what may be suspicious behaviour - the professional spam generally is less easy to detect - note that the bot closes/marks stale requests after some time of 'inactivity' of the link or when it got cleared up). --Dirk Beetstra T C 10:02, 29 July 2013 (UTC)[reply]