Talk:Wayback Machine

From Wikipedia, the free encyclopedia
Jump to: navigation, search

:See Wikipedia:Using the Wayback Machine for information on using the Wayback Machine with Wikipedia.

Banned in Russia[edit]

Not sure if this is newsworthy enough for main paragraph or needs a new subject or nothing. Maybe sites banned and unbanned all the time. — Preceding unsigned comment added by RonPaul573e (talkcontribs) 08:50, 27 October 2014 (UTC)

Still reading?[edit]

Is it still reading pages? Seems not. (talk) 14:40, 8 June 2010 (

Did you see the part of the article that reads: "Snapshots become available 6 to 18 months after they are archived." ? -- Quiddity (talk) 18:11, 8 June 2010 (UTC)

Where in Europe?[edit]

Later in the article it talks about how copyright law in 'Europe' could cause certain effects but it doesn't mention where in Europe! The Continent? If so, where on the continent? Is it the UK? There is no single copyright law within the region... Just curios!

Presumably this refers to the European Union (not all the countries of the European Peninsula/so-called continent), which has a very important governing role. --Eleanor1944 (talk) 02:55, 11 February 2013 (UTC)

Wayback Machine is Amazingly Slow[edit]

What surprises me time and time again is how incredibly slow the WayBackMachine is. Check Google for 'waybackmachine slow' and you'll see other people agree; even called "notoriously slow" by some folks.[1] I wonder if there's a reliable source somewhere so we could mention the service's speed in the article. -- (talk) 06:07, 19 June 2010 (UTC)

Still collecting pages?[edit]

I was able to see the page from October 22, 2009 (talk) 15:31, 9 September 2010 (UTC)

All of Wayback Machine's archived links are shut down![edit]

Why aren't all those archived links in the Wayback Machine working anymore?! Can't someone please fix the Wayback Machine?! --Angeldeb82 (talk) 20:30, 26 January 2012 (UTC)

Would you kindly explain to those of us who are not familiar with the term, what are "archived links"? Thanks in advance Ottawahitech (talk) 15:55, 3 March 2012 (UTC)
I took "archived links" to mean links to its old, archived pages, it's main function.
      As of June 30 it's still down. ERR: "The New Wayback Machine is having problems. Please try again later." Seeking help in forums etc, I could find no activity in recent months. I hope this historical treasure of history comes back, as I see evidence that Winston Smith's memory hole is gaining power —and coincidentally the historical treasure of Google's Usenet archive no longer seems cut in stone.
-- (talk) 17:53, 30 June 2012 (UTC)Doug Bashford
UPDATE my above: I've since used it, it's seemingly working fine.
-- (talk) 16:15, 27 July 2012 (UTC) Doug Bashford

Yahoo! Search provides links to Waybackmachine?[edit]

See: Talk:Yahoo!_Search#Waybackmachine Ottawahitech (talk) 16:00, 3 March 2012 (UTC)

Netbula v. Chordiant Software ? ...Jargon?[edit]

That section makes no sense. The first paragraph, I assume accurate, is meaningless. Probable jargon and/or insider-know presumptions. Suggest repair or deletion.
-- (talk) 16:56, 30 June 2012 (UTC)Doug Bashford

Not reliable anymore[edit]

A matter of location of the IP? — Preceding unsigned comment added by (talk) 02:05, 6 September 2012 (UTC)

Reliability in retrieving archived material[edit]

It would probably be miraculous if the WM could archive everything on the internet, but as an experienced user I know only too well that pages and images are often unavailable not because of robots.txt or legal reasons, but simply because WM failed to retrieve them properly. There is absolutely no mention of this in the article and there should be. Lee M (talk) 02:42, 1 July 2013 (UTC)

I agree it's only archive 10%~40% of whole pages specially if the site are above 500 pages , no need to mention sites had million of pages/link they almost store 10% max .--Salem F (talk) 01:12, 7 December 2015 (UTC)

Not well[edit]

Section Search engine links:

... began to provide links to other versions of pages archived on the Wayback Machine.

What does that even mean? That they use the Wayback Machine as a caching service? That it is possible to see not only the latest version of a page, but olders versions as well? Whatever it is, it ought to be described.

--Mortense (talk) 14:18, 15 February 2014 (UTC)

December 2014[edit]

This week it rained in San Francisco and the power immediately blew out. Your tech utopia • The Register

Internet Archive: The big storm in SF has knocked out power to our main data center, so the site will be down for a while. We'll keep you posted here! 7:59 AM - 11 Dec 2014

unintelligible sentence[edit]

Under the heading "Origin, growth and storage", this rather odd sentence appears: "This became a threat of abuse the service for hosting malicious binaries." Can anyone make sense of this? It would seem to be missing a few words. Bricology (talk) 06:40, 23 March 2015 (UTC)

 Done. I checked all three references the paragraph cites. I changed the sentence to, "This became a threat of abuse by the service for hosting malicious binaries." The sources support the assertion that potentially malicious executables and PDFs are currently archived at the site.  —Aladdin Sane (talk) 19:06, 25 March 2015 (UTC)

Storage capacity[edit]

At present this section is mainly a list of historical capacities. Can anyone add anything about the growth rate and future ability to store information? It would also be good to include information in the section on resilience i.e security of the data stored. LookingGlass (talk) 10:14, 12 September 2015 (UTC)

External links modified[edit]

Hello fellow Wikipedians,

I have just added archive links to one external link on Wayback Machine. Please take a moment to review my edit. You may add {{cbignore}} after the link to keep me from modifying it, if I keep adding bad data, but formatting bugs should be reported instead. Alternatively, you can add {{nobots|deny=InternetArchiveBot}} to keep me off the page altogether, but should be used as a last resort. I made the following changes:

When you have finished reviewing my changes, please set the checked parameter below to true or failed to let others know (documentation at {{Sourcecheck}}).

You may set the |checked=, on this template, to true or failed to let other editors know you reviewed the change. If you find any errors, please use the tools below to fix them or call an editor by setting |needhelp= to your help request.

  • If you have discovered URLs which were erroneously considered dead by the bot, you can report them with this tool.
  • If you found an error with any archives or the URLs themselves, you can fix them with this tool.

If you are unable to use these tools, you may set |needhelp=<your help request> on this template to request help from an experienced user. Please include details about your problem, to help other editors.

Cheers.—cyberbot IITalk to my owner:Online 13:36, 31 March 2016 (UTC)

Stanford version of Wayback Machine[edit]

I was just wondering if the Stanford version of the Wayback Machine is in any way related to the Internet Archive's Wayback Machine. And the Stanford Wayback Machine has a few pages, some dating to late 1991! So if anyone knows, make sure to reply.

Source(s): — Preceding unsigned comment added by (talk) 01:34, 25 April 2016 (UTC)

An error on Storage capabilities[edit]

At the start, we claim that in 2009 the site grew by 100 TB per month.

At the bottom, we claim that in 2014 the site grew by 20 TB per week, which is 80 TB per month - less than in 2009.

Is it possible? רן כהן (talk) 13:53, 11 May 2016 (UTC)

"Mass deletion of content"[edit]

"Beginning in 2015, mass deletions of previously archived content caused a number of critics to question the sincerity of this goal."

The cited sources don't support that assertion. The first source is confused and inaccurate. The second source contains an update to the effect the problem was specific to that user and fixed. Both are essentially self-published blogs. -- GreenC 21:45, 23 September 2016 (UTC)

@Green Cardamom: Then please go ahead and remove it (and put this info into the edit summary). --Fixuture (talk) 17:02, 29 September 2016 (UTC)
I was going to alert people to a seldom covered fact: the archive's own archive of itself claims to have 502 billion pages saved, not the current 278. However, I later saw that it's just a change in their counting definition. I hope the "bug" in the two sources you mention served as a wake-up call for certain people to get their act together. A site this important should be coded in such a way that bugs are likely to make it display more pages than desired. Connor Behan (talk) 02:53, 13 February 2017 (UTC)

Major problem with robots.txt[edit]

Hello, I just notice that since wayback machine won't archive pages AND also deletes the all previous archives of the webpage prior to the use of robots.txt, there is a flaw in it:

  • If a website went defunct, another site opens with the same URL later, and the second URL have robots.txt, can delete the previous defunct website. Even if the latest web owner does not technically own the dead website version of the URL.
  • If a site got hacked and robots.txt was applied, the same thing happened, all history is gone.

Check out a citation of an archive of SpySheriff, before, wayback machine does host the website, now since it now have robots.txt, the past versions archived are now deleted. I've assume hackers adjusted the website under that URL to include that file.

This is another threat to both wikipedia and wayback machine, as wayback machine does not have a "protection" to its archive. With things that can accidentally vanish by website replacement with robots.txt and hacked sites, it makes archiving virtually pointless in the very future.Joeleoj123 (talk) 05:12, 15 April 2017 (UTC)

@Joeleoj123: Thank you for bringing this up. Do you have any relevant references? However, from what I can see, there are good reasons to exclude malware-distributing websites which seems to be the case for "SpySheriff". Also it seems that as of last month they are exploring ignoring robots.txt more broadly (see: Wayback Machine#Website exclusion policy). --Fixuture (talk) 14:26, 18 May 2017 (UTC)