|Web address||Official website|
Type of site
|4,767 (Jul 2015[update])|
archive.is (or formerly archive.today) is a privately funded digital time capsule, with data-centre located in Europe at Nord-Pas-de-Calais, France. The archive runs Apache Hadoop and Apache Accumulo software. It retrieves one page at a time similar to WebCite, smaller than 50 MB each, but with Google Maps and Twitter included.
Unlike crawlers such as Wayback Machine, archive.is only captures individual pages in response to explicit user requests, and so does not obey the robots exclusion standard. Because of this, there is no way for websites to prevent an archive.is user from mirroring pages on the site. This has caused the site to be criticised.
The archive is most notably used for hacktivism:
- Gamergate controversy supporters have reportedly used it to view content on websites they disagree with without contributing to its traffic.
- Julian Assange's book When Google Met WikiLeaks uses archive.today for preserving online citations.
- Syrian Electronic Army uses it to proof website defacements.
On July 21, 2015, the operators blocked access to the service from all Finnish IP addresses, stating on Twitter that they did this in order to avoid escalating a dispute they allegedly had with the Finnish government. They did not provide additional information regarding these claims.
In Russia you can access only the HTTP version of the website; the HTTPS version is blocked on the state firewall. In Mainland China it is the other way around: the HTTPS version is accessible but the HTTP version is blocked on the state firewall.
- "Archive.is Site Info". Site Info. Alexa Internet. Retrieved 14 June 2015.
- Martin Brinkmann (22 Apr 2015). "Create publicly available web page archives with Archive.is". Ghacks. Retrieved 13 Jun 2015.
- "Archive.is status". Stat Radar. Archived from the original on 27 July 2013. Retrieved 8 May 2013.
- "archive.today – webpage capture". archive.today. Archived from the original on 28 June 2012. Retrieved 8 May 2013.
- "archive.today FAQ". archive.today. Archived from the original on 20 April 2013. Retrieved 8 May 2013.
- Dascalescu, Dan (18 February 2013). "Web page archiving - Dan Dascalescu's Wiki (review)". Wiki.dandascalescu.com. Retrieved 3 October 2013.
- "Dear GamerGate: Please Stop Stealing Our Shit". Motherboard.
- Nelson, Michael L. (9 July 2013). "Archive.is Supports Memento". Research and Teaching Updates. Web Science and Digital Libraries Research Group at Old Dominion University. Archived from the original on 27 July 2013. Retrieved 17 September 2013.
- "Archiveror". mozilla.org.
- "archive.is Button". google.com.
- "archive.is" Memento Protocol Information. Memento Development Group. Retrieved 17 September 2013.
- "View Julian Assange - When Google Met WikiLeaks.pdf - PDFy - Instant PDF Host". pdf.yt.
- "Julian Assange - Google Is Not What It Seems". wikileaks.org.
- "US Army Official Website Hacked — Syrian Electronic Army". 14 June 2015. Archived from the original on 21 July 2015.
- "archive.is". Twitter.
- "Роскомнадзор заблокировал сервис archiveis, хранящий копии веб-сайтов". 2016-01-29. Retrieved 2016-01-30.
- "rylor". Twitter.