Jump to content

Archive.today: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Undid revision 731988222 by 93.185.30.126 (talk) - not all piracy sites are blacklisted, only those sites that have a history of abuse on Wikipedia. Do not violate WP:COPYLINK again.
Undid revision 731994080 by Amatulic (talk)
Line 30: Line 30:
|title = The impact of JavaScript on archivability
|title = The impact of JavaScript on archivability
|url = http://link.springer.com/article/10.1007/s00799-015-0140-8
|url = http://link.springer.com/article/10.1007/s00799-015-0140-8
|archive-url = http://ocean.sci-hub.cc/f3a4d44e3501c024515b26de70a44d98/brunelle2015.pdf?download=true
|archive-date = 2016-06-11
|archive-date = 2016-06-11
|journal = International Journal on Digital Libraries
|journal = International Journal on Digital Libraries

Revision as of 21:44, 28 July 2016


archive.is
Logo of archive.is
Screenshot of archive.is
Type of site
Web archiving
Available inMultilingual
URLhttps://archive.is
CommercialNo
RegistrationNo

archive.is (formerly archive.today) is a privately funded digital time capsule,[2] with data-centre located in Europe at Nord-Pas-de-Calais, France.[3] The archive runs Apache Hadoop and Apache Accumulo software. It retrieves one page at a time similar to WebCite, smaller than 50 MB each, but with Google Maps and Twitter included.

archive.is uses headless browsing to record what embedded resources need to be captured to provide a high-quality memento, as well as takes a PNG snapshot of the representation to provide a static and non-interactive visualization of the representation.[4]

Unlike crawlers such as Wayback Machine, archive.is only captures individual pages in response to explicit user requests, and so does not obey the robots exclusion standard.[5] Because of this, website owners cannot unilaterally remove content at will, thus it is a "permanent" archive.[6]

Since July 2013, archive.is supports the Memento Project application programming interface (API),[7] and Firefox[8] and Chrome[9] Plugins.[7][10]

Worldwide availability

The whole website is not reachable also from Kazakhstan[11] and Iran. In Russia, only HTTP access is possible; HTTPS connections are blocked.[12][13] In Mainland China, HTTPS access is possible and HTTP access blocked.

See also

Notes

  1. ^ "Archive.is Site Info". Site Info. Alexa Internet. Retrieved 3 July 2015.
  2. ^ Martin Brinkmann (22 April 2015). "Create publicly available web page archives with Archive.is". Ghacks. Retrieved 13 June 2015.
  3. ^ "Archive.is status". Stat Radar. Archived from the original on 27 July 2013. Retrieved 8 May 2013. {{cite web}}: Unknown parameter |deadurl= ignored (|url-status= suggested) (help)
  4. ^ Brunelle, Justin F.; Kelly, Mat; Weigle, Michele C.; Nelson, Michael L. (25 January 2015). "The impact of JavaScript on archivability" (PDF). International Journal on Digital Libraries. 17 (2). Springer-Verlag Berlin Heidelberg: 95–117. doi:10.1007/s00799-015-0140-8. Archived from the original on 11 June 2016.
  5. ^ Dascalescu, Dan (18 February 2013). "Web page archiving – Dan Dascalescu's Wiki (review)". Wiki.dandascalescu.com. Retrieved 3 October 2013.
  6. ^ "Dear GamerGate: Please Stop Stealing Our Shit". Motherboard.
  7. ^ a b Nelson, Michael L. (9 July 2013). "Archive.is Supports Memento". Research and Teaching Updates. Web Science and Digital Libraries Research Group at Old Dominion University. Archived from the original on 27 July 2013. Retrieved 17 September 2013. {{cite web}}: Unknown parameter |deadurl= ignored (|url-status= suggested) (help)
  8. ^ "Archiveror". mozilla.org.
  9. ^ "archive.is Button". google.com.
  10. ^ "archive.is" Memento Protocol Information. Memento Development Group. Retrieved 17 September 2013.
  11. ^ "Alexey Chernyavskiy". Twitter.
  12. ^ "Роскомнадзор заблокировал сервис archive..., хранящий копии веб-сайтов". 29 January 2016. Retrieved 30 January 2016.
  13. ^ "Russia Blocks Another Archive Site Because It Might Contain Old Pages About Drugs". Techdirt.