Wikipedia:List of web archives on Wikipedia

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search
Web Archives on Wikipedia
List of known web archive services in-use on English Wikipedia. Sorted roughly by number of uses from most to least. Wayback Machine is about 80% of the total. Data initially compiled by User:GreenC as of March 2017, updates and corrections welcome.

Archive services[edit]

Wayback Machine
  • Domain: archive.org, waybackmachine.org
  • Hostname: <none>, web, wayback, liveweb, www, www.web, classic-web, web-beta, replay, replay.web, web.wayback
  • Path: <none>, web
  • Timestamp: Number 4-14 digits. Or "*". Or "?". Or combination. May also contain trailing chars like "re_". If timestamp missing returns best available page.
  • Examples:
WebCite
  • Domain: webcitation.org
  • Hostname: <none>, www
  • Path: base62ID, query, cache, getfile.php, <number>
  • Timestamp: None. Uses &date=2012-06-01+21:40:03 in query?url ; the short ID is base62 which converts to unix time
  • Examples:
Archive.is
  • Domain: .is, .today, .fo, .li
  • Hostname: <none>, www
  • Path: <none>
  • Timestamp: 4-14 digits; or digits + characters (see example)
  • Examples:
National Archives UK
  • Domain: nationalarchives.gov.uk
  • Hostname: webarchive, yourarchives
  • Path: <none>
  • Timestamp: 4-14 digits
  • Examples:
NLA Australia
  • Domain: nla.gov.au
  • Hostname: pandora, trove, webarchive, content.webarchive
  • Path: see examples. The /pan/ regex should be /pan/[0-9]{4,7}/
  • Timestamp: Three types (20120727-0512, S2000-Dec-5, 20120326012340)
  • Examples:
  • Note: Not to be confused with non-webarchive URLs that appear similar:
Note: No memento access
Freezepage.com
  • Domain: freezepage.com
  • Hostname: <none>, www
  • Path: <none>
  • Timestamp: <none> (only available via web scrape)
  • Examples:
Note: If the account ID which created the snapshot expires for lack of activity (no login to freezepage), the snapshot is deleted from freezepage.com
Note: No memento access
Library of Congress
  • Domain: loc.gov
  • Hostname: webarchive
  • Path: all, lcwa####
  • Timestamp: 4-14 digits
  • Examples:
Arquivo.pt (Portugal)
  • Domain: arquivo.pt
  • Hostname: <none>
  • Path: wayback, wayback/wayback
  • Timestamp: 4-14 digits
  • Examples:
Stanford Edu
  • Domain: stanford.edu
  • Hostname: swap, sul-swap-prod
  • Path: <none>
  • Timestamp: 4-14 digits
  • Examples:
Archive-It
  • Domain: archive-it.org
  • Hostname: wayback
  • Path: all, a 4 digit number
  • Timestamp: 4-14 digits
  • Examples:
BibAlex
  • Domain: bibalex.org:80
  • Hostname: web.archive, web.petabox
  • Path: web
  • Timestamp: 4-14 digits
  • Examples:
WikiWix
  • Domain: wikiwix.com
  • Hostname: archive
  • Path: cache
  • Timestamp: 4-14 digits
  • Examples:
Note: Does not support https. Does not support Memento
Note: API access added in March 2018. By appending &apiresponse=1 to the end of the URL. (http://archive.wikiwix.com/cache/?url=http://www.linterweb.fr&apiresponse=1). This may require encoding of any other & in the url= section
Note: Supports &title argument at end of URL not part of the source URL (similar to &apiresponse). Gives the name of the Wikipedia article the link is being used in (optional).
National Archives US
  • Domain: webharvest.gov
  • Hostname: <none>
  • Path: <variable>
  • Timestamp: 4-14 digits
  • Examples:
National Archives Iceland
  • Domain: vefsafn.is
  • Hostname: wayback
  • Path: wayback
  • Timestamp: 4-14 digits
  • Examples:
Europa Archives (Ireland)
  • Domain: europarchive.org
  • Hostname: collection
  • Path: nli
  • Timestamp: 4-14 digits
  • Examples:
Perma CC
  • Domain: perma-archives.org, perma.cc
  • Hostname: <none>
  • Path: <none>, warc
  • Timestamp: 4-14 digits for perma-archives.org, or snapshot ID
  • Examples:
Proni Web Archives
  • Domain: proni.gov.uk
  • Hostname: webarchive
  • Path: <none>
  • Timestamp: 4-14 digits
  • Examples:
Parliament UK
  • Domain: parliament.uk
  • Hostname: webarchive
  • Path: <none>
  • Timestamp: 4-14 digits
  • Examples:
UK Web Archive (British Library)
  • Domain: webarchive.org.uk
  • Hostname: www
  • Path: wayback/archive
  • Timestamp: 4-14 digits
  • Examples:
Canada
  • Domain: collectionscanada.gc.ca
  • Hostname: www
  • Path: archivesweb, webarchives
  • Timestamp: 4-14 digits
  • Examples:
  • Note: Not to be confused with other close URL variants. Only capture "/webarchives/" or "/archivesweb/"
Catalonian Archive
  • Domain: padi.cat(:8080)?
  • Hostname: www, (none)
  • Path: wayback
  • Timestamp: 4-14 digits
  • Examples:
Singapore Archives
  • Domain: nlb.gov.sg
  • Hostname: eresources
  • Path: webarchives/wayback
  • Timestamp: 4-14 digits
  • Examples:
  • Note: Not to be confused with other close URL variants. Only capture "/webarchives/wayback/"
Slovenian Archives
  • Domain: nuk.uni-lj.si:8080
  • Hostname: nukrobi2 (may change)
  • Path: wayback
  • Timestamp: 4-14 digits
  • Examples:
Estonia Archives
  • Domain: digar.ee
  • Hostname: veebiarhiiv
  • Path: a
  • Timestamp: 4-14 digits
  • Examples:
Bavarian Archives
  • Domain: bib-bvb.de
  • Hostname: langzeitarchivierung
  • Path: wayback
  • Timestamp: 4-14 digits
  • Examples:
York University Digital Library
  • Domain: yorku.ca
  • Hostname: digital.library
  • Path: wayback
  • Timestamp: 4-14 digits
  • Examples:

Other[edit]

Memento
Note: Redirects to an external archive service based on cached data in the Memento database which can fluctuate and/or be inaccurate due to the cache going out of sync with the client service.
Google
Note: Links quickly expire.
Note: No memento access