Wikipedia:List of web archives on Wikipedia
Jump to navigation
Jump to search
Web Archives on Wikipedia
- List of known web archive services in-use on English Wikipedia. Sorted roughly by number of uses from most to least. Wayback Machine is about 80% of the total. Data initially compiled by User:GreenC as of March 2017, updates and corrections welcome.
Archive services[edit]
- Domain: archive.org, waybackmachine.org
- Hostname: <none>, web, wayback, liveweb, www, www.web, classic-web, web-beta, replay, replay.web, web.wayback
- Path: <none>, web
- Timestamp: Number 4-14 digits. Or "*". Or "?". Or combination. May also contain trailing chars like "re_". If timestamp missing returns best available page.
- Examples:
- Domain: webcitation.org
- Hostname: <none>, www
- Path: base62ID, query, cache, getfile.php, <number>
- Timestamp: None. Uses &date=2012-06-01+21:40:03 in query?url ; the short ID is base62 which converts to unix time
- Examples:
- http://www.webcitation.org/gT64fd
- http://www.webcitation.org/66lmEkpE8?url=http://www.ariacharts.com.au/pages/charts_display_album.asp?chart%3D1G50
- http://www.webcitation.org/query?id=1138911916587475
- http://www.webcitation.org/query?url=http..&date=2012-06-01+21:40:03
- http://www.webcitation.org/1138911916587475
- http://www.webcitation.org/cache/73e53dd1f16cf8c5da298418d2a6e452870cf50e
- http://www.webcitation.org/getfile.php?fileid=1c46e791d68e89e12d0c2532cc3cf629b8bc8c8e
- Domain: .is, .today, .fo, .li
- Hostname: <none>, www
- Path: <none>
- Timestamp: 4-14 digits; or digits + characters (see example)
- Examples:
- National Archives UK
- Domain: nationalarchives.gov.uk
- Hostname: webarchive, yourarchives
- Path: <none>
- Timestamp: 4-14 digits
- Examples:
- NLA Australia
- Domain: nla.gov.au
- Hostname: pandora, trove, webarchive, content.webarchive
- Path: see examples. The /pan/ regex should be
/pan/[0-9]{4,7}/ - Timestamp: Three types (20120727-0512, S2000-Dec-5, 20120326012340)
- Examples:
- http://pandora.nla.gov.au/pan/14231/20120727-0512/www.howlspace.com.au/en2/inxs/inxs.htm
- http://pandora.nla.gov.au/pan/128344/20110810-1451/www.theaureview.com/guide/festivals/bam-festival-2010-ivorys-rock-qld.html
- http://pandora.nla.gov.au/nph-wb/20010328130000/http://www.howlspace.com.au/en2/arenatina/arenatina.htm
- http://pandora.nla.gov.au/nph-arch/2000/S2000-Dec-5/http://www.paralympic.org.au/athletes/athleteprofile60da.html
- http://webarchive.nla.gov.au/gov/20120326012340/http://news.defence.gov.au/2011/09/09/army-airborne-insertion-capability/
- http://content.webarchive.nla.gov.au/gov/wayback/20120326012340/http://news.defence.gov.au/2011/09/09/army-airborne-insertion-capability
- Note: Not to be confused with non-webarchive URLs that appear similar:
- Note: No memento access
- Domain: freezepage.com
- Hostname: <none>, www
- Path: <none>
- Timestamp: <none> (only available via web scrape)
- Examples:
- Note: If the account ID which created the snapshot expires for lack of activity (no login to freezepage), the snapshot is deleted from freezepage.com
- Note: No memento access
- Library of Congress
- Domain: loc.gov
- Hostname: webarchive
- Path: all, lcwa####
- Timestamp: 4-14 digits
- Examples:
- Arquivo.pt (Portugal)
- Domain: arquivo.pt
- Hostname: <none>
- Path: wayback, wayback/wayback
- Timestamp: 4-14 digits
- Examples:
- Stanford Edu
- Domain: stanford.edu
- Hostname: swap, sul-swap-prod
- Path: <none>
- Timestamp: 4-14 digits
- Examples:
- Archive-It
- Domain: archive-it.org
- Hostname: wayback
- Path: all, a 4 digit number
- Timestamp: 4-14 digits
- Examples:
- BibAlex
- Domain: bibalex.org:80
- Hostname: web.archive, web.petabox
- Path: web
- Timestamp: 4-14 digits
- Examples:
- WikiWix
- Domain: wikiwix.com
- Hostname: archive
- Path: cache
- Timestamp: 4-14 digits
- Examples:
- Note: Does not support https. Does not support Memento
- Note: API access added in March 2018. By appending &apiresponse=1 to the end of the URL. (http://archive.wikiwix.com/cache/?url=http://www.linterweb.fr&apiresponse=1). This may require encoding of any other & in the url= section
- Note: Supports &title argument at end of URL not part of the source URL (similar to &apiresponse). Gives the name of the Wikipedia article the link is being used in (optional).
- National Archives US
- Domain: webharvest.gov
- Hostname: <none>
- Path: <variable>
- Timestamp: 4-14 digits
- Examples:
- National Archives Iceland
- Domain: vefsafn.is
- Hostname: wayback
- Path: wayback
- Timestamp: 4-14 digits
- Examples:
- Europa Archives (Ireland)
- Domain: europarchive.org
- Hostname: collection
- Path: nli
- Timestamp: 4-14 digits
- Examples:
- Perma CC
- Domain: perma-archives.org, perma.cc
- Hostname: <none>
- Path: <none>, warc
- Timestamp: 4-14 digits for perma-archives.org, or snapshot ID
- Examples:
- Proni Web Archives
- Domain: proni.gov.uk
- Hostname: webarchive
- Path: <none>
- Timestamp: 4-14 digits
- Examples:
- Parliament UK
- Domain: parliament.uk
- Hostname: webarchive
- Path: <none>
- Timestamp: 4-14 digits
- Examples:
- UK Web Archive (British Library)
- Domain: webarchive.org.uk
- Hostname: www
- Path: wayback/archive
- Timestamp: 4-14 digits
- Examples:
- Canada
- Domain: collectionscanada.gc.ca
- Hostname: www
- Path: archivesweb, webarchives
- Timestamp: 4-14 digits
- Examples:
- Note: Not to be confused with other close URL variants. Only capture "/webarchives/" or "/archivesweb/"
- Catalonian Archive
- Domain: padi.cat(:8080)?
- Hostname: www, (none)
- Path: wayback
- Timestamp: 4-14 digits
- Examples:
- Singapore Archives
- Domain: nlb.gov.sg
- Hostname: eresources
- Path: webarchives/wayback
- Timestamp: 4-14 digits
- Examples:
- Note: Not to be confused with other close URL variants. Only capture "/webarchives/wayback/"
- Slovenian Archives
- Domain: nuk.uni-lj.si:8080
- Hostname: nukrobi2 (may change)
- Path: wayback
- Timestamp: 4-14 digits
- Examples:
- Estonia Archives
- Domain: digar.ee
- Hostname: veebiarhiiv
- Path: a
- Timestamp: 4-14 digits
- Examples:
- Bavarian Archives
- Domain: bib-bvb.de
- Hostname: langzeitarchivierung
- Path: wayback
- Timestamp: 4-14 digits
- Examples:
- York University Digital Library
- Domain: yorku.ca
- Hostname: digital.library
- Path: wayback
- Timestamp: 4-14 digits
- Examples:
Other[edit]
- Memento
- Note: Redirects to an external archive service based on cached data in the Memento database which can fluctuate and/or be inaccurate due to the cache going out of sync with the client service.
- Note: Links quickly expire.
- Note: No memento access