Webarchive

From Wikipedia, the free encyclopedia
  (Redirected from WebArchive)
Jump to: navigation, search
This article is about webarchive file format. For web archiving, see web archiving. For web.archive.org website, see Internet Archive.
Web archive
Filename extension .webarchive
Internet media type application/x-webarchive
Uniform Type Identifier (UTI) com.apple.webarchive
Type of format web page file archive
Container for websites
Extended from Apple Binary Property List

The webarchive file format is available on Mac OS X and Windows for saving and reviewing complete web pages using the Safari web browser.[1] Support for webarchive documents was added in Safari 4 Beta on Windows and is included in subsequent versions. Safari for iOS (iPhone and iPad) does not support web archive files, however a third party app[2] provides this functionality.

The webarchive format is a concatenation of source files with filenames saved in the binary plist format using NSKeyedEncoder.[citation needed] The API uses webarchives to simplify using cutting-and-pasting with whole or partial web pages.[citation needed]

A version of the webarchive format is used to bundle whole music albums and movies with extra content and menus inside iTunes LP and Extras.[citation needed]

Converting for other browsers[edit]

Workarounds to allow the file to be viewed in other browsers are possible, though specific webpage contents may hinder this process:

Alternatives[edit]

MAFF is an open format (with a published specification) that enables saving of whole webpages in a single file. It is currently supported by Firefox, using an extension.[4] Other web browsers use the MHTML format or do the equivalent by saving a directory of inline resources (usually images) alongside the HTML file, sometimes compressed, like the .war format used by Konqueror (tar+gzip or tar+bzip2). Safari does not support these alternative archive formats.

For archiving entire websites, the Internet Archive has developed the Web ARChive (WARC) format which was standardized by ISO and must not be confused with Safari's webarchive format.

HTMLD (HTML Directory) is a NeXT-developed format for saving web pages and their dependencies in a bundle that may also be served by a web server.[5]

References[edit]

  1. ^ a b De-archive Web Archives
  2. ^ Web Archive Viewer
  3. ^ WebArchive Extractor
  4. ^ "Mozilla Archive Format, with MHT and Faithful Save". Retrieved 8 December 2011. 
  5. ^ ".htmld Discussion".