Digital dark age

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

The digital dark age is a lack of historical information in the digital age as a direct result of outdated file formats, software, or hardware that becomes corrupt, scarce, or inaccessible as technologies evolve and data decays.[1] Future generations may find it difficult or impossible to retrieve electronic documents and multimedia, because they have been recorded in an obsolete and obscure file format. The name derives from the term Dark Ages in the sense that there could be a relative lack of records in the digital age, as documents are transferred to digital formats and original copies are lost. An early mention of the term was at a conference of the International Federation of Library Associations and Institutions (IFLA) in 1997.[1] The term was also mentioned in 1998 at the Time and Bits conference,[2][3] which was co-sponsored by the Long Now Foundation and the Getty Conservation Institute.

Proprietary and obsolete file formats[edit]

The problem is not limited to text documents, but applies equally to photos, video, audio and other kinds of electronic documents.[4] One concern leading to the use of the term is that documents are stored on physical media which require special hardware in order to be read and that this hardware will not be available in a few decades from the time the document was created. For example, it is already the case that disk drives capable of reading 5¼-inch floppy disks are not readily available.[5]

The digital dark age also applies to the problems which arise due to obsolete file formats. In such a case, it is the lack of necessary software which causes problems when retrieving stored documents. This is especially problematic when proprietary formats are used, in which case it might be impossible to write appropriate software to read the file.

Magnetic tape data storage[edit]

Magnetic tape data storage is a method of storing data on magnetic tape. It is used as a backup method of storage for digital storage and is one way of mitigating a possible digital dark age. For example, in 2011 hundreds of thousands of Google accounts were reset and the data in those accounts went missing. Google was able to restore the data to the email accounts from the data stored on magnetic tape.[2] Magnetic data storage is also used by financial institutions, hospitals, movie studios, and manufacturing companies to backup content.[3] Magnetic tape can hold hundreds of terabytes of data.[4]

Archiving the internet[edit]

The Internet Archive has stated that one of their goals is to prevent the digital dark age.[5]

Even Vinton Cerf, Vice President of Google, showed his concerns about data preservation in the annual meeting of the American Association for the Advancement of Science: "As the way that we store information about ourselves develops, memories stored in files that use older technology are becoming harder to access. That could mean that historians of the future are unable to learn about our lives". His suggested solution consists of preserving a sample of every piece of software and hardware that has ever existed so that it never becomes obsolete. He proposed taking an X-ray snapshot of the content, the application and the operating system along with a description of the machine. This information should be then stored, instead of in a museum, in servers in the cloud. [6]

Historical examples[edit]

A famous example is NASA, whose early space records have suffered from a dark age issue more than once. For over a decade, magnetic tapes from the 1976 Viking Mars landing were unprocessed. When later analyzed, the data was unreadable as it was in an unknown format and the original programmers had either died or left NASA. The images were eventually extracted following many months of puzzling through the data and examining how the recording machines functioned.[7]

Another example is the BBC Domesday Project in which a survey of the nation was compiled 900 years after the Domesday Book was published. While the original Domesday Book is still readable today, there were great fears that the discs of the Domesday Project would become unreadable as software and disk drives capable of reading the format became rarer and rarer. However, in 2002 the CAMiLEON project migrated the information to a system called DomesEm, allowing it to be accessed on modern computers.[8]

Encryption and data preservation[edit]

Encryption may exacerbate the problem of preserving data, since decoding adds complexity even when the relevant software is available.[9] Historically, encrypted data is quite rare, but even the very simple means available throughout history have provided many examples of documents that can only be read with great effort. For example, it took the capacity of a distributed computing project to break the mechanically generated code of a single brief World War II submarine tactical message.[10] Modern encryption is being used in many more documents and media due to publishers wanting the promised protections of DRM.

Open source file formats[edit]

As more records are stored in digital form, there have been several measures to standardize electronic file formats so software to read them is widely available and can be re-implemented on new platforms if necessary.

PDF/A is an open standard based on Adobe Systems PDF format.[11] It has been widely adopted by governments and archives around the world, such as the United Kingdom.[12]

The Open Document Format for Office Applications (OpenDocument) has been standardized by OASIS in 2005, and by ISO in 2006. Since then, support for OpenDocument has been implemented in a large number of open source and proprietary software. Therefore, using OpenDocument is one option for archiving editable documents from office applications. More broadly, the use of open source software is a prevention measure.[13] Since the source code for reading and writing a file format is open, the code can be used as a base for future implementations. In 2007, the chief information officer of the UK's National Archives stated "We welcome open-source software because it makes our lives easier".[14]

Data storage standardization[edit]

In 2007, Microsoft created a partnership with the UK's National Archives to prevent the digital dark age and "unlock millions of unreadable stored computer files".[15][16][17] UK's National Archives now accepts various file formats for long term sustenance, including Office Open XML, PDF and OpenDocument.[18]

See also[edit]

References[edit]

  1. ^ "Data Reawakening". Science Friday. Retrieved 2018-03-01.
  2. ^ "Thousands of Gmail users find emails missing". 2011-02-28. Retrieved 2018-02-28.
  3. ^ "Ghosts In The Reels". Science Friday. Retrieved 2018-02-28.
  4. ^ "Scientists warn we may be creating a 'digital dark age'". Public Radio International. Retrieved 2018-03-01.
  5. ^ "About the Internet Archive". Archived from the original on 2 October 2013. Retrieved 5 October 2013.
  6. ^ Pallab, Ghosh. 2015. "Google's Vint Cerf warns of 'digital Dark Age'". BBC News, Science & Environment.
  7. ^ Blakeslee, Sandra (20 March 1990). "Lost on Earth: Wealth of Data Found in Space". New York Times. Archived from the original on 9 November 2012. Retrieved 7 July 2013.
  8. ^ McKie, Robin; Thorpe, Vanessa (3 March 2002). "Digital Domesday Book lasts 15 years not 1000". The Observer. Archived from the original on 20 January 2013.
  9. ^ Digital Preservation Coalition (2012). "Media and Formats - Compression and Encryption". Digital Preservation Handbook. Archived from the original on 29 July 2012. Retrieved 17 August 2013.
  10. ^ Wearden, Graeme (27 February 2006). "Distributed computing cracks Enigma code". CNET News. Archived from the original on 19 December 2010.
  11. ^ "Adobe Acrobat Engineering:PDF Standards". Adobe. 12 March 2013. Archived from the original on 7 July 2013. Retrieved 7 July 2013.
  12. ^ "Viewing government documents". GOV.UK. Cabinet Office. 6 August 2015. Retrieved 10 September 2015.
  13. ^ Cassia, Fernando (March 28, 2007). "Open Source, the only weapon against 'planned obsolescence'". The Inquirer. Retrieved August 2, 2012.
  14. ^ Donoghue, Andrew (19 July 2007). "Defending against the digital dark age". ZDNet. Archived from the original on 23 October 2012.
  15. ^ Kennedy, Maev (4 July 2007). "National Archive project to avert digital dark age". News:Technology. The Guardian. Archived from the original on 17 July 2010. Retrieved 7 October 2009.
  16. ^ Ferguson, Tim (5 July 2007). "Microsoft Helps Archives Save the Past". Technology. Business Week. Archived from the original on 10 July 2007. Retrieved 7 October 2009.
  17. ^ Colvile, Robert (5 July 2007). "How to stave off a digital 'dark age'". Telegraph. Archived from the original on 24 April 2012. Retrieved 7 October 2009.
  18. ^ "File formats for transfer - The National Archives".

Further reading[edit]

External links[edit]

External video
Digital Dark Age (Computer History Museum, 2011)