Internet Memory Foundation
|Industry||Web archiving and preservation|
|Founded||2004 as European Archive
2010 as Internet Memory
|Headquarters||Amsterdam, The Netherlands|
The Internet Memory Foundation (formerly the European Archive Foundation) is a non profit foundation whose purpose is archiving content of the World Wide Web. It supports projects and research that include the preservation and protection of digital media content in various forms, to form a digital library of cultural content.
The non-profit institution European Archive Foundation was incorporated in 2004 in Amsterdam. An announcement at the opening of the Cross Media Week in Amsterdam during September 2006 included a quote from Brewster Kahle, who founded the Internet Archive. Julien Masanès was its first director. Operating from Amsterdam and Paris, it said it would make freely accessible public domain collections and web archives. Masanès, previously at the Bibliothèque nationale de France, edited a book on Web archiving in 2007. The Paris organization is called Internet Memory Research, which operates a service known as ArchiveTheNet.
In December 2010, the Foundation changed its name to Internet Memory to indicate a wider scope of activities.
In 2011, the foundation archived dozens of terabytes of data per month. The foundation is a member of the International Internet Preservation Consortium. It developed collaborations, both with cultural institutions (such as the UK National Archives) and research teams including the Max Planck Institute, TU Berlin, University of Southampton and the Institut Telecom Paris Tech.
The foundation was involved in research projects to improve technologies of web-scale crawling, data extraction, text mining, preservation to support the growth and use of the Internet memory. They were funded as part of the seventh of the Framework Programmes for Research and Technological Development funded by the European Commission.
- Living Web Archives (LiWA, contract 216267) ran from February 2008 through January 2011 with about 2.7 million Euro in funding. LiWA developed a set Web archiving methods and tools.
- LivingKnowledge (contract 231126) ran from February 2009 through January 2012 with about 4.8 million Euro in funding. The goal was to improve navigation and search in large multimodal datasets.
- Longitudinal Analytics of Web Archive data (LAWA, contract 258105) ran from September 2010 through August 2013 with about 2.8 million Euro of funding. It said it would develop analytics for use in the Future Internet Research and Experimentation project.
- Collect-All ARchives to COmmunity MEMories (ARCOMEM, contract 270239) ran from January 2011 through December 2013 with 6 million Euro funding. It studied the cost and risk of ephemeral information, such as that used in social network sites.
- Scalable Preservation Environments (SCAPE, contract 27013) ran from February 2011 through July 2014 with about 8.6 million Euro funding.
With funding from the Living Web Archives project, in December 2010 Internet Memory carried out a survey on Web archiving among European institutions. Results from 74 returned answers were published.
The Web crawler used by the project is Heritrix version 3. Heritrix generates resources stored in a “container”, the ARC file (.arc). The ARC file was extended to the Web ARChive file format (.warc), which was approved as an international standard in June 2009 (ISO 28500:2009).
Audio and video
Before focusing on web archiving, the European Archive Foundation has collected one of the largest online free classical music collections (more than 800 pieces, from Mozart to Dvorak) and Public Information Films from the British Government, made in collaboration with the Netherlands Institute for Sound and Vision and the UK National Archives.
Selective web collection
The foundation archived a snapshot of the Italian web domain, made in collaboration with the National Library of Italy, an archive of political websites of the 25 EU member states captured during the European constitutional debate, and archives (among others):
- The National Archives (United Kingdom)
- National Library of Ireland
- CERN, Organisation européenne pour la recherche nucléaire (Switzerland)
- Parliament of the United Kingdom
- Public Record Office of Northern Ireland
- Mia Consalvo, Charles Ess, ed. (2011). The Handbook of Internet Studies. John Wiley & Sons. p. 31. ISBN 9781444342383.
- "Official Launch of the European Archive Foundation". Press release. 28 September 2006. Retrieved 7 October 2013.
- Julien Masanès, ed. (2007). Web Archiving. Springer. ISBN 9783540463320.
- "À propos: Internet Memory". Web site for ArchiveThe.net. Retrieved 7 October 2013. (in French)
- Members (International Internet Preservation Consortium)
- Diana Maynard and Mark A. Greenwood (16 May 2012). "Large Scale Semantic Annotation, Indexing, and Search at The National Archives". International Conference on Language Resources and Evaluatio.
- "Living Web Archives". Community Research and Development Information Service web site. European Union. Retrieved 7 October 2013.
- "Report on “Technologies for Living Web archives”". Deliverable report. 10 February 2011. Retrieved 7 October 2013.
- Dimitar Denev, Arturas Mazeika, Marc Spaniol and Gerhard Weikum (April 2011). "The SHARC framework for data quality in Web archiving". The International Journal on Very Large Data Bases (Springer-Verlag) 20 (2): 183–207. doi:10.1007/s00778-011-0219-9.
- "LivingKnowledge Facts, Opinions and Bias in Time". Community Research and Development Information Service web site. European Union. Retrieved 7 October 2013.
- "Longitudinal Analytics of Web Archive data". Community Research and Development Information Service web site. European Union. Retrieved 7 October 2013.
- "ARchive COmmunities MEMories". Community Research and Development Information Service web site. European Union. Retrieved 7 October 2013.
- "Scalable Preservation Environments". Community Research and Development Information Service web site. European Union. Retrieved 7 October 2013.
- "Web Archiving in Europe: A survey provided by the Internet Memory Foundation, 2010". 22 March 2011. Retrieved 16 October 2013.
- Adrian Brown (2006). Archiving websites: a practical guide for information management professionals. Facet Publishing. pp. 17–18. ISBN 9781856045537.