Digital curation

From Wikipedia, the free encyclopedia

Digital curation is the selection,[1] preservation, maintenance, collection, and archiving of digital assets.[2][3][4][5] Digital curation establishes, maintains, and adds value to repositories of digital data for present and future use.[4] This is often accomplished by archivists, librarians, scientists, historians, and scholars.[6] Enterprises are starting to use digital curation to improve the quality of information and data within their operational and strategic processes.[7] Successful digital curation will mitigate digital obsolescence, keeping the information accessible to users indefinitely.[8] Digital curation includes digital asset management, data curation, digital preservation, and electronic records management.[9]

Word History[edit]

Much like the word archive has layered meanings and uses, the word curation is both a noun and a verb, used originally in the field of museology to represent a wide range of activities, most often associated with collection care, long-term preservation, and exhibition design. Curation can be a reference to physical repositories that store cultural heritage or natural resource collections (e.g., a curatorial repository) or a representation of varied policies and processes involved with the long-term care and management of heritage collections, digital archives, and research data (e.g, curatorial/collections management plans, curation life-cycle, and data curation). Yet curation is also associated with short-term objectives and processes of selection and interpretation for the purposes of presentation, such as for gallery exhibitions and websites, which contribute to knowledge creation. It has also been applied to interaction with social media including compiling digital images, web links, and movie files.

The term curation entered the legal framework through federal historic preservation laws, starting with the National Historic Preservation Act of 1966,[10] and was further defined and coded into federal regulations through 36 CFR Part 79: Curation of Federally-owned and Administered Archaeological Collections.[11] Curation has since permeated into an array of disciplines but remains closely tied to heritage and information management.

Core Principles and Activities[edit]

The term "digital curation" was first used in the e-science and biological science fields as a means of differentiating the additional suite of activities ordinarily employed by library and museum curators to add value to their collections and enable its reuse[12][13][14] from the smaller subtask of simply preserving the data, a significantly more concise archival task.[12] Additionally, the historical understanding of the term "curator" demands more than simple care of the collection. A curator is expected to command academic mastery of the subject matter as a requisite part of appraisal and selection of assets and any subsequent adding of value to the collection through application of metadata.[12]


There are five commonly accepted principles that govern the occupation of digital curation:

  • Manage the complete birth-to-retirement life cycle of the digital asset.[5]
  • Evaluate and cull assets for inclusion in the collection.[5]
  • Apply preservation methods to strengthen the asset’s integrity and reusability for future users.[5]
  • Act proactively throughout the asset life cycle to add value to both the digital asset and the collection.[5]
  • Facilitate the appropriate degree of access to users.[5]


The Digital Curation Center offers the following step-by-step life cycle procedures for putting the above principles into practice:[15]

Sequential Actions:

  • Conceptualize: Consider what digital material you will be creating and develop storage options. Take into account websites, publications, email, among other types of digital output.[4][15]
  • Create: Produce digital material and attach all relevant metadata, typically the more metadata the more accessible the information.[4][15]
  • Appraise and select: Consult the mission statement of the institution or private collection and determine what digital data is relevant. There may also be legal guidelines in place that will guide the decision process for a particular collection.[4]
  • Ingest: Send digital material to the predetermined storage solution. This may be an archive, repository or other facility.[4][15]
  • Preservation action: Employ measures to maintain the integrity of the digital material.[4][15]
  • Store: Secure data within the predetermined storage facility.[4][15]
  • Access, use, and reuse: Determine the level of accessibility for the range of digital material created. Some material may be accessible only by password and other material may be freely accessible to the public.[4][15] Routinely check that material is still accessible for the intended audience and that the material has not been compromised through multiple uses.[15]
  • Transform: If desirable or necessary the material may be transferred into a different digital format.[15]

Occasional Actions:

  • Dispose: Discard any digital material that is not deemed necessary to the institution.[4][15]
  • Reappraise: Reevaluate material to ensure that is it still relevant and is true to its original form.[4][15]
  • Migrate: Migrate data to another format in order to protect data for using better in the future.[15]

Related terms[edit]

The term "digital curation" is sometimes used interchangeably with terms such as "digital preservation" and "digital archiving."[2][16] While digital preservation does focus a significant degree of energy on optimizing reusability, preservation remains a subtask to the concept of digital archiving, which is in turn a subtask of digital curation.[12][14] For example, archiving is a part of curation, but so are subsequent tasks such as themed collection-building, which is not considered an archival task. Similarly, preservation is a part of archiving, as are the tasks of selection and appraisal that are not necessarily part of preservation.[14]

Data curation is another term that is often used interchangeably with digital curation, however common usage of the two terms differs. While "data" is a more all-encompassing term that can be used generally to indicate anything recorded in binary form, the term "data curation" is most common in scientific parlance and usually refers to accumulating and managing information relative to the process of research.[17] Data-driven research of education request the role of information professional gradually develop tradition of digital service to data curation particularly at the management of digital research data.[18] So, while documents and other discrete digital assets are technically a subset of the broader concept of data,[12] in the context of scientific vernacular digital curation represents a broader purview of responsibilities than data curation due to its interest in preserving and adding value to digital assets of any kind.[13]


Rate of creation of new data and data sets[edit]

The ever lowering cost and increasing prevalence of entirely new categories of technology has led to a quickly growing flow of new data sets.[19] These come from well established sources such as business and government, but the trend is also driven by new styles of sensors becoming embedded in more areas of modern life.[13] This is particularly true of consumers, whose production of digital assets is no longer relegated strictly to work. Consumers now create wider ranges of digital assets, including videos, photos, location data, purchases, and fitness tracking data, just to name a few, and share them in wider ranges of social platforms.[13]

Additionally, the advance of technology has introduced new ways of working with data. Some examples of this are international partnerships that leverage astronomical data to create "virtual observatories," and similar partnerships have also leveraged data resulting from research at the Large Hadron Collider at CERN and the database of protein structures at the Protein Data Bank.[14]

Storage format evolution and obsolescence[edit]

By comparison, archiving of analog assets is notably passive in nature, often limited to simply ensuring a suitable storage environment.[2] Digital preservation requires a more proactive approach.[20] Today’s artifacts of cultural significance are notably transient in nature and prone to obsolescence when social trends or dependent technologies change.[13] This rapid progression of technology occasionally makes it necessary to migrate digital asset holdings from one file format to another in order to mitigate the dangers of hardware and software obsolescence which would render the asset unusable.[15][8]

Underestimation of human labor costs[edit]

Modern tools for program planning often underestimate the amount of human labor costs required for adequate digital curation of large collections. As a result cost-benefit assessments often paint an inaccurate picture of both the amount of work involved and the true cost to the institution for both successful outcomes and failures.[13]

The concept of cost in business field would be more obvious. Varieties of business systems are running for daily operations. For example, human resources systems deal with recruitment and payroll, communication systems manage internal and external email, and administration systems handle finance, marketing, and other aspects. However, business systems in institutions are not designed for long-term information preservation initially.[21] In some instances, business systems are revised to become Digital Curation systems for preserving transaction information due to cost consideration. The example of business systems are Enterprise Content Management (ECM) applications, which are used by designated group people such as business executives, customers for information management that support key processes organizationally. In the long run, to transfer digital content from ECM applications to Digital Curation (DC) applications would be a trend in large organizations domestically or internationally. The improvement of maturity models of ECM and DC may add value to information that request cost deduction and extensive use for further modification.[21]

Standardization and coordination between institutions[edit]

An absence of coordination across different sectors of society and industry in areas such as the standardization of semantic and ontological definitions,[22] and in forming partnerships for proper stewardship of assets has resulted in a lack of interoperability between institutions, and a partial breakdown in digital curation practice from the standpoint of the ordinary user.[13] The example of coordination is Open Archival Information System (OAIS).[2]

OAIS Reference Model allows professionals and many other organizations and individuals to contribute efforts to the OAIS open forums for developing international standards of archival information in long-term access.[2][23]

Digitization of analog materials[edit]

The curation of digital objects is not limited to strictly born-digital assets. Many institutions have engaged in monumental efforts to digitize analog holdings in an effort to increase access to their collections.[2] Examples of these materials are books, photographs, maps, audio recordings, and more.[13] The process of converting printed resources into digital collections has been epitomized to some degree by librarians and related specialists. For example, The Digital Curation Centre is claimed to be a "world leading centre of expertise in digital information curation"[24] that assists higher education research institutions in such conversions.

Material Types[edit]


Built Cultural Heritage[edit]

Monuments or Architectural Assets[edit]

Nowadays, with the development in ICT and computer-based visualisation, curators benefit from the 3D Reconstruction methods and Digital Twin to not only represent their updated and authentic cultural heritage data sets but also assist conservation architects and the other experts in further practices on the assets.[25]

Intangible Cultural Heritage[edit]


New representational formats[edit]

For some topics, knowledge is embodied in forms that have not been conducive to print, such as how choreography of dance or of the motion of skilled workers or artisans is difficult to encode. New digital approaches such as 3D holograms and other computer-programmed expressions are developing.[citation needed]

For mathematics, it seems possible for a new common language to be developed that would express mathematical ideas in ways that can be digitally stored, linked, and made accessible. The Global Digital Mathematics Library is a project to define and develop such a language.[26][27]


The ability of the intended user community to access the repository’s holdings is of equal importance to all the preceding curatorial tasks. This must take into account not only the user community’s format and communication preferences, but also a consideration of communities that should not have access for various legal or privacy reasons.[28]

Access can be increased by providing information about open access status with open data and open source methods such as the OAI-PMH endpoints of an open archive, which are then aggregated by databases and search engines like BASE, CORE, and Unpaywall for academic papers.[29]

Responses to challenges[edit]

  • Specialized research institutions[30][31]

There are three elements for essential needs of institutions dealing with issues of digital curation: Leadership, Resources, and Collaboration. Three elements related to the role of advance-guards for librarians and archivists working with open approaches to technology, standardized process and scholarly communication. The archivist with leadership, who needs to be a dynamic and active role to embrace technology, standardized process, and scholarly communication. In addition, Archivist leader might adopt the business concept and methods to deal with their workflow such as raise funds, invest technology system, and comply with industry standards, in order to obtain more resources. Collaboration in archives and digital curation community could provide and share training, technologies, standards, and tools to help institutions on challengeable issues of digital curation. Digital Preservation Coalition (DPC), the Open Preservation Foundation or novel partnerships offer collaboration opportunity to institutions facing similar challenges in digital curation issues.[32]

  • Academic courses

Information field especially in libraries, archives, and museums significantly need to bring knowledge of new technologies. Traditional graduate school education is not enough to meet that demand; training program for current staffs in cultural repository would be an efficient supplement for that request, such as professional workshops, and MOOCs (Massively Open Online Courses) in data curation and management.[33]

International Digital Curation Conference (IDCC) is an established annual event since 2005, aiming to collaborate with individuals, organizations, and institutions facing challenges, supporting development, and exchanging ideas in the field.[36]

  • Peer reviewed technical and industry journals[37]

The International Journal of Digital Curation (IJDC) is administered by IJDC Editorial Board including the Editor-in-Chief, Digital Curation Center (DCC), and the following members. IJDC dedicate to provide scholarly platform for sharing, discussing, and improving knowledge and information of digital curation within the worldwide community. IJDC has two types of submission under editorial guidelines, which are peer-reviewed papers and general articles base on original research, the field information, and relevant events in digital curation. IJDC is published by the University of Edinburgh for the Digital Curation Centre in electronic form on a rolling basis two times a year. The open access to the public supports knowledge exchangeable in digital curation worldwide.[38]


Many approaches to digital curation exist and have evolved over time in response to the changing technological landscape. Two examples of this are sheer curation[12] and channelization.[citation needed]

Sheer curation is an approach to digital curation where curation activities are quietly integrated into the normal work flow of those creating and managing data and other digital assets. The word sheer is used to emphasize the lightweight and virtually transparent nature of these curation activities. The term sheer curation was coined by Alistair Miles in the ImageStore project,[39] and the UK Digital Curation Centre's SCARP project.[40] The approach depends on curators having close contact or 'immersion' in data creators' working practices. An example is the case study of a neuroimaging research group by Whyte et al., which explored ways of building its digital curation capacity around the apprenticeship style of learning of neuroimaging researchers, through which they share access to datasets and re-use experimental procedures.[41]

Sheer curation depends on the hypothesis that good data and digital asset management at the point of creation and primary use is also good practice in preparation for sharing, publication and/or long-term preservation of these assets. Therefore, sheer curation attempts to identify and promote tools and good practices in local data and digital asset management in specific domains, where those tools and practices add immediate value to the creators and primary users of those assets. Curation can best be supported by identifying existing practices of sharing, stewardship, and re-use that add value, and augmenting them in ways that both have short-term benefits, and in the longer term reduce risks to digital assets or provide new opportunities to sustain their long-term accessibility and re-use value.[citation needed]

The aim of sheer curation is to establish a solid foundation for other curation activities which may not directly benefit the creators and primary users of digital assets, especially those required to ensure long-term preservation. By providing this foundation, further curation activities may be carried out by specialists at appropriate institutional and organisation levels, whilst causing the minimum of interference to others.[citation needed]

A similar idea is curation at source used in the context of Laboratory Information Management Systems LIMS. This refers more specifically to automatic recording of metadata or information about data at the point of capture and has been developed to apply semantic web techniques to integrate laboratory instrumentation and documentation systems.[42] Sheer curation and curation-at-source can be contrasted with post hoc digital preservation, where a project is initiated to preserve a collection of digital assets that have already been created and are beyond the period of their primary use.[citation needed]

Channelization is curation of digital assets on the web, often by brands and media companies, into continuous flows of content, turning the user experience from a lean-forward interactive medium, to a lean-back passive medium. The curation of content can be done by an independent third party, that selects media from any number of on-demand outlets from across the globe and adds them to a playlist to offer a digital "channel" dedicated to certain subjects, themes, or interests so that the end user would see and/or hear a continuous stream of content.[citation needed]

See also[edit]


  1. ^ Erin Scime (8 December 2009). "The Content Strategist as Digital Curator". A List Apart.
  2. ^ a b c d e f "Digital Curation". Smithsonian Institution Archives. Archived from the original on October 13, 2022. Retrieved November 14, 2022.
  3. ^ Rusbridge, C.; Buneman, P.; Burnhill, P.; Giaretta, D.; Ross, S.; Lyon, L.; Atkinson, M. (2005). "The Digital Curation Centre: A Vision for Digital Curation" (PDF). 2005 IEEE International Symposium on Mass Storage Systems and Technology (PDF). p. 31. doi:10.1109/LGDI.2005.1612461. ISBN 978-0-7803-9228-1. S2CID 20810596.
  4. ^ a b c d e f g h i j k "What is Digital Curation?". Digital Curation Centre. Retrieved 2008-04-01.
  5. ^ a b c d e f Elizabeth Yakel (2007). "Digital curation". OCLC Systems & Services: International Digital Library Perspectives. 23 (4): 335–340. doi:10.1108/10650750710831466. S2CID 33219560.
  6. ^ "Digital Curation - MLIS Career Pathway". SJSU School of Information. 5 October 2020. Archived from the original on October 19, 2021. Retrieved November 4, 2022.
  7. ^ E. Curry, A. Freitas, and S. O'Riáin, "The Role of Community-Driven Data Curation for Enterprises," Archived 2012-01-23 at the Wayback Machine in Linking Enterprise Data, D. Wood, Ed. Boston, MA: Springer US, 2010, pp. 25-47.
  8. ^ a b Post, Colin (2021). "The Art of Digital Curation: Co-operative Stewardship of Net-Based Art" (PDF). Archivaria. 92: 6–47.
  9. ^ Yakel, Elizabeth (2013-04-13). "Digital curation". OCLC Systems & Services: International Digital Library Perspectives. 23 (4): 335–340. doi:10.1108/10650750710831466. S2CID 33219560.
  10. ^ United States. 1993. National Historic Preservation Act of 1966, as amended. Washington, DC: Produced by the Advisory Council on Historic Preservation, in conjunction with the National Conference of State Historic Preservation Officers, and the National Park Service.
  11. ^ United States. 1990. 36 CFR Part 79: Curation of Federally-owned and Administered Archaeological Collections. Washington, DC: U.S. Dept. of the Interior, National Park Service.
  12. ^ a b c d e f Dallas, Costis (2016-12-01). "Digital curation beyond the "wild frontier": a pragmatic approach". Archival Science. 16 (4): 421–457. doi:10.1007/s10502-015-9252-6. ISSN 1389-0166. S2CID 143306405.
  13. ^ a b c d e f g h Committee on Future Career Opportunities and Educational Requirements for Digital Curation; Board on Research Data and Information; Policy And Global, Affairs; National Research Council (2015). Preparing the Workforce for Digital Curation. Committee on Future Career Opportunities Educational Requirements for Digital Curation / Board on Research Data Information / Policy and Global Affairs / National Research Council. doi:10.17226/18590. ISBN 9780309296946. PMID 25950079. {{cite book}}: |last3= has generic name (help)
  14. ^ a b c d Beagrie, Neil (2008). "Digital Curation for Science, Digital Libraries, and Individuals". International Journal of Digital Curation. 1: 3–16. doi:10.2218/ijdc.v1i1.2.
  15. ^ a b c d e f g h i j k l m Higgins, Sarah (2008). "The DCC Curation Lifecycle Model". International Journal of Digital Curation. 3: 134–140. doi:10.2218/ijdc.v3i1.48. hdl:1842/3350. S2CID 11415404. Retrieved 2018-02-19.
  16. ^ Lazorchak, Butch (August 23, 2011). "Digital Preservation, Digital Curation, Digital Stewardship: What's in (Some) Names?". Library of Congress. Archived from the original on September 30, 2022. Retrieved November 14, 2022.
  17. ^ Borgman, Christine L. Big data, little data, no data : scholarship in the networked world. Cambridge, Massachusetts. ISBN 9780262327862. OCLC 900409008.
  18. ^ Weber, Nicholas; Palmer, Carole L; Chao, Tiffany (October 2012). "Current Trends and Future Directions in Data Curation Research and Education". Journal of Web Librarianship. 6 (4): 305–320. doi:10.1080/19322909.2012.730358. hdl:2142/48759. S2CID 62237710.
  19. ^ Ray, Joyce (2009). "Sharks, digital curation, and the education of information professionals". Museum Management and Curatorship. 24 (4): 357–368. doi:10.1080/09647770903314720. S2CID 143147784.
  20. ^ Higgins, Sarah (2011). "Digital Curation: The Emergence of a New Discipline". International Journal of Digital Curation. 6 (2): 78–88. doi:10.2218/ijdc.v6i2.191.
  21. ^ a b Katuu, Shadrack. (2012). Enterprise Content Management and Digital Curation Applications Maturity Model Connections.
  22. ^ Paul Watry (November 2007). "Digital Preservation Theory and Application: Transcontinental Persistent Archives Testbed Activity". The International Journal of Digital Curation. 2 (2). Archived from the original on March 15, 2008. Retrieved September 12, 2022.
  23. ^ A Lee, Christopher. (2011). Open Archival Information System (OAIS) Reference Model. 10.1201/b11499-56.
  24. ^ "About the DCC". Website. Digital Curation Centre. Retrieved 6 March 2013.
  25. ^ Parsinejad, H.; Choi, I.; Yari, M. (2021). "Production of Iranian Architectural Assets for Representation in Museums: Theme of Museum-Based Digital Twin". Body, Space and Technology. 20 (1): 61–74. doi:10.16995/bst.364.
  26. ^ Committee on Planning a Global Library of the Mathematical Sciences (2014). Read "Developing a 21st Century Global Library for Mathematics Research" at arXiv:1404.1905. doi:10.17226/18619. ISBN 978-0-309-29848-3. S2CID 119149004.
  27. ^ Ion, Patrick (2016), "The Effort to Realize a Global Digital Mathematics Library", Mathematical Software – ICMS 2016, Lecture Notes in Computer Science, vol. 9725, Springer International Publishing, pp. 458–466, doi:10.1007/978-3-319-42432-3_59, ISBN 9783319424316
  28. ^ Lavoie, Brian (2014). "The Open Archival Information System (OAIS) Reference Model: Introductory Guide". doi:10.7207/twr14-02. Retrieved September 12, 2022. {{cite journal}}: Cite journal requires |journal= (help)
  29. ^ Dhakal, Kerry (15 April 2019). "Unpaywall". Journal of the Medical Library Association. 107 (2): 286–288. doi:10.5195/jmla.2019.650. PMC 6466485.
  30. ^ Digital Curation Centre
  31. ^ Digital Preservation Coalition
  32. ^ "We must fight to preserve digital information". The Economist. 2019-02-21. ISSN 0013-0613. Retrieved 2019-05-01.
  33. ^ Tibbo, Helen R. (2015). "Digital Curation Education and Training: From Digitization to Graduate Curricula to MOOCs". International Journal of Digital Curation. 10: 144–153. doi:10.2218/ijdc.v10i1.352.
  34. ^ DigCCurr 2007 - an international symposium on Digital Curation, April 18-20, 2007
  35. ^ "1st African Digital Management and Curation Conference and Workshop". African Digital Curation Conference. 13 February 2008. Retrieved April 25, 2022.
  36. ^ "Homepage". DCC because good research needs good data. Retrieved April 25, 2022.
  37. ^ "Current Issue". International Journal of Digital Curation. Retrieved April 25, 2022.
  38. ^ "International Journal of Digital Curation". Retrieved 2019-05-01.
  39. ^ "The ImageStore Project - ImageWeb". Archived from the original on 2009-05-19. Retrieved 2009-05-18.
  40. ^ Digital Curation Centre: DCC SCARP Project
  41. ^ Whyte, Angus; Job, Dominic; Giles, Stephen; Lawrie, Stephen (July 2008). "Meeting Curation Challenges in a Neuroimaging Group". The International Journal of Digital Curation. 3 (1): 171–181. doi:10.2218/ijdc.v3i1.53. Retrieved April 25, 2022.
  42. ^ Frey, J. 'Sharing and Collaboration' keynote presentation at UK e-Science All Hands Meeting Archived 2008-08-20 at the Wayback Machine, 8–11 September 2008, Edinburgh

External links[edit]