Jump to content

DataONE

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Derek R Bullamore (talk | contribs) at 16:07, 21 April 2016 (Filling in 3 references using Reflinks). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Data Observation Network for Earth (DataONE)[1] is a project supported by the National Science Foundation under the DataNet program. DataONE will provide scientific data archiving for ecological and environmental data produced by scientists worldwide. DataONE's stated goal is to preserve and provide access to multi-scale, multi-discipline, and multi-national data. The community of users for DataONE includes scientists, ecosystem managers, policy makers, students, educators, librarians, and the public.

DataONE will link together existing cyberinfrastructure to provide a distributed framework, sound management, and robust technologies that enable long-term preservation of diverse multi-scale, multi-discipline, and multi-national observational data. The distributed framework will be composed of Coordinating Nodes currently located at the Oak Ridge Campus, University of California Santa Barbara, and University of New Mexico, and many Member Nodes, located globally. DataONE will also provide an Investigator Tool Kit [2] that will provide the DataONE users community with tools for accessing and using DataONE efficiently.

Coordinating nodes

Coordinating Nodes will provide network-wide services to Member Nodes. They will be geographically replicated, with mirrored content and full copies of science metadata. The three Coordinating Nodes are:

Member nodes

Member Nodes will consist of Earth observing institutions, projects, and networks. They will provide resources for their own data and replicated data, and focus on serving their specific constituencies. These member nodes are geographically distributed and consist of diverse implementations. Current Member Nodes include:

Investigator Tool Kit

The Tool Kit will provide tools for researchers to access DataONE. These will be both general purpose and discipline-specific tools, and DataONE developers will adapt existing tools where possible. The Tool Kit will include Java and Python libraries, an R programming language plug-in for analysis, extensions for Excel, the VisTrails scientific workflow, and the Kepler scientific workflow system.

Data management

DataONE will provide a place for scientists to store data and its associated metadata. The metadata will then make this data searchable and accessible to other scientists. Data management practices include

  • Data management planning
  • Data acquisition (techniques, protocols, methods)
  • Data protection (backing up)
  • Data entry and manipulation (naming files, organization)
  • Quality control on data
  • Data analysis
  • Workflow tools (VisTrails, Kepler scientific workflow system)
  • Data documentation (metadata)
  • Data sharing, citation, and discovery
  • Data preservation & curation

DataONE collaborates with other institutions to bring together tools that help with good data management practices. One of those tools, developed in collaboration with other organizations and hosted by the University of California Digital Curation Center, is the DMPTool for data management planning.[1]

Some of the additional data management planning resources include: a primer for best practices, a database for best practices in data management, educational modules and tutorials, webinars, and an investigator toolkit. Many of these resources have been used and/or adapted for use under Creative Commons license by organizations and institutions that seek to educate other communities about data and research management.

DataONE community

The DataONE community includes research networks, professional societies, libraries, academic institutions, data centers, data repositories, environmental observatory networks, educators, scientists, policy makers, administrators, citizen scientists, international organizations, NGOs, ecosystem managers, students, private companies and the public.

DataONE has an active worldwide users group (called the DUG for "DataONE Users Group") that represents a wide range of diverse stakeholders. The DUG meets on an annual basis and provides feedback from users to DataONE that guides areas of interest for future work and helps DataONE to reach its stated goals.[32]

References

  1. ^ "DataONE". DataONE. Retrieved 2016-04-21.
  2. ^ "Investigator Toolkit". DataONE. Retrieved 2016-04-21.
  3. ^ "New Mexico's Flagship University | The University of New Mexico". Unm.edu. Retrieved 2016-04-21.
  4. ^ "Home - University of California, Santa Barbara". Ucsb.edu. Retrieved 2016-04-21.
  5. ^ "Welcome to eBird". eBird.org. Retrieved 2016-04-21.
  6. ^ "Dryad Digital Repository - Dryad". Datadryad.org. Retrieved 2016-04-21.
  7. ^ "Earth Data Analysis Center | Center for Geospatial & Information Technology Services". Edac.unm.edu. Retrieved 2016-04-21.
  8. ^ "Environmental Data for the Oak Ridge Area : Search". Mercury-ops2.ornl.gov. Retrieved 2016-04-21.
  9. ^ "ESA Data Registry". Data.esa.org. Retrieved 2016-04-21.
  10. ^ "Taking Europe's pulse - Research for our continent's future — LTER in Europe". Lter-europe.net. Retrieved 2016-04-21.
  11. ^ "GLEON". GLEON. Retrieved 2016-04-21.
  12. ^ >"Gulf of Alaska Data Portal". Portal.aoos.org. Retrieved 2016-04-21.
  13. ^ "The IARC Data Archive at UAF, an AA/EO employer and educational institution". Climate.iarc.uaf.edu. 2007-08-23. Retrieved 2016-04-21.
  14. ^ >"Cumulative human impacts data (2008 and 2013) Halpern B, et al. 2015" (JSP). Knb.ecoinformatics.org. Retrieved 2016-04-21.
  15. ^ The Long Term Ecological Research Network. "The Long Term Ecological Research Network | Long-term, broad-scale research to understand our world". Lternet.edu. Retrieved 2016-04-21.
  16. ^ "UC3 Merritt Home :". Merritt.cdlib.org. Retrieved 2016-04-21.
  17. ^ "MPC Data Projects". Ipums.org. Retrieved 2016-04-21.
  18. ^ "Current Member Nodes". DataONE. Retrieved 2016-04-21.
  19. ^ "Nevada Research Data Center". Sensor.nevada.edu. Retrieved 2016-04-21.
  20. ^ "Current Member Nodes". DataONE. Retrieved 2016-04-21.
  21. ^ "Dash". Oneshare.cdlib.org. Retrieved 2016-04-21.
  22. ^ "ORNL DAAC for Biogeochemical Dynamics". Daac.ornl.gov. doi:10.1016/j.foreco.2008.11.016. Retrieved 2016-04-21.
  23. ^ "Pisco | Pisco". Data.piscoweb.org. Retrieved 2016-04-21.
  24. ^ "Regional and Global Data Available Through Mercury". Daac.ornl.gov. 2010-03-18. doi:10.1016/j.foreco.2008.11.016. Retrieved 2016-04-21.
  25. ^ "South African National Parks - SANParks - Official Website - Accommodation, Activities, Prices, Reservations". SANParks.org.za. Retrieved 2016-04-21.
  26. ^ "SEAD | A Knowledge Network for Collaboration, Data Curation, and Discovery". Sead-data.net. Retrieved 2016-04-21.
  27. ^ "TFRI Metacat Data Catalog". Metacat.tfri.gov.tw. Retrieved 2016-04-21.
  28. ^ "Terrestrial Ecosystem Research Network: Home". TERN. Retrieved 2016-04-21.
  29. ^ "KU Biodiversity Institute & Natural History Museum". Biodiversity.ku.edu. Retrieved 2016-04-21.
  30. ^ "USA National Phenology Network | USA National Phenology Network". Usanpn.org. 2016-04-15. Retrieved 2016-04-21.
  31. ^ "U.S. Geological Survey Science Data Catalog". Data.usgs.gov. Retrieved 2016-04-21.
  32. ^ "Users Group". DataONE. Retrieved 2016-04-21.