Jump to content

Genomic Standards Consortium

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by InternetArchiveBot (talk | contribs) at 20:11, 25 December 2019 (Rescuing 2 sources and tagging 0 as dead.) #IABot (v2.0). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

The Genomic Standards Consortium (GSC) is an initiative working towards richer descriptions of our collection of genomes, metagenomes and marker genes. Established in September 2005, this international community includes representatives from a range of major sequencing and bioinformatics centres (including NCBI, EMBL, DDBJ, JCVI, JGI, EBI, Sanger, FIG) and research institutions. The goal of the GSC is to promote mechanisms for standardizing the description of (meta)genomes, including the exchange and integration of (meta)genomic data. The number and pace of genomic and metagenomic sequencing projects will only increase as the use of ultra-high-throughput methods becomes common place and standards are vital to scientific progress and data sharing.

Mission

Community-driven standards have the best chance of success if developed within the auspices of international working groups. Participants in the GSC include biologists, computer scientists, those building genomic databases and conducting large-scale comparative genomic analyses, and those with experience of building community-based standards. The mission of the GSC is to work with the wider community towards:

  • the implementation of a new genomic standards
  • methods of capturing and exchanging metadata
  • harmonization of metadata collection and analysis efforts across the wider genomics community

MIGS/MIMS/MIMARKS and other projects

The GSC has published a “Minimum Information about a (Meta)Genome Sequence” specification and has now completed a "Minimum Information about an ENvironmental Sequence" specification. MIGS/MIMS/MIMARKS provides an extension of the minimum information already captured by the primary nucleotide sequence archives (INSDC or DDBJ/ENA/GenBank). The development of any checklist must be an open and iterative process that involves a balanced group of participants. Further, this development process must be supported by providing mechanisms for achieving compliance if a checklist is to be adopted as a tool for the standardization of a particular area of knowledge. Work towards this goal has spawned a set of interlocking projects that are described in more detail here: GSC projects. These include The Genomic Contextual Data Markup Language (GCDML), Genomic Rosetta Stone (GRS), Habitat-Lite. Newer projects include the M5 project.

Linkages to other groups

The GSC is interested in making and building links with other communities. As stated above, the GSC is engaged in ontology development within the OBO Foundry. The GSC is also a founding member community of the Minimum Information about a Biomedical or Biological Investigation (MIBBI), an umbrella community for supporting and co-ordinating the development of checklists describing Minimum Information Standards.

Publications

The GSC maintains a list of publications on its wiki - GSC Publications. This list includes reports from all workshops, articles from the special issue of the journal OMICS on data standards, and the publications describing the MIGS/MIMS and MIMARKS specifications in the journal Nature Biotechnology (May 2008 and May 2011 respectively).