GWAS Central

From Wikipedia, the free encyclopedia
Jump to: navigation, search

GWAS Central (previously HGBASE, HGVbase and HGVbaseG2P) is a publicly available database of summary-level findings from genetic association studies in humans, including genome-wide association studies (GWAS).

It is funded through the GEN2PHEN project by the European Union under their Seventh Framework Programme.


GWAS Central contains the most comprehensive collection of summary-level p-value GWAS data. The web resource employs powerful graphical and text based data presentation methods for discovery of and simultaneous visualisation and co-examination of many studies, at genome-wide and region-specific levels. Studies of interest can now be identified using chromosomal regions/genes and markers; there is also the facility for researchers to view their own data alongside selected studies.

Current content includes ‘top’ p-values from collections; supplementary data; direct researcher submissions; and publicly available data. Consequently, the database now hosts >21 million p-values and 708 studies (vs 3,948 p-values and 798 studies in the NHGRI GWAS catalog), representing ~5% of all such data yet produced. GWAS Central makes parts of its data freely available for download by the research community. However, only parts of the data may be downloaded freely, the whole database content can be accessed as part of a collaboration.


The Human Genome Bi-Allelic SEquence (HGBASE) database[1] was the first version of what is now GWAS Central.[2] It was first released in August 1998, focusing on providing a centralized collection of known human single nucleotide polymorphisms and other simple DNA variants. It was the first publicly available SNP database. The project was expanded over the next year by a consortium including the Karolinska Institute, the European Bioinformatics Institute and the European Molecular Biology Laboratory. Corporate support was provided by Pfizer and GlaxoSmithKline. The version released in November 2001 was renamed the Human Genome Variation database (HGVbase),[3] as this was a better reflection of the scope of the database and its emphasis on collection from many different laboratories. In addition this also highlighted its new role as a central repository for data collection efforts in collaboration with the Human Genome Variation Society.

HGVbase was scaled back in 2004 to simply provide an alternative representation of the full marker list from dbSNP, but development continued on its successor: the Human Genome Variation Genotype-to-Phenotype database (HGVbaseG2P),[4] in many ways the natural evolution of HGVbase into a central database for summary-level genetic association data. The work was originally funded by GlaxoSmithKline, the University of Leicester, and the European Community's Sixth Framework Programme ('INFOBIOMED' Network of Excellence), but the GEN2PHEN project became the main source of funds in 2008. Early work in the project involved devising a powerful way of modeling phenotype and genotype- phenotype data, which itself was adopted and adapted to become the global standard Phenotype And Genotype Experiment Object model.[5] In 2008 HGVbaseG2P went live and extended the project's content and scope by adding a far broader and more comprehensive range of markers (i.e., SNPs, structural variants, and STSs), along with association data from many genetic association studies.

In February 2010, the project was once again renamed to GWAS Central, to reflect the growing focus on genome-wide association studies. GWAS Central is a core component of the GEN2PHEN project and intends to provide an operational model, plus an open-source software package, so others can create similar databases across the world. These will be hosted by institutes, consortia, and even individual laboratories; providing those groups a toolkit for publicising and publishing their genetic association findings on the web and examine their data alongside others data from similar valuable resources.


  1. ^ Brookes, AJ; Lehväslaiho, H; Siegfried, M; Boehm, JG; Yuan, YP; Sarkar, CM; Bork, P; Ortigao, F (2000-01-01). "HGBASE: a database of SNPs and other variations in and around human genes.". Nucleic Acids Research. 28 (1): 356–60. doi:10.1093/nar/28.1.356. PMC 102467Freely accessible. PMID 10592273. 
  2. ^ Beck, T; Hastings, RK; Gollapudi S; Free, RC; Brookes, AJ. (2014-07-01). "GWAS Central: a comprehensive resource for the comparison and interrogation of genome-wide association studies.". Eur J Hum Genet. 22 (7): 949–52. doi:10.1038/ejhg.2013.274. PMC 4060122Freely accessible. PMID 24301061. 
  3. ^ Fredman, D; Siegfried, M; Yuan, YP; Bork, P; Lehväslaiho, H; Brookes, AJ (2002-01-01). "HGVbase: a human sequence variation database emphasizing data quality and a broad spectrum of data sources.". Nucleic Acids Research. 30 (1): 387–91. doi:10.1093/nar/30.1.387. PMC 99093Freely accessible. PMID 11752345. 
  4. ^ Thorisson, GA; Lancaster, O; Free, RC; Hastings, RK; Sarmah, P; Dash, D; Brahmachari, SK; Brookes, AJ (Jan 2009). "HGVbaseG2P: a central genetic association database.". Nucleic Acids Research. 37 (Database issue): D797–802. doi:10.1093/nar/gkn748. PMC 2686551Freely accessible. PMID 18948288. 
  5. ^ Brookes AJ, Lehvaslaiho H, Muilu J, Shigemoto Y, Oroguchi T, Tomiki T, Mukaiyama A, Konagaya A, Kojima T, Inoue I, Kuroda M, Mizushima H, Thorisson GA, Dash D, Rajeevan H, Darlison MW, Woon M, Fredman D, Smith AV, Senger M, Naito K, Sugawara H (31 May 2009). "The phenotype and genotype experiment object model (PaGE-OM): a robust data structure for information related to DNA variation". Human Mutation. 30 (6): 968–977. doi:10.1002/humu.20973. PMID 19479963. 

External links[edit]