Jump to content

Serratus (virology): Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Ababaian (talk | contribs)
{{connected contributor}} Initialize Serratus virology page with basic outline of the project
Tag: possible conflict of interest
(No difference)

Revision as of 21:12, 13 January 2023

Serratus is a large scale viroinformatics project for uncovering the total genetic diversity of Earth's virome. Originating with the goal of uncovering novel coronaviruses which may have been incidentally sequenced by other researchers, the project expanded to encompass all RNA viruses, those which encode a viral RNA-dependent RNA polymerase (RdRp).

At the end of 2020 there were approximately 15,000 distinct RNA virus sequences known from public databases, measured by the number of distinct RdRp (greater than 10% difference in amino acid sequence). Using a bioinformatics workflow optimized for large-scale cloud computing, the research team analyzed 5.7 million freely available sequencing datasets (20.4 petabytes of raw data) in the Sequence Read Archive in only 11 days and a computing cost of USD$23,900. This analysis yielded 132,000 novel viral RdRp, representing nearly an order of magnitude increase in the known genetic diversity of RNA viruses [1].

All Serratus data is made freely-available on the serratus.io website such that it can be used by researchers, clinicians, and epidemiologists to better understand the viral world.

References

  1. ^ Edgar RC, Taylor J, Lin V, Altman T, Barbera P, Meleshko D; et al. (2022). "Petabase-scale sequence alignment catalyses viral discovery". Nature. 602 (7895): 142–147. doi:10.1038/s41586-021-04332-2. PMID 35082445.{{cite journal}}: CS1 maint: multiple names: authors list (link)