Planned intervention: On Thursday 19/09 between 05:30-06:30 (UTC), Zenodo will be unavailable because of a scheduled upgrade in our storage cluster.
Published July 28, 2015 | Version v1
Dataset Open

Data from: Viral dark matter and virus–host interactions resolved from publicly available microbial genomes

  • 1. University of Arizona
  • 2. University of British Columbia
  • 3. Joint Genome Institute

Description

The ecological importance of viruses is now widely recognized, yet our limited knowledge of viral sequence space and virus-host interactions precludes accurate prediction of their roles and impacts. Here we mined publicly available bacterial and archaeal genomic datasets to identify 12,498 high‑confidence viral genomes linked to their microbial hosts. These data augment public datasets 10-fold, provide first viral sequences for 13 new bacterial phyla including ecologically abundant phyla, and help taxonomically identify 7-38% of 'unknown' sequence space in viromes. Genome- and network-based classification was largely consistent with accepted viral taxonomy and suggested that (i) 264 new viral genera were identified (doubling known genera) and (ii) cross-taxon genomic recombination is limited. Further analyses provided empirical data on extrachromosomal prophages and co‑infection prevalences, as well as evaluation of in silico virus-host linkage predictions. Together these findings illustrate the value of mining viral signal from microbial genomes.

Notes

Files

VirSorter_Curated_Dataset_genbank-files.zip

Files (254.5 MB)

Name Size Download all
md5:861a8bc37fdcaebbe7cd7f41a38024c7
254.5 MB Preview Download

Additional details

Related works

Is cited by
10.7554/eLife.08490 (DOI)