Published February 4, 2021 | Version v3
Journal article Open

Biosynthetic potential of the global ocean microbiome

  • 1. Department of Biology, Institute of Microbiology and Swiss Institute of Bioinformatics, ETH Zürich, Zürich 8093, Switzerland

Description

This dataset is part of the supplementary data to the manuscript 'Biosynthetic potential of the global ocean microbiome', DOI to come soon (preprint available here: https://doi.org/10.1101/2021.03.24.436479).

As part of this work, we established the Ocean Microbiomics Database (OMD), which can be interactively accessed at: https://www.microbiomics.io/ocean/. This repository versions some of the file that make up the database while other (larger) files can be accessed at ENA under the project PRJEB45951 (https://www.ebi.ac.uk/ena/browser/view/PRJEB45951).

  • Metagenomic assemblies:
    • antismash-bgcs-metagenomes-unfiltered.tar.gz: antiSMASH annotations of the metagenomic assemblies (unfiltered, i.e. includes scaffolds below 5kbp) 
    • assemblies-MGE-predictions.tsv.gz: Mobile genetic element (including plasmids, prophages, viruses etc) annotations for the metagenomic assemblies. 
    • antismash-bgcs-metagenomes-with-MGEs.tsv: Summary table combining both antiSMASH and MGE annotations. 
  • Genomes:
    • antismash-bgcs-genomes-unfiltered.tar.gz: antiSMASH annotations of the genomes in the OMD (unfiltered, i.e. includes scaffolds below 5kbp)
    • gecco-bgcs-genomes-summary.tsv.gz: GECCO annotations of the genomes in the OMD.
    • antismash-bgcs-genomes-with-MGEs.tsv: Summary table combining both antiSMASH and MGE annotations for the genomes reconstructed in this study (subset of the OMD). MGE annotations are derived from the metagenomic assemblies.
    • genomes-usage.tsv: Summary table of all the tables and their usage for the different analyses performed in this work. 
    • motus-profiles.tsv.gz: Metagenomic and metatranscriptomic taxonomic profiles. All genomes with enough marker genes were used to extend the mOTUs database, which was then use to generate these profiles.
  • Genes: (from the OMD genomes)
    • gene-catalog-membership.tsv.gz: Map between a gene and its representative in the 95% identity gene catalog.
    • gene-catalog-profile.tsv.gz: Abundance profiles of the representative genes of the gene catalog.
  • Candidatus Eudoremicrobiaceae:
    • Eudoremicrobiaceae-profiling-raw.tsv.gz
    • E.taraoceanii-metatranscriptomics-processed.tsv.gz

Any question or further requests can be addressed as specified at https://www.microbiomics.io/ocean/ or to the corresponding authors of the associated manuscript. 

Files

Files (8.7 GB)

Name Size Download all
md5:17add5b3a7494e4be68d93408e2188f8
698.3 MB Download
md5:cd5d951cee706ba5471fb00f335a7373
4.7 MB Download
md5:6440feae7b7ba1742fda21ec258d7d79
684.0 MB Download
md5:ccd9380ca0c2f8ddfe1ae05d9a06441d
4.7 MB Download
md5:7b9a342c1c7123df7c245dff64e75d1c
2.2 GB Download
md5:db0a9ecabdb073e36a5301c9938aa901
7.4 MB Download
md5:e5f09b769351be53d00a4127a4738a19
8.1 MB Download
md5:924ab1b01113aa3b4113e2e64056703b
43.0 MB Download
md5:aac53aa07adf6fe82060297964ebf280
600.8 MB Download
md5:e759389f806f7691f14c45269996fbea
4.5 GB Download
md5:349fe2a50e0d93b4c3c2e366dadcaa1c
2.9 MB Download
md5:568f9171bfd588c3cef1645841271b69
2.5 MB Download

Additional details

Related works

Is supplement to
Journal article: 10.1101/2021.03.24.436479 (DOI)