Dataset Open Access

AMPSphere : the worldwide survey of prokaryotic antimicrobial peptides

Santos-Júnior, Célio Dias; Schmidt, Thomas S.B.; Fullam, Anthony; Duan, Yiqian; Bork, Peer; Zhao, Xing-Ming; Coelho, Luis Pedro


JSON-LD (schema.org) Export

{
  "inLanguage": {
    "alternateName": "eng", 
    "@type": "Language", 
    "name": "English"
  }, 
  "description": "<p>&nbsp;</p>\n\n<p><strong>INTRODUCTION</strong></p>\n\n<p>AMPSphere is a comprehensive catalog of antimicrobial peptides predicted using Macrel (DOI: <a href=\"https://peerj.com/articles/10555/\">10.7717/peerj.10555</a>) from 63,410 public metagenomes, <a href=\"http://progenomes.embl.de/\">ProGenomes v2.2 database</a> (82,400 high-quality microbial genomes), and c.a. 4k non-whitelisted microbial genomes from NCBI.</p>\n\n<p>&nbsp;</p>\n\n<p><strong>GENERATION</strong></p>\n\n<p>Peptides were predicted using Macrel. Singleton peptides were removed, except those with a direct hit to the <a href=\"http://dramp.cpu-bioinfor.org/\">DRAMP database</a>.</p>\n\n<p>Redundant peptides were hierarchically clustered using CD-HIT (version 4.6) at 100%, 85%, 75%, and 50% of amino acid identity (and 90% of overlap of the shorter peptide). The obtained clusters were sorted by decrescent size and<br>\nnumbered as families. Each level of clustering was called SPHERE and was used to understand the AMPs&#39; structure accordingly to their orthology.</p>\n\n<p>Nucleotide sequences from the most frequent variants per AMP also were included in this version of AMPSphere.</p>\n\n<p>&nbsp;</p>\n\n<p><strong>STATISTICS</strong></p>\n\n<p>AMPSphere v.2021-03 contains 863,498 sequences (avg length: 36 amino acids, range 8-98). DRAMP database was used to find confirmed sequences with strict homology to reference. This approach showed that 2,488 peptides were previously confirmed in our dataset.</p>\n\n<p>&nbsp;</p>\n\n<p><strong>IDENTIFIERS</strong></p>\n\n<p>Peptides are named:</p>\n\n<pre><code>&gt;AMP10.XXX_XXX</code></pre>\n\n<p>Where `XXX_XXX` is a unique numerical identifier (starting at zero). Numbers were assigned in order of increasing<br>\nnumber of copies. So that the lower the number, the higher the number of copies of that peptide was present in the input data.</p>\n\n<p>Annotations were also provided as separated fields in the fasta file, containing their:</p>\n\n<p>- SPHERE families at level 4 (corresponding to hierarchically obtained clusters using 100%, 85%, and 75% of identity with a minimum overlap of 90% of the shorter gene).</p>\n\n<p>Example of the header:</p>\n\n<pre><code>&gt;AMP10.000_000 | SPHERE-III.001_396</code></pre>\n\n<p>&nbsp;</p>\n\n<p><strong>VERSION</strong></p>\n\n<p>This version <strong>v.2021-03</strong> brings the validated SPHERE families. Using the families from level III, containing 8 sequences or more, we calculated the alignments and trees in Newick format, which were included in this version too. No major changes were made in the other files besides the naming.</p>\n\n<p>&nbsp;</p>\n\n<p><em>Files included:</em></p>\n\n<p>AMPsphere_v.2021-03_families_tree_nwk.tar</p>\n\n<p>AMPsphere_v.2021-03_families_alignment.tar</p>\n\n<p>SPHERE_v.2021-03_levels.assessment.tsv</p>\n\n<p>&nbsp;</p>\n\n<p><em>Files changed:</em></p>\n\n<p>AMPsphere_v.2021-03.faa&nbsp; -- Included SPHERE lv.III information</p>\n\n<p>README -- update the information about the new files</p>\n\n<p>&nbsp;</p>\n\n<p><em>Files eliminated:</em></p>\n\n<p>No files were eliminated from this version when compared to the previous.</p>\n\n<p>&nbsp;</p>\n\n<p>&nbsp;</p>\n\n<p><strong>FILES</strong></p>\n\n<p><em>README.md</em><br>\nThis file.</p>\n\n<p>&nbsp;</p>\n\n<p><em>AMPSphere_v.2021-03.fna</em><br>\nMulti-fasta with AMPSphere gene sequences (nucleotide).</p>\n\n<p>&nbsp;</p>\n\n<p><em>AMPSphere_v.2021-03.faa</em><br>\nMulti-fasta with AMPSphere peptide sequences (amino acid).</p>\n\n<p>&nbsp;</p>\n\n<p><em>AMPSphere_v.2021-03.features.tsv</em><br>\nTable relating AMP name, and the features used for its prediction.<br>\nColumns:</p>\n\n<ol>\n\t<li>AMP accession</li>\n\t<li>tinyAA</li>\n\t<li>smallAA</li>\n\t<li>aliphaticAA</li>\n\t<li>aromaticAA</li>\n\t<li>nonpolarAA</li>\n\t<li>polarAA</li>\n\t<li>chargedAA</li>\n\t<li>basicAA</li>\n\t<li>acidicAA</li>\n\t<li>charge</li>\n\t<li>pI</li>\n\t<li>aindex</li>\n\t<li>instaindex</li>\n\t<li>boman</li>\n\t<li>hydrophobicity</li>\n\t<li>hmoment</li>\n\t<li>SA.Group1.residue0</li>\n\t<li>SA.Group2.residue0</li>\n\t<li>SA.Group3.residue0</li>\n\t<li>HB.Group1.residue0</li>\n\t<li>HB.Group2.residue0</li>\n\t<li>HB.Group3.residue0</li>\n</ol>\n\n<p>For more details about these features see the <a href=\"https://doi.org/10.7717/peerj.10555\">Macrel manuscript</a></p>\n\n<p><br>\n<em>AMPSphere_v.2021-03.origin_samples.tsv</em><br>\nTSV table relating AMP accession, sequence, and their origins in terms of prokaryotic genome or metagenome sample.</p>\n\n<p>Columns:</p>\n\n<ol>\n\t<li>AMP accession</li>\n\t<li>GMSC accession&nbsp; (comma separated list)</li>\n\t<li>metagenome samples&nbsp; (comma separated list)</li>\n\t<li>proGenomes2 genomes&nbsp; (comma separated list)</li>\n</ol>\n\n<p>&nbsp;</p>\n\n<p><em>AMPSphere_v.2021-03.species.tsv</em><br>\nTSV table relating AMP name, sequence, and the species from which they were detected.</p>\n\n<p>Columns:</p>\n\n<ol>\n\t<li>AMP accession</li>\n\t<li>proGenomes2 genomes</li>\n\t<li>SpecI cluster</li>\n</ol>\n\n<p>Note that AMPSphere was generated from the complete <a href=\"http://progenomes.embl.de/\">proGenomes v2</a> database.</p>\n\n<p>However, after the initial release, many genomes were removed due to quality-control issues, leading to version 2.2 used for constructing this table.</p>\n\n<p>&nbsp;</p>\n\n<p><em>DRAMP_anno_AMPSphere_v.2021-03.parsed.tsv</em><br>\nTSV table relating AMP name as query and the hits obtained with Blast against DRAMP database. The format is blast `outfmt6`.</p>\n\n<p>Columns:</p>\n\n<ol>\n\t<li>query</li>\n\t<li>target</li>\n\t<li>identity</li>\n\t<li>alignment length</li>\n\t<li>misalignment</li>\n\t<li>gaps</li>\n\t<li>query start</li>\n\t<li>query end</li>\n\t<li>target start</li>\n\t<li>target end</li>\n\t<li>e-value</li>\n\t<li>score</li>\n\t<li>target annotation</li>\n\t<li>target function</li>\n\t<li>target biochemical targets</li>\n\t<li>target origin reference</li>\n</ol>\n\n<p>&nbsp;</p>\n\n<p><em>AMPSphere_v.2021-03.hosts.tsv</em><br>\nTSV table relating AMPs with the hosts of host-associated metagenomes via metadata.</p>\n\n<p>Columns:</p>\n\n<ol>\n\t<li>AMP accession</li>\n\t<li>host common name</li>\n\t<li>host scientific name</li>\n\t<li>host NCBI taxid&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; &nbsp;</li>\n\t<li>counts</li>\n</ol>\n\n<p>Column 5 (counts) measures are in the number of identical variants of a given peptide assigned to a common host.<br>\n&nbsp; &nbsp;<br>\n&nbsp;</p>\n\n<p><em>AMPSphere_v.2021-03.locations.tsv</em><br>\nTSV table relating AMP name and their geographic location from metadata annotation of metagenome samples.</p>\n\n<p>Columns:</p>\n\n<ol>\n\t<li>AMP accession,</li>\n\t<li>geographic location,</li>\n\t<li>copies</li>\n</ol>\n\n<p>Geographic location refers to the locale where the gene was found through metagenomics. It was assigned as a broad location such as country, ocean, continent (e.g. US, Atlantic Ocean, Arctic, Australia).<br>\n&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; &nbsp;<br>\nCounts are the number of identical variants of a given peptide assigned to a common location.</p>\n\n<p>&nbsp;</p>\n\n<p><em>AMPSphere_v.2021-03.microontology.tsv</em><br>\nTable relating AMP name and their habitat of origin.</p>\n\n<p>Columns:</p>\n\n<ol>\n\t<li>AMP accession,</li>\n\t<li>microontology,</li>\n\t<li>counts</li>\n</ol>\n\n<p>Microontology is a scheme used to annotate environments, it has different levels of complexity separated by &#39;:&#39;<br>\n&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; &nbsp;<br>\nCounts measures are in the number of identical variants of a given peptide assigned to a common habitat.</p>\n\n<p>&nbsp;</p>\n\n<p>&nbsp;</p>\n\n<p><em>SPHERE_v.2021-03_levels.assessment.tsv</em><br>\nTSV table relating AMP name, sequence, and the hierarchically obtained clusters per level.</p>\n\n<p>Columns:</p>\n\n<ol>\n\t<li>AMP accession</li>\n\t<li>evaluation vs. representative</li>\n\t<li>SPHERE_fam level I</li>\n\t<li>SPHERE_fam level II</li>\n\t<li>SPHERE_fam level III</li>\n</ol>\n\n<p>Levels of each SPHERE family:</p>\n\n<p>I: contains clusters obtained with 100% of identity cut-off and 90% of overlap of the shorter sequence;</p>\n\n<p>II: contains clusters obtained with the unclustered sequences and the representatives from level I at 85% of identity and 90% of overlap of the shorter sequence;</p>\n\n<p>III: contains clusters obtained with the unclustered sequences and the representatives from level II at 75% of identity and 90% of overlap of the shorter sequence;</p>\n\n<p>`evaluation vs. representatives` shows the percent of identity the sequence has in an alignment against the cluster representative, and also the overlap in percent.</p>\n\n<p>Example:</p>\n\n<p>* -- This means: this sequence is the cluster representative.</p>\n\n<p>OR something like this:</p>\n\n<p>77.50%,1:40:1:40 -- This means: alignment identity against the representative of the cluster equals 77.5% and the alignment start and end position for the query (1 and 40, respectively), and target (1 and 40, respectively).</p>\n\n<p>&nbsp;</p>\n\n<p>&nbsp;</p>\n\n<p><em>AMPsphere_v.2021-03_families_alignment.tar</em></p>\n\n<p>Compressed folder (`.tar.gz`) containing peptide alignments in FASTA format used for phylogenetic reconstruction. These alignments were built with Muscle for all SPHERE families larger than 7 sequences.</p>\n\n<p>For more information, see the <a href=\"https://github.com/BigDataBiology/AMPsphere\">AMPsphere code</a>, where you can find all parameters used in this step.</p>\n\n<p>&nbsp;</p>\n\n<p>&nbsp;</p>\n\n<p><em>AMPsphere_v.2021-03_families_tree_nwk.tar</em><br>\nCompressed folder (`.tar.gz`) containing phylogenetic trees in Newick (`.nwk`) calculated from the peptide alignments built with Muscle per SPHERE family larger than 7 sequences.<br>\n&nbsp;&nbsp; &nbsp;<br>\nTrees were built using FastTree with 1000 bootstraps as node support.</p>\n\n<p>For more information, see the <a href=\"https://github.com/BigDataBiology/AMPsphere\">AMPsphere code</a>, where you can find all parameters used in this step.</p>\n\n<p>&nbsp;</p>\n\n<p>&nbsp;</p>\n\n<p><strong>CONTACT</strong></p>\n\n<p>You can contact us via our <a href=\"https://groups.google.com/g/ampsphere-users\">discussion group</a>.</p>\n\n<p>AMPSphere main developers:</p>\n\n<p>- <a href=\"mailto:celio@big-data-biology.com?subject=AMPSphere%20dataset%20--%20Zenodo%20communication&amp;body=Dear%20Celio%2C%20%0A%0AThis%20message%20was%20composed%20from%20Zenodo%20repository%20containing%20AMPSphere%20dataset%3A%0A%0Aversion%20%3D%3D%20_______%0Auser%20%3D%3D%20_________%0A%0AMessage%3A%0A%0A\">C&eacute;lio Dias Santos J&uacute;nior</a><br>\n- <a href=\"mailto:yiqian@big-data-biology.org?subject=AMPSphere%20dataset%20--%20Zenodo%20communication&amp;body=Dear%20Yiqian%2C%20%0A%0AThis%20message%20was%20composed%20from%20Zenodo%20repository%20containing%20AMPSphere%20dataset%3A%0A%0Aversion%20%3D%3D%20_______%0Auser%20%3D%3D%20_________%0A%0AMessage%3A%0A%0A\">Yiqian Duan</a><br>\n- <a href=\"mailto:luispedro@big-data-biology.com?subject=AMPSphere%20dataset%20--%20Zenodo%20communication&amp;body=Dear%20Luis%2C%20%0A%0AThis%20message%20was%20composed%20from%20Zenodo%20repository%20containing%20AMPSphere%20dataset%3A%0A%0Aversion%20%3D%3D%20_______%0Auser%20%3D%3D%20_________%0A%0AMessage%3A%0A%0A\">Luis Pedro Coelho</a></p>\n\n<p>&nbsp;</p>\n\n<p><strong>COPYRIGHT NOTICE</strong></p>\n\n<p><em>AMPSphere - the worldwide survey of prokaryotic antimicrobial peptides.</em></p>\n\n<p>This work is a joint effort of the Big Data Biology group from the Institute of<br>\nScience and Technology for Brain-Inspired Intelligence (ISTBI) - Fudan<br>\nUniversity, Shanghai, China, and the Structural and Computational Biology Unit<br>\n(Heidelberg) - European Molecular Biology Laboratory (EMBL).</p>\n\n<p>Copyright (C) 2019-2021 The Authors</p>\n\n<p>&nbsp;&nbsp; AMPSphere IS PROVIDED &quot;AS IS&quot;, WITHOUT WARRANTY OF ANY KIND,<br>\n&nbsp;&nbsp; EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES<br>\n&nbsp;&nbsp; OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT.<br>\n&nbsp;&nbsp; IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM,<br>\n&nbsp;&nbsp; DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR<br>\n&nbsp;&nbsp; OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE<br>\n&nbsp;&nbsp; USE OR OTHER DEALINGS IN THE SOFTWARE.</p>\n\n<p>&nbsp;&nbsp; This database is free; you can redistribute it and/or modify it<br>\n&nbsp;&nbsp; as you wish, under the terms of the CC BY 4.0 license.</p>\n\n<p>&nbsp;&nbsp; You are allowed to:</p>\n\n<p>&nbsp;&nbsp; Share &mdash; copy and redistribute the material in any medium or format</p>\n\n<p>&nbsp;&nbsp; Adapt &mdash; remix, transform, and build upon the material for any purpose,<br>\n&nbsp; &nbsp; &nbsp; &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; even commercially.</p>\n\n<p>&nbsp;&nbsp; You may also obtain a copy of the CC BY 4.0 license <a href=\"https://creativecommons.org/licenses/by/4.0/\">here.</a></p>\n\n<p>&nbsp;</p>\n\n<p><strong>REFERENCES CITED</strong></p>\n\n<ol>\n\t<li><strong>Macrel:</strong> <em>Santos-J&uacute;nior CD, Pan S, Zhao X, Coelho LP. 2020. Macrel: antimicrobial peptide screening in genomes and metagenomes. PeerJ 8:e10555. </em><a href=\"https://doi.org/10.7717/peerj.10555\">https://doi.org/10.7717/peerj.10555</a></li>\n\t<li><strong>ProGenomes:</strong> <em>Mende DR, Letunic I, Maistrenko OM et al. 2020. proGenomes2: an improved database for accurate and consistent habitat, taxonomic and functional annotations of prokaryotic genomes.&nbsp; Nucleic Acids Research</em> <em>48(D1): D621&ndash;D625.</em> <a href=\"https://doi.org/10.1093/nar/gkz1002\">https://doi.org/10.1093/nar/gkz1002</a></li>\n\t<li><strong>DRAMP:</strong> <em>Kang X, Dong F, Shi C et al. 2019. DRAMP 2.0, an updated data repository of antimicrobial peptides. Sci Data 6, 148.</em> <a href=\"https://doi.org/10.1038/s41597-019-0154-y\">https://doi.org/10.1038/s41597-019-0154-y</a></li>\n</ol>", 
  "license": "https://creativecommons.org/licenses/by/4.0/legalcode", 
  "creator": [
    {
      "affiliation": "Institute of Science and Technology for Brain-Inspired Intelligence - ISTBI, Fudan University, Shanghai, China", 
      "@id": "https://orcid.org/0000-0002-1974-1736", 
      "@type": "Person", 
      "name": "Santos-J\u00fanior, C\u00e9lio Dias"
    }, 
    {
      "affiliation": "Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany", 
      "@id": "https://orcid.org/0000-0001-8587-4177", 
      "@type": "Person", 
      "name": "Schmidt, Thomas S.B."
    }, 
    {
      "affiliation": "Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany", 
      "@id": "https://orcid.org/0000-0002-0884-8124", 
      "@type": "Person", 
      "name": "Fullam, Anthony"
    }, 
    {
      "affiliation": "Institute of Science and Technology for Brain-Inspired Intelligence - ISTBI, Fudan University, Shanghai, China", 
      "@id": "https://orcid.org/0000-0002-8046-9831", 
      "@type": "Person", 
      "name": "Duan, Yiqian"
    }, 
    {
      "affiliation": "Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany", 
      "@id": "https://orcid.org/0000-0002-2627-833X", 
      "@type": "Person", 
      "name": "Bork, Peer"
    }, 
    {
      "affiliation": "Institute of Science and Technology for Brain-Inspired Intelligence - ISTBI, Fudan University, Shanghai, China", 
      "@id": "https://orcid.org/0000-0002-4531-3970", 
      "@type": "Person", 
      "name": "Zhao, Xing-Ming"
    }, 
    {
      "affiliation": "Institute of Science and Technology for Brain-Inspired Intelligence - ISTBI, Fudan University, Shanghai, China", 
      "@id": "https://orcid.org/0000-0002-9280-7885", 
      "@type": "Person", 
      "name": "Coelho, Luis Pedro"
    }
  ], 
  "url": "https://zenodo.org/record/4606582", 
  "datePublished": "2021-03-16", 
  "version": "v.2021-03", 
  "keywords": [
    "antimicrobial peptides", 
    "metagenomics", 
    "progenomes", 
    "AMPSphere", 
    "macrel"
  ], 
  "@context": "https://schema.org/", 
  "distribution": [
    {
      "contentUrl": "https://zenodo.org/api/files/6ef0d3ed-ce19-4f73-9d3a-d471f177be02/AMPSphere_v.2021-03.faa.gz", 
      "encodingFormat": "gz", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/6ef0d3ed-ce19-4f73-9d3a-d471f177be02/AMPSphere_v.2021-03_families_alignment.tar.gz", 
      "encodingFormat": "gz", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/6ef0d3ed-ce19-4f73-9d3a-d471f177be02/AMPSphere_v.2021-03_families_tree_nwk.tar.gz", 
      "encodingFormat": "gz", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/6ef0d3ed-ce19-4f73-9d3a-d471f177be02/AMPSphere_v.2021-03.fna.xz", 
      "encodingFormat": "xz", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/6ef0d3ed-ce19-4f73-9d3a-d471f177be02/AMPSphere_v.2021-03.hosts.tsv.gz", 
      "encodingFormat": "gz", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/6ef0d3ed-ce19-4f73-9d3a-d471f177be02/AMPSphere_v.2021-03.locations.tsv.gz", 
      "encodingFormat": "gz", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/6ef0d3ed-ce19-4f73-9d3a-d471f177be02/AMPSphere_v.2021-03.microontology.tsv.gz", 
      "encodingFormat": "gz", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/6ef0d3ed-ce19-4f73-9d3a-d471f177be02/AMPSphere_v.2021-03.origin_samples.tsv.gz", 
      "encodingFormat": "gz", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/6ef0d3ed-ce19-4f73-9d3a-d471f177be02/AMPSphere_v.2021-03.species.tsv.gz", 
      "encodingFormat": "gz", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/6ef0d3ed-ce19-4f73-9d3a-d471f177be02/DRAMP_anno_AMPSphere_v.2021-03.parsed.tsv.gz", 
      "encodingFormat": "gz", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/6ef0d3ed-ce19-4f73-9d3a-d471f177be02/README.md", 
      "encodingFormat": "md", 
      "@type": "DataDownload"
    }, 
    {
      "contentUrl": "https://zenodo.org/api/files/6ef0d3ed-ce19-4f73-9d3a-d471f177be02/SPHERE_v.2021-03.levels_assessment.tsv.gz", 
      "encodingFormat": "gz", 
      "@type": "DataDownload"
    }
  ], 
  "identifier": "https://doi.org/10.5281/zenodo.4606582", 
  "@id": "https://doi.org/10.5281/zenodo.4606582", 
  "@type": "Dataset", 
  "name": "AMPSphere : the worldwide survey of prokaryotic antimicrobial peptides"
}
236
47
views
downloads
All versions This version
Views 23629
Downloads 4714
Data volume 1.1 GB248.0 MB
Unique views 18524
Unique downloads 144

Share

Cite as