Poster Open Access

LMAS: Evaluating metagenomic de novo assembly methods through defined communities

Catarina Inês Mendes; Pedro Vila-Cerqueira; Yair Motro; Jacob Moran-Gilad; João André Carriço; Mário Ramirez


DataCite XML Export

<?xml version='1.0' encoding='utf-8'?>
<resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-4" xsi:schemaLocation="http://datacite.org/schema/kernel-4 http://schema.datacite.org/meta/kernel-4.1/metadata.xsd">
  <identifier identifierType="DOI">10.5281/zenodo.5578327</identifier>
  <creators>
    <creator>
      <creatorName>Catarina Inês Mendes</creatorName>
      <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0002-3090-7426</nameIdentifier>
      <affiliation>Instituto de Microbiologia, Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisboa, Portuga</affiliation>
    </creator>
    <creator>
      <creatorName>Pedro Vila-Cerqueira</creatorName>
      <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0002-6121-8906</nameIdentifier>
      <affiliation>Instituto de Microbiologia, Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisboa, Portuga</affiliation>
    </creator>
    <creator>
      <creatorName>Yair Motro</creatorName>
      <affiliation>Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer-Sheva, Israel</affiliation>
    </creator>
    <creator>
      <creatorName>Jacob Moran-Gilad</creatorName>
      <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0001-9134-050X</nameIdentifier>
      <affiliation>Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer-Sheva, Israel</affiliation>
    </creator>
    <creator>
      <creatorName>João André Carriço</creatorName>
      <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0002-5274-2722</nameIdentifier>
      <affiliation>Instituto de Microbiologia, Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisboa, Portuga</affiliation>
    </creator>
    <creator>
      <creatorName>Mário Ramirez</creatorName>
      <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="http://orcid.org/">0000-0002-4084-6233</nameIdentifier>
      <affiliation>Instituto de Microbiologia, Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisboa, Portuga</affiliation>
    </creator>
  </creators>
  <titles>
    <title>LMAS: Evaluating metagenomic de novo assembly methods through defined communities</title>
  </titles>
  <publisher>Zenodo</publisher>
  <publicationYear>2021</publicationYear>
  <subjects>
    <subject>de novo assembly</subject>
    <subject>metagenomics</subject>
    <subject>benchmark</subject>
    <subject>quality assessment</subject>
    <subject>bioinformatics</subject>
    <subject>workflow</subject>
  </subjects>
  <dates>
    <date dateType="Issued">2021-10-21</date>
  </dates>
  <language>en</language>
  <resourceType resourceTypeGeneral="Text">Poster</resourceType>
  <alternateIdentifiers>
    <alternateIdentifier alternateIdentifierType="url">https://zenodo.org/record/5578327</alternateIdentifier>
  </alternateIdentifiers>
  <relatedIdentifiers>
    <relatedIdentifier relatedIdentifierType="DOI" relationType="IsVersionOf">10.5281/zenodo.5578326</relatedIdentifier>
  </relatedIdentifiers>
  <rightsList>
    <rights rightsURI="https://creativecommons.org/licenses/by/4.0/legalcode">Creative Commons Attribution 4.0 International</rights>
    <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights>
  </rightsList>
  <descriptions>
    <description descriptionType="Abstract">&lt;p&gt;&lt;strong&gt;Background &lt;/strong&gt;Short-read shotgun metagenomics can offer comprehensive microbial detection and characterisation of complex clinical samples. The de novo assembly of this data into draft genomes is key in metagenomic analysis, yielding longer sequences that offer contextual information and afford a more complete picture of the microbial community. The assembly process represents a major bottleneck in obtaining trustworthy, reproducible results.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Methods &lt;/strong&gt;LMAS is an automated workflow developed as a flexible platform to evaluate traditional and metagenomic dedicated prokaryotic de novo assembly software performance given known standard communities. Its implementation in Nextflow ensures the transparency and reproducibility of the results obtained and the use of Docker containers provides further flexibility. The results are presented in an interactive HTML report where global and reference specific performance metrics can be explored. Currently, 10 assemblers are implemented in LMAS, with the possibility for expansion as novel algorithms are developed.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Results &lt;/strong&gt;The eight bacterial genomes and four plasmids of the ZymoBIOMICS Microbial Community Standards were used as reference. Raw sequence data of the mock communities, with an even and logarithmic distribution of species, and matching simulated samples were used as input. The resulting LMAS report is available at &lt;a href="https://lmas-demo.herokuapp.com"&gt;https://lmas-demo.herokuapp.com&lt;/a&gt;.&amp;nbsp;&amp;nbsp;&lt;/p&gt;

&lt;p&gt;&lt;br&gt;
&lt;strong&gt;Discussion &lt;/strong&gt;Overall, k-mer De Bruijn graph assemblers outperform the alternative approaches but come with a greater computational cost. Metagenomic dedicated algorithms produce fewer misassembly errors than standard genomic assemblers. The performance of each assembler varied depending on the species of interest and its abundance in the sample, with less abundant species presenting a significant challenge for all assemblers. No assembler stood out as an undisputed all-purpose choice for short-read metagenomic prokaryote genome assembly, highlighting that efforts are still needed to further improve metagenomic assembly performance. Using LMAS could underpin this development process. The LMAS workflow is available at &lt;a href="https://github.com/cimendes/LMAS"&gt;https://github.com/cimendes/LMAS&lt;/a&gt;.&lt;strong&gt; &lt;/strong&gt;&lt;/p&gt;</description>
    <description descriptionType="Other">Presented at ICCMg6</description>
  </descriptions>
</resource>
68
35
views
downloads
All versions This version
Views 6868
Downloads 3535
Data volume 39.8 MB39.8 MB
Unique views 5959
Unique downloads 3535

Share

Cite as