[VM] BIOCOM-PIPE: a new user-friendly metabarcoding pipeline for the characterization of microbial diversity from 16S, 18S and 23S rRNA gene amplicons
Authors/Creators
Description
Summary:
We clearly understand the difficulties involved in the complete installation of the BIOCOM-PIPE pipeline. So, we tried to answer this problem giving a Virtual Machine containing the BIOCOM-PIPE fully integrated into an UBUNTU system (LTS 20.04) with also the example dataset available.
Background:
The ability to compare samples or studies easily using metabarcoding so as to better interpret microbial ecology results is an upcoming challenge. There exists a growing number of metabarcoding pipelines, each with its own benefits and limitations. However, very few have been developed to offer the opportunity to characterize various microbial communities (e.g., archaea, bacteria, fungi, photosynthetic microeukaryotes) with the same tool.
Results:
BIOCOM-PIPE is a flexible and independent suite of tools for processing data from high-throughput sequencing technologies, Roche 454 and Illumina platforms, and focused on the diversity of archaeal, bacterial, fungal, and photosynthetic microeukaryote amplicons. Various original methods were implemented in BIOCOM-PIPE to (i) remove chimeras based on read abundance, (ii) align sequences with structure-based alignments of RNA homologs using covariance models or a post-clustering tool (ReClustOR), and (iii) re-assign OTUs based on a reference OTU database. The comparison with two other pipelines (FROGS and mothur) highlighted that BIOCOM-PIPE was better at discriminating land use groups.
Conclusions:
The BIOCOM-PIPE pipeline makes it possible to analyze 16S/18S and 23S rRNA genes in the same package tool. This innovative approach defines a biological database from previously analyzed samples and performs post-clustering of reads with this reference database by using open-reference clustering. This makes it easier to compare projects from various sequencing runs. For advanced users, the pipeline was developed to allow for adding or modifying the components, the databases and the bioinformatics tools easily.
Files
Files
(46.9 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:78040f790cdec087865779cb1dd4b818
|
46.9 GB | Download |