Published October 5, 2018 | Version 1.0
Dataset Open

SILVA v128 and v132 dada2 formatted 18s 'train sets'

  • 1. UBC Botany

Contributors

Contact person:

Researcher:

  • 1. UBC Botany

Description

These are species-level taxonomy classification training sets for the assignTaxonomy function from the dada2 R package.

The v132 training set includes every Eukaryotic organism from SILVA's v132 database, clustered at 99% similarity.

The v128 training set includes every Eukaryotic organism from SILVA's v128 database, clustered at 99% similarity. Additionally, it includes corrected species labels for the Blastocystis clade, and 37 Entamoeba sequences sourced from GenBank not present in the original v128 db. The v128 training set is modified specifically to allow for better species-level assignments for those two clades in mammalian gut microbiome studies.

Files

Files (35.9 MB)

Name Size Download all
md5:8112bff028267a061e30cb379318a684
14.3 MB Download
md5:9a3c977a8f9d427d5502ba2d4de553a8
21.5 MB Download