Dataset Open Access

SILVA v128 and v132 dada2 formatted 18s 'train sets'

Morien, Evan; Parfrey, Laura W.

Contact person(s)
Morien, Evan
Researcher(s)
Parfrey, Laura W.

These are species-level taxonomy classification training sets for the assignTaxonomy function from the dada2 R package.

The v132 training set includes every Eukaryotic organism from SILVA's v132 database, clustered at 99% similarity.

The v128 training set includes every Eukaryotic organism from SILVA's v128 database, clustered at 99% similarity. Additionally, it includes corrected species labels for the Blastocystis clade, and 37 Entamoeba sequences sourced from GenBank not present in the original v128 db. The v128 training set is modified specifically to allow for better species-level assignments for those two clades in mammalian gut microbiome studies.

Files (35.9 MB)
Name Size
silva_128.18s.99_rep_set.dada2.fa.gz
md5:8112bff028267a061e30cb379318a684
14.3 MB Download
silva_132.18s.99_rep_set.dada2.fa.gz
md5:9a3c977a8f9d427d5502ba2d4de553a8
21.5 MB Download
4,724
8,704
views
downloads
All versions This version
Views 4,7244,724
Downloads 8,7048,701
Data volume 183.8 GB183.8 GB
Unique views 4,1034,103
Unique downloads 8,1718,168

Share

Cite as