Published August 30, 2023
| Version 1.0.uniq
Dataset
Open
16S V4-V5 metabarcoding reference databases and weighted naive-bayes classifiers, dereplicated
Description
16S metabarcoding databases and naive-bayes classifiers specific to the V4-V5 region. Built from the Silva 138.1 SSU Ref NR 99 database using Qiime2 (version 2023.2) and the q2-clawback plugin. Includes weighted classifiers for two Earth Microbiome Project Ontology (EMPO) 3 habitat types: "sediment (saline)" and "water (saline)" , with data downloaded from Qiita. Sequences were dereplicated with Rescript --p-mode 'uniq' , retaining identical sequence records that have differing taxonomies.
Primers used:
EMP 16S 515f: GTGYCAGCMGCCGCGGTAA
EMP 16S 926r: CCGYCAATTYMTTTRAGTTT
Stats
286,948 unique sequences
309,567 total sequences
46,254 unique taxa (Level 7)
|
---|
Notes
Files
Silva 16S V4-V5 with weights, dereplicated.md
Files
(699.5 MB)
Name | Size | Download all |
---|---|---|
md5:c05a3f39f2797792163a785d5edc449b
|
4.4 kB | Preview Download |
md5:de8886bb2c059b1e8752255d271f3010
|
97.1 MB | Download |
md5:f12d5b78bf4b1519721fe52803581c3d
|
6.9 MB | Download |
md5:11162a28d5d70b81920ad1a41b03f304
|
162.3 MB | Download |
md5:1f583a0be1e498cb5e417f1cab30f166
|
21.7 MB | Download |
md5:87dcb9a926a9ba7a8614f204cde15076
|
37.9 MB | Download |
md5:a8cd81353196ada23fd1a14bd2312a90
|
27.6 MB | Download |
md5:f7b7f00f46e0cd361522c9ba2b5fac7c
|
162.3 MB | Download |
md5:caf45ac871ca592f675056b232ca03c0
|
21.6 MB | Download |
md5:aa00bccba0850f064eb8e8e0cd4e9cdd
|
162.1 MB | Download |
Additional details
References
- Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, Peplies J, Glöckner FO (2013) The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucl. Acids Res. 41 (D1): D590-D596.
- Yilmaz P, Parfrey LW, Yarza P, Gerken J, Pruesse E, Quast C, Schweer T, Peplies J, Ludwig W, Glöckner FO (2014) The SILVA and "All-species Living Tree Project (LTP)" taxonomic frameworks. Nucl. Acids Res. 42:D643-D648
- Bokulich NA, Kaehler BD, Rideout JR, Dillon M, Bolyen E, Knight R, Huttley GA, Caporaso JG. 2018. Optimizing taxonomic classification of marker gene sequences. Microbiome 6(1): 90. doi: https://doi.org/10.1186/s40168-018-0470-z.
- Kaehler BD, Bokulich NA, McDonald D, Knight R, Caporaso JG, Huttley GA. 2019. Species-level microbial sequence classification is improved by source-environment information. Nature Communications 10: 4643. https://doi.org/10.1038/s41467-019-12669-6
- Robeson MS 2nd, O'Rourke DR, Kaehler BD, Ziemski M, Dillon MR, Foster JT, Bokulich NA. RESCRIPt: Reproducible sequence taxonomy reference database management. PLoS Comput Biol. 2021 Nov 8;17(11):e1009581. doi: 10.1371/journal.pcbi.1009581
- Gonzalez, A. et al. Qiita: rapid, web-enabled microbiome meta-analysis. Nat. methods 15, 796–798 (2018).