Published June 1, 2017 | Version v1
Dataset Open

RDP taxonomic training data formatted for DADA2 (RDP trainset 16/release 11.5)

  • 1. NC State University

Description

These DADA2-formatted training fasta files were derived from the Ribosomal Database Project's Training Set 16 and the 11.5 release of the RDP database.

These fastas were generated by the following commands (using the dada2 R package version 1.5.1):

path <- "~/Desktop/RDP/RDPClassifier_16S_trainsetNo16_rawtrainingdata"
dada2:::makeTaxonomyFasta_RDP(file.path(path, "trainset16_022016.fa"), file.path(path, "trainset16_db_taxid.txt"), "~/tax/rdp_train_set_16.fa.gz")

dada2:::makeSpeciesFasta_RDP("~/Desktop/RDP/current_Bacteria_unaligned.fa", "~/tax/rdp_species_assignment_16.fa.gz")

Files

Files (14.7 MB)

Name Size Download all
md5:d68d4980326be10c58aaaa74cc6cdb6e
10.9 MB Download
md5:cac51b436f1679fefc9a1db1d3b24686
3.8 MB Download