(annotation names) MMETSP re-assemblies
Description
Corresponding .csv table of annotation names for all MMETSP (columns) by transcript ID (rows) in the Trinity assembly files (https://doi.org/10.5281/zenodo.251828), generated from the dammit pipeline (https://github.com/camillescott/dammit). One annotation name was chosen for each transcript ID by sorting by e-value (if < 1e-05) then choosing the best (lowest) e-value. Some transcripts were dropped because there was no 'Name' entry in the .gff or e-value < 1e-05.
The Marine Microbial Eukaryotic Transcriptome Sequencing Project (MMETSP) data set contains cultured samples of pelagic and endosymbiotic marine eukaryotic species representing more than 40 phyla (Keeling et al. 2014).
Methods for the de novo transcriptome assembly are described in the Eel pond khmer protocols (Brown et al. 2015).
Scripts available on github:
https://github.com/dib-lab/dib-MMETSP
References:
C. Titus Brown, Camille Scott, and Leigh Sheneman. 2015. The Eel Pond mRNAseq Protocol. https://khmer-protocols.readthedocs.io/en/ctb/mrnaseq/
Keeling et al. 2014. The Marine Microbial Eukaryote Transcriptome Sequencing Project (MMETSP): Illuminating the Functional Diversity of Eukaryotic Life in the Oceans through Transcriptome Sequencing. http://dx.doi.org/10.1371/journal.pbio.100188
Files
Files
(72.1 MB)
Name | Size | Download all |
---|---|---|
md5:cec0ff0925ec8445b6082993db8d4fc8
|
72.1 MB | Download |