Published February 19, 2017 | Version v1
Dataset Open

(annotation names) MMETSP re-assemblies

  • 1. University of California, Davis

Description

Corresponding .csv table of annotation names for all MMETSP (columns) by transcript ID (rows) in the Trinity assembly files (https://doi.org/10.5281/zenodo.251828), generated from the dammit pipeline (https://github.com/camillescott/dammit). One annotation name was chosen for each transcript ID by sorting by e-value (if < 1e-05) then choosing the best (lowest) e-value. Some transcripts were dropped because there was no 'Name' entry in the .gff or e-value < 1e-05.

The Marine Microbial Eukaryotic Transcriptome Sequencing Project (MMETSP) data set contains cultured samples of pelagic and endosymbiotic marine eukaryotic species representing more than 40 phyla (Keeling et al. 2014).

Methods for the de novo transcriptome assembly are described in the Eel pond khmer protocols (Brown et al. 2015).

Scripts available on github: 

https://github.com/dib-lab/dib-MMETSP

References:

C. Titus Brown, Camille Scott, and Leigh Sheneman. 2015. The Eel Pond mRNAseq Protocol. https://khmer-protocols.readthedocs.io/en/ctb/mrnaseq/

Keeling et al. 2014. The Marine Microbial Eukaryote Transcriptome Sequencing Project (MMETSP): Illuminating the Functional Diversity of Eukaryotic Life in the Oceans through Transcriptome Sequencing. http://dx.doi.org/10.1371/journal.pbio.100188

Files

Files (72.1 MB)

Name Size Download all
md5:cec0ff0925ec8445b6082993db8d4fc8
72.1 MB Download