Published November 13, 2022 | Version 1
Dataset Open

Genetic Features of the Marine Polychaete Sirsoe methanicola from Metagenomic Data

Description

WebAUGUSTUS input and output data used in for eukaryotic gene prediction in Sirsoe methanicola:

WebAUGUSTUS input data:

  1. capitella.fa - Nucleotide genomic sequence of Capitella teleta (NCBI accession: GCA_000328365.1)
  2. capitella-protein.faa - Protein sequences in the Capitella teleta genome (NCBI accession: GCA_000328365.1)
  3. big-contigs-wrapped.fa - Contigs >= 3,000 bp long assembled from the S. methanicola metagenomes (NCBI BioProject ID PRJNA689840) that did not bin into any bacterial MAGs
  4. small-contigs-wrapped.fa - Contigs< 3,000 bp long assembled from the S. methanicola metagenomes (NCBI BioProject ID PRJNA689840) that did not bin into any bacterial MAGs

WebAUGUSTUS output data:

  1. augustus.bigcontigs.gff - WebAUGUSTUS GFF output file for contigs >=3,000 bp
  2. augustus.smallcontigs.gff - WebAUGUSTUS GFF output file for contigs <3,000 bp
  3. augustus.all.faa - All protein sequences predicted from the S. methanicola metagenomes using WebAUGUSTUS

Files

Files (1.9 GB)

Name Size Download all
md5:8a00229f16ef50c87757c856e54513cb
19.8 MB Download
md5:9adb36152c6cdc8c25453c801ba88e7e
117.5 MB Download
md5:d53d7b41bacd620d1c2fd16e9f9eca2d
35.3 MB Download
md5:761a9cccad54a7a3fae61fc699764bbb
1.0 GB Download
md5:51b258b122db160170b13581b6ba1360
11.8 MB Download
md5:051ed4b873292e22f8e96d4f08a6f6f5
337.9 MB Download
md5:07f39a61ca165c7bd60a9ab412d6b627
376.7 MB Download