Published May 30, 2023 | Version v1
Dataset Open

Supplementary data for: Chromosome-level genome assembly and circadian gene repertoire of the Patagonia blennie Eleginops maclovinus

  • 1. University of Illinois at Urbana Champaign

Description

This dataset contains the genome assembly and associated annotation of the Patagonian Blennie (Eleginops maclovinus), the closest extant taxon to the Antarctic notothenioid radiation. In addition to the characterization of the E. maclovinus genome, the dataset includes a description of circadian rhythm orthologs for E. maclovinus, other notothenenioid taxa, and teleost outgroups, as well as a copy of the bioinformatic scripts used for the assembly, annotation, and other downstream analysis.

Notes

All assembly and annotation files are gzipped, but are otherwise standard bioinformatic formats (i.e., FASTA for genome assembly and coding/amino acid sequences, GTF for annotation, AGP for scaffolding). In addition, bioinformatic scripts for data generation and analysis are in Python (*.py) or Bash (*.sh, but might require the installation of additional, open-source software (e.g., wtdbg2, BRAKER)

See links for a description of the FASTA (http://www.ncbi.nlm.nih.gov/blast/fasta.shtml), and GTF (https://useast.ensembl.org/info/website/upload/gff.html), and AGP (https://www.ncbi.nlm.nih.gov/assembly/agp/AGP_Specification/) file format specifications.

File format Specification
File Suffix1 Description
*.fa Genome assembly in nucleotide FASTA format. 
*.agp Assembly structure in AGP format.
*.gtf Genome annotation in GTF format.
*.cds.fa Genomic sequence for all annotated protein-coding genes in nucleotide FASTA format.
*.protein.fa Protein sequence for all annotated protein-coding genes in amino acid FASTA format.

1Does not include the gzipped compression suffix (*.gz).

Funding provided by: National Science Foundation
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100000001
Award Number: 1645087

Funding provided by: National Science Foundation
Crossref Funder Registry ID: http://dx.doi.org/10.13039/100000001
Award Number: 11-42158

Files

README.md

Files (211.2 MB)

Name Size Download all
md5:ec7b2aaf551bf6ed75b04b89c6f4ab80
565.5 kB Download
md5:0159b4a64964c4a99d3afa120835a011
210.3 MB Download
md5:1dbebdd8fc8e141f403fcb57cd11ee9f
20.4 kB Preview Download
md5:06266e686855f4fe0d2cba98755ee604
299.7 kB Preview Download

Additional details

Related works

Is derived from
10.5281/zenodo.7829978 (DOI)