Gene annotations for greater hornwrack bryozoan (Flustra foliacea) and repeat annotations for selected species
Description
Here we provide the gene annotations for greater hornwrack bryozoan (Flustra foliacea). We provide these for both convenience and because some of the functional annotations of genes/proteins are removed when we prepare these for uploading to ENA. We also provide the FASTA files for the assemblies we have made.
We annotated the genome assemblies using a pre-release version of the EBP-Nor genome annotation pipeline (https://github.com/ebp-nor/GenomeAnnotation). Predicted proteins from Bugulina stolonifera were downloaded from https://datadryad.org/dataset/doi:10.5061/dryad.76hdr7t3f and miniprot (Li 2023) was used to align the proteins to the curated assemblies. UniProtKB/Swiss-Prot (UniProt Consortium 2023) release 2023_03 in addition to the metazoa part of OrthoDB v11 (Kuznetsov et al. 2023) were also aligned separately to the assemblies. Red (Girgis 2015) was run via redmask (https://github.com/nextgenusfs/redmask) on the assemblies to mask repetitive areas. GALBA (Brůna et al. 2023; Buchfink, Xie, and Huson 2015; Hoff and Stanke 2019; Li 2023; Stanke et al. 2006) was run with the B. stolonifera proteins using the miniprot mode on the masked assemblies. The funannotate-runEVM.py script from Funannotate was used to run EvidenceModeler (Haas et al. 2008) on the alignments of GRCh38 proteins, UniProtKB/Swiss-Prot proteins, vertebrata proteins and the predicted genes from GALBA. The resulting predicted proteins were compared to the protein repeats that Funannotate distributes using DIAMOND blastp and the predicted genes were filtered based on this comparison using AGAT. The filtered proteins were compared to the UniProtKB/Swiss-Prot release 2023_03 using DIAMOND (Buchfink, Xie, and Huson 2015) blastp to find gene names and InterProScan (Jones et al. 2014) was used to discover functional domains. AGATs agat_sp_manage_functional_annotation.pl was used to attach the gene names and functional annotations to the predicted genes.
We also ran EarlGrey (Baril, Galbraith, and Hayward 2024; https://github.com/TobyBaril/EarlGrey) on all species investigated and provide the summary files folder here. These species were included: Bugulina stolonifera (Bryozoa, Gymnolaemata, Cheilostomatida; GCA_935421135.1), Cristatella mucedo (Bryozoa, Phylactolaemata, Plumatellida; GSM5182733), Cryptosula pallasiana (Bryozoa, Gymnolaemata, Cheilostomatida, GCA_945261195.1), Membranipora membranacea (Bryozoa, Gymnolaemata, Cheilostomatida; GCA_914767715.1), and Waterispora subatra (Bryozoa, Gymnolaemata, Cheilostomatida; GCF_963576615.1). We also included two outgroup species, Pecten maximus (Mollusca; GCF_902652985.1) and Lineus longissimus (Nemertea; GCF_910592395.1).
List of files provided here and their description:
tzFluFoli1.1.hap1.fa.gz - genome assembly of greater hornwrack bryozoan (hap1)
tzFluFoli1.1.hap1.proteins.fa.gz - predicted proteins greater hornwrack bryozoan (hap1)
tzFluFoli1.1.hap1.gff.gz - genome annotation of greater hornwrack bryozoan (hap1)
tzFluFoli1.1.hap2.fa.gz - genome assembly of greater hornwrack bryozoan (hap2)
tzFluFoli1.1.hap2.proteins.fa.gz - predicted proteins greater hornwrack bryozoan (hap2)
tzFluFoli1.1.hap2.gff.gz - genome annotation of greater hornwrack bryozoan (hap2)
BugStol_summaryFiles.tar.gz - repeat annotation summary files from EarlGrey for Bugulina stolonifera
CryPall_summaryFiles.tar.gz - repeat annotation summary files from EarlGrey for Cryptosula pallasiana
CriMuce_summaryFiles.tar.gz - repeat annotation summary files from EarlGrey for Cristatella mucedo
FluFoli_summaryFiles.tar.gz - repeat annotation summary files from EarlGrey for Flustra foliacea (hap1)
LinLong_summaryFiles.tar.gz - repeat annotation summary files from EarlGrey for Lineus longissimus
MemMemb_summaryFiles.tar.gz - repeat annotation summary files from EarlGrey for Membranipora membranacea
PecMax_summaryFiles.tar.gz - repeat annotation summary files from EarlGrey for Pecten maximus
WatSuba_summaryFiles.tar.gz - repeat annotation summary files from EarlGrey for Watersipora subatra
Files
Files
(716.3 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:c29e1cea067c9c0a28068c78e400139b
|
9.4 MB | Download |
|
md5:7c9e55fc9c00dbb96d061b50b06754b5
|
40.4 MB | Download |
|
md5:e201bd462bae3f1de07593419d9f2b69
|
25.5 MB | Download |
|
md5:c62559cd0df0103ebc6bfd37c70f5355
|
44.2 MB | Download |
|
md5:9344dbb50b101b507500f502325e7b29
|
22.0 MB | Download |
|
md5:7ff8c5e9687d0dcdb380921540d764ae
|
17.5 MB | Download |
|
md5:a929a903a399e76f87e06ad437ed73f9
|
43.8 MB | Download |
|
md5:25216799707a37d4f0139430dcae523d
|
7.1 MB | Download |
|
md5:67f977a928a4bfd8031186dc9a601435
|
230.1 MB | Download |
|
md5:ac718dff47d8ce45bcff07cc65510917
|
4.5 MB | Download |
|
md5:9e6c6fc7fa72b5d8626458eafed3a895
|
4.6 MB | Download |
|
md5:e1f664da7e3cf3d463d815d3ca3239a2
|
7.1 MB | Download |
|
md5:5e7bc2afa71e8b05b2432ea23f0a6695
|
212.1 MB | Download |
|
md5:7c3a7a2d49a1729bfddf85fe90d70ed1
|
4.5 MB | Download |
|
md5:55da8c6d384e0c30dd56a61724817ae6
|
4.6 MB | Download |
|
md5:cd5251e54da764138506b0772745f2f2
|
39.0 MB | Download |