Published July 1, 2025 | Version v2
Dataset Open

Gene annotations for wild alpine reindeer (Rangifer tarandus tarandus)

  • 1. ROR icon University of Oslo
  • 2. Streitlievegen 131, 2580 Folldal
  • 3. Norwegian Institute for Nature Research (NINA), P. O. Box 5685 Torgarden, NO-7485 Trondheim, Norway.
  • 4. Department of Bioinformatics and Genetics, Swedish Museum of Natural History, Stockholm, Sweden.
  • 5. ROR icon Science for Life Laboratory
  • 6. University of Oslo Centre for Ecological and Evolutionary Synthesis

Description

Here we provide the gene annotations for wild alpine reindeer (also called mountain reindeer; Rangifer tarandus tarandus). In addition, we provide the gene annotations for an updated genome assembly of Svalbard reindeer (Rangifer tarandus platyrhynchus). We provide these for both convenience and because some of the functional annotations of genes/proteins are removed when we prepare these for uploading to ENA. We also provide the FASTA files for the assemblies we have made.

We annotated the genome assemblies using a pre-release version of the EBP-Nor genome annotation pipeline (https://github.com/ebp-nor/GenomeAnnotation). First, AGAT (https://zenodo.org/record/7255559) agat_sp_keep_longest_isoform.pl and agat_sp_extract_sequences.pl were used on the GRCh38 genome assembly and annotation to generate one protein (the longest isoform) per gene. Miniprot (Li, 2023) was used to align the proteins to the curated assemblies. UniProtKB/Swiss-Prot (Consortium et al., 2023) release 2024_04 in addition to the Vertebrata part of OrthoDB v11 (Kuznetsov et al., 2022) were also aligned separately to the assemblies. Red (Girgis, 2015) was run via redmask (https://github.com/nextgenusfs/redmask) on the assemblies to mask repetitive areas. GALBA (Brůna et al., 2023; Buchfink et al., 2015; Hoff and Stanke, 2018; Li, 2023; Stanke et al., 2006) was run with the GRCh38 proteins using the miniprot mode on the masked assemblies. The funannotate-runEVM.py script from Funannotate was used to run EvidenceModeler (Haas et al., 2008) on the alignments of GRCh38 proteins, UniProtKB/Swiss-Prot proteins, Vertebrata proteins and the predicted genes from GALBA. The resulting predicted proteins were compared to the protein repeats that Funannotate distributes using DIAMOND blastp, and the predicted genes were filtered based on this comparison using AGAT. The filtered proteins were compared to the UniProtKB/Swiss-Prot release 2024_04 using DIAMOND (Buchfink et al., 2015) blastp to find gene names, and InterProScan was used to discover functional domains. AGATs agat_sp_manage_functional_annotation.pl was used to attach the gene names and functional annotations to the predicted genes.

List of files provided here and their description:

mRanTar1.2.hap1.fa.gz  - genome assembly of Svalbard reindeer (hap1)

mRanTar1.2.hap1.gff.gz - genome annotation of Svalbard reindeer (hap1)

mRanTar1.2.hap1.proteins.fa.gz - predicted proteins of Svalbard reindeer (hap1)

mRanTar1.2.hap2.fa.gz - genome assembly of Svalbard reindeer (hap2)

mRanTar1.2.hap2.gff.gz - genome annotation of Svalbard reindeer (hap2)

mRanTar1.2.hap2.proteins.fa.gz - predicted proteins of Svalbard reindeer (hap2)

mRanTar2.1.hap1.fa.gz - genome assembly of wild alpine reindeer (hap1)

mRanTar2.1.hap1.gff.gz - genome annotation of wild alpine reindeer (hap1)

mRanTar2.1.hap1.proteins.fa.gz - predicted proteins of wild alpine reindeer (hap1)

mRanTar2.1.hap2.fa.gz - genome assembly of wild alpine reindeer (hap2)

mRanTar2.1.hap2.gff.gz - genome annotation of wild alpine reindeer (hap2)

mRanTar2.1.hap2.proteins.fa.gz - predicted proteins of wild alpine reindeer (hap2)

Files

Files (3.3 GB)

Name Size Download all
md5:15fb281a797e47578c75fed43430bc9a
842.9 MB Download
md5:50ccf77929e01ff5c17ddf35e1bb472d
7.8 MB Download
md5:207ca3cbb835f238ed36b00266dad515
7.4 MB Download
md5:b23b48f387b77ae44803cf6c5951803b
786.0 MB Download
md5:fcd1fd93d82e7ac4793ec7a7a13aac75
7.8 MB Download
md5:af0bad99f0d2720bc677c3c69c2b0458
7.4 MB Download
md5:e0e8b10faabbc9920d3ff1f6191d3dcc
844.8 MB Download
md5:0a1ea6792258be43e1138f014f2eb88f
7.7 MB Download
md5:2d7dc0be1d8779b0e7915cdebd0a2895
7.2 MB Download
md5:fa22c038e5fb09e9031637270f9eab10
766.9 MB Download
md5:94d68fc3aceef16b189433cbfae6f6b6
8.4 MB Download
md5:9fbba6ba76de2ab7bccbc416e5034274
8.0 MB Download

Additional details

Funding

The Research Council of Norway
Earth Biogenome Project Norway 326819
Norwegian Environment Agency
HelRein