Genome and repeat annotation of the phased telomere-to-telomere assembly of Moroccan argane tree (Argania spinosa)
Authors/Creators
- 1. Biotechnology Unit, Regional Center of Agronomic Research of Rabat, National Institute of Agricultural Research, Rabat 10090, Morocco
- 2. Department of Biology, Faculty of Science, Mohammed V University in Rabat, Rabat, Morocco
- 3. International Center for Biosaline Agriculture (ICBA), Dubai, UAE
- 4. Regional Center of Agricultural Research of Errachidia, National Institute of Agricultural Research, Errachidia, Morocco
- 5. Center for Biotechnology and Genomics, Texas Tech University, Lubbock, TX, United States
- 6. International Center for Agricultural Research in the Dry Areas (ICARDA), Rabat 10100, Morocco
Description
This dataset provides the full genome annotation supporting the publication:
"Phased T2T reference genome assembly of Moroccan Argane (Argania spinosa)"
Hanane El Idrissi, Anestis Gkanogiannis, Driss Iraqi, Siham Khoulassa, Mohamed Fokar, Bouabid Badaoui, Rachid Moussadek, Rachid Mentag and Slimane Khayi (2025)
We present the genome annotation files for the phased, telomere-to-telomere (T2T), chromosome-scale genome assembly of Argania spinosa, an ecologically and economically important tree endemic to Morocco. The assembly comprises two fully phased haplotypes, each organized into 11 pseudochromosomes.
This Zenodo entry includes:
-
Structural gene annotation (GFF3) generated using the Funannotate pipeline
-
Predicted protein sequences (FASTA)
-
Repeat annotations:
-
GFF3 files for simple, complex, and combined repeat annotations
-
Soft-masked genome FASTA (simple and complex repeats masked in lowercase)
-
Hard-masked genome FASTA (repeats replaced with Ns)
-
Gene prediction was performed using transcript evidence (RNA-Seq from root and leaf tissues), protein homology from Ericales and SwissProt, and de novo ab initio models (AUGUSTUS, GeneMark-ES). Functional annotations were assigned using InterProScan, eggNOG, Pfam, and Gene Ontology databases. Repeat annotation was performed using RepeatModeler and RepeatMasker in a multi-round strategy incorporating both lineage-specific and de novo repeat libraries.
NCBI BioProject: PRJNA1223813
Files
2.Sideroxylon_spinosum_functional_annotation.zip
Files
(612.8 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:56d1229027a8ae594855b3e59060b051
|
303.0 MB | Preview Download |
|
md5:de498cdb784ce7bb2edc62325ab87ae6
|
309.9 MB | Preview Download |