There is a newer version of the record available.

Published March 18, 2024 | Version v1
Dataset Open

A phased genome of the highly heterozygous 'Texas' almond uncovers patterns of allele-specific expression linked to heterozygous structural variants

Description

# Genomic datasets associated to the publication: 

# Gene-ID conversion with previous genome version

Texasv3_vs_Texasv2_GeneID.txt -- gene ID conversion between Texasv3 and Texasv2 (https://www.rosaceae.org/analysis/295)

pdulcis26_to_F1_liftoff_polished.gff3 -- Texasv2 gene annotation liftoff on Phase-1 assembly (Phase-1 coordinates)

# Phase-1

Texas_F1_K80_chr.fasta  -- genome asssembly, phase-1 
Texas_F1_gene_models.gff3 --  phase-1  gene annotation (de novo annotation)
Functional_annotation_TexasF1.csv --  phase-1 gene functions  
Texas_F1_ref_SV.vcf --- Structural variations relative to phase-0 (This file uses Phase-1 as reference)

# Phase-0

Texas_F0_K80_chr.fasta -- genome asssembly, phase-0 
Texas_F0_gene_models.gff3 --  phase-0  gene annotation (liftoff from Phase-1)
Functional_annotation_TexasF0.csv --  phase-0  gene functions  
Texas_F0_ref_SV.vcf --- Structural variations relative to phase-1 (This file uses Phase-0 as reference)

# Transposable element annotation

Texas_F0_HiConf_TE_v3.gff3 --- TE annotation in Phase-0
Texas_F1_HiConf_TE_v3.gff3 --- TE annotation in Phase-1
Texasv3_TElib.fa --- TE library of TexasV3 (non-redundant repeat consensuses taking the account the two genome phases)

Files

Phase-0.zip

Files (258.2 MB)

Name Size Download all
md5:96b007e802f7703e5f863ffda3b828b0
78.2 MB Download
md5:9b222a6de974d332378cbb184909f7a7
92.1 MB Preview Download
md5:90cc27ceb7279722af96669780dc1f1f
81.7 MB Preview Download
md5:6919327851e9686b4db4128b0d3710c3
630.0 kB Preview Download
md5:cfbf9c2b0524ebf00015bbd2df3acc94
5.6 MB Preview Download