Telomere-to-Telomere assembly and annotation of PN40024
Creators
Description
PN40024, a highly homozygous Pinot Noir inbred line, was used for T2T genome assembly. In total, we generated 39.12Gb (~65X coverage) HiFi reads by the PacBio platform. The preliminary assembly were conducted using Hifiasm on HiFi reads, and Mumer was used to order and orient the contig-level assemblies using the PN40024.v3 genome as the reference, forming 169 contigs representing 19 chromosomes.
The PN_T2T genome size (494.87M) is longer than that of 12X.v0 (426.18M). Due to the accuracy of HiFi long reads, the N50 length PN_T2T of (26.89 Mb) is 260 times higher than PN_v3 (~102Kb). For all 9423 gaps in 12X.v0 assembly, PN_T2T assembly is the gap-free grape genome.To validate the quality of our assembly, K-mer and BUSCO were conducted. We used K-mer to evaluate genomic heterozygosity, estimated 99.8%. BUSCO to evaluate genomic completeness, about 98.5% of the core conserved plant genes were found complete in the genome assembly.
The PN40024.T2T genome assembly: PN40024.T2T.fa
The PN40024.T2T gene annotation: PN40024.gff3
The PN40024.T2T TE annotation: PN40024.TE.gff
The PN40024.T2T centromere annotation: PN40024.trf.gff3
The PN40024.T2T protein sequence: PN40024.protein.fa
The PN40024.T2T cds sequence: PN40024.cds.fa
Comparison of gene annotation among PN_T2T and 12X.v0, 12X.v2, PN40024.v4, PN40024.v4.1: correlation.list
Files
Files
(885.1 MB)
Name | Size | Download all |
---|---|---|
md5:43f667069b460e7741a3b4bcb86e4193
|
3.1 MB | Download |
md5:3cb62ad5a1f40e385f110c1eec9748b3
|
52.1 MB | Download |
md5:2f4c0c3901470e7d085c47ddd49db147
|
64.0 MB | Download |
md5:f05a9ec9ca3a9d0e566a112813a31a17
|
17.8 MB | Download |
md5:41e9b15711e8ed66803caa23aafa3da3
|
494.9 MB | Download |
md5:1c54d8328c25ee6de6a1808a7806c2dd
|
70.7 MB | Download |
md5:d81e776f8086b03f5aa0fac74d67eccf
|
182.6 MB | Download |