Telomere-to-Telomere assembly and annotation of Pinot Noir 40024
Creators
Description
PN40024, a highly homozygous Pinot Noir inbred line, was used for T2T genome assembly. In total, we generated 39.12Gb (~65X coverage) HiFi reads by the PacBio platform. The preliminary assembly were conducted using Hifiasm on HiFi reads, and Mumer was used to order and orient the contig-level assemblies using the PN40024.v3 genome as the reference, forming 169 contigs representing 19 chromosomes.
The PN_T2T genome size (494.87M) is longer than that of pn40024.v3 (426.18M). Due to the accuracy of HiFi long reads, the N50 length PN_T2T of (26.89 Mb) is 260 times higher than PN_v3 (~102Kb). For all 9423 gaps in PN_v3 assembly, PN_T2T assembly is the gap-free grape genome.To validate the quality of our assembly, K-mer and BUSCO were conducted. We used K-mer to evaluate genomic heterozygosity, estimated 99.8%. BUSCO to evaluate genomic completeness, about 98.5% of the core conserved plant genes were found complete in the genome assembly.
The PN40024.T2T genome assembly: PN40024.T2T.fa
The PN40024.T2T gene annotation: PN40024.gff3
The PN40024.T2T TE annotation: PN40024.TE.gff
The PN40024.T2T centromere annotation: PN40024.trf.gff3
Comparison of gene annotation among PN_T2T and 12X.v0, 12X.v2, PN40024.v4, PN40024.v4.1: Gene connected list .xlsx
Files
Gene_connected_list.txt
Files
(847.8 MB)
Name | Size | Download all |
---|---|---|
md5:8edc34f05bc8131c7402d3a485bcb38e
|
200 Bytes | Preview Download |
md5:2f4c0c3901470e7d085c47ddd49db147
|
64.0 MB | Download |
md5:41e9b15711e8ed66803caa23aafa3da3
|
494.9 MB | Download |
md5:eea1e045d8b7f666fd036e3f96e7683c
|
106.3 MB | Download |
md5:d81e776f8086b03f5aa0fac74d67eccf
|
182.6 MB | Download |