There is a newer version of the record available.

Published April 11, 2023 | Version v5
Dataset Open

Telomere-to-Telomere assembly and annotation of PN40024

Creators

Description

    PN40024, a highly homozygous Pinot Noir inbred line, was used for T2T genome assembly. In total, we generated 39.12Gb (~65X coverage) HiFi reads by the PacBio platform. The preliminary assembly were conducted using Hifiasm on HiFi reads, and Mumer was used to order and orient the contig-level assemblies using the PN40024.v3 genome as the reference, forming 169 contigs representing 19 chromosomes.

     The PN_T2T genome size (494.87M) is longer than that of 12X.v0 (426.18M). Due to the accuracy of HiFi long reads, the N50 length PN_T2T of (26.89 Mb) is 260 times higher than PN_v3 (~102Kb). For all 9423 gaps in 12X.v0 assembly, PN_T2T assembly is the gap-free grape genome.To validate the quality of our assembly, K-mer and BUSCO were conducted. We used K-mer to evaluate genomic heterozygosity, estimated 99.8%. BUSCO to evaluate genomic completeness, about 98.5% of the core conserved plant genes were found complete in the genome assembly.

The PN40024.T2T genome assembly: PN40024.T2T.fa

The PN40024.T2T gene annotation: PN40024.gff3

The PN40024.T2T TE annotation: PN40024.TE.gff

The PN40024.T2T centromere annotation: PN40024.trf.gff3

The PN40024.T2T protein sequence: PN40024.protein.fa

The PN40024.T2T cds sequence: PN40024.cds.fa

Comparison of gene annotation among PN_T2T and 12X.v0, 12X.v2, PN40024.v4, PN40024.v4.1: correlation.list

Files

Files (885.1 MB)

Name Size Download all
md5:43f667069b460e7741a3b4bcb86e4193
3.1 MB Download
md5:3cb62ad5a1f40e385f110c1eec9748b3
52.1 MB Download
md5:2f4c0c3901470e7d085c47ddd49db147
64.0 MB Download
md5:f05a9ec9ca3a9d0e566a112813a31a17
17.8 MB Download
md5:41e9b15711e8ed66803caa23aafa3da3
494.9 MB Download
md5:1c54d8328c25ee6de6a1808a7806c2dd
70.7 MB Download
md5:d81e776f8086b03f5aa0fac74d67eccf
182.6 MB Download