There is a newer version of the record available.

Published April 11, 2023 | Version v2
Dataset Open

Telomere-to-Telomere assembly and annotation of Pinot Noir 40024

Creators

Description

    PN40024, a highly homozygous Pinot Noir inbred line, was used for T2T genome assembly. In total, we generated 39.12Gb (~65X coverage) HiFi reads by the PacBio platform. The preliminary assembly were conducted using Hifiasm on HiFi reads, and Mumer was used to order and orient the contig-level assemblies using the PN40024.v3 genome as the reference, forming 169 contigs representing 19 chromosomes.

     The PN_T2T genome size (494.87M) is longer than that of pn40024.v3 (426.18M). Due to the accuracy of HiFi long reads, the N50 length PN_T2T of (26.89 Mb) is 260 times higher than PN_v3 (~102Kb). For all 9423 gaps in PN_v3 assembly, PN_T2T assembly is the gap-free grape genome.To validate the quality of our assembly, K-mer and BUSCO were conducted. We used K-mer to evaluate genomic heterozygosity, estimated 99.8%. BUSCO to evaluate genomic completeness, about 98.5% of the core conserved plant genes were found complete in the genome assembly.

The PN40024.T2T genome assembly: PN40024.T2T.fa

The PN40024.T2T gene annotation: PN40024.gff3

The PN40024.T2T TE annotation: PN40024.TE.gff

The PN40024.T2T centromere annotation: PN40024.trf.gff3

Comparison of gene annotation among PN_T2T and 12X.v0, 12X.v2, PN40024.v4, PN40024.v4.1: Gene connected list .xlsx

Files

Gene_connected_list.txt

Files (847.8 MB)

Name Size Download all
md5:8edc34f05bc8131c7402d3a485bcb38e
200 Bytes Preview Download
md5:2f4c0c3901470e7d085c47ddd49db147
64.0 MB Download
md5:41e9b15711e8ed66803caa23aafa3da3
494.9 MB Download
md5:eea1e045d8b7f666fd036e3f96e7683c
106.3 MB Download
md5:d81e776f8086b03f5aa0fac74d67eccf
182.6 MB Download