There is a newer version of the record available.

Published November 29, 2023 | Version v1
Dataset Open

Chromosome-level genome assembly of Triticum turgidum var 'Kronos'

  • 1. ROR icon University of California, Berkeley

Description

 

This data is made available under the Toronto Agreement. 

All of the data listed here is available under the prepublication data sharing principle of the Toronto agreement (1). By using this data, you agree to:

  • respect the rights of the data producers and contributors to analyze and publish the first global analyses and certain other reserved analyses of this data set in a peer-reviewed publication.
  • not redistribute, release, or otherwise provide access to the data to anyone outside of the group, until the data has been published & submitted to the public data repositories.
  • contact the authors to discuss any plans to publish data or analyses that utilize this data to avoid the overlap of any planned analyses.
  • fully cite the prepublication data along with any applicable versioning details.
  • understand that this data as accessed is precompetitive and is not patentable in its present state.

This agreement does not expire by time but only upon publication of the first global analysis by the data producers and contributors.
(1) Toronto International Data Release Workshop Authors. Prepublication data sharing. Nature 461, 168–170 (2009). https://doi.org/10.1038/461168a

 

  • If you have questions about the use of this dataset, please contact Ksenia Krasileva: kseniak [at] berkeley.edu

 

Summary of the datasets

We produced 526 Gbp of high-fidelity (HiFi) reads for Kronos. As Kronos typically self-pollinates in the field and its residual heterozygosity is low, these reads were assembled with hifiasm v0.19.5-r587 (-l0) to produce haplotype-collapsed assembly. Primary and associated contigs were concatenated into a single file. These contigs are in the files with the prefix 'Kronos.contigs'

The concatenated primary and associated contigs were further scaffolded with chromosome conformation capture sequencing (Hi-C) data. We used yahs v1.2a.2. The resulting 14 largest scaffolds were greater than 600 Mbp in size, representing 14 chromosomes (7 x AB). These scaffolds were renamed based on the similarity to the bread wheat reference genome from the IWGSC. After plasmid genomes were separated, the rest of the contigs or scaffolds, which were all smaller than 4 Mbp, were concatenated into a single sequenced named 'Un' (for unplaced). These sequences can be found in the files with the prefix 'Kronos.collapsed'. 

 

  • We plan to release additional data, such as annotations, as they become ready. If you have questions or suggestions about this dataset, please contact Ksenia Krasileva and Kyungyong Seong: s.kyungyong [at] berkeley.edu

 

Acknowledgement

This work has been funded by the United States Department of Agriculture - National Institute for Food and Agriculture Award (2021-67013-35726). 

Files

Files (6.0 GB)

Name Size Download all
md5:d5e8a858d73acdecfcfed3b8a8160412
15.9 MB Download
md5:ba99881361f3369ff52134e5d800338a
2.8 GB Download
md5:dc6654c1acd74a796af6b1cdcabe8004
2.8 MB Download
md5:f0c130f7523bbaedc978442df9f3334a
16.7 MB Download
md5:d8f389185969b3630344593b73c4c86a
3.1 GB Download
md5:5423c91b3ab7ed1f8aae2709eb1c7d93
3.1 MB Download