There is a newer version of the record available.

Published October 18, 2024 | Version v1
Dataset Open

Data From: Oatk - a de novo assembly tool for complex plant organelle genomes

  • 1. ROR icon University of Cambridge
  • 2. ROR icon Wellcome Sanger Institute
  • 3. Anglia Ruskin University

Description

This reposity hosts the data for 195 plant organelle genome assemblies generated in the manuscript "Oatk: a de novo assembly tool for complex plant organelle genomes". The sequence data were produced by the Tree of Life programme at the Sanger Institute, mostly from the Darwin Tree of Life (DToL) project, including 24 monocots, 154 eudicots, 16 mosses and one liverwort. See SAMPLE_LIST file for descriptions of these species.

In each species subfolder, below files are included.

  1. PLTD.fasta                   Plastome assembly file in FASTA format
  2. PLTD.annot.bed          Plastome assembly annotation file in BED format
  3. MITO.fasta                   Mitogenome assembly file in FASTA format
  4. MITO.annot.bed          Mitogenome assembly annotation file in BED format
  5. MBG.gfa                         Genome assembly file in GFA format generated with MBG
  6. PMAT.gfa                       Genome assembly file in GFA format generated with OATK
  7. OATK.gfa                       Genome assembly file in GFA format generated with PMAT (may not exist)

Files

DATA.zip

Files (260.2 MB)

Name Size Download all
md5:d2af6b5ed179ffd8126438ca09e43e57
260.2 MB Preview Download
md5:20730dc20ad04aa4e813abbcdcfa890c
13.1 kB Preview Download

Additional details

Software