Published July 20, 2022 | Version v2
Dataset Open

Lathyrus sativus LS007 genome assembly and annotation Rbp1.0

  • 1. John Innes Centre
  • 2. Queensland University of Technology
  • 3. National Institute of Agricultural Botany
  • 4. Earlham Institute
  • 5. University of Nottingham
  • 6. Biology Centre CAS
  • 7. Istituto Agrario di San Michele all'Adige – IASMA
  • 8. Whitehead Institute for Biomedical Research
  • 9. Global Crop Diversity Trust
  • 10. ICARDA
  • 11. University of East Anglia

Description

Genome assembly of grass pea (Lathyrus sativus L.) genotype LS007, assembled from PromethION nanopore data and polished using Illumina HiSeq PE data. The assembly was annotated using the mikado-minos pipeline developed by the Earlham Institute. Also included is a separate annotation track for repeat sequences produced using the DANTE pipleline. 

 

For any questions regarding this dataset, contact peter.emmrich@jic.ac.uk

 

Note: ctg14433 has been manually corrected based on sequenced amplicon data. Files have been updated accordingly.

 

Assembly files:

Lsativus_LS007_Rbp1.0.7z - compressed complete assembly without scaffolding. The annotation refers to this assembly

Rbp_9 largest HiC scaffolds.7z - compressed fasta file of the largest 9 scaffolds following HiC scaffolding

Lsat_LS007_Rbp_chloroplast.fasta - fasta file of the complete LS007 chloroplast genome

Lsat_LS007_Rbp_mitochondrion.fasta - fasta file of the complete LS007 mitochondrial genome

 

Annotation tracks:

LATSA3860_EIv1.0.annotation.gff3

DANTE_transposable_element_protein_domains.gff3

Full_length_LTR_retrotransposons.gff3

Repeat_annotation_classI_classII_satellites.gff3

 

Annotation FASTA files:

LATSA3860_EIv1.0.annotation.gff3.cds.fasta

LATSA3860_EIv1.0.annotation.gff3.cdna.fasta

LATSA3860_EIv1.0.annotation.gff3.pep.fasta

 

Summaries and statistics:

LATSA3860_EIv1.0.annotation.gff3.final_table.tsv

LATSA3860_EIv1.0.annotation.gff3.mikado_stats.txt

LATSA3860_EIv1.0.annotation.gff3.biotype_conf.summary

LATSA3860_EIv1.0.annotation.gff3.final_table.tsv

LATSA3860_EIv1.0.annotation.gff3.pep.fasta.functional_annotation.tsv

NOT_UPDATED_LATSA3860_EIv1.0.annotation.gff3.metrics.tsv *

Blobtools_passed_contigs.txt - list of all contigs of the assembly that pass the BlobTools filter (Streptophyta, 20-100x coverage, >50 kbp) 

 

*this file has not been updated to reflect the correction to ctg14433

Files

Blobtools_passed_contigs.txt

Files (6.2 GB)

Name Size Download all
md5:71a3bc1efdf5dcb956aa411ab44fcc41
1.5 MB Preview Download
md5:ffa42278010f45f186d52d4f496ac7b3
485.7 MB Download
md5:95fc2d3ee6093cf7482fbcaa8d077e94
28.6 MB Download
md5:800a56ae1ae23dd47f313fe86e5183e1
171.3 MB Download
md5:cf5915ab9b2be465d1347e3934f63930
237 Bytes Download
md5:c2b778df00c26aeb2ab3d1c3cfeabf8f
177.3 MB Download
md5:77bc005195869086b167f4361a179f6b
108.1 MB Download
md5:a1aaecf009423acb40b014684cbef8b8
13.9 MB Download
md5:d27a536a0d87e5e939e381f08336378b
2.4 kB Preview Download
md5:8d399188a4299ad7d1631c6aa528c3b5
557 Bytes Download
md5:bcaa57152d85669003a44e3db1025af1
43.7 MB Download
md5:e3e7ee441969173d03c2b065ac2114df
22.4 MB Download
md5:5c67ce0100061a88fae2ad9fc41d4f42
122.1 kB Download
md5:edf60710ac9277bad50222f3eefbe226
299.6 kB Download
md5:46dccd8176920ce8aa0088399c158bf6
1.3 GB Download
md5:21ee9426da6ef3ff66381264d3ac6154
17.1 MB Download
md5:967900eec58909571e4102c501c3a78e
620.6 MB Download
md5:4c80c6f6f2455b8cf8ea935c39deab97
2.7 GB Download
md5:aa711d6532eb21bb664104a2192686bf
437.3 MB Download