Published October 11, 2021 | Version 1.0.1
Dataset Open

Discoba protein sequences for protein structure predictions

  • 1. University of Oxford

Description

Comprehensive database of Discoba protein sequences, gathered for the purpose of improving protein structure predictions of Discoba species (including Trypanosoma and Leishmania) by AlphaFold and RoseTTAFold. Originally gathered for use with: https://github.com/zephyris/discoba_alphafold

Notes

Updated to add two new genomes, add a summary statistics files, and transition to a standardised protein sequence scheme in the fasta file: [isolate/species name]_P[protein index]

Files

discobaStats.csv

Files (759.2 MB)

Name Size Download all
md5:86baa9946e1a4fa1673e644bb4a0fa50
759.1 MB Download
md5:ec91d1e6501b2f0d6fded2e958f8a59b
22.2 kB Preview Download
md5:fa5353703a2d3d1c5871917a5fb12169
149 Bytes Preview Download

Additional details

Funding

Making it through the life cycle: Motility for pathogenicity in Leishmania parasites 211075
Wellcome Trust