There is a newer version of the record available.

Published April 17, 2023 | Version 1.0.0
Dataset Open

Dataset of sequences, alignments and structural models generated for the structural prediction of complexes mediated by intrinsically disordered regions.

  • 1. Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France

Description

This archive contains input and ouput files used and generated for the scanning of intrinsically disordered region and the prediction of their binding sites to receptor proteins. The archive contains 3 directories and a README file detailing their contents :

  • the initial raw sequence and alignment data for every chain        -> DIRECTORY fasta_msa/
  • the input and output data of every Alphafold run for every complex   -> DIRECTORY af2_runs/
  • the native reference structures    -> DIRECTORY ref_capri_curated/

The protein-peptide complex cases have been assigned a distinct index number, from 1 to 42, consistent across the several directories of the archive. Their corresponding directories are labelled as <index>_<pdbcode>.


These data can be used to rerun specific sections of the pipeline and scripts provided in: https://github.com/i2bc/SCAN_IDR

 

Files

Files (1.2 GB)

Name Size Download all
md5:dc438a8ddec8131d5d4221fac5727935
1.2 GB Download

Additional details

Funding

Agence Nationale de la Recherche
PPIMei - Protein-Protein Interactions in Meiosis ANR-21-CE44-0009