There is a newer version of the record available.

Published January 5, 2024 | Version v2

PRESCOTT/ESCOTT/iGEMME mutational effect predictions of all single point mutations for ~3000 proteins

  • 1. ROR icon Sorbonne Université
  • 2. ROR icon Centre International de Recherche en Infectiologie

Description

This dataset contains all necessary data to reproduce our analyses on ~3000 proteins:

It is made up of 4 compressed datasets. 

  1. All colabfold MSAs and structures for ~3000 proteins: colabfold-sequences-structures-3000-proteins.tar.bz2
  2. All escott prediction: escott-v-1-6-0-max-two-components-colabfold-msas-entire-single-point-mutations-cvRC7.tgz
  3. All igemme predictions: escott-v-1-6-0-tjet-only-colabfold-msas-entire-single-point-mutations.tgz
  4. All gnomad v4.0.0 csv files used for prescott predictions: gnomadv4-0-0-csv-files.tgz

Files

Files (40.5 GB)

Name Size
md5:e926667f5c6c56d710459729cc11be1d
16.7 GB Download
md5:fdbda059fa886a9f9bab511782bfd8d0
11.4 GB Download
md5:cea2e12a561ebfe4ad7ba299a68b80ce
12.0 GB Download
md5:dad9725c4a82697733189f8c77ef0636
490.5 MB Download