Published January 5, 2024
| Version v2
Dataset
Open
PRESCOTT/ESCOTT/iGEMME mutational effect predictions of all single point mutations for ~3000 proteins
Authors/Creators
Description
This dataset contains all necessary data to reproduce our analyses on ~3000 proteins:
It is made up of 4 compressed datasets.
- All colabfold MSAs and structures for ~3000 proteins: colabfold-sequences-structures-3000-proteins.tar.bz2
- All escott prediction: escott-v-1-6-0-max-two-components-colabfold-msas-entire-single-point-mutations-cvRC7.tgz
- All igemme predictions: escott-v-1-6-0-tjet-only-colabfold-msas-entire-single-point-mutations.tgz
- All gnomad v4.0.0 csv files used for prescott predictions: gnomadv4-0-0-csv-files.tgz
Files
Files
(40.5 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:e926667f5c6c56d710459729cc11be1d
|
16.7 GB | Download |
|
md5:fdbda059fa886a9f9bab511782bfd8d0
|
11.4 GB | Download |
|
md5:cea2e12a561ebfe4ad7ba299a68b80ce
|
12.0 GB | Download |
|
md5:dad9725c4a82697733189f8c77ef0636
|
490.5 MB | Download |