ESCOTT Missense Mutational Effect Predictions for Entire Human Proteome
Creators
Description
This dataset contains ESCOTT single point mutation predictions of about ~19000 human proteins.
Description of the data and file structure
Data of each human protein is in a folder named after its uniprotID. Inside uniprotID folder, there is a subfolder called results that contain all input and output. An example results folder for uniprotID A0A0B4J245 will contain the following files:
-
Raw escott predictions (output file): A0A0B4J245_normPred_evolCombi_escott.txt
-
Ranksorted (between 0-1) escott predictions in csv format (output file): A0A0B4J245_normPred_evolCombiTransposedRanksorted_escott.csv
-
Colabfold MSA file (input file): aliA0A0B4J245.fasta
-
Bzipped pdb file (input file): AF-A0A0B4J245-F1-model_v4.pdb.tar.bz2
-
JET2 file containing JET, PC and CV scores for each amino acid (output file) : A0A0B4J245_jet_escott.res
-
Configuration file containing default parameters (output file): default.conf
-
Log file (output file): escott.log
Files
ESCOTT-protein-list-v3.txt
Files
(66.0 GB)
Name | Size | Download all |
---|---|---|
md5:a36d85f4bf52de743f56f996dfb568a0
|
136.3 kB | Preview Download |
md5:fe928a798debd76ad5ddc6d6af07564b
|
2.0 GB | Download |
md5:18b4bfab01ecac950f7297b5a1d9540d
|
3.8 GB | Download |
md5:328050b7e7398951309b3da90024736a
|
6.7 GB | Download |
md5:026c1b25d603cbb80a62c338a7f49826
|
7.3 GB | Download |
md5:9ddc7cc8989adadd0b0f0bb1e0757ee8
|
9.2 GB | Download |
md5:e164b0cfa014edf30fb055df275a705c
|
11.2 GB | Download |
md5:2a1ccadf4584bf3fa6abbe8d883db2c0
|
13.2 GB | Download |
md5:5a17b2ba5c3faf53ddd3be1135c87e57
|
12.5 GB | Download |
Additional details
Related works
- Is published in
- Publication: 10.1186/s13059-025-03581-y (DOI)
Dates
- Available
-
2024-02-16