ESCOTT Missense Mutational Effect Predictions for Entire Human Proteome
Creators
Description
This dataset contains ESCOTT single point mutation predictions of about ~19000 human proteins.
Description of the data and file structure
Data of each human protein is in a folder named after its uniprotID. Inside uniprotID folder, there is a subfolder called results that contain all input and output. An example results folder for uniprotID A0A0B4J245 will contain the following files:
-
Raw escott predictions (output file): A0A0B4J245_normPred_evolCombi_escott.txt
-
Ranksorted (between 0-1) escott predictions in csv format (output file): A0A0B4J245_normPred_evolCombiTransposedRanksorted_escott.csv
-
Colabfold MSA file (input file): aliA0A0B4J245.fasta
-
Bzipped pdb file (input file): AF-A0A0B4J245-F1-model_v4.pdb.tar.bz2
-
JET2 file containing JET, PC and CV scores for each amino acid (output file) : A0A0B4J245_jet_escott.res
-
Configuration file containing default parameters (output file): default.conf
-
Log file (output file): escott.log
Files
Files
(66.0 GB)
Name | Size | Download all |
---|---|---|
md5:fb1897bffd8f20362e9af8544aa5905d
|
3.8 GB | Download |
md5:2eec7cf66325edbe8bc3414297a8c891
|
9.6 GB | Download |
md5:1e6411e9aa67a486a8e49704ee324016
|
12.9 GB | Download |
md5:264c61df3df5da30f27d024edb23839c
|
17.5 GB | Download |
md5:9ef42f4757d331aa73e445852de23aa0
|
11.2 GB | Download |
md5:9fc180d1fec4926062edf48e35c2a3ca
|
9.9 GB | Download |
md5:f08906f9de81d802412fa7679131e149
|
1.1 GB | Download |