iGEMME Missense Mutational Effect Predictions for Entire Human Proteome
Creators
Description
This dataset contains iGEMME single point mutation predictions of about ~19000 human proteins. In iGEMME predictions, only evolutionary data coming from multiple sequence alignment files is used.
Description of the data and file structure
This dataset contains iGEMME predictions for all human proteins.
Data of each human protein is in a folder named after its uniprotID. Inside uniprotID folder, there is a subfolder called results that contain all input and output. An example results folder for uniprotID A0A0B4J245 will contain the following files:
-
Raw igemme predictions (output file): A0A0B4J245_normPred_evolCombi_igemme.txt
-
Ranksorted (between 0-1) igemme predictions in csv format (output file): A0A0B4J245_normPred_evolCombiTransposedRanksorted_igemme.csv
-
Colabfold MSA file (input file): aliA0A0B4J245.fasta
-
JET2 file containing JET scores for each amino acid (output file) : A0A0B4J245_jet_igemme.res
-
Configuration file containing default parameters (output file): default.conf
-
Log file (output file): igemme.log
Files
Files
(15.7 GB)
Name | Size | Download all |
---|---|---|
md5:9a1d9a0a141064cdfbcc6191afad9456
|
15.7 GB | Download |