Published March 4, 2022 | Version 1.0
Dataset Open

IV-KAPhE kinase-substrate assignments for the entire human phosphoproteome

  • 1. University of Exeter

Description

This data set includes the full, all-vs-all kinase-substrate assignments by the IV-KAPhE method for the entire human phosphoproteome (union of the PhosphoSitePlus human phosphosite database and the Ochoa et al. 2020 high-confidence human phosphoproteome). This is an unfiltered version of Supplemental Table S1 from Invergo BM (2022) "Accurate, high-coverage assignment of in vivo protein kinases to phosphosites from in vitro phosphoproteomic specificity data".

The data set also includes files to facilitate scoring new human phosphosites, particularly the in vitro half of the IV-KAPhE model. "naive-bayes-plus-model.tar.gz" is an archive of HDF5 files comprising the "Naive Bayes+" multi-label, in vitro kinase-substrate assignment model used in the IV-KAPhE model, as described in the manuscript. These files are to be used with the motif-kit software package and can be used to score new sites. "kinase-int-domains-sig.tsv" and "kinase-sub-domains-sig.tsv" contain Pfam domains enriched among each kinase's interacting partners or substrates, respectively. Finally, "human-kinase-interactions.tsv" and "human-kinase-2nd-interactions.tsv" contain physical interactions and indirect ("2 hop") interactions between human protein kinases and other proteins, as described in the manuscript.

Files

Files (2.4 GB)

Name Size Download all
md5:d6ed847816fd15efa77369412172fd9a
1.8 MB Download
md5:393ff42f1e7ddeeef7fd566e7e6d7b71
735.9 kB Download
md5:aa2d61cfd8b817160edc9f0650921170
414.9 kB Download
md5:f853f8fe4a29d6de407e2f808ad26a16
266.0 kB Download
md5:8325bdd12643f4e6e862c30e69cca44c
3.4 MB Download
md5:848e33cca500c6ce05ee8997e5a7946d
2.4 GB Download

Additional details

Related works

Is documented by
Preprint: 10.1101/2021.08.31.458376 (DOI)