CPT-1 whole-proteome variant effect prediction
Creators
- 1. University of California, Berkeley
Description
Cross-protein transfer learning for variant effect prediction
This repository contains the variant effect preditions of CPT-1 for 18,602 human proteins, initially released with the manuscript "Cross-protein transfer learning substantially improves zero-shot prediction of disease variant effects". The proteins are split into three files.
CPT1_score_EVE_set.zip: Proteins in the EVE set (Frazer et al., 2021)
CPT1_score_no_EVE_set_1.zip & CPT1_score_no_EVE_set_2.zip: Proteins not in the EVE set. Predictions for these proteins use imputed values for features depending on the EVE MSA.
Citation
Jagota, M.*, Ye, C.*, Rastogi, R., Albors, C., Koehl, A., Ioannidis, N., and Song, Y.S.†
"Cross-protein transfer learning substantially improves zero-shot prediction of disease variant effects", bioRxiv (2022)
*These authors contributed equally to this work.
†To whom correspondence should be addressed: yss@berkeley.edu