SaLT&PepPr: An Interface-Predicting Language Model for Designing Peptide-Guided Protein Degraders
Creators
- Brixi, Garyk (Researcher)1
- Ye, Tianzheng (Researcher)1
- Hong, Lauren (Researcher)1
- Wang, Tian (Researcher)1
- Monticello, Connor (Researcher)2
- Lopez-Barbosa, Natalia (Researcher)2
- Vincoff, Sophie (Researcher)1
- Yudistyra, Vivian (Researcher)1
- Zhao, Lin (Researcher)1
- Haarer, Elena (Researcher)1
- Chen, Tianlai (Researcher)1
- Pertsemlidis, Sarah (Researcher)1
- Palepu, Kalyan (Researcher)1
- Bhat, Suhaas (Researcher)1
- Christopher, Jayani (Researcher)1
- Li, Xinning (Researcher)1
- Liu, Tong (Researcher)1
- Zhang, Sue (Researcher)1
- Petersen, Lillian (Researcher)1
- DeLisa, Matthew P. (Supervisor)2
- Chatterjee, Pranam (Contact person)1
Description
Protein-protein interactions (PPIs) are critical for biological processes and predicting the sites of these interactions is useful for both computational and experimental applications. We present a novel Structure-agnostic Language Transformer and Peptide Prioritization (SaLT&PepPr) pipeline to predict interaction interfaces from a protein sequence alone for subsequent generation of peptidic binding motifs. Our model fine-tunes the state-of-the-art ESM-2 protein language model (pLM) with a per-position prediction task to identify PPI sites using data from the PDB, and prioritizes motifs which are most likely to be involved within intra-chain binding. By only using amino acid sequence as input, the model is competitive with structural homology-based methods, but exhibits reduced performance compared with deep learning models that input both structural and sequence features, such as ScanNet. Inspired by our previous results using co-crystal structures to engineer target-binding "guide" peptide motifs derived from interaction sites on a binding partner, we use curated PPI databases to identify binding partners for subsequent peptide derivation. We specifically focus on several critical cancer-regulating proteins and prioritize guide peptides from either known or dataset-identified interacting partners. Fusing these guide peptides to an E3 ubiquitin ligase domain, CHIPΔTPR, we demonstrate degradation of endogenous β-catenin, 4E-BP2, and TRIM8, and specifically highlight the nanomolar binding affinity, low off-targeting propensity, and function-altering capability of our best performing degraders in cellular models of cancer. In total, our study suggests that prioritizing binders from natural interactions via pLMs can enable programmable protein targeting and modulation.
Other
The work was further supported by the National Science Foundation (grant CBET-1605242) and the Defense Threat Reduction Agency (grant HDTRA1-20-10004) to the lab of M.P.D. at Cornell University.
Files
FCS files_ZipGFP.zip
Files
(250.6 MB)
Name | Size | Download all |
---|---|---|
md5:b36f4a5307d43b72b4822d51f9a2c860
|
166.0 kB | Download |
md5:7fbb3a8be1fc58ac894877a8bc78a13f
|
48.5 MB | Preview Download |
md5:9f8cdda8b431396f187003cd0447a545
|
64.4 MB | Download |
md5:d5f73d6936dc63c91927bafd43b61687
|
8.2 MB | Preview Download |
md5:d60960f7add00206f4a65d8c5f5c28e3
|
123.3 MB | Preview Download |
md5:3305e5a1f1923f8fbd9f21ac2a2535b3
|
6.1 MB | Preview Download |