Published October 16, 2023 | Version v1
Journal article Open

SaLT&PepPr: An Interface-Predicting Language Model for Designing Peptide-Guided Protein Degraders

Description

Protein-protein interactions (PPIs) are critical for biological processes and predicting the sites of these interactions is useful for both computational and experimental applications. We present a novel Structure-agnostic Language Transformer and Peptide Prioritization (SaLT&PepPr) pipeline to predict interaction interfaces from a protein sequence alone for subsequent generation of peptidic binding motifs. Our model fine-tunes the state-of-the-art ESM-2 protein language model (pLM) with a per-position prediction task to identify PPI sites using data from the PDB, and prioritizes motifs which are most likely to be involved within intra-chain binding. By only using amino acid sequence as input, the model is competitive with structural homology-based methods, but exhibits reduced performance compared with deep learning models that input both structural and sequence features, such as ScanNet. Inspired by our previous results using co-crystal structures to engineer target-binding "guide" peptide motifs derived from interaction sites on a binding partner, we use curated PPI databases to identify binding partners for subsequent peptide derivation. We specifically focus on several critical cancer-regulating proteins and prioritize guide peptides from either known or dataset-identified interacting partners. Fusing these guide peptides to an E3 ubiquitin ligase domain, CHIPΔTPR, we demonstrate degradation of endogenous β-catenin, 4E-BP2, and TRIM8, and specifically highlight the nanomolar binding affinity, low off-targeting propensity, and function-altering capability of our best performing degraders in cellular models of cancer. In total, our study suggests that prioritizing binders from natural interactions via pLMs can enable programmable protein targeting and modulation.

Other

The work was further supported by the National Science Foundation (grant CBET-1605242) and the Defense Threat Reduction Agency (grant HDTRA1-20-10004) to the lab of M.P.D. at Cornell University.

Files

FCS files_ZipGFP.zip

Files (250.6 MB)

Name Size Download all
md5:b36f4a5307d43b72b4822d51f9a2c860
166.0 kB Download
md5:7fbb3a8be1fc58ac894877a8bc78a13f
48.5 MB Preview Download
md5:9f8cdda8b431396f187003cd0447a545
64.4 MB Download
md5:d5f73d6936dc63c91927bafd43b61687
8.2 MB Preview Download
md5:d60960f7add00206f4a65d8c5f5c28e3
123.3 MB Preview Download
md5:3305e5a1f1923f8fbd9f21ac2a2535b3
6.1 MB Preview Download