There is a newer version of the record available.

Published December 14, 2022 | Version 1.0
Dataset Open

Proteome-wide structure-based accessibility analysis of ligandable and detectable cysteines in chemoproteomic datasets

  • 1. Imperial College London
  • 2. Imperial College London, London Institute of Medical Sciences
  • 3. Imperial College London, Francis Crick Institute

Description

Covalent drug discovery, in particular targeting reactive cysteines, has undergone a resurgence over the past two decades, demonstrated by recent clinical successes of covalent inhibitors for high-priority cancer targets. Reactive cysteine profiling, first pioneered by the Cravatt lab, has emerged in parallel as a powerful approach for proteome-wide on- and off-target profiling. Thus far however, structural analysis of liganded cysteines has been restricted to experimentally determined protein structures. We combined AlphaFold-predicted amino acid side chain accessibilities for >95% of the human proteome with a meta-analysis of thirteen public cysteine profiling datasets, totalling 40,070 unique cysteine residues, revealing accessibility biases in sampled cysteines primarily dictated by warhead chemistry. Analysis of >3.5 million cysteine-fragment interactions further suggests that exposed cysteine residues are preferentially targeted by elaborated fragments and drug-like compounds. We finally propose a framework for benchmarking coverage of ligandable cysteines in future cysteine profiling approaches, considering both selectivity for high-priority residues and quantitative depth. All analysis and produced resources (freely available at www.github.com/TateLab) are readily extendable to reactive amino acids beyond cysteine, and related questions in chemical biology.

Files

20221214_CysteineAccessibility_bioRxiv_DataRepository.zip

Files (657.9 MB)

Additional details

Related works

Is supplement to
Preprint: 10.1101/2022.12.12.518491 (DOI)