PPB-Affinity: Protein-Protein Binding Affinity dataset for AI-based protein drug discovery

Liu, Huaqing

doi:10.5281/zenodo.13054646

Published July 27, 2024 | Version v1.2

Dataset Open

PPB-Affinity: Protein-Protein Binding Affinity dataset for AI-based protein drug discovery

Liu, Huaqing (Contact person)¹

1. Research Institute of Tsinghua, Pearl River Delta

Prediction of protein-protein binding (PPB) affinity plays an important role in large-molecular drug discovery. Deep learning (DL) has been adopted to predict the change of PPB binding affinity upon mutation, but there was a scarcity of studies predicting the PPB affinity itself. The major reason is the paucity of open-source dataset concerning PPB affinity. Therefore, the current study aimed to introduce and disclose a PPB affinity dataset (PPB-Affinity), which will definitely benefit the development of applicable DL to predict the PPB affinity. The PPB-Affinity dataset contains key information such as crystal structures of protein-protein complexes (with or without protein mutation patterns), PPB affinity, receptor protein chain, ligand protein chain, etc. To the best of our knowledge, this is the largest and publicly available PPB-Affinity dataset, which may finally help the industry in improving the screening efficiency of discovering new large-molecular drugs.

Codes for PPB-Affinity database preparation is disclosed at https://github.com/Huatsing-Lau/PPB-Affinity-DataPrepWorkflow.
Codes for the benchmark algorithm is disclosed at https://github.com/ChenPy00/PPB-Affinity.

Files are orginized as follows:

- PPB-Affinity.xlsx

- samples_deleted.zip

- PDB/

- Affinity Benchmark v5.5/

- file1.pdb

- file2.pdb

- ...

- filek.pdb

- ATLAS/

- PDBbind v2020/

- SAbDab/

- SKEMPIv2.0/

Files

PDB.zip

Files (3.1 GB)

Name	Size	Download all
PDB.zip md5:88ba34c314b2820435afa7ccb8005b1a	3.1 GB	Preview Download
PPB-Affinity.xlsx md5:032a5ce1f24212aef5cdedf8117ef084	1.2 MB	Download
samples_deleted.zip md5:43872a607961598860b5a45f66afc35a	2.1 MB	Preview Download

	All versions	This version
Views	2,483	570
Downloads	2,124	352
Data volume	3.4 TB	317.0 GB

PPB-Affinity: Protein-Protein Binding Affinity dataset for AI-based protein drug discovery

Creators

Description

Files

PDB.zip

Files (3.1 GB)