There is a newer version of the record available.

Published July 25, 2024 | Version v7
Dataset Open

PPIRef

Authors/Creators

Description

PPIRef is a dataset of 3D structures of protein-protein interfaces. See the GitHub repository for more details.

 

File description

ppi_6A.zip stores the PPIRef dataset: .pdb files with all 6A-distance interfaces from PDB as of Jan 2024.

ppi_6A_stats.zip stores the statistics about the dataset. This includes the indexes for fast search with MMseqs2 and iDist, as well as a .csv file with main statistics for all interfaces.

ppi_10A.zip: .pdb files with all 10A-distance interfaces from PDB downloaded in June 2024.

pdb_redo_ppi_10A.zip: .pdb files with all 10A-distance interfaces from PDB-REDO downloaded in June 2024.



How to use

It is recommended to download and extract the files in the PPIRef/ppiref/data/ppiref directory. This can be done automatically via the ppiref package. For example, to download and extract the ppi_6A.zip archive run:
from ppiref.utils.misc import download_from_zenodo
download_from_zenodo('ppi_6A.zip')

Files

ppi_6A.zip

Files (32.7 GB)

Name Size Download all
md5:05f2605d4d7ab326e28b61000c2559ca
9.2 GB Preview Download
md5:eca65e5b07c27cc7d657dd7876fe0c69
47.8 MB Preview Download
md5:a41aeb496cdc58c0e89fab87bfd5f34b
13.3 GB Preview Download
md5:00442fc07e10541a5b17aa1baf7d0841
70.2 MB Preview Download
md5:198b99214247843b5c29c1b5c9fede04
6.9 GB Preview Download
md5:4152f220384ded2f1a67989c39ea77ef
3.1 GB Preview Download

Additional details

Related works

Is published in
Dataset: arXiv:2310.18515 (arXiv)

Software

Repository URL
https://github.com/anton-bushuiev/PPIRef
Programming language
Python