Published February 24, 2024 | Version v3
Dataset Open

PPIRef

Description

PPIRef is a dataset of 3D structures of protein-protein interfaces. See the GitHub repository for more details.

 

File description

ppi_6A.zip stores the PPIRef dataset: .pdb files with all 6A-distance interfaces from PDB as of Jan 2024.

ppi_6A_stats.zip stores the statistics about the dataset. This includes the indexes for fast search with MMseqs2 and iDist, as well as a .csv file with main statistics for all interfaces.

ppi_10A.zip TBD

ppi_10A_stats.zip TBD



How to use

It is recommended to download and extract the files in the PPIRef/ppiref/data/ppiref directory. This can be done automatically via the ppiref package. For example, to download and extract the ppi_6A.zip archive run:
from ppiref.utils.misc import download_from_zenodo
download_from_zenodo('ppi_6A.zip')

Files

ppi_6A_stats.zip

Files (10.0 GB)

Name Size Download all
md5:198b99214247843b5c29c1b5c9fede04
6.9 GB Preview Download
md5:4152f220384ded2f1a67989c39ea77ef
3.1 GB Preview Download

Additional details

Related works

Is published in
Dataset: arXiv:2310.18515 (arXiv)

Software

Repository URL
https://github.com/anton-bushuiev/PPIRef
Programming language
Python