Published October 18, 2025 | Version v1.0.0
Dataset Open

Pragmatic analysis with knowledge-guided for unraveling peptide-protein pairwise non-covalent mechanisms

Description

This dataset accompanies the research presented in the paper "Pragmatic analysis with knowledge-guided for unraveling peptide-protein pairwise non-covalent mechanisms".

Understanding peptide-protein interactions is vital for decoding cellular signaling and developing targeted therapies. However, the complexity of multi-molecular associations and diverse non-covalent interactions make accurate prediction and site-specific annotation challenging. Here, we propose KGIPA, a knowledge-guided pragmatic analysis framework that incorporates pragmatic concepts from natural language into life science, capturing the influence of biological environments on non-covalent interactions. KGIPA integrates intra- and extra-linguistic contextual information to combine multimodal single-molecule features and build residue-level interaction maps. It also uses biological prior knowledge to coordinate various non-covalent interaction types. Benchmark tests demonstrate KGIPA outperforms the state-of-the-art methods in evaluating molecular binding, including protein and peptide binding residues and residue-pair interactions. Furthermore, KGIPA demonstrates strong performance in peptide-protein binding affinity prediction and peptide virtual screening. Wet-lab experiments validate its reliability, revealing high consistency between predicted and experimentally measured binding behaviors. These results highlight KGIPA’s potential to accelerate peptide drug discovery and establish pragmatic analysis as an effective paradigm for decoding the molecular language of interactions.

Files

Files (2.2 GB)

Name Size Download all
md5:20cf721eed994a736613fd55b636b46d
1.8 GB Download
md5:96c7c97a9486fa9ba48968befb3f2103
315.0 MB Download

Additional details

Additional titles

Alternative title (English)
KGIPA