Protein ligand benchmark dataset to "Current state of open source force fields in protein-ligand binding affinity predictions"
Description
This record contains the structure files and Gromacs topologies for the simulations carried out for the publication:
David F. Hahn, Vytautas Gapsys, Bert L. de Groot, David L. Mobley, and Gary Tresadern, Current state of open source force fields in protein-ligand binding affinity predictions, 2024.
The data is organized as followed:
├── targets.yml # list of all targets and their directories ├── <date>_<target_name_1> # directory for target 1│ ├── 00_data # metadata for target 1│ │ ├── edges.yml # edges/perturbations│ │ ├── ligands.yml # ligands and activities│ │ └── target.yml # target│ ├── 01_protein # protein data│ │ ├── crd # coordinates│ │ │ ├── cofactors_crystalwater.pdb # cofactors and cyrstal waters (might be empty if there are none) │ │ │ └── protein.pdb # aminoacid residues │ │ └── top # topology(s)│ │ │ └── amber99sb-star-ildn-mut.ff # force field spec. │ │ │ ├── cofactors_crystalwater.top# Gromacs TOP file of cofactors and crystal water (might be empty if there are none)│ │ │ ├── protein.top # Gromacs TOP file of amino acid residues│ │ │ └── *.itp # Gromacs ITP file(s) to be included in TOP files│ └── 02_ligands # ligands│ ├── lig_<name_1> # ligand 1 │ │ ├── crd # coordinates│ │ │ └── lig_<name_1>.sdf # SDF file│ │ └── top # topology(s)│ │ └── openff-1.0.0.offxml # force field spec. │ │ ├── fflig_<name_1>.itp # Gromacs ITP file : atom types │ │ ├── lig_<name_1>.itp # Gromacs ITP file │ │ ├── lig_<name_1>.top # Gromacs TOP file │ │ └── posre_lig_<name_1>.itp # Gromacs ITP file : position restraint file │ ├── lig_<name_2> # ligand 2 │ … │ └── 03_hybrid # edges (perturbations)│ ├── edge_<name_1>_<name_2> # edge between ligand 1 and ligand 2 │ │ └── water # edge in water │ │ ├── crd # coordinates │ │ │ ├── mergedA.pdb # merged conf based on coords of ligand 1 │ │ │ ├── mergedB.pdb # merged conf based on coords of ligand 2 │ │ │ ├── pairs.dat # atom mapping │ │ │ └── score.dat # similarity score │ │ └── top # topology(s) │ │ └── openff-1.0.0.offxml # force field spec. │ │ ├── ffmerged.itp # Gromacs ITP file │ │ ├── ffMOL.itp # Gromacs ITP file │ │ └── merged.itp # Gromacs ITP file
│ │ └── complex # edge in complex│ │ ├── crd # coordinates │ │ │ ├── complex.pdb # merged ligand conformation in complex │ │ │ ├── ions1.pdb # solvated complex with water and non xtal ions│ │ │ ├── ions2 # 2nd solvated complex (differing water and ion coordinates) │ │ │ | │ │ │ |│ │ └── top # topology(s) │ │ └── openff-1.0.0.offxml # force field spec. with gromacs topology files │ … ├── <date>_<target_name_2> # directory for target 2 …
More details and later versions can be found in the protein-ligand benchmark github repository:
https://github.com/openforcefield/protein-ligand-benchmark/
Original publications using the targets in free-energy calculations with the same starting conformations:
- V. Gapsys et al., Large scale relative protein ligand binding affinities using non-equilibrium alchemy, Chem. Sci., 2020,11, 1140-1152
- Christina E. M. Schindler et al., Large-Scale Assessment of Binding Free Energy Calculations in Active Drug Discovery Projects, J. Chem. Inf. Model. 2020, 60, 11, 5457–5474
- Laura Perez Benito et al., Predicting Activity Cliffs with Free-Energy Perturbation, J. Chem. Theory Comput. 2019, 15, 3, 1884–1895
Files
Files
(5.0 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:3f37668f1ccb76b1455ecf5280176192
|
5.0 GB | Download |