Published June 16, 2025 | Version v1
Other Open

Supplementary data for "scalable emulation of protein equilibrium ensembles with generative deep learning"

Description

Supplementary data for Biomolecular Emulator

Overview

This repository contains supplementary data for Biomolecular Emulator:

Lewis, S.; Hempel, T.; Jiménez-Luna, J.; Gastegger, M.; Xie, Y.; Foong, A. Y. K.; Satorras, V. G.; Abdin, O.; Veeling, B. S.; Zaporozhets, I.; Chen, Y.; Yang, S.; Foster, A. E.; Schneuing, A.; Nigam, J.; Barbero, F.; Stimper, V.; Campbell, A.; Yim, J.; Lienen, M.; Shi, Y.; Zheng, S.; Schulz, H.; Munir, U.; Sordillo, R.; Tomioka, R.; Clementi, C.; Noé, F. Scalable Emulation of Protein Equilibrium Ensembles with Generative Deep Learning. Science 2025. https://doi.org/10.1126/science.adv9817.
 

Supplementary code

This repository contains a snapshot of the bioemu inference and benchmark code,

We strongly recommend using the linked GitHub repositories for up-to-date code, installation instructions, and further information. This Zenodo repository represents a static snapshot of supplementary data for the publication.

The code is licensed under the MIT License.
 

Model checkpoints

The model checkpoints that were used to produce the results for the publication are given in
  • bioemu_finetuned_step_4421376000_epoch_00100.ckpt: Default BioEmu model used throughout the publication
  • desres-models.zip: fast folder model, Fig. 3 A (for developers, please do not use for inference unless explicitly intended)
The model checkpoints are licensed under the MIT License.
 

Supplementary structures

This repository contains the molecular structures that were used to generate the results in the above publication.

The supplementary structures are licensed under Community Data License Agreement - Permissive - Version 2.0.

Figure panel data assignment:

The samples_*.zip files map to the paper figures as follows:

  • samples_cath.zip: Fig. 3 B, Fig. S10 A
  • samples_cath-scaling-001.zip: Fig. 3 B; Fig. S10 C
  • samples_cath-scaling-010.zip: Fig. 3 B; Fig. S10 C
  • samples_cath-scaling-100.zip: Fig. 3 B; Fig. S10 C
  • samples_delta_G.zip: Fig. 4 B, C and F
  • samples_desres_model.zip: Fig. 3 A; Fig. S9
  • samples_e2e_multiconf_crypticpocket.zip: Fig. S6
  • samples_e2e_multiconf_domainmotion.zip: Fig. S6
  • samples_e2e_multiconf_ood60_local.zip: Fig. S6
  • samples_e2e_singleconf_localunfolding.zip: Fig. S6
  • samples_examples_fig3.zip: Fig. 3 C and D; Fig 12
  • samples_multiconf_crypticpocket.zip: Fig. 2 C; Fig. S4
  • samples_multiconf_domainmotion.zip: Fig. 2 A; Fig. S2
  • samples_multiconf_ood60_local.zip: Fig. S1
  • samples_singleconf_localunfolding.zip: Fig. 2 B; Fig. S3
  • samples_stable_proteins.zip: Fig 4 D
  • samples_unstable_proteins.zip: Fig 4 E

Data loading

The data can be loaded with common molecular dynamics tools, e.g.,
with mdtraj:

import mdtraj
samples = mdtraj.load("path/to/samples.xtc", top="path/to/topology.pdb")

Files

bioemu-benchmarks_code_release.zip

Files (9.0 GB)

Name Size Download all
md5:dd8f50a68aa8e01751069f0845966d66
14.6 MB Preview Download
md5:8c81538d61c286c4a11fb2a038fb75ca
5.1 MB Preview Download
md5:4a6af470b7be40ceeabdc44e266d3ce5
312.1 MB Download
md5:474bfdef0dfc1e74ee85d7859fcf9ffb
1.5 GB Preview Download
md5:95f5b84c57f6bdf7ba07abe6ff341762
221.5 MB Preview Download
md5:d68847f319742b71faa52be51d19a228
221.5 MB Preview Download
md5:7884666ddde9f90fb946296d9ed34a8e
221.5 MB Preview Download
md5:51617c0c1b17b57ad921087b61a8ceb3
221.5 MB Preview Download
md5:30c285d2a503fb3436cefda86c75a5c6
2.0 GB Preview Download
md5:5d5522bb8e7536f427e9392752865324
111.4 MB Preview Download
md5:a8b88a474d377885c38de1512cc2a4bf
800.5 MB Preview Download
md5:e9c4401ab45900af06908ae1e11cafaf
784.9 MB Preview Download
md5:7b743a8d00270ebd855b3599d53d26a1
334.7 MB Preview Download
md5:5a05aa4e474054c63c56ea3d5b56f7c1
291.0 MB Preview Download
md5:2fc0dca334257735d2e15d3890690a16
70.8 MB Preview Download
md5:ecdbe83823362d494744902222fcb59f
800.0 MB Preview Download
md5:64a86bf1c07cf0ac8c16d9ada10b4ad5
784.5 MB Preview Download
md5:d1f8b5792281e2186d85cf389daf2f2e
334.4 MB Preview Download

Additional details

Software