Published November 14, 2023 | Version v1
Dataset Open

CryoSTAR Tutorial Dataset

  • 1. ROR icon ByteDance (China)
  • 2. ByteDance Research

Description

This dataset accompanies the CryoSTAR tutorial and serves as a demonstration dataset for the CryoSTAR framework. It provides a minimal, reproducible example used to illustrate the workflow of the proposed method.

The dataset is organized into three main sub-directories:
pdbs/ – Contains 50 PDB files (1akeA_{i}.pdb, i = 1 … 50), representing interpolated conformations between the adenylate kinase structures 4ake and 1ake. Both share identical amino-acid sequences but adopt different conformations. The interpolations were generated using PyMOL’s morph tool.
mrcs/ – Contains 50 MRC density maps (1akeA_{i}.mrc), each corresponding to its paired PDB file. The densities were generated from the atomic models using EMAN2’s e2pdb2mrc utility.
uniform_snr0-0001_ctf/ – Contains particle stacks simulated from the MRC densities with random orientations. CTF distortions and Gaussian noise were applied to mimic realistic cryo-EM acquisition conditions.

Additionally, reconstructed volumes are included for demonstration:
rln.mrc – Reconstruction from all particles combined.
rln_reconstruct/rln{i}.mrc – Reconstruction from particles generated from the i-th MRC density.

This dataset is intended solely for educational and methodological demonstration of the CryoSTAR pipeline. It illustrates each step of the generative reconstruction process, but does not represent experimentally acquired cryo-EM data.

Files

tutorial_data_1ake.zip

Files (3.4 GB)

Name Size Download all
md5:455d7b0991754cd26975e8fa46a65106
3.4 GB Preview Download

Additional details

Related works

Is supplement to
Journal article: 10.1038/s41592-024-02486-1 (DOI)

Software

Repository URL
https://github.com/bytedance/cryostar
Programming language
Python
Development Status
Active