Published May 1, 2025 | Version v1
Dataset Open

Dataset for the paper "Advancing Biomolecule Understanding and Design Following Human Instructions"

Creators

  • 1. ROR icon Zhejiang University

Description

Overview

This dataset was created and released to support the research presented in the following academic paper:

Paper Title: Advancing Biomolecule Understanding and Design Following Human Instructions

Author List: Xiang Zhuang, Keyan Ding, Tianwen Lyu, Yinuo Jiang, Xiaotong Li, Zhuoyi Xiang, Zeyuan Wang, Ming Qin, Kehua Feng, Jike Wang , Qiang Zhang, and Huajun Chen

A preprint of this paper is available on ArXiv: https://arxiv.org/abs/2410.07919

This dataset includes data used for training and evaluating our proposed model.

Related Code Repository

The code associated with this dataset and the related paper "Advancing Biomolecule Understanding and Design Following Human Instructions" is open-sourced on GitHub, providing examples for data loading, preprocessing, and model implementation and evaluation:

GitHub Repository Link: https://github.com/HICAI-ZJU/InstructBioMol

Files

molecule-text.zip

Files (41.8 GB)

Name Size Download all
md5:d3c903d8e4ab5ed607090d0a599e956a
9.1 GB Preview Download
md5:7a5bc783de89bcfeb55a0bc655f55efe
7.2 MB Preview Download
md5:a4444bf6fa40e4700c1bc26df6351a08
122.5 MB Preview Download
md5:b9ebe757429f915bf5726d328afea9cc
16.0 MB Preview Download
md5:698d75872b59aeb3bbf8999281edf5bf
344.2 MB Preview Download
md5:c65b3202e57d9c5e9993e1d33576022d
197.0 MB Preview Download
md5:c3b8ab31513e82ed5465e3cbfe2b8cee
31.8 GB Preview Download
md5:35662609e2961f0449326358751c62a6
91.2 MB Preview Download
md5:8b1add7c28ccf92816534a1d9a36dd11
47.7 MB Preview Download