Dataset for the paper "Advancing Biomolecule Understanding and Design Following Human Instructions"
Description
Overview
This dataset was created and released to support the research presented in the following academic paper:
Paper Title: Advancing Biomolecule Understanding and Design Following Human Instructions
Author List: Xiang Zhuang, Keyan Ding, Tianwen Lyu, Yinuo Jiang, Xiaotong Li, Zhuoyi Xiang, Zeyuan Wang, Ming Qin, Kehua Feng, Jike Wang , Qiang Zhang, and Huajun Chen
A preprint of this paper is available on ArXiv: https://arxiv.org/abs/2410.07919
This dataset includes data used for training and evaluating our proposed model.
Related Code Repository
The code associated with this dataset and the related paper "Advancing Biomolecule Understanding and Design Following Human Instructions" is open-sourced on GitHub, providing examples for data loading, preprocessing, and model implementation and evaluation:
GitHub Repository Link: https://github.com/HICAI-ZJU/InstructBioMol
Files
molecule-text.zip
Files
(41.8 GB)
Name | Size | Download all |
---|---|---|
md5:d3c903d8e4ab5ed607090d0a599e956a
|
9.1 GB | Preview Download |
md5:7a5bc783de89bcfeb55a0bc655f55efe
|
7.2 MB | Preview Download |
md5:a4444bf6fa40e4700c1bc26df6351a08
|
122.5 MB | Preview Download |
md5:b9ebe757429f915bf5726d328afea9cc
|
16.0 MB | Preview Download |
md5:698d75872b59aeb3bbf8999281edf5bf
|
344.2 MB | Preview Download |
md5:c65b3202e57d9c5e9993e1d33576022d
|
197.0 MB | Preview Download |
md5:c3b8ab31513e82ed5465e3cbfe2b8cee
|
31.8 GB | Preview Download |
md5:35662609e2961f0449326358751c62a6
|
91.2 MB | Preview Download |
md5:8b1add7c28ccf92816534a1d9a36dd11
|
47.7 MB | Preview Download |