Published November 10, 2025 | Version v1
Dataset Open

A Conformation-Centric Generative Foundation Model for Linear Polymer Modeling and Design

Description

Please note that the polymer conformation dataset for pretraining is compiled entirely within the .lmdb files  (i.e., train.lmdb, valid.lmdb, and test.lmdb).

The isolated .sdf files explicitly visible in certain directories (e.g., data_1010) are merely structural arrangements for the test set, provided solely to facilitate the convenient calculation of various metrics during the evaluation phase.

Files

datasets.zip

Files (3.2 GB)

Name Size Download all
md5:b41b438b80920e3491048bbfa5bd4bbe
3.2 GB Preview Download