There is a newer version of the record available.

Published September 20, 2024 | Version v1
Dataset Open

Datasets for understanding the importance of conformation in property prediction models

  • 1. ROR icon Nara Institute of Science and Technology
  • 2. Nara Institute of Science and Technology(NAIST)

Description

Descriptor and conformer data sets for molecular property and reaction selectivity prediction tasks. The PQC data set was created based on a part of the PubChemQC PM6 dataset (J. Chem. Inf. Model. 2020, 60, 12, 5891–5899), which contains two- and three-dimensional descriptors and conformers. The APTC data sets are based on the data sets for asymmetric phase transfer catalysts with enantio-selectivity (https://github.com/Laboratoire-de-Chemoinformatique/3D-MIL-QSSR/tree/main/datasets). They contained the descriptors and conformers to train and validate machine learning models.

Detailed explanations on how to use these datasets are found in the Github repository: https://github.com/YuHamakawa/Conformation-Importance-ML-Models

 

 

Files

dataset.zip

Files (3.3 GB)

Name Size Download all
md5:b810f472ab829be963ee7aebb4372a40
3.3 GB Preview Download