Published January 30, 2026
| Version v1
Dataset
Open
QC Fitting Datasets for OpenFF SMIRNOFF Sage 2.3.0
Description
Description
A quantum chemical (QC) optimization and torsiondrive datasets generated at the OpenFF default level of theory, B3LYP-D3BJ/DZVP, and curated to train parameters in OpenFF 2.3.0 Sage with NAGL partial charge model AshGC v1.0.
The records compiled in this dataset are a subset sourced from the QCArchive datasets listed in the Technical Information section, were there is overlap with the datasets used to train previous versions of Sage (see references).
Additional details can be found in the GitHub dataset repository and the Force field repository.
General Information
* Date: 2026-01-27
* Name: OpenFF SMIRNOFF Sage 2.3.0
* Dataset submitter: Jennifer A Clark
* Dataset curator: Lily Wang
* Class: OpenFF Optimization Dataset
* Dataset Type: optimization
* Purpose: B3LYP-D3BJ/DZVP conformers for training OpenFF 2.3.0 Sage with AshGC v1.0 NAGL partial charge model.
* Number of unique molecules: 4696
* Number of conformers: 4696
* Number of conformers (min, mean, max): 1.00, 1.00, 1.00
* Molecular weight (min, mean, max): 32.05, 207.67, 878.25
* Charges: -4.0, -3.0, -2.0, -1.0, 0.0, 1.0, 2.0
* Class: OpenFF TorsionDrive Dataset
* Dataset Type: torsiondrive
* Purpose: B3LYP-D3BJ/DZVP conformers for training torsion drives for OpenFF 2.3.0 Sage with AshGC v1.0 NAGL partial charge model.
* Number of unique molecules: 1265
* Number of driven torsions: 1371
* Number of conformers: 1265
* Number of conformers (min, mean, max): 1, 1, 1
* Molecular weight (min, mean, max): 32.05, 165.33, 511.27
* Charges: -3.0, -2.0, -1.0, 0.0, 1.0, 2.0
Notes
Technical info
Optimization Datasets
The optimization records were selected to maximize chemical diversity using a selection of record IDs listed in the Sage 2.3.0 repository. These records came from the following datasets:
Torsiondrive Datasets
The optimization records were selected to maximize chemical diversity using a selection of record IDs listed in the Sage 2.3.0 repository. These records came from the following datasets:
Files
README.txt
Additional details
Software
- Repository URL
- https://github.com/openforcefield/ash-sage-rc2
References
- Boothroyd, S., Maat, J., Jang, H., Stern, C. D., Behara, P. K., Qiu, Y., Tjanaka, B., Madin, O., Hahn, D., Gapsys, V., Horton, J. T., Dotson, D., Gokey, T., Jennifer, C., Mitchell, J., Wang, L., Shirts, M., Cole, D., Chodera, J., & Mobley, D. (2025). QC Fitting Datasets for OpenFF SMIRNOFF Sage 2.2.0 [Data set]. Zenodo. https://doi.org/10.5281/zenodo.15635099
- Boothroyd, S., Maat, J., Jang, H., Stern, C. D., Behara, P. K., Qiu, Y., Tjanaka, B., Madin, O., Hahn, D., Gapsys, V., Horton, J. T., Dotson, D., Gokey, T., Clark, J., Mitchell, J., Wang, L., Shirts, M., Cole, D., Chodera, J., & Mobley, D. (2025). QC Fitting Datasets for OpenFF SMIRNOFF Sage 2.1.0 [Data set]. Zenodo. https://doi.org/10.5281/zenodo.15633037
- Boothroyd, S., Maat, J., Jang, H., Behara, P. K., Madin, O., Hahn, D., Gapsys, V., Horton, J. T., Dotson, D., Gokey, T., Clark, J., Mitchell, J., Wang, L., Shirts, M., Cole, D., & Mobley, D. (2025). QC Fitting Datasets for OpenFF SMIRNOFF Sage 2.0.0 [Data set]. In Journal of Chemical Theory and Computation (Vol. 19, pp. 3251–3275). Zenodo. https://doi.org/10.5281/zenodo.15611784