Published December 1, 2024 | Version v2
Software Open

TheJacksonLab/OMG_PhysicalProperties: OMG-Property-Database

  • 1. University of Illinois at Urbana-Champaign

Description

Monomer-level properties for 12M synthetically accessible polymers in the Open Macromolecular Genome derived via quantum chemistry.

[1] Duplicated data in QM calculated OMG monomers (Feb 14, 2025)

Please apply "drop_duplicates" for "methyl_terminated_product" before constructing a dataset for your purpose. 

There are 10 duplicated methyl terminated monomers among 32,529 QM-calculated OMG methyl terminated geometries (pareto greedy) and 5 duplicates among 15,147 (test). There are 15 overlapping methyl terminated monomers between the train and test dataset. Regardless of duplicates in "methyl_terminated_product", these duplicates do not have a significant impact on the active learning test performance.

There are no duplicates in "product" (ended with asterisks to indicate the repeating points), but some duplicates in "methyl terminated monomers" (asterisks replaced by methyl groups). This can happen when added methyl groups make different repeating units have the same methyl terminated structure. 

Files

OthersWithREADME.zip

Files (52.1 GB)

Name Size Download all
md5:90e597ce2c0b33482f79ce63fb0e4f4e
6.0 GB Preview Download
md5:ca66dcf39a69414a2152a8df805124ab
9.5 GB Preview Download
md5:d5c8e2c624e337b91181163bf9e86e29
12.8 GB Preview Download
md5:3dcf94fc45ce2a24b79742f08fc9f830
5.6 GB Preview Download
md5:5c703d4cca39ee2b4a488ce9d7535b16
5.4 GB Preview Download
md5:2aa885150cd762a62c9c39af7d3336d0
12.6 GB Preview Download
md5:eaa567b954db0c3abeafc2e6d5e7af42
190.5 MB Preview Download

Additional details

Software

Programming language
Python
Development Status
Active