TheJacksonLab/OMG_PhysicalProperties: OMG-Property-Database
Description
Monomer-level properties for 12M synthetically accessible polymers in the Open Macromolecular Genome derived via quantum chemistry.
[1] Duplicated data in QM calculated OMG monomers (Feb 14, 2025)
Please apply "drop_duplicates" for "methyl_terminated_product" before constructing a dataset for your purpose.
There are 10 duplicated methyl terminated monomers among 32,529 QM-calculated OMG methyl terminated geometries (pareto greedy) and 5 duplicates among 15,147 (test). There are 15 overlapping methyl terminated monomers between the train and test dataset. Regardless of duplicates in "methyl_terminated_product", these duplicates do not have a significant impact on the active learning test performance.
There are no duplicates in "product" (ended with asterisks to indicate the repeating points), but some duplicates in "methyl terminated monomers" (asterisks replaced by methyl groups). This can happen when added methyl groups make different repeating units have the same methyl terminated structure.
Files
OthersWithREADME.zip
Files
(52.1 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:90e597ce2c0b33482f79ce63fb0e4f4e
|
6.0 GB | Preview Download |
|
md5:ca66dcf39a69414a2152a8df805124ab
|
9.5 GB | Preview Download |
|
md5:d5c8e2c624e337b91181163bf9e86e29
|
12.8 GB | Preview Download |
|
md5:3dcf94fc45ce2a24b79742f08fc9f830
|
5.6 GB | Preview Download |
|
md5:5c703d4cca39ee2b4a488ce9d7535b16
|
5.4 GB | Preview Download |
|
md5:2aa885150cd762a62c9c39af7d3336d0
|
12.6 GB | Preview Download |
|
md5:eaa567b954db0c3abeafc2e6d5e7af42
|
190.5 MB | Preview Download |
Additional details
Related works
- Is supplement to
- Software: https://github.com/TheJacksonLab/OMG_PhysicalProperties/tree/OMG_property_v1.0 (URL)
Software
- Programming language
- Python
- Development Status
- Active