Computation-Ready Experimental Metal-Organic Framework (CoRE MOF) 2019 Dataset
Creators
- 1. Pusan National University
- 2. University of Minnesota Twin Cities
- 3. Northwestern University
- 4. IMDEA
- 5. University of Nottingham
- 6. Georgia Institute of Technology
- 7. University College London
Description
High-throughput computational screening of metal-organic frameworks rely on the availability of atomic coordinate files which can be used as input to simulation software packages. CoRE MOF Datasets are derived from Cambridge Structural Database (CSD) and also from the World Wide Web.
Nomenclatures:
LCD (Largest Cavity Diameter), PLD (pore limiting diameter), LFPD (Largest Sphere along the Free Path), ASA (Accessible Surface Area), NASA (Non-accessible surface area), AV_VF (Void Fraction, 0 - 1), NAV (Non Accessible Volume)
Dataset Directory Organization
CoREMOF2019_public_v2.zip: dataset with CR and NCR classifications
1. ASR: all solvent removed
- CR dataset: computaion-ready (N = 6,603)
- NCR dataset: not computaion-ready (N = 5,417)
- Both: NCR determined by Chen_Manz and mofchecker (N = 2,597)
- Chen_Manz: NCR determined by Chen_Manz (N = 958)
- mofchecker: NCR determined by mofchecker (N = 1,859)
- PACMAN_fail: NCR determined by Chen_Manz and mofchecker, and fail to predict PACMAN charges (N = 3)
2. FSR: free solvent removed
- CR: computaion-ready (N = 3,764)
- NCR: not computaion-ready (N = 3,297)
- Both: NCR determined by Chen_Manz and mofchecker (N = 1,646)
- Chen_Manz: NCR determined by Chen_Manz (N = 463)
- mofchecker: NCR determined by mofchecker (N = 1,185)
- PACMAN_fail: NCR determined by Chen_Manz and mofchecker, and fail to predict PACMAN charges (N = 3)
2. NCR_detail.xlsx: details of all structures by mofchecker and Chen_Manz for each NCR cases
November, 13 2024
- Classification of Computation-Ready (CR) and Not Computation-Ready (NCR) Structures based on Chen & Manz (RSC Adv., 2020,10, 26944-26951) and MOFChecker program by Kevin M. Jablonka)
- ML-predicted DDEC6 partial atomic charges based on PACMAN
March, 2 2023
- Added topology data from ToposPro for part of the ASR dataset (10,024)
February, 21 2020
- Unmodified filenames are available in ASR-full_unmodified.csv and FSR-full_unmodified.csv
November, 4 2019
- Added a spreadsheet for the full internal version of ASR datasets (N = 14,142).
- Moved additional disordered structures to disorder folder (thanks to Matthew Witman and Martin Hutereau)
August, 17 2019
- FIJDOS_freeONLY has been corrected.
Acknowledgements
- This reserach is supported by the National Research Foundation of Korea (No. 2016R1D1A1B3934484, NRF-2020R1C1C1010373, RS-2024-00449431)
- This research is supported by the U.S. Department of Energy, Office of Basic Energy Sciences, Division of Chemical Sciences, Geosciences and Biosciences under Award DE-FG02-17ER16362 (Predictive Hierarchical Modeling of Chemical Separations and Transformations in Functional Nanoporous Materials: Synergy of Electronic Structure Theory, Molecular Simulations, Machine Learning, and Experiment)
Files
ASR-full_unmodified.csv
Files
(86.7 MB)
Name | Size | Download all |
---|---|---|
md5:bdba0d3b80f0680cd0cf57cb64383849
|
263.3 kB | Preview Download |
md5:5a56771b09eaeac517cc0b993d6bc343
|
1.7 MB | Preview Download |
md5:8493218da5f97725a1fa887815d171b2
|
2.0 MB | Preview Download |
md5:23964294d098314c2a3c411d93fb712f
|
81.0 MB | Preview Download |
md5:23b331647df0ea7dc40c86fcd10351e3
|
197.4 kB | Preview Download |
md5:79c3a3c14a3d852de40fcc0e28bf4727
|
987.0 kB | Preview Download |
md5:c13f9f1bc00221b579cf060b25283e86
|
521.4 kB | Download |
Additional details
Related works
- Is supplemented by
- 10.1021/acs.jctc.4c00434 (DOI)
- 10.1039/D0RA02498H (DOI)