Published March 20, 2025 | Version 1.1
Dataset Open

Computation-Ready Experimental Metal-Organic Framework (CoRE MOF) 2024 Dataset

  • 1. ROR icon Pusan National University
  • 2. ROR icon Georgia Institute of Technology
  • 3. ROR icon University of Minnesota
  • 4. ROR icon University of California, Berkeley
  • 5. ROR icon University of Toronto
  • 6. ROR icon Northwestern University
  • 7. ROR icon Massachusetts Institute of Technology
  • 8. ROR icon Institut de Recherche de Chimie Paris
  • 9. Campus Innovation Paris d'Air Liquide
  • 10. École Normale Supérieure / PSL University
  • 11. Centre National de la Recherche Scientifique
  • 12. Chimie ParisTech, PSL University
  • 13. ROR icon IMDEA Materials
  • 14. ROR icon Oak Ridge National Laboratory

Description

Updates

March 20th, 2025: Added 12089-recommended-screening-list.csv file, which lists unique CR MOFs (ASR, FSR, and ION) from SI, CSD-modified, and CSD-unmodified datasets. While originating from the same source file, the ASR and FSR data differ because ASR structures have the coordinated solvent removed from the open metal sites (OMS), whereas FSR structures retain the solvent coordinated with the OMS.

 

Web interface for the CoRE MOF SI dataset

https://mof-db.pusan.ac.kr

Full CoRE MOF DB (40,837) = CoRE MOF SI (8,300) + CoRE MOF CSD-modified (20,276) + CoRE MOF CSD-unmodified (12,261)

The dataset is the public version of the CoRE MOF database updated in 2024 ("CoRE MOF SI"), which includes 2,664 computation-ready (CR) and 5,636 not computation-ready (NCR) MOF CIF files (total = 8,300 structures) and precomputed material properties. The dataset includes structures reported up to 12/31/2023 (manuscript acceptance date).

The dataset, based on the structures obtained from the Cambridge Structural Database (CSD) updated in 2024 ("CoRE MOF CSD"), is split into two datasets (unmodified CIFs and modified CIFs).

 

1. To obtain modified CIFs from CoRE MOF CSD (9,835 CR and 10,441 NCR), please go:

https://www.ccdc.cam.ac.uk/support-and-resources/downloads/

You will need a valid email to log in to the CCDC website to download the dataset for free.

2. To obtain unmodified CIFs from CoRE MOF CSD (4,703 CR and 7,558 NCR), please go:

https://www.ccdc.cam.ac.uk/support-and-resources/downloads/

You will need a CCDC license to obtain the unmodified CIFs.

 

Precomputed properties: pore limiting diameter (PLD), largest cavity diameter (LCD), pore volume (PV), framework dimensions, accessible surface area, crystal density, topology, open metal site, MOFidv1, MOFidv2, DDEC06 partial atomic charges from PACMAN model, heat capacity, decomposition temperature, probability of solvent removal stability, probability of water stability, hydrophobic classification based on GEMC 

 

Dataset Directory Organization

1. CoREMOF2024DB_SI_20250204.zip: dataset with computation-ready (CR) and not-computation-ready (NCR) classifications come from supporting information

  • CR dataset: 2,664
    •     ASR (all solvent removed): 1,372
    •     FSR (free solvent removed): 1,192
    •     Ion (with ions): 100
  • NCR dataset: 5,636
    • Both Chen_Manz and mofchecker: 3,692
    • Chen_Manz: 562
    • mofchecker: 1,073
    • occupancy of a single atom is less than 1: 309

2. ASR_data_SI.csv, FSR_data_SI.csv and ION_data_SI.csv: the information of CoRE MOF 2024 ASR, FSR and Ion datasets.

3. NCR_ASR_SI.xlsx, NCR_FSR_SI.xlsx and NCR_ION_SI.xlsx: details of all structures by Chen_Manz and mofchecker for each NCR cases.

4. unmodified_check_for_NCR_SI.xlsx: whether the NCR structures are unmodified according to comparing with the original structure

5. mofid-v2.zip: XYZ files of linkers and metal nodes, errors (which is an "unknown" MOFid)

6. water.zip: GEMC water isotherm data of CR dataset

7. TSA.zip: Single isotherms of 35 MOFs used in TSA; TSA results and adsorption data at different feed conditions

8. ASR_FSR_check.csv: duplicated MOFs from ASR & FSR datasets. We recommend that the researcher remove the structures from this list (one of the columns) for high-throughput screening

9. 12089-recommended-screening-list.csv: lists unique CR MOFs from ASR, FSR, and ION datasets.

Files

12089-recommended-screening-list.csv

Files (335.5 MB)

Name Size Download all
md5:7887c53f0ebeea1142dfc5bf1403f7e2
330.7 kB Preview Download
md5:c02235ce5e3843257bb561911b07d0fa
1.1 MB Preview Download
md5:47b074978286715566d5987e44efaaf7
217.4 kB Preview Download
md5:240444c92c1868ee131ab7b059f45b05
44.0 MB Preview Download
md5:5a7737add7b5e01f3cbf35aa8ca6fbb4
649.3 kB Preview Download
md5:15a86925553b95d3ac81d0c8da1f07c7
55.5 kB Preview Download
md5:e797a8250c57ab6a39c4d47ee2b10032
4.5 MB Preview Download
md5:6b647e538213b41ca4dcf97d8f542e69
172.1 kB Download
md5:6dddbd239c84b20f9e9ef3c336dcf6c9
187.3 kB Download
md5:c7f2978769aa6cd6dfb2008c00cb2ec7
23.0 kB Download
md5:db3c6c2955eb1149c81ea09d3f28773d
271.7 MB Preview Download
md5:bd2d204ba48e6b1cdd5336718c5fed60
154.9 kB Download
md5:c358323e014ff217fa8ed859703c2236
12.5 MB Preview Download

Additional details

Software