Published February 6, 2024 | Version 2.0
Dataset Open

AIMEl-DB: Atomic Properties for 44K small organic molecules

Description

AIMEl-DB: Atomic Properties for 44K small organic molecules

This dataset comprises atomic properties of 44K (44 470) molecules selected from the QM9 database. The file names are based on the same indexing system used for QM9. 

This dataset includes four types of files:

  • .com Files
    Input files for Gaussian 16. Simple-point energy calculations were carried out using the keywords
    # B3LYP/6-31G(2df,p) scf=(maxcycle=9999) nosymm output=wfx

  • .log Files
    Output files from Gaussian 16 calculation with the aformentioned parameters.

  • .wfx Files
    Wave function files from Gaussian 16 calculation. These files were used as inputs for QTAIM calculations. 

  • .sumviz Files
    Output file from AIMAll software. The keywords used for the calculations were
    aimqb -nogui -scp=false -nproc=8 -naat=4 input.wfx
    Each .sumviz file contains more than 30 properties based on the Quantum Theory of Atoms in Molecules (QTAIM).

  • .csv Files
    These files contain the results of a in-house treament of .sumviz data. They cointain two calculated atomic properties:

    1. Total magnitude of the dipole moment, |mu|
    2. Total magnitude of the quadrupole moment, |Q|

            and two extracted atomic properties:

                  3.  Electronic Population, N
                  4. Atomic Energy, E

 

The aimel_merged_44k.csv presents the concatenation of the 44 470 csv Files.

Additionaly, the aimel_merged_38k.csv presents the concatenation of the 38 876 csv Files. This file corresponds to the version 1.0 of the dataset. 


If you find this dataset useful, please cite the original paper:

Meza-González, B., Ramírez-Palma, D.I., Carpio-Martínez, P. et al. Quantum Topological Atomic Properties of 44K molecules. Sci Data 11, 945 (2024). https://doi.org/10.1038/s41597-024-03723-0

 

 

Files

aimel_merged_44k.csv

Files (28.4 GB)

Name Size Download all
md5:45b18bb2a7da7195e2a26ec156c0c6d2
8.3 MB Download
md5:f9ef10ad5bdfc1b4afde0b63974d5d7d
36.6 MB Download
md5:93c7445d4a6e426c0e3546568b8f7a6f
178.8 MB Download
md5:a653785605ce90979c856ab0754742ed
17.8 GB Download
md5:60ceb52415b6395114164ba7cfd6e3e3
5.1 GB Download
md5:88934ab31ec3092b8249ba3650df32d0
5.1 GB Download
md5:b34749c6b2adabfdf0a5c9b845b41242
81.6 MB Preview Download
md5:f2adc3bcd95d04eaa37294de6e82943f
100.7 MB Preview Download

Additional details

Related works

Is published in
Data paper: 10.1038/s41597-024-03723-0 (DOI)

Funding

Consejo Nacional de Humanidades, Ciencias y Tecnologías
Estructura Electrónica e Inteligencia Artificial Aplicada a Problemas Actuales de Tecnología Química en México CF2019-185 1561802/2020
Universidad Nacional Autónoma de México
Desarrollo de predictores de la reactividad química de moléculas orgánicas PAPIIT IN207822
Universidad Nacional Autónoma de México
Estudio de fenómenos de transferencia electrónica en procesos químicos, utilizando como descriptor la densidad de espín y su laplaciano LANCAD-UNAM-DGTIC-194

Dates

Updated
2024-06-23
44K molecules, including all scaffolds from QM9 dataset