Published November 8, 2024 | Version v1
Dataset Open

CsSn(Cl/Br/I)3 Perovskite Alloy DFT Dataset for Machine Learning

  • 1. ROR icon Aalto University
  • 2. ROR icon Technical University of Munich

Description

This upload contains density functional theory (DFT) calculations of CsSn(Cl/Br/I)3 perovskite alloy. The calculations were performed for a study, where the DFT data was used to train an energy predicting machine learning model for CsSn(Cl/Br/I)3. The code related to the study is available through GitLab (https://gitlab.com/cest-group/learnsolar-cssnclbri).

The data is divided into four data sets. For each set, the atomic structure data with total energies and forces has been separated into an ASE (Atomic Simulation Environment) extended XYZ file. Additional information on the atomic structures (e.g. space groups) is provided in JSON format. The data sets are:

sp_train_set
Single point DFT calculations of 16 000 algorithmically generated CsSn(Cl/Br/I)3 structures of four different space groups: Pm-3m, P4/mbm, I4/mcm, and Pnma. Lattice parameters and atomic positions are determined through Vegard's law, but random deviations have been added to the atom positions, tilting angles of the Sn coordination octahedra, cell volume, cell height-to-width ratio, and some lattice vector angles. Cl/Br/I configurations are randomized. This data set was used to fit an initial machine learning model. The atomic structures included were selected using a clustering algorithm to accelerate learning.

sp_test_set
Single point DFT calculations of 2 600 atomic structures similar to sp_train_set. The Cl/Br/I compositions are uniformly represented, having two atomic structures per composition and space group. This data was used for testing the machine learning model. 

al_data
DFT relaxation structure snapshots from the active learning run that was performed to improve the machine learning model's structure relaxation accuracy. There are 4230 structure snapshots in total.

relax_test_set
100 DFT relaxations used for testing the machine learning relaxation accuracy. There are 2881 structure snapshots in total. Both initial (relax_test_set_initial.xyz) and final (relax_test_set_relaxed.xyz) atomic geometries are included.

Files

al_data.json

Files (105.2 MB)

Name Size Download all
md5:5610bb4529f1a30a561aad4f167d5810
456.9 kB Preview Download
md5:516019b2774c6104b769d08b3210998e
19.1 MB Download
md5:8fe5b6709df9a5f6df2a735ddf13d95d
2.0 kB Preview Download
md5:2b35585d40909c110b628f6c87f02d62
8.4 kB Preview Download
md5:6cec001aaadc3fef274ee3ff63b74fb0
450.2 kB Download
md5:e1d83c0bedb693c9df48046e38e9c7ac
450.6 kB Download
md5:672052e616fb2ebca668b3566bf0a622
174.8 kB Preview Download
md5:661fe42b2d454dcba229fce3276c63e8
11.6 MB Download
md5:43d2ac7be984808861271abba239f679
1.4 MB Preview Download
md5:436a09244e484feec9a47790dbb4a754
71.6 MB Download

Additional details

Related works

Is derived from
Dataset: 10.17172/NOMAD/2024.11.08-1 (DOI)