There is a newer version of the record available.

Published May 21, 2025 | Version v2
Dataset Open

SHNITSEL - Surface Hopping Nested Instances Training Set for Excited-state Learning

  • 1. ROR icon Leipzig University
  • 2. ROR icon Friedrich-Alexander-Universität Erlangen-Nürnberg

Description

SHNITSEL

The Surface Hopping Nested Instances Training Set for Excited-State Learning (SHNITSEL) is a comprehensive data repository designed to support the development and benchmarking of excited-state dynamics methods.

Configuration Space

SHNITSEL contains datasets for nine organic molecules that represent a diverse range of photochemical behaviors. The following molecules are included in the dataset:

  • Alkenes: ethene (A01), propene (A02), 2-butene (A03)
  • Ring structures: fulvene (R01), 1,3-cyclohexadiene (R02), tyrosine (R03)
  • Other molecules: methylenimmonium cation (I01), methanethione (T01), diiodomethane (H01)

Property Space

These datasets provide key electronic properties for singlet and triplet states, including energies, forces, dipole moments, transition dipole moments, nonadiabatic couplings, and spin-orbit couplings, computed at the multi-reference ab initio level. The data is categorized into static and dynamic data, based on its origin and purpose.

  • Static data (#147,169 data points in total) consists of sampled molecular structures without time-dependent information, covering relevant vibrational and conformational spaces. These datasets are provided for eight molecules: A01, A02, A03, R01, R03I01, T01, and H01
  • Dynamic data (#271,700 data points in total) originates from surface hopping simulations and captures the evolution of molecular structures and properties over time, as they propagate on potential energy surfaces according to Newton’s equations of motion. These datasets are provided for five molecules: A01, A02A03, R02, and I01

Data Structure and Workflow

The data is stored in xarray format, using xarray.Dataset objects for efficient handling of multidimensional data. Key dimensions include electronic states, couplings, atoms, and time frames for dynamic data. The dataset is scalable and compatible with large datasets, stored in NetCDF4 format within HDF5 for optimal performance.

An overview of the molecular structures and visualizations of key properties (from trajectory data) are compiled on the SHNITSEL webpage (https://shnitsel.github.io/).

Files

A01_75731.zip

Files (690.8 MB)

Name Size Download all
md5:fab7b159843bc097bfe036131a0ec535
42.3 MB Preview Download
md5:c5590fc35fa2e7f5e9c981b35841554b
23.5 MB Preview Download
md5:f1725c66822552477985f43439a36113
91.0 MB Preview Download
md5:fa0e5650731eedb20393b7bcb03762a8
150.9 MB Preview Download
md5:4beaa442fcadc1cc78dd8ea1e46e750b
36.5 MB Preview Download
md5:7671d1d6475435e4ca769a0107a6805a
17.0 MB Preview Download
md5:55ca7780613712a88ceaea7ea414eb4a
175.8 MB Preview Download
md5:71f012889a2f73c09ddcdf5ffe75ac17
150.6 MB Preview Download
md5:9f4b37df02e8dd32fe06a04bf799920e
3.2 MB Preview Download

Additional details

Funding

Friedrich-Alexander-Universität Erlangen-Nürnberg
Emerging Talents Initiative ETI_2024-2_Nat_09_Mueller
Friedrich-Alexander-Universität Erlangen-Nürnberg
EAM Starting Grant EAM-SG24-01

Software

Repository URL
https://github.com/SHNITSEL/shnitsel-tools
Programming language
Python
Development Status
Active