Published December 21, 2021 | Version v2
Dataset Open

Equivariant analytical mapping of first principles Hamiltonians to accurate and transferable materials models

  • 1. University of British Columbia
  • 2. University of Warwick
  • 3. CNRS
  • 4. Indian Institute of Engineering Science and Technology-Shibpur

Description

Supporting data for https://arxiv.org/abs/2111.13736.

ACEhamiltonians.jl code

This is an archived copy of the ACEhamiltonians.jl code to accompany the paper arXiv:2111.13736.

See https://github.com/ACEsuit/ACEhamiltoniansExamples for examples of how to use this code.

The code is written in Julia and requires v1.6 or later. To install the Julia depenendencies:

$ cd ACEhamiltonians.jl
$ julia
julia> import Pkg
julia> Pkg.activate(".")
julia> Pkg.instantiate()

The scripts test/plots.jltest/fcc-to-bcc.jl and test/vacancy.jl which produce all the plots in the paper can then run as, e.g.

julia --project=. test/plots.jl

Training data

The training_data folder contains the atomic structure, Hamiltonian and overlap matrices stored in HDF5 format with the following schema:

  • Data Group : aitb/
  • Datasets :
    • H : Real-space Hamiltonian Matrix. Type: Float64. Shape: Tensor(# of TB Cells, # of Rows, # of Columns)
    • S : Real-space Overlap Matrix. Type: Float64. Shape: Tensor(# of TB Cells, # of Rows, # of Columns)
    • energy : Energy. Unit: eV. Type: Float64. Shape: Scalar
    • freeenergy : Free Energy. Unit: eV. Shape: Scalar
    • unitcell : Unit cell vectors. Type: Float64. Shape: Matrix(3,3)
    • positions : Atom positions. Type: Float64. Shape: Array(3)
    • forces : (Optional, if available) Forces. Type: Float64. Shape: Array(3)
    • metadata : JSON String including dictionary of information of FHIaims calculation (k-points, basis sets), TB Cells, Cutoff, Orbital definitions.,

The molecular dynamics and FHI-aims parameters are described in the manuscript.

On-site models

The onsite_models_ord2 folder contains our correlation order 2 models for the on site blocks of the Hamiltonian, in a JSON format readable by the ACE.jl and ACEhamiltonians.jl Julia packages. There are separate files for the Hamiltonian (*_H.json) and overlap (*_S.json) models. The JSON files also contain training and test sets and associated errors as plotted in Figure 3 in our manuscript.

Models have a unique identifier (UUID) which is a hash of the input parameters and training data. The mapping from (order, max_degree) to UUID is as follows:

(2,4)  - 13427527590286463256
(2,5)  - 10538156191357510769
(2,6)  - 1646489440533135164
(2,7)  - 12130775482127724115
(2,8)  - 12487060958610974041
(2,9)  - 2653067664384673997
(2,10) - 1143382251563115664
(2,11) - 4564001820340015372
(2,12) - 9474261500251782658

Off-site models

The offsite_models_ord1 and offsite_models_ord2 folders contain our order 1 and order 2 offsite models for Hamiltonian and overlap matrices. The mapping from (H_order, H_max_degree) + (S_order, S_max_degree) to UUID is as follows:

(1,6) + (1,8)    -    7014526518680934587
(1,7) + (1,9)    -    8594416159488562244
(1,8) + (1,10)   -    10204186688118368371
(1,9) + (1,11)   -    13078304848585360574
(1,10)+ (1,12)   -    14750835312950641338
(1,11)+ (1,13)   -    9883802224093245794
(1,12)+ (1,14)   -    3907899412408606585
(1,13)+ (1,15)   -    201683837542179657
(1,14)+ (1,16)   -    277744202775070779
(2,6) + (1,8)    -    4699475053563592071
(2,7) + (1,9)    -    489637409713831432
(2,8) + (1,10)   -    18034631670613263469
(2,9) + (1,11)   -    720654516759450160
(2,10)+ (1,12)   -    15214900801060024044
(2,11)+ (1,13)   -    13798832597295943078
(2,12)+ (1,14)   -    13162803789413134473

FCC only

Onsite models:

2  6 5311732756869418284
2  7 13030014632886405308
2  8 5820099621734447846
2  9 10161014511878227635
2 10 11298425190201843107
2 11 9932031839231628354
2 12 9447261873515969583

Optimised FCC model 16110190062237887798

BCC only

Onsite models:

2  6 8949023800586845770
2  7 8045797268444730200
2  8 6919809282139600809
2  9 9935027806122780319
2 10 6376963380608532713
2 11 5001375576268070883
2 12 9678585765722197901

Optimised BCC model 10293566074413000591

FCC+BCC optimised models

Onsite models

2  6 2154760103892646619
2  7 6450474921309693835
2  8 14227277988574899288
2  9 476820595195218567
2 10 5364136683220082110
2 11 14619519825012606580
2 12 14181614899005838824

Offsite FCC+BCC optimised model - 4570230078043807257

Model errors

The model_errors directory contains summarised model errors for the training and testing errors for the models listed above.

Reference data

Reference electronic structure data computed for the BCC and FCC crystals, along the Bain path and for the relaxed vacancy is stored in the reference_data folder. The Hamiltonian and overlap matrices are stored as compressed binary HDF5 files. The format and metadata can be viewed with the h5dump utility, or read in using the supplied Julia code (or indeed from other languages).

Predicted data

The predicted_data/FCC and predicted_data/BCC folders contain HDF5 files with the results of all model predictions shown in the manuscript on the FCC and BCC crystal structures. predicted_data/FCC-to-BCC contains the results of predictions along the Bain path with the optimized model described in the manuscript and predicted_data/vacancy contains the vacancy calculations.

Files

ACE-hamiltonians-supporting-data.zip

Files (9.2 GB)

Name Size Download all
md5:0dfc94a212c832e80f301f0e99ed34e8
9.2 GB Preview Download

Additional details

Funding

UK Research and Innovation
Boundary Conditions for Atomistic Simulation of Material Defects EP/R043612/1
UK Research and Innovation
Proposal for a Tier 2 Centre - HPC Midlands Plus EP/P020232/1
European Commission
NOMAD CoE - Novel Materials Discovery 951786
UK Research and Innovation
Support for the UKCP consortium EP/P022065/1
UK Research and Innovation
Computational prediction of hot-electron chemistry: Towards electronic control of catalysis MR/S016023/1