Published July 4, 2024 | Version v1
Dataset Open

Training data for ARPESNet

  • 1. Aarhus University

Description

Datasets used for training the ARPESNet autoencoder.

Datasets:

The training and test data are stored in the .zip files. Unzipping these will provide the files created by saving PyTorch tensors. These can be loaded with the python code:

import torch

data = torch.load("filename.pt")

 where filename can be any of the files in this repository.

train_data.zip & test_data.zip: training and testing data, respectively. Collection of 256x256 ARPES spectra (images) obtained by randomly slicing 3D of 28 (train) and 18 (test) high resolution angle scans, covering 19 material systems: Au(110), Au(111), Bi2Se3(111), CoO2, on Au(111), CrSBr(001), Graphene on Ir(111), Graphene on Ru(0001), single-layer MoS2 on Au(111), single-layer NbSe2 on bilayer graphene, NdTe3(010), Pd(100), Pd(111), Pt(111), Rb-doped Bi2Se3(111), Ru(0001), P-δ-layer on Si(001), single-layer TaS2 on Au(111), single-layer WS2 on Ag(111) and single-layer WS2on Au(111).

Each file corresponds to one material system, for which 500 images were generated, resulting in a tensor of shape 500x256x256 each.

test_imgs.pt: 6 ARPES spectra used for visual inspection and performance test of the ARPESNet autoencoder.

cluster_centers.pt: ARPES spectra extracted by slicing an angle scan obtained measuring a Bi2Se3 crystal. These are used to generate simulated nanoARPES maps for testing clustering performance.

dataset_info.csv: tabluar data describing the single datasets, their use in test or training and appropriate citation to the source publication wherre the data was first published.

This repository contains the data related to the publication

Steinn Ýmir Ágústsson, Mohammad Ahsanul Haque, Thi Tam Truong, Marco Bianchi, Nikita Klyuchnikov, Davide Mottin, Panagiotis Karras, Philip Hofmann; An autoencoder for compressing angle-resolved photoemission spectroscopy data. Mach. Learn.: Sci. Technol. 6 015019 (2025) DOI: 10.1088/2632-2153/ada8f2

Please cite the paper above in case of re-use of these data in a scientific publication.

Files

dataset_info.csv

Files (10.3 GB)

Name Size Download all
md5:b49d05b71663596c30ce938f2c8c5974
1.3 MB Download
md5:795ff81a3230bc5fb94369dd86d4e53c
6.7 kB Preview Download
md5:3601e54277c106646543411c1029885b
4.0 GB Preview Download
md5:760a13db48091e3c90e3fd6376de0b34
2.6 MB Download
md5:6f6abb115b068fa0a9a955e623512886
6.3 GB Preview Download

Additional details

Related works

Is derived from
Dataset: 10.5281/zenodo.12665275 (DOI)
Is published in
Publication: 10.1088/2632-2153/ada8f2 (DOI)

Software

Repository URL
https://github.com/ARPES-on-the-fly/arpesnet
Programming language
Python
Development Status
Active