Training data for ARPESNet
Authors/Creators
- 1. Aarhus University
Description
Datasets used for training the ARPESNet autoencoder.
Datasets:
The training and test data are stored in the .zip files. Unzipping these will provide the files created by saving PyTorch tensors. These can be loaded with the python code:
import torch
data = torch.load("filename.pt")
where filename can be any of the files in this repository.
train_data.zip & test_data.zip: training and testing data, respectively. Collection of 256x256 ARPES spectra (images) obtained by randomly slicing 3D of 28 (train) and 18 (test) high resolution angle scans, covering 19 material systems: Au(110), Au(111), Bi2Se3(111), CoO2, on Au(111), CrSBr(001), Graphene on Ir(111), Graphene on Ru(0001), single-layer MoS2 on Au(111), single-layer NbSe2 on bilayer graphene, NdTe3(010), Pd(100), Pd(111), Pt(111), Rb-doped Bi2Se3(111), Ru(0001), P-δ-layer on Si(001), single-layer TaS2 on Au(111), single-layer WS2 on Ag(111) and single-layer WS2on Au(111).
Each file corresponds to one material system, for which 500 images were generated, resulting in a tensor of shape 500x256x256 each.
test_imgs.pt: 6 ARPES spectra used for visual inspection and performance test of the ARPESNet autoencoder.
cluster_centers.pt: ARPES spectra extracted by slicing an angle scan obtained measuring a Bi2Se3 crystal. These are used to generate simulated nanoARPES maps for testing clustering performance.
dataset_info.csv: tabluar data describing the single datasets, their use in test or training and appropriate citation to the source publication wherre the data was first published.
This repository contains the data related to the publication
Steinn Ýmir Ágústsson, Mohammad Ahsanul Haque, Thi Tam Truong, Marco Bianchi, Nikita Klyuchnikov, Davide Mottin, Panagiotis Karras, Philip Hofmann; An autoencoder for compressing angle-resolved photoemission spectroscopy data. Mach. Learn.: Sci. Technol. 6 015019 (2025) DOI: 10.1088/2632-2153/ada8f2
Please cite the paper above in case of re-use of these data in a scientific publication.
Files
dataset_info.csv
Files
(10.3 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:b49d05b71663596c30ce938f2c8c5974
|
1.3 MB | Download |
|
md5:795ff81a3230bc5fb94369dd86d4e53c
|
6.7 kB | Preview Download |
|
md5:3601e54277c106646543411c1029885b
|
4.0 GB | Preview Download |
|
md5:760a13db48091e3c90e3fd6376de0b34
|
2.6 MB | Download |
|
md5:6f6abb115b068fa0a9a955e623512886
|
6.3 GB | Preview Download |
Additional details
Related works
- Is derived from
- Dataset: 10.5281/zenodo.12665275 (DOI)
- Is published in
- Publication: 10.1088/2632-2153/ada8f2 (DOI)
Software
- Repository URL
- https://github.com/ARPES-on-the-fly/arpesnet
- Programming language
- Python
- Development Status
- Active