There is a newer version of the record available.

Published October 18, 2025 | Version 1.0.1
Dataset Open

The AutoCorrelation Integral Drill (ACID) Test Set

  • 1. ROR icon Ghent University
  • 2. Labo Soete, Ghent University, Technologiepark-Zwijnaarde 46, B-9052, Ghent, Belgium
  • 3. FlandersMake\@UGent, Core Lab EEDT-MP, 3001 Leuven, Belgium
  • 4. Center for Molecular Modeling (CMM), Ghent University, Technologiepark-Zwijnaarde 46, B-9052, Zwijnaarde, Belgium

Description

This repository contains the scripts and StepUp workflows to regenerate the "AutoCorrelation Integral Drill" (ACID) test set. The ACID test set comprises a diverse collection of algorithmically generated time series designed to evaluate the performance of algorithms that compute the autocorrelation integral. The set contains in total 15360 test cases, and each case consists of one or more time series. The cases differ in the kernel characterizing the time correlations, the number of time series, and the length of the time series. For each combination of kernel, number of sequences and sequence length, 64 test cases are generated with different random seeds to allow for a systematic validation of uncertainty estimates. The total dataset, once generated, is about 80 GB in size.

In addition to the ACID test set, this repository also contains scripts and workflows to validate STACIE, a software package for the computation of the autocorrelation integral. The results of this analysis are discussed in the following paper:

Gözdenur Toraman, Dieter Fauconnier, and Toon Verstraelen "STable AutoCorrelation Integral Estimator (STACIE): Robust and accurate transport properties from molecular dynamics simulations" Journal of Chemical Information and Modeling 2025, 65 (19), 10445–10464, doi:10.1021/acs.jcim.5c01475, arXiv:2506.20438.

This dataset is distributed under a choice of license: either the Creative Commons Attribution-ShareAlike 4.0 International license (CC BY-SA 4.0) or the GNU Lesser General Public License, version 3 or later (LGPL-v3+). The SPDX License Expression for the documentation is CC-BY-SA-4.0 OR LGPL-3.0-or-later.

You should have received a copy of the CC BY-SA 4.0 and LGPL-v3+ licenses along with the data set. If not, see:

Files

acid-dataset.pdf

Files (14.2 MB)

Name Size Download all
md5:69bb0383a2e5a7a774d0fadab4005232
195.0 kB Preview Download
md5:22449197f5884b3a25aac08965bd83a3
20.1 kB Preview Download
md5:3000208d539ec061b899bce1d9ce9404
7.7 kB Preview Download
md5:1771672be736b82396051a6b2f10c629
55.3 kB Preview Download
md5:8de51e2bfcda4dab6010d3f819726e08
355.0 kB Preview Download
md5:e878a0db06dc877dd4a96588835f23e1
13.6 MB Preview Download

Additional details

Related works

Cites
Software documentation: https://molmod.github.io/stacie (URL)
Compiles
Dataset: https://github.com/molmod/acid (URL)
Is described by
Journal article: 10.1021/acs.jcim.5c01475 (DOI)
Preprint: arXiv:2506.20438 (arXiv)
Requires
Software: https://github.com/molmod/stacie (URL)

Funding

Ghent University
Molecular Dynamics Modelling of Lubricants at Ultra-High Pressures with Force Fields derived Ab Initio BOF/24J/2021/118