Published July 1, 2025 | Version v2
Dataset Open

Research data supporting: "Relevant, hidden, and frustrated information in high-dimensional analyses of complex dynamical systems with internal noise"

  • 1. ROR icon Polytechnic University of Turin

Description

This repository contains the set of data shown in the paper "Relevant, hidden, and frustrated information in high-dimensional analyses of complex dynamical systems with internal noise", published on JCTC (DOI: 10.1021/acs.jctc.5c00374).

The scripts contained herein are:

  1.  PCA-Analysis.py:  python script to calculate the SOAP descriptor, denoising it, and compute the Principal Component Analysis
  2. SOAP-Component-Analysis.py: python script to calculate the variance of the single SOAP components
  3. Hierarchical-Clustering.py: python script to compute the hierarchical clustering and plot the dataset
  4. OnionClustering-1d.py: script to compute the Onion clustering on a single SOAP component or principal component
  5. OnionClustering-2d.py: script to compute bi-dimensional Onion clustering
  6. OnionClustering-plot.py: script to plot the Onion plot, removing clusters with population <1%
  7. UMAP.py: script to compute the UMAP dimensionality reduction technique
  8. TiCA.py: script to compute the TiCA dimensionality reduction technique
  9. VAMPnets.py: script to compute the VAMPnets dimensionality reduction technique

To reproduce the data of this work you should start form SOAP-Component-Analysis.py to calculate the SOAP descriptor and select the components that are interesting for you, then you can calculate the PCA with PCA-Analysis.py, and applying the clustering based on your necessities (OnionClustering-1d.py, OnionClustering-2d.py, Hierarchical-Clustering.py). Further modifications of the Onion plot can be done with the script: OnionClustering-plot.py. UMAP can be calculated with UMAP.py, TiCA with TiCA.py, and VAMPnets with VAMPnets.py.

 

Additional data contained herein are:

  1. starting-configuration.gro: gromacs file with the initial configuration of the ice-water system
  2. traj-ice-water-50ns-sampl4ps.xtc: trajectory of the ice-water system sampled every 4 ps
  3. traj-ice-water-50ns-sampl40ps.xtc: trajectory of the ice-water system sampled every 40 ps
  4. some files containing the SOAP descriptor of the ice-water system: ice-water-50ns-sampl40ps.hdf5, ice-water-50ns-sampl40ps_soap.hdf5, ice-water-50ns-sampl40ps_soap.npy, ice-water-50ns-sampl40ps_soap-spavg.npy
  5. PCA-results: folder that contains some example results of the PCA
  6. UMAP-results: folder that contains some example results of UMAP
  7. TiCA-results: folder that contains some example results of TiCA
  8. VAMPnets-results: folder that contains some example results of VAMPnets

The data related to the Quincke rollers can be found here: https://zenodo.org/records/10638736

Files

HiddenInformation-v2.zip

Files (22.0 GB)

Name Size Download all
md5:aeaa07e31495c8e90657b8f8f5e226b5
22.0 GB Preview Download

Additional details

Related works

Is supplement to
Publication: 10.1021/acs.jctc.5c00374 (DOI)

Funding

European Commission
DYNAPOL - Modeling approaches toward bioinspired dynamic materials 818776

Software

Programming language
Python