There is a newer version of the record available.

Published December 19, 2024 | Version v1
Dataset Open

Research data supporting: "Relevant, hidden, and frustrated information in high-dimensional analyses of complex dynamical systems with internal noise"

  • 1. ROR icon Polytechnic University of Turin

Description

This repository contains the set of data shown in the paper "Relevant, hidden, and frustrated information in high-dimensional analyses of complex dynamical systems with internal noise", published on arXiv (DOI: 10.48550/arXiv.2412.09412).

The scripts contained herein are:

  1.  PCA-Analysis.py:  python script to calculate the SOAP descriptor, denoising it, and compute the Principal Component Analysis
  2. SOAP-Component-Analysis.py: python script to calculate the variance of the single SOAP components
  3. Hierarchical-Clustering.py: python script to compute the hierarchical clustering and plot the dataset
  4. OnionClustering-1d.py: script to compute the Onion clustering on a single SOAP component or principal component
  5. OnionClustering-2d.py: script to compute bi-dimensional Onion clustering
  6. OnionClustering-plot.py: script to plot the Onion plot, removing clusters with population <1%
  7. UMAP.py: script to compute the UMAP dimensionality reduction technique

To reproduce the data of this work you should start form SOAP-Component-Analysis.py to calculate the SOAP descriptor and select the components that are interesting for you, then you can calculate the PCA with PCA-Analysis.py, and applying the clustering based on your necessities (OnionClustering-1d.py, OnionClustering-2d.py, Hierarchical-Clustering.py). Further modifications of the Onion plot can be done with the script: OnionClustering-plot.py. Umap can be calculated with UMAP.py.

 

Additional data contained herein are:

  1. starting-configuration.gro: gromacs file with the initial configuration of the ice-water system
  2. traj-ice-water-50ns-sampl4ps.xtc: trajectory of the ice-water system sampled every 4 ps
  3. traj-ice-water-50ns-sampl40ps.xtc: trajectory of the ice-water system sampled every 40 ps
  4. some files containing the SOAP descriptor of the ice-water system: ice-water-50ns-sampl40ps.hdf5, ice-water-50ns-sampl40ps_soap.hdf5, ice-water-50ns-sampl40ps_soap.npy, ice-water-50ns-sampl40ps_soap-spavg.npy
  5. PCA-results: folder that contains some example results of the PCA
  6. UMAP-results: folder that contains some example results of UMAP

The data related to the Quincke rollers can be found here: https://zenodo.org/records/10638736

Files

HiddenInformation.zip

Files (21.9 GB)

Name Size Download all
md5:28a9759ba19e185a51ac13290c23fc44
21.9 GB Preview Download

Additional details

Funding

European Commission
DYNAPOL – Modeling approaches toward bioinspired dynamic materials 818776

Software

Programming language
Python