Published October 30, 2023 | Version v1
Journal article Open

Unsupervised Discovery of Extreme Weather Events Using Universal Representations of Emergent Organization

  • 1. ROR icon Pacific Northwest National Laboratory

Description

A record of code and data used to produce results for manuscript "Unsupervised Discovery of Extreme Weather Events Using Universal Representations of Emergent Organization" by Adam Rupe, Karthik Kashinath, Nalini Kumar, and James P. Crutchfield.

Includes DisCo source code, original run scripts used on the Cori supercomputer at NERSC, LBNL, as well as conda environments with required dependencies. The local causal state segmentation fields computed on Cori are included so that figures and analyses can be reproduced without HPC resources. 

Contents

Data

  • netcdf_data.zip
    Contains CAM5.1 netcdf files with core climate variables.
  • IVT_netcdfs.zip
    Contains netcdf files with Integrated Vapor Transport fields computed from the fields in netcdf_data.zip
  • twodimturb.zip
    A netcdf file with the vorticity field from two-dimensional free-decay turbulence.
  • reduced_npy_data.zip
    Numpy arrays of 4x reduced-resolution climate data (integrated vapor field and mid-column velocity components)
  • jupyter_trim.npy
    Numpy ndarray of integer grayscale of interpolated RGB data from the NASA Cassini spacecraft
  • IVT_alt-result-16.zip
    Local causal state segmentation results used as ".../IVT_alt/result-16/fields/" in climate notebooks. 
  • IVT-result-8.zip
    Local causal state segmentation results used as ".../IVT/result-8/fields/" in climate notebooks.
  • TC_seg_field.npy
    Numpy ndarray of TECA TC segmentation output. 
  • TECA_BARD.zip
    Netcdf files of TECA BARD AR segmentation output. 
  • LCS_1deg_reduced-full-3.zip
    Numpy arrays of local causal state segmentation output for the low-resolution climate data.
  • turb-result-15.zip
    Numpy arrays of local causal state segmentation output for the two-dimensional turbulence data.
  • vortex_counts-15.npy
    Output of union-find algorithm counting individual vortices from turb-results-15.zip turbulence segmentation output. 

Notebooks

  • climate_figs.ipynb
    Reproduces majority of climate related figures in the manuscript.
  • climate_old.ipynb
    Reproduces some older figures, some of which are used in Supplementary Information.
  • climate-low-res.ipynb
    Creates the low-resolution numpy arrays from the CAM5.1 netcdf files, and also creates related figures.
  • extreme_precipitation.ipynb
    Performs extreme precipitation analysis.
  • jupyter_figs.ipynb
    Creates figures for Jupiter atmosphere segmentation. 
  • TECA-compare.ipynb
    Creates figures from TECA segmentations. 
  • turbulence_figs.ipynb
    Creates turbulence figures and vortex decay analyses. 

Src

  • pdisco.py
    Python source code with core DisCo algorithms for distributed reconstruction of local causal states. 
  • visuals.py
    Python source code for visualizing spacetime fields using matplotlib.

Test-single-node

  • turb_small.npy
    Small sample of turbulence data used to test the DisCo local causal state reconstruction pipeline on a single machine. 
  • single-node-turb.ipynb
    A full pipeline test of local causal state reconstruction that can run on single machine (e.g. a laptop). 
  • single-node-turb.py
    A python script version of single-node-turb.ipynb. 

Scripts

Collection of python and SLURM run scripts used for experiments run on the Cori supercomputer. 

Param-logs

Logs of parameters and metadata used for various experiments run on the Cori supercomputer. The run numbers used for particular figures or analyses are indicated in the relevant notebook by the data being loaded.

Conda environment files

  • disco-deps.yml
    Simple list of basic dependencies needed to run DisCo local causal state reconstruction code. 
  • disco-env.yml
    Full conda environment details for running DisCo code on a single machine at time of this record's publication. 
  • disco-cori-env.yml
    Record of conda environment used on the Cori supercomputer for producing the segmentation results shown in the manuscript.

Files

IVT-netcdfs.zip

Files (34.0 GB)

Name Size Download all
md5:408305a29649b3c01da266d9cc6f2002
1.9 kB Download
md5:60b2f810943f3f2de324c9c18dcae2c5
634 Bytes Download
md5:b92c14014921573b06dad7e29440f715
16.1 kB Download
md5:cff34c33b826a44100a825a9f43fe95e
7.8 GB Preview Download
md5:4a796372bd2c3ea555e6f2159ce39568
90.9 MB Preview Download
md5:0cacf1d504b0c6cc899e74be55a391e3
24.9 MB Preview Download
md5:bccff88d2795b887231066776ca6d3fe
484.5 MB Download
md5:9049e4fad17d2276b32f84f2c14a35ba
2.7 MB Preview Download
md5:25e77da872718d530f58d7fd979a2d3b
23.4 GB Preview Download
md5:85afbe6e828be77eb013664368728444
15.9 MB Preview Download
md5:eb5cc0a22041073b6848af2eda57c698
39.4 kB Preview Download
md5:e992c066fc592b90f51fbf1b499f6add
308.6 MB Preview Download
md5:71e5a69be3bb19a70906d613caf02e7b
28.4 kB Preview Download
md5:b659d84efdba4a7cad31885b036cfcc5
36.9 kB Preview Download
md5:4d22cbb83d64f14121ec76ac00dab5e5
442.4 MB Download
md5:770ba440fadbfb94e66926d055a28a8d
99.2 MB Preview Download
md5:bf97d2a54d1cba19a7218c6577df1634
76.8 MB Preview Download
md5:1c857ee51ad8f9986ecff66bc4263c65
35.0 MB Preview Download
md5:812cecf3abc6b06a7d83566b70c57645
1.2 GB Preview Download
md5:4a9581ef202718a67600c81d9826b0d7
6.0 kB Download

Additional details

Related works

Is described by
Preprint: arXiv:2304.12586 (arXiv)