Published March 25, 2026 | Version v1
Dataset Open

DAN One Layer Retrievals - Sols 751-4050

  • 1. ROR icon Arizona State University

Description

DAN One-Layer Retrieval Data Release (Sols 751–4050)

This repository contains processed Dynamic Albedo of Neutrons (DAN) one-layer retrieval products and supporting summary products for Curiosity observations spanning sols 751–4050. The release is organized into region-level summary products and per-location retrieval products. File names use a consistent `site / drive / start_sol / stop_sol` convention so that products from different subdirectories can be matched directly. Retrievals are performed on coadds of all observations at a given site/drive, producing one retrieval per location.

Repository structure

1_layer_DAN_sols_751_4050/
├── regional_summaries/
└── retrieval_products/

regional_summaries/

This directory contains region-scale summary tables and regional overlay figures. Each region contained in the data volume contains three files.  One file stores per-observation summary statistics, the second stores bulk region summary statistics and the third is a plot of the regions retrieval results.

Example:

- region_<region>_per_observation_stats_snr5_sub50_hdi68.csv
- region_<region>_region_summary_stats_snr5_sub50_hdi68.csv
- region_<region>_overlay_COMBINED_snr5_sub50.png

The same naming pattern is used across regions.

retrieval_products/

This directory contains the observation-level retrieval products and diagnostics.

retrieval_products/
├── background_subtracted_coadd/
├── coadd_observations/
├── coadd_product/
├── corner_plots/
├── GMMs/
│   ├── unmix_2/
│    │  └── gmm_mix/
│   └── unmix_variable/
│       ├── gmm_mix/
│       └── gmm_selection/
├── MCMC_backend/
├── SNR/
├── times/
└── walker_plots/

Naming convention

Most files share a common stem:

site_<SITE>_drive_<DRIVE>_start_sol_<START>_stop_sol_<STOP>

For example:

site_026_drive_1274_start_sol_542_stop_sol_542
This stem is followed by a product-specific suffix, for example:

- _bg_dat.npy
- _label_matched.txt
- _coadded.npy
- _corner_plot.png
- _gmm_mix.png
- _gmm_selection.png
- _MCMC.h5.zip
- _CETN_SNR.npy
- _CTN_SNR.npy
- _times.npy
- _walker_plot.png

This convention allows all products associated with a given coadded retrieval to be aligned by filename.

Retrieval content

The products in this release correspond to a one-layer retrieval framework. The primary retrieved physical parameters are:

- WEH: water-equivalent hydrogen, in wt%
- Σ_abs: bulk macroscopic neutron absorption cross section (BNACS), in cm^2/g

Posterior diagnostics may also include an additional logf[counts] fit parameter in the MCMC products.

Observation-level products

Each retrieval generally includes one file in each of the subdirectories below.

background_subtracted_coadd/

Files ending in _bg_dat.npy contain the background-subtracted count data used in retrieval processing.

coadd_observations/

Files ending in _label_matched.txt list the DAN observation label or labels contributing to the coadded retrieval product. These files provide the traceability link between the retrieval and the original contributing observation set.

coadd_product/

Files ending in _coadded.npy contain the coadded observation-space product used as the retrieval input.

corner_plots/

Files ending in _corner_plot.png`are posterior diagnostic figures showing parameter distributions and pairwise covariances.

For one-layer retrievals, these figures are expected to show the posterior structure for WEH and Σ_abs, and logf[counts].

GMMs/

This directory contains Gaussian mixture model post-processing products.

- unmix_2/ contains fixed two-component mixture summaries.
- unmix_variable/ contains variable-component mixture exploration products.

Within these directories:

- gmm_mix/ contains posterior unmixing summary plots.
- gmm_selection/ contains model-selection plots used to compare mixture counts.

These products support interpretation of multimodal posterior structure.

MCMC_backend/

Files ending in _MCMC.h5.zip are compressed HDF5 backends containing the archived MCMC chains. These are the primary reproducibility products for users who want to regenerate posterior summaries, diagnostics, or alternate post-processing results.

SNR/

This directory contains per-observation signal-to-noise products:

- *_CETN_SNR.npy
- *_CTN_SNR.npy

These arrays store time-bin-level SNR values associated with the retrieval input data.

times/

Files ending in _times.npy contain the time-bin definitions associated with the DAN die-away measurement.

walker_plots/

Files ending in _walker_plot.png show the evolution of the MCMC walkers as a function of iteration number and are intended for quality control and convergence assessment.

Regional summary products

Per-observation regional statistics

Files named region_<region>_per_observation_stats_snr5_sub50_hdi68.csv contain one row per parameter per observation for all observations assigned to a region. These tables provide per-observation posterior summary statistics and identifying metadata such as sol, site, drive, start/stop sol, and source retrieval filename. Coadded observations are only included in regional summaries if their SNR is greater than or equal to 5.

Summary statistics include quantities such as:

- number of posterior samples
- mean
- standard deviation
- median
- KDE mode
- percentile-based intervals
- mode-centered interval terms
- 68% highest-density interval bounds and width

These files are intended for downstream analysis, filtering, and custom regional comparisons.

Region-level summary statistics

Files named region_<region>_region_summary_stats_snr5_sub50_hdi68.csv contain region-level aggregate summaries for each retrieval parameter. These include statistics derived from the distribution of per-observation medians, modes, interval widths, and mixture-distribution summaries.

These products are designed for compact regional characterization without requiring users to reprocess all observation-level posterior samples.

Regional overlay figures

Files named region_<region>_overlay_COMBINED_snr5_sub50.png show the combined regional posterior-density overlays for the one-layer retrieval parameters. These figures provide a visual summary of how posterior structure varies across all observations within a region.

Processing tags in filenames

Several filenames include processing tags that encode how the products were generated:

- snr5: products generated using an SNR threshold of 5
- sub50: products generated using a 50% posterior subsampling step
- hdi68: summary statistics reported using 68% highest-density intervals

These tags are part of the product identity and should be preserved when referencing or redistributing derivative products.

Recommended use

A typical use pattern is:

1. Start with regional_summaries/ to review region-scale behavior and identify observations of interest.
2. Use coadd_observations/ to trace a retrieval back to the contributing DAN observation labels.
3. Use coadd_product/, background_subtracted_coadd/, SNR/, and times/ for observation-space analysis.
4. Use corner_plots/, walker_plots/, and GMMs/ for posterior interpretation and quality control.
5. Use MCMC_backend/ when full reproducibility or custom post-processing is required.

Acknowledgment

If you use these products in published work, please cite the associated data release and the relevant scientific publications describing the DAN retrieval methodology.

Files

1_layer_DAN_sols_751_4050.zip

Files (43.6 GB)

Name Size Download all
md5:272f7285817aab4c4b55bc6a674a8f09
43.6 GB Preview Download