DAN One Layer Retrievals - Sols 751-4050
Description
DAN One-Layer Retrieval Data Release (Sols 751–4050)
This repository contains processed Dynamic Albedo of Neutrons (DAN) one-layer retrieval products and supporting summary products for Curiosity observations spanning sols 751–4050. The release is organized into region-level summary products and per-location retrieval products. File names use a consistent `site / drive / start_sol / stop_sol` convention so that products from different subdirectories can be matched directly. Retrievals are performed on coadds of all observations at a given site/drive, producing one retrieval per location.
Repository structure
1_layer_DAN_sols_751_4050/
├── regional_summaries/
└── retrieval_products/
regional_summaries/
This directory contains region-scale summary tables and regional overlay figures. Each region contained in the data volume contains three files. One file stores per-observation summary statistics, the second stores bulk region summary statistics and the third is a plot of the regions retrieval results.
Example:
- region_<region>_per_observation_stats_snr5_sub50_hdi68.csv
- region_<region>_region_summary_stats_snr5_sub50_hdi68.csv
- region_<region>_overlay_COMBINED_snr5_sub50.png
The same naming pattern is used across regions.
retrieval_products/
This directory contains the observation-level retrieval products and diagnostics.
retrieval_products/
├── background_subtracted_coadd/
├── coadd_observations/
├── coadd_product/
├── corner_plots/
├── GMMs/
│ ├── unmix_2/
│ │ └── gmm_mix/
│ └── unmix_variable/
│ ├── gmm_mix/
│ └── gmm_selection/
├── MCMC_backend/
├── SNR/
├── times/
└── walker_plots/
Naming convention
Most files share a common stem:
site_<SITE>_drive_<DRIVE>_start_sol_<START>_stop_sol_<STOP>
For example:
site_026_drive_1274_start_sol_542_stop_sol_542
This stem is followed by a product-specific suffix, for example:
- _bg_dat.npy
- _label_matched.txt
- _coadded.npy
- _corner_plot.png
- _gmm_mix.png
- _gmm_selection.png
- _MCMC.h5.zip
- _CETN_SNR.npy
- _CTN_SNR.npy
- _times.npy
- _walker_plot.png
This convention allows all products associated with a given coadded retrieval to be aligned by filename.
Retrieval content
The products in this release correspond to a one-layer retrieval framework. The primary retrieved physical parameters are:
- WEH: water-equivalent hydrogen, in wt%
- Σ_abs: bulk macroscopic neutron absorption cross section (BNACS), in cm^2/g
Posterior diagnostics may also include an additional logf[counts] fit parameter in the MCMC products.
Observation-level products
Each retrieval generally includes one file in each of the subdirectories below.
background_subtracted_coadd/
Files ending in _bg_dat.npy contain the background-subtracted count data used in retrieval processing.
coadd_observations/
Files ending in _label_matched.txt list the DAN observation label or labels contributing to the coadded retrieval product. These files provide the traceability link between the retrieval and the original contributing observation set.
coadd_product/
Files ending in _coadded.npy contain the coadded observation-space product used as the retrieval input.
corner_plots/
Files ending in _corner_plot.png`are posterior diagnostic figures showing parameter distributions and pairwise covariances.
For one-layer retrievals, these figures are expected to show the posterior structure for WEH and Σ_abs, and logf[counts].
GMMs/
This directory contains Gaussian mixture model post-processing products.
- unmix_2/ contains fixed two-component mixture summaries.
- unmix_variable/ contains variable-component mixture exploration products.
Within these directories:
- gmm_mix/ contains posterior unmixing summary plots.
- gmm_selection/ contains model-selection plots used to compare mixture counts.
These products support interpretation of multimodal posterior structure.
MCMC_backend/
Files ending in _MCMC.h5.zip are compressed HDF5 backends containing the archived MCMC chains. These are the primary reproducibility products for users who want to regenerate posterior summaries, diagnostics, or alternate post-processing results.
SNR/
This directory contains per-observation signal-to-noise products:
- *_CETN_SNR.npy
- *_CTN_SNR.npy
These arrays store time-bin-level SNR values associated with the retrieval input data.
times/
Files ending in _times.npy contain the time-bin definitions associated with the DAN die-away measurement.
walker_plots/
Files ending in _walker_plot.png show the evolution of the MCMC walkers as a function of iteration number and are intended for quality control and convergence assessment.
Regional summary products
Per-observation regional statistics
Files named region_<region>_per_observation_stats_snr5_sub50_hdi68.csv contain one row per parameter per observation for all observations assigned to a region. These tables provide per-observation posterior summary statistics and identifying metadata such as sol, site, drive, start/stop sol, and source retrieval filename. Coadded observations are only included in regional summaries if their SNR is greater than or equal to 5.
Summary statistics include quantities such as:
- number of posterior samples
- mean
- standard deviation
- median
- KDE mode
- percentile-based intervals
- mode-centered interval terms
- 68% highest-density interval bounds and width
These files are intended for downstream analysis, filtering, and custom regional comparisons.
Region-level summary statistics
Files named region_<region>_region_summary_stats_snr5_sub50_hdi68.csv contain region-level aggregate summaries for each retrieval parameter. These include statistics derived from the distribution of per-observation medians, modes, interval widths, and mixture-distribution summaries.
These products are designed for compact regional characterization without requiring users to reprocess all observation-level posterior samples.
Regional overlay figures
Files named region_<region>_overlay_COMBINED_snr5_sub50.png show the combined regional posterior-density overlays for the one-layer retrieval parameters. These figures provide a visual summary of how posterior structure varies across all observations within a region.
Processing tags in filenames
Several filenames include processing tags that encode how the products were generated:
- snr5: products generated using an SNR threshold of 5
- sub50: products generated using a 50% posterior subsampling step
- hdi68: summary statistics reported using 68% highest-density intervals
These tags are part of the product identity and should be preserved when referencing or redistributing derivative products.
Recommended use
A typical use pattern is:
1. Start with regional_summaries/ to review region-scale behavior and identify observations of interest.
2. Use coadd_observations/ to trace a retrieval back to the contributing DAN observation labels.
3. Use coadd_product/, background_subtracted_coadd/, SNR/, and times/ for observation-space analysis.
4. Use corner_plots/, walker_plots/, and GMMs/ for posterior interpretation and quality control.
5. Use MCMC_backend/ when full reproducibility or custom post-processing is required.
Acknowledgment
If you use these products in published work, please cite the associated data release and the relevant scientific publications describing the DAN retrieval methodology.
Files
1_layer_DAN_sols_751_4050.zip
Files
(43.6 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:272f7285817aab4c4b55bc6a674a8f09
|
43.6 GB | Preview Download |