ESFlow: source code, sample data, and benchmark outputs
Description
This Zenodo record is a frozen snapshot of ESFlow, a module-grounded agentic AI framework for Earth system model (ESM) analysis, together with the sample data and full per-model benchmark outputs required to reproduce the results reported in the companion manuscript:
Zhou, T., Qian, Y., and Leung, L. R.: Can We Trust LLMs for Complex Earth System Model Analysis? Silent Failure and Evidence from Module-Grounded Benchmarking, submitted to Geoscientific Model Development (GMD), 2026.
The record contains two zip files:
- esflow-gmd-v1.zip: the ESFlow framework, the 24-tool library for E3SM land-surface and hydrology analysis, the benchmark runner, the self-debug experiment runner, the structural grading pipeline, benchmark task definitions, reference workflows, and per-model benchmark outputs.
- esflow_sample_data.zip: the sample data required to execute the reference workflows, including a representative subset of E3SM model output (land and river components), GRDC streamflow observations and basin polygons, and the ILAMB-derived observation files used in the benchmark workflows:
- GPCCv2018 precipitation: esflow_sample_data/ilamb/pr/GPCCv2018/pr.nc
- MODIS evapotranspiration: esflow_sample_data/ilamb/evspsbl/MODIS/et_0.5x0.5.nc
- LORA runoff: esflow_sample_data/ilamb/mrro/LORA/LORA.nc
The ILAMB-derived files are archived here to provide a frozen reproducibility record; their original source paths and embedded dataset metadata should be used for provenance and licence information.
Reproduction instructions are provided in the repository README.md.
Files
esflow_sample_data.zip
Additional details
Dates
- Submitted
-
2026-04-16