Published January 23, 2026 | Version v1
Model Open

Model and Code for publication 'Inferring the Controlling Factors of Ice Aggregation from Targeted Cloud Seeding Experiments'

Description

# Inferring the Controlling Factors of Ice Aggregation from Targeted Cloud Seeding Experiments

> **Authors:** Huiying Zhang , Fabiola Ramelli, Christopher Fuchs , Nadja Omanovic , Anna J. Miller , Robert Spirig, Zhaolong Wu , Yunpei Chu , Xia Li, Ulrike Lohmann , and Jan Henneberger
> **Affiliation:** ETH Zurich
> **Contact:** huiying.zhang@env.ethz.ch

## πŸ“– Introduction

This repository contains the source code, machine learning models, and datasets associated with the research project/paper: **"Inferring the Controlling Factors of Ice Aggregation from Targeted Cloud Seeding Experiments"**.

It includes:
1. **Machine Learning**: aggregation rate prediction models (XGBoost, AdaBoost, etc.)
2. **Visualization**: Correlation plots and Probability Density Function (PDF) analysis etc.

---

## πŸ“‚ Repository Structure

The project is organized as follows:

```text
.
β”œβ”€β”€ README.md
β”œβ”€β”€ casual_analysis
β”‚   └── casual.ipynb
β”œβ”€β”€ correlation_plots
β”‚   β”œβ”€β”€ EDR_temp.ipynb
β”‚   β”œβ”€β”€ ICNCt0_IWC.ipynb
β”‚   β”œβ”€β”€ ICNCt0_Temp.ipynb
β”‚   └── residenceTime_Temp.ipynb
β”œβ”€β”€ correlation_subplots
β”‚   └── agg_rate
β”‚   └── ICNCt0_AggRate.ipynb
β”œβ”€β”€ environment.yml
β”œβ”€β”€ ml_models
β”‚   β”œβ”€β”€ AdaBoost.pkl
β”‚   β”œβ”€β”€ BayesianRidge.pkl
β”‚   β”œβ”€β”€ CatBoost.pkl
β”‚   β”œβ”€β”€ DecisionTree.pkl
β”‚   β”œβ”€β”€ ExtraTrees.pkl
β”‚   β”œβ”€β”€ GradientBoosting.pkl
β”‚   β”œβ”€β”€ KNN.pkl
β”‚   β”œβ”€β”€ LightGBM.pkl
β”‚   β”œβ”€β”€ LinearRegression.pkl
β”‚   β”œβ”€β”€ RandomForest.pkl
β”‚   β”œβ”€β”€ Ridge.pkl
β”‚   β”œβ”€β”€ XGBoost.pkl
β”‚   └── models_performance.csv
β”œβ”€β”€ models_training
β”‚   β”œβ”€β”€ model_training.ipynb
β”‚   └── physical_ml_models_2.0_withoutEDR_groupBased.ipynb
└── pdf
β”œβ”€β”€ icnc_pdf.ipynb
β”œβ”€β”€ irre_rim_pdf.ipynb
└── majsiz_pdf.ipynb

```

---

## πŸš€ Getting Started

### 1. Prerequisites

The code relies on Python and several data science libraries. We recommend using **Conda** to manage the environment.

### 2. Installation

To set up the environment and reproduce the results, please follow these steps:

```bash
# 1.Create the environment from the provided file
conda env create -f environment.yml

# 2. Activate the environment
conda activate ice

```

---

## πŸ“Š Data Usage

The data is located in `casual_analysis/data/`.

* **Main Dataset:** `merged_with_edr_postprocessed.csv`
* **Metadata:** Please refer to `data_description.md` in the data folder for a detailed explanation of column names, physical units, and measurement techniques.

---

## πŸ€– Model Usage

Models are stored in `ml_models/`. You can load them using Python's `joblib` or `pickle` library to make predictions on new data.

**Example (Python):**

```python
import joblib
import pandas as pd

# 1. Load a trained model (e.g., XGBoost)
model_path = 'ml_models/XGBoost.pkl'
model = joblib.load(model_path)

# 2. Load your data (ensure feature columns match the training data)
# data = pd.read_csv('your_new_data.csv')

# 3. Predict
# predictions = model.predict(data)

```

To retrain the models or view the training process, run the notebook:

`models_training/model_training.ipynb`

---

## πŸ“ˆ Visualizations (Notebooks)

To view the analysis plots (PDFs and Correlations), start the Jupyter Notebook server:

```bash
jupyter notebook

```

* **Correlation Analysis:** Go to `correlation_plots/` (e.g., `EDR_temp.ipynb`).
* **PDF Analysis:** Go to `pdf/` (e.g., `icnc_pdf.ipynb`).

---

## πŸ“ Citation

If you use this code or data in your research, please cite our paper:

```bibtex
@article{zhang2025inferring,
title={Inferring the Controlling Factors of Ice Aggregation from Targeted Cloud Seeding Experiments},
author={Zhang, Huiying and Ramelli, Fabiola and Fuchs, Christopher and Omanovic, Nadja and Miller, Anna J and Spirig, Robert and Wu, Zhaolong and Chu, Yunpei and Li, Xia and Lohmann, Ulrike and others},
journal={EGUsphere},
volume={2025},
pages={1--31},
year={2025},
publisher={Copernicus Publications G{\"o}ttingen, Germany}
}

```

## πŸ“„ License

This project is licensed under the [CC-BY](https://www.google.com/search?q=LICENSE) - see the https://www.google.com/search?q=LICENSE file for details.

```

```

Files

models_and_code.zip

Files (23.1 MB)

Name Size Download all
md5:af1b94b6d952b07cc7645bd005b36595
23.1 MB Preview Download

Additional details

Related works

Is supplement to
Dataset: 10.5281/zenodo.18348125 (DOI)

Funding

European Commission
CLOUDLAB - Using clouds as a natural laboratory to improve precipitation forecast skills 101021272

References

  • Zhang H, Ramelli F, Fuchs C, et al. Inferring the Controlling Factors of Ice Aggregation from Targeted Cloud Seeding Experiments[J]. EGUsphere, 2025, 2025: 1-31.