Published March 12, 2026 | Version v1
Dataset Open

Parameter-Efficient Adaptation of Generative-Foundation (Flux, Qwen) vs. Zero-Shot (Gemini, SAM3) Models for Aerial Image Segmentation

Description

Dataset Overview:

This dataset and the associated LoRA-adapted model weights (FLUX.1 Kontext, FLUX.2, and Qwen) provide the experimental framework and results for the study: 'Parameter-Efficient Adaptation of Generative-Foundation (Flux, Qwen) vs. Zero-Shot (Gemini, SAM3) Models for Aerial Image Segmentation', published in Buildings (MDPI), 2026, 16(7), 1369 https://doi.org/10.3390/buildings16071369 . The records include the Brisbane metropolitan training/testing tiles and the fine-tuned adapters that achieved a peak mean Intersection over Union (IoU) of 89% and 96% pixel accuracy, demonstrating the efficacy of generative foundation models for high-precision spatial analysis.

Technical Specifications:

·         Geographic Focus: Brisbane, Australia (Metropolitan regions).

·         Task: Binary Semantic Segmentation (Rooftop vs. Non-rooftop).

·         Data Split: 121 total tiles (80/20 split)- Training: 97 tiles.

·         Testing: 24 tiles.Format: High-resolution aerial imagery paired with dense binary annotations.

Benchmarked Models & Performance:

This dataset serves as the benchmark for several state-of-the-art architectures evaluated under a unified protocol:

Model Category

Name

IoU

Zero-Shot

Gemini 3 Pro

85%

Segmentation Baseline

SAM3 (Segment Anything Model)

84%

LoRA-Adapted Diffusion

FLUX.1-Kontext

89%

 

 

Key Research Insights:

·         Diffusion as Segmentors: The dataset demonstrates that with only 250–5000 steps of LoRA fine-tuning, generative models like FLUX.1-Kontext can outperform dedicated segmentation models (SAM3), reaching a mean IoU of 89%.

·         Generalization: While Gemini 3 Pro provides a powerful zero-shot baseline, fine-tuned diffusion models offer superior boundary fidelity in heterogeneous urban morphologies.

·         Efficiency: The results highlight that parameter-efficient tuning is a viable path for transforming general-purpose AI into scalable spatial analysis tools for urban monitoring and solar potential assessment.

 

Citation

  • Dataset could be cited from:   https://explore.openaire.eu/search/result?pid=10.5281%2Fzenodo.18571111
  • APA style ciataion: "Shata, D., Denman, S., Omrani, S., Drogemuller, R., Ali, H., & Wagdy, A. (2026). Parameter-Efficient Adaptation of Generative-Foundation (Flux, Qwen) vs. Zero-Shot (Gemini, SAM3) Models for Aerial Image Segmentation. https://doi.org/10.5281/zenodo.18571111"
  • APA style Publication ciataion: "Shata, D., Denman, S., Omrani, S., Drogemuller, R., Ali, H., & Wagdy, A. (2026). Parameter-Efficient Adaptation of Generative-Foundation (Flux, Qwen) vs. Zero-Shot (Gemini, SAM3) Models for Aerial Image Segmentation. Buildings16(7), 1369. https://doi.org/10.3390/buildings16071369"

Files

Files (26.1 GB)

Name Size Download all
md5:7d1a84526badadd487d2c5a228a40915
37.7 MB Download
md5:cfe7f70ed7358ddcc95449f5e90684e4
157.0 MB Download
md5:32fa97e7060c08b5e6d82a56c66f650e
1.4 MB Download
md5:eb29f333d4bd9861926f537e8b7c9329
25.9 GB Download

Additional details

Related works

Is supplement to
Journal article: 10.3390/buildings16071369 (DOI)