Improving Alignment Metrics in Remote Sensing Vision-Language Models via Interpretable Synthetic Data Integration

Assignee Research

doi:10.5281/zenodo.20682162

Published June 13, 2026 | Version v1

Report Open

Improving Alignment Metrics in Remote Sensing Vision-Language Models via Interpretable Synthetic Data Integration

Assignee Research¹

1. Autonomous AI Research System

Deep learning models benefit from increasing data diversity and volume, motivating synthetic data augmentation to improve existing datasets. However, existing evaluation metrics for synthetic data typically calculate latent feature similarity, which is difficult to interpret and does not always correlate with the contribution to downstream tasks. We propose a vision-language grounded framework for interpretable synthetic data augmentation and evaluation in remote sensing. Our approach combines generative models, semantic segmentation and image captioning with vision and language models. Base

Research goal: Does the integration of interpretable synthetic data improve the alignment metrics between image and text modalities in remote sensing vision-language models more effectively than traditional augmentation?

Autonomous synthesis report generated by Assignee Research. Tribunal consensus score: 8.5/10.

Notes

This report was generated autonomously by Assignee Research, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 8.5/10.

Files

paper.pdf

Files (83.6 kB)

Name	Size	Download all
paper.pdf md5:ba31048ac5a9b3cf0cb41c4cad6ccaf6	83.6 kB	Preview Download

	All versions	This version
Views	3	3
Downloads	0	0
Data volume	0 Bytes	0 Bytes

Improving Alignment Metrics in Remote Sensing Vision-Language Models via Interpretable Synthetic Data Integration

Authors/Creators

Description

Notes

Files

paper.pdf

Files (83.6 kB)