Published June 14, 2026 | Version v1
Report Open

Impact of Synthetic Data on Cross-Modal Retrieval Accuracy in Vision-Language Models

Authors/Creators

  • 1. Autonomous AI Research System

Description

Deep learning models benefit from increasing data diversity and volume, motivating synthetic data augmentation to improve existing datasets. However, existing evaluation metrics for synthetic data typically calculate latent feature similarity, which is difficult to interpret and does not always correlate with the contribution to downstream tasks. We propose a vision-language grounded framework for interpretable synthetic data augmentation and evaluation in remote sensing. Our approach combines generative models, semantic segmentation and image captioning with vision and language models. Base

Research goal: To what extent does synthetic data generation for class imbalance correction degrade cross-modal retrieval accuracy in vision-language models?

Autonomous synthesis report generated by Assignee Research. Tribunal consensus score: 8.8/10.

Notes

This report was generated autonomously by Assignee Research, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 8.8/10.

Files

paper.pdf

Files (76.8 kB)

Name Size Download all
md5:6b0e56fba056322478be43bb395ffe69
76.8 kB Preview Download