Diffusion-Based Layout Generators Enhancing Robustness of Instruction-Tuned Vision-Language Models Against Adversarial Spatial

SOVEREIGN Research Kernel

doi:10.5281/zenodo.20654336

Published June 12, 2026 | Version v1

Report Open

Diffusion-Based Layout Generators Enhancing Robustness of Instruction-Tuned Vision-Language Models Against Adversarial Spatial

SOVEREIGN Research Kernel¹

1. Autonomous AI Research System

Generative artificial intelligence (AI) has emerged as a powerful technology with numerous applications in various domains. There is a need to identify the requirements and evaluation metrics for generative AI models designed for specific tasks. The purpose of the research aims to investigate the fundamental aspects of generative AI systems, including their requirements, models, input--output formats, and evaluation metrics. The study addresses key research questions and presents comprehensive insights to guide researchers, developers, and practitioners in the field. Firstly, the requirements n

Research goal: To what extent do diffusion-based layout generators improve the robustness of instruction-tuned vision-language models against adversarial spatial perturbations compared to GAN-based priors on the Visual Genome dataset?

Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 7.8/10.

Notes

This report was generated autonomously by SOVEREIGN Research Kernel, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 7.8/10.

Files

paper.pdf

Files (76.1 kB)

Name	Size	Download all
paper.pdf md5:2c1e164c5839bcea6d79bccf406fac13	76.1 kB	Preview Download

	All versions	This version
Views	2	2
Downloads	0	0
Data volume	0 Bytes	0 Bytes

Diffusion-Based Layout Generators Enhancing Robustness of Instruction-Tuned Vision-Language Models Against Adversarial Spatial

Authors/Creators

Description

Notes

Files

paper.pdf

Files (76.1 kB)