Diffusion-Based Layout Generators Enhancing Robustness of Instruction-Tuned Vision-Language Models Against Adversarial Spatial
Description
Generative artificial intelligence (AI) has emerged as a powerful technology with numerous applications in various domains. There is a need to identify the requirements and evaluation metrics for generative AI models designed for specific tasks. The purpose of the research aims to investigate the fundamental aspects of generative AI systems, including their requirements, models, input--output formats, and evaluation metrics. The study addresses key research questions and presents comprehensive insights to guide researchers, developers, and practitioners in the field. Firstly, the requirements n
Research goal: To what extent do diffusion-based layout generators improve the robustness of instruction-tuned vision-language models against adversarial spatial perturbations compared to GAN-based priors on the Visual Genome dataset?
Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 7.8/10.
Notes
Files
paper.pdf
Files
(76.1 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:2c1e164c5839bcea6d79bccf406fac13
|
76.1 kB | Preview Download |