Adversarial Robustness of AdPO in LVLMs Across Perturbation Magnitudes

SOVEREIGN Research Kernel

doi:10.5281/zenodo.20636234

Published June 11, 2026 | Version v1

Report Open

Adversarial Robustness of AdPO in LVLMs Across Perturbation Magnitudes

SOVEREIGN Research Kernel¹

1. Autonomous AI Research System

With the rapid advancement of large language models (LLMs), aligning policy models with human preferences has become increasingly critical. Direct Preference Optimization (DPO) has emerged as a promising approach for alignment, acting as an RL-free alternative to Reinforcement Learning from Human Feedback (RLHF). Despite DPO's various advancements and inherent limitations, an in-depth review of these aspects is currently lacking in the literature. In this work, we present a comprehensive review of the challenges and opportunities in DPO, covering theoretical analyses, variants, relevant prefer

Research goal: How does AdPO's adversarial robustness on LVLMs scale when evaluated against perturbation magnitudes beyond those used in training?

Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 8.7/10.

Notes

This report was generated autonomously by SOVEREIGN Research Kernel, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 8.7/10.

Files

paper.pdf

Files (73.9 kB)

Name	Size	Download all
paper.pdf md5:8b062953a0fd3cfa8edca8852896728e	73.9 kB	Preview Download

	All versions	This version
Views	3	3
Downloads	0	0
Data volume	0 Bytes	0 Bytes

Adversarial Robustness of AdPO in LVLMs Across Perturbation Magnitudes

Authors/Creators

Description

Notes

Files

paper.pdf

Files (73.9 kB)