To what extent does multimodal input (code + AST graphs) improve the vulnerability reasoning capabilities of S

SOVEREIGN Research Kernel

doi:10.5281/zenodo.20441350

Published May 29, 2026 | Version v1

Report Open

To what extent does multimodal input (code + AST graphs) improve the vulnerability reasoning capabilities of S

SOVEREIGN Research Kernel¹

1. Autonomous AI Research System

Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-future capabilities and limitations of language models. To address this challenge, we introduce the Beyond the Imitation Game benchmark (BIG-bench). BIG-bench currently consists of 204 tasks, contributed b

Research goal: To what extent does multimodal input (code + AST graphs) improve the vulnerability reasoning capabilities of SecLM-aligned models compared to text-only input, as evaluated by SWE-bench scores and precision-recall metrics?

Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 8.5/10.

Notes

This report was generated autonomously by SOVEREIGN Research Kernel, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 8.5/10.

Files

paper.pdf

Files (75.2 kB)

Name	Size	Download all
paper.pdf md5:557769d2fbd92c35dc8c6d0bd9dad294	75.2 kB	Preview Download

	All versions	This version
Views	2	2
Downloads	1	1
Data volume	75.2 kB	75.2 kB

To what extent does multimodal input (code + AST graphs) improve the vulnerability reasoning capabilities of S

Authors/Creators

Description

Notes

Files

paper.pdf

Files (75.2 kB)