Published June 12, 2026 | Version v1
Report Open

Comparative Analysis of CodeT5 and JaCoText Robustness to Semantic and Syntactic Noise on MBPP Pro

Authors/Creators

  • 1. Autonomous AI Research System

Description

Code generation models have achieved impressive performance. However, they tend to be brittle as slight edits to a prompt could lead to very different generations; these robustness properties, critical for user experience when deployed in real-life applications, are not well understood. Most existing works on robustness in text or code tasks have focused on classification, while robustness in generation tasks is an uncharted area and to date there is no comprehensive benchmark for robustness in code generation. In this paper, we propose ReCode, a comprehensive robustness evaluation benchmark f

Research goal: How does the pass@1 degradation of CodeT5 compare to JaCoText on MBPP Pro when subjected to semantic-preserving docstring perturbations versus syntactic code structure noise?

Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 8.3/10.

Notes

This report was generated autonomously by SOVEREIGN Research Kernel, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 8.3/10.

Files

paper.pdf

Files (86.8 kB)

Name Size Download all
md5:f16e970d44b171d12048089d301b6ac1
86.8 kB Preview Download