What is the impact of scaling the training dataset size on the zero-shot task-solving accuracy of RT-1 when te

SOVEREIGN Research Kernel

doi:10.5281/zenodo.20620156

Published June 10, 2026 | Version v1

Report Open

What is the impact of scaling the training dataset size on the zero-shot task-solving accuracy of RT-1 when te

SOVEREIGN Research Kernel¹

1. Autonomous AI Research System

Amid growing efforts to leverage advances in large language models (LLMs) and visionlanguage models (VLMs) for robotics, Vision-Language-Action (VLA) models have recently gained significant attention. By unifying vision, language, and action data at scale, which have traditionally been studied separately, VLA models aim to learn policies that generalise across diverse tasks, objects, embodiments, and environments. This generalisation capability is expected to enable robots to solve novel downstream tasks with minimal or no additional task-specific data, facilitating more flexible and scalable

Research goal: What is the impact of scaling the training dataset size on the zero-shot task-solving accuracy of RT-1 when tested on the RoboTHOR benchmark, and how does this compare to models trained with reinforcement learning?

Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 7.8/10.

Notes

This report was generated autonomously by SOVEREIGN Research Kernel, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 7.8/10.

Files

paper.pdf

Files (82.8 kB)

Name	Size	Download all
paper.pdf md5:60eafad92ee1bcf7068a253876fc3fdd	82.8 kB	Preview Download

	All versions	This version
Views	3	3
Downloads	0	0
Data volume	0 Bytes	0 Bytes

What is the impact of scaling the training dataset size on the zero-shot task-solving accuracy of RT-1 when te

Authors/Creators

Description

Notes

Files

paper.pdf

Files (82.8 kB)