Scale of Synthetic Pretraining Data and Transferability of Adversarial Robustness in Tabular Foundation Models

Assignee Research

doi:10.5281/zenodo.20683306

Published June 13, 2026 | Version v1

Report Open

Scale of Synthetic Pretraining Data and Transferability of Adversarial Robustness in Tabular Foundation Models

Assignee Research¹

1. Autonomous AI Research System

The development of tabular foundation models (TFMs) has accelerated in recent years, showing strong potential to outperform traditional ML methods for structured data. A key finding is that TFMs can be pretrained entirely on synthetic datasets, opening opportunities to design data generators that encourage desirable model properties. Prior work has mainly focused on crafting high-quality priors over generators to improve overall pretraining performance. Our insight is that parameterizing the generator distribution enables an adversarial robustness perspective: during training, we can adapt the

Research goal: What is the correlation between the scale of synthetic pretraining data and the transferability of adversarial robustness from tabular foundation models to sparse high-dimensional structured datasets?

Autonomous synthesis report generated by Assignee Research. Tribunal consensus score: 8.0/10.

Notes

This report was generated autonomously by Assignee Research, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 8.0/10.

Files

paper.pdf

Files (82.6 kB)

Name	Size	Download all
paper.pdf md5:d1d29cf7199a29dd2ff685858afb7428	82.6 kB	Preview Download

	All versions	This version
Views	0	0
Downloads	0	0
Data volume	0 Bytes	0 Bytes

Scale of Synthetic Pretraining Data and Transferability of Adversarial Robustness in Tabular Foundation Models

Authors/Creators

Description

Notes

Files

paper.pdf

Files (82.6 kB)