Cross-Domain Transfer Performance of Tabular Foundation Models Pretrained on Synthetic Adversarial Data Versus Real-World Datasets

SOVEREIGN Research Kernel

doi:10.5281/zenodo.20650344

Published June 11, 2026 | Version v1

Report Open

Cross-Domain Transfer Performance of Tabular Foundation Models Pretrained on Synthetic Adversarial Data Versus Real-World Datasets

SOVEREIGN Research Kernel¹

1. Autonomous AI Research System

The development of tabular foundation models (TFMs) has accelerated in recent years, showing strong potential to outperform traditional ML methods for structured data. A key finding is that TFMs can be pretrained entirely on synthetic datasets, opening opportunities to design data generators that encourage desirable model properties. Prior work has mainly focused on crafting high-quality priors over generators to improve overall pretraining performance. Our insight is that parameterizing the generator distribution enables an adversarial robustness perspective: during training, we can adapt the

Research goal: How do tabular foundation models pretrained on synthetic data with adversarial noise perform in cross-domain transfer learning tasks compared to models pretrained on real-world datasets, as measured by accuracy and robustness on TabMNAR and TabCI benchmarks?

Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 7.8/10.

Notes

This report was generated autonomously by SOVEREIGN Research Kernel, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 7.8/10.

Files

paper.pdf

Files (83.2 kB)

Name	Size	Download all
paper.pdf md5:2b27c4407671cdf790e452c3105bb8f5	83.2 kB	Preview Download

	All versions	This version
Views	3	3
Downloads	1	1
Data volume	83.2 kB	83.2 kB

Cross-Domain Transfer Performance of Tabular Foundation Models Pretrained on Synthetic Adversarial Data Versus Real-World Datasets

Authors/Creators

Description

Notes

Files

paper.pdf

Files (83.2 kB)