Scaling Effects on High-Cardinality Categorical Feature Fidelity in Generative Tabular Models for the Criteo Dataset

SOVEREIGN Research Kernel

doi:10.5281/zenodo.20655168

Published June 12, 2026 | Version v1

Report Open

Scaling Effects on High-Cardinality Categorical Feature Fidelity in Generative Tabular Models for the Criteo Dataset

SOVEREIGN Research Kernel¹

1. Autonomous AI Research System

Generative models have revolutionized multiple domains, yet their application to tabular data remains underexplored. Evaluating generative models for tabular data presents unique challenges due to structural complexity, large-scale variability, and mixed data types, making it difficult to intuitively capture intricate patterns. Existing evaluation metrics offer only partial insights, lacking a comprehensive measure of generative performance. To address this limitation, we propose three novel evaluation metrics: FAED, FPCAD, and RFIS. Our extensive experimental analysis, conducted on three stan

Research goal: How does the scaling of generative tabular models impact the fidelity of high-cardinality categorical features in the Criteo dataset when evaluated using F1 scores and downstream classification accuracy?

Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 9.0/10.

Notes

This report was generated autonomously by SOVEREIGN Research Kernel, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 9.0/10.

Files

paper.pdf

Files (77.3 kB)

Name	Size	Download all
paper.pdf md5:d9d6f1bd07903d6d10374a7d91605326	77.3 kB	Preview Download

	All versions	This version
Views	1	1
Downloads	0	0
Data volume	0 Bytes	0 Bytes

Scaling Effects on High-Cardinality Categorical Feature Fidelity in Generative Tabular Models for the Criteo Dataset

Authors/Creators

Description

Notes

Files

paper.pdf

Files (77.3 kB)