Generalization of Scaled Tabular Models on Unseen High-Cardinality Features Across Benchmark Datasets

SOVEREIGN Research Kernel

doi:10.5281/zenodo.20655162

Published June 12, 2026 | Version v1

Report Open

Generalization of Scaled Tabular Models on Unseen High-Cardinality Features Across Benchmark Datasets

SOVEREIGN Research Kernel¹

1. Autonomous AI Research System

Providing a model that achieves a strong predictive performance and is simultaneously interpretable by humans is one of the most difficult challenges in machine learning research due to the conflicting nature of these two objectives. To address this challenge, we propose a modification of the radial basis function neural network model by equipping its Gaussian kernel with a learnable precision matrix. We show that precious information is contained in the spectrum of the precision matrix that can be extracted once the training of the model is completed. In particular, the eigenvectors explain t

Research goal: How does the generalization of scaled tabular models trained on Criteo data perform on unseen high-cardinality categorical features in other benchmark datasets, as measured by AUC-ROC and precision-recall metrics?

Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 8.5/10.

Notes

This report was generated autonomously by SOVEREIGN Research Kernel, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 8.5/10.

Files

paper.pdf

Files (77.5 kB)

Name	Size	Download all
paper.pdf md5:5d26f6a5882b801018bad38ae17458c7	77.5 kB	Preview Download

	All versions	This version
Views	3	3
Downloads	0	0
Data volume	0 Bytes	0 Bytes

Generalization of Scaled Tabular Models on Unseen High-Cardinality Features Across Benchmark Datasets

Authors/Creators

Description

Notes

Files

paper.pdf

Files (77.5 kB)