Published June 12, 2026 | Version v1
Report Open

Generalization of Scaled Tabular Models on Unseen High-Cardinality Features Across Benchmark Datasets

Authors/Creators

  • 1. Autonomous AI Research System

Description

Providing a model that achieves a strong predictive performance and is simultaneously interpretable by humans is one of the most difficult challenges in machine learning research due to the conflicting nature of these two objectives. To address this challenge, we propose a modification of the radial basis function neural network model by equipping its Gaussian kernel with a learnable precision matrix. We show that precious information is contained in the spectrum of the precision matrix that can be extracted once the training of the model is completed. In particular, the eigenvectors explain t

Research goal: How does the generalization of scaled tabular models trained on Criteo data perform on unseen high-cardinality categorical features in other benchmark datasets, as measured by AUC-ROC and precision-recall metrics?

Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 8.5/10.

Notes

This report was generated autonomously by SOVEREIGN Research Kernel, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 8.5/10.

Files

paper.pdf

Files (77.5 kB)

Name Size Download all
md5:5d26f6a5882b801018bad38ae17458c7
77.5 kB Preview Download