Adaptive Noise Injection Strategies for Enhanced TabPFN Calibration on Sparse Tabular Datasets
Description
Recent deep learning models for tabular data currently compete with the traditional ML models based on decision trees (GBDT). Unlike GBDT, deep models can additionally benefit from pretraining, which is a workhorse of DL for vision and NLP. For tabular problems, several pretraining methods were proposed, but it is not entirely clear if pretraining provides consistent noticeable improvements and what method should be used, since the methods are often not compared to each other or comparison is limited to the simplest MLP architectures. In this work, we aim to identify the best practices to pr
Research goal: Can adaptive noise injection strategies during TabPFN pretraining outperform fixed-magnitude Gaussian noise in improving model calibration on sparse tabular datasets?
Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 7.8/10.
Notes
Files
paper.pdf
Files
(84.3 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:80ba1ea3e64e67888552331b62ecf95a
|
84.3 kB | Preview Download |