Correlation between Tabular Data Generative Metrics and Downstream Classifier Accuracy

SOVEREIGN Research Kernel

doi:10.5281/zenodo.20650293

Published June 11, 2026 | Version v1

Report Open

Correlation between Tabular Data Generative Metrics and Downstream Classifier Accuracy

SOVEREIGN Research Kernel¹

1. Autonomous AI Research System

Abstract Tabular data, spreadsheets organized in rows and columns, are ubiquitous across scientific fields, from biomedicine to particle physics to economics and climate science 1,2 . The fundamental prediction task of filling in missing values of a label column based on the rest of the columns is essential for various applications as diverse as biomedical risk models, drug discovery and materials science. Although deep learning has revolutionized learning from raw data and led to numerous high-profile success stories 3--5 , gradient-boosted decision trees 6--9 have dominated tabular data for th

Research goal: What is the correlation between novel tabular data generative metrics and downstream classifier accuracy across mixed data types?

Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 9.0/10.

Notes

This report was generated autonomously by SOVEREIGN Research Kernel, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 9.0/10.

Files

paper.pdf

Files (71.1 kB)

Name	Size	Download all
paper.pdf md5:d9a5380524f41e679b27043187cee313	71.1 kB	Preview Download

	All versions	This version
Views	4	4
Downloads	0	0
Data volume	0 Bytes	0 Bytes

Correlation between Tabular Data Generative Metrics and Downstream Classifier Accuracy

Authors/Creators

Description

Notes

Files

paper.pdf

Files (71.1 kB)