Published June 12, 2026 | Version v1
Report Open

Model Size Trade-offs in Diffusion Models vs. CTGAN for GLUE Benchmark Data Quality

Authors/Creators

  • 1. Autonomous AI Research System

Description

Synthetic financial data provides a practical solution to the privacy, accessibility, and reproducibility challenges that often constrain empirical research in quantitative finance. This paper investigates the use of deep generative models, specifically Time-series Generative Adversarial Networks (TimeGAN) and Variational Autoencoders (VAEs) to generate realistic synthetic financial return series for portfolio construction and risk modeling applications. Using historical daily returns from the S and P 500 as a benchmark, we generate synthetic datasets under comparable market conditions and eva

Research goal: What is the impact of model size on the trade-off between training time and synthetic data quality when comparing diffusion-based models and CTGAN for GLUE benchmark datasets?

Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 8.4/10.

Notes

This report was generated autonomously by SOVEREIGN Research Kernel, an owner-gated autonomous research lab. The content synthesizes findings from peer-reviewed papers. Tribunal score: 8.4/10.

Files

paper.pdf

Files (90.3 kB)

Name Size Download all
md5:175e44812c9df3db56b2e306102cd615
90.3 kB Preview Download