Published May 20, 2026
| Version v1.0.0
Software
Open
morgan-heisler/DeckBench: v1.0.0 — DECKBench Initial Release (KDD 2026)
Description
DECKBench v1.0.0
Official benchmark release accompanying the KDD 2026 paper: DECKBench: Benchmarking Multi-Agent Frameworks for Academic Slide Generation and Editing
What's Included
- 294 paper–slide pair URLs spanning CVPR, ECCV, ICLR, and ICML across multiple years
- Evaluation code for both tasks: paper-to-slide generation (Task 1) and multi-turn iterative editing (Task 2)
- User simulation pipeline with three personas (Granular Analyst, Balanced Editor, Executive) for generating realistic multi-turn editing instructions
- Baseline editor agent built on the OpenAI Agents framework, with an abstract base class for plug-in custom agents
- Full metric suite: perplexity, faithfulness, text/figure similarity, layout quality, DTW, transition similarity, ΔDTW, ΔTransSim
- Tutorial notebook for the user simulator (
user_simulator_tutorial.ipynb) - Pre-generated baseline decks available on HuggingFace
Citation
If you use DECKBench in your research, please cite:
@misc{jang2026deckbench,
title = {DECKBench: Benchmarking Multi-Agent Frameworks for Academic Slide Generation and Editing},
author = {Daesik Jang and Morgan Lindsay Heisler and Linzi Xing and Yifei Li
and Edward Wang and Ying Xiong and Yong Zhang and Zhenan Fan},
year = {2026},
eprint = {2602.13318},
archivePrefix = {arXiv},
primaryClass = {cs.AI},
url = {https://arxiv.org/abs/2602.13318}
}
Files
morgan-heisler/DeckBench-v1.0.0.zip
Files
(296.8 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:994afb4758bc3c0b266a8439a5fbfaa2
|
296.8 kB | Preview Download |
Additional details
Related works
- Is supplement to
- Software: https://github.com/morgan-heisler/DeckBench/tree/v1.0.0 (URL)
Software
- Repository URL
- https://github.com/morgan-heisler/DeckBench