Published May 16, 2025
| Version v1
Dataset
Restricted
Finance Agent Benchmark
Authors/Creators
Description
We present the Finance Agent Benchmark, featuring challenging and diverse real-world finance research problems which require LLMs to perform complex analysis with the use of of recent SEC filings. We construct the benchmark using a taxonomy of nine financial task categories, developed in consultation with experts from banks, hedge funds, and private equity firms. The dataset includes 537 expert-authored questions, covering tasks from information retrieval to complex financial modeling, where each question was validated through a rigorous review process to ensure accuracy and relevance.
Files
Additional details
Software
- Repository URL
- https://github.com/vals-ai/finance-agent
- Programming language
- Python