Published May 16, 2025 | Version v1
Dataset Restricted

Finance Agent Benchmark

Authors/Creators

Description

We present the Finance Agent Benchmark, featuring challenging and diverse real-world finance research problems which require LLMs to perform complex analysis with the use of of recent SEC filings. We construct the benchmark using a taxonomy of nine financial task categories, developed in consultation with experts from banks, hedge funds, and private equity firms. The dataset includes 537 expert-authored questions,  covering tasks from information retrieval to complex financial modeling, where each question was validated through a rigorous review process to ensure accuracy and relevance.

Files

Restricted

The record is publicly accessible, but files are restricted. <a href="https://zenodo.org/account/settings/login?next=https://zenodo.org/records/15428639">Log in</a> to check if you have access.

Additional details

Software

Repository URL
https://github.com/vals-ai/finance-agent
Programming language
Python