CompBioBench v1: A benchmark of 100 diverse, verifiable questions for agents for computational biology

Nair, Surag; Gunsalus, Laura; Orcutt-Jahns, Brian; Rossen, Jordan; Lal, Avantika; De Donno, Carlo; Celik, Muhammed Hasan; Fletez-Brant, Kipper; Xie, Xiaoman; Corrada Bravo, Hector; Eraslan, Gokcen

doi:10.5281/zenodo.19443186

Published April 6, 2026 | Version v1

Dataset Open

CompBioBench v1: A benchmark of 100 diverse, verifiable questions for agents for computational biology

1. Genentech
2. Roche

We introduce CompBioBench v1, a benchmark of 100 diverse tasks for evaluating agentic systems in computational biology. Unlike mathematics and programming, which more readily admit systematic verification, biological data are inherently noisy and open to interpretation. To enable objective evaluation without reducing tasks to prescriptive checklists, we propose a new benchmark-construction strategy based on synthetic/augmented data and metadata scrambling/scrubbing of real datasets to create challenging problems with a single ground-truth answer that require multi-step reasoning, tool use, bespoke code, and interaction with real-world external resources. The benchmark spans genomics, transcriptomics, epigenomics, single-cell analysis, human genetics, and machine learning workflows. Questions are curated by domain experts to cover a broad range of skills with varying difficulty.

This record contains all questions, metadata, and input data files associated with CompBioBench v1. You can evaluate your answers here: https://huggingface.co/spaces/Genentech/compbiobench-leaderboard-v1.

Files

Files (12.0 GB)

Name	Size
compbiobench.v1.tsv md5:b9d72c04c018ee25798cc93ba77c1964	57.4 kB	Download
compbiobench_v1_data.tar md5:043dd0395898f2a71b6e81aea6a92276	12.0 GB	Download

	All versions	This version
Views	1,256	1,256
Downloads	304	304
Data volume	1.9 TB	1.9 TB

CompBioBench v1: A benchmark of 100 diverse, verifiable questions for agents for computational biology

Authors/Creators

Description

Files

Files (12.0 GB)