Execution-Feedback Driven Test Generation from SWE Issues

Blinded

doi:10.5281/zenodo.16307058

There is a newer version of the record available.

Published July 22, 2025 | Version v4

Dataset Open

Execution-Feedback Driven Test Generation from SWE Issues

Blinded

We have shared 8 folders containing tests for 2 benchmarks (SWT-Bench Lite and TDD-Bench Verified), 2 models (GPT-4o and Claude-3.7-Sonnet), and 2 approaches (Otter and e-Otter). Each folder contains 10 tests using different prompting techniques (e.g., planner, full, standard) and associated logs.

Files

e-otter_lite_claude.zip

Files (2.8 GB)

Name	Size
e-otter_lite_claude.zip md5:99331605793fab49344635c48b195ad3	289.0 MB	Preview Download
e-otter_lite_gpt4o.zip md5:006ab1c91b5c3fbad72213c5fc3e3407	287.2 MB	Preview Download
e-otter_verified_claude.zip md5:e94da042bc591f858d5ef29b87aab110	387.4 MB	Preview Download
e-otter_verified_gpt4o.zip md5:420bb3d2e38aec5dcbc07e24b89a6a61	412.6 MB	Preview Download
otter_lite_claude.zip md5:30e3ce3e187c57d31802ee8fd1acf1f6	292.2 MB	Preview Download
otter_lite_gpt4o.zip md5:6237d2dde3b997c72703a1f1ed80e42b	289.1 MB	Preview Download
otter_verified_claude.zip md5:efa67759a49030169e745ee3296f613f	384.9 MB	Preview Download
otter_verified_gpt4o.zip md5:57946c7226907d2af86f02dcd4f84e84	411.1 MB	Preview Download

360

Views

707

Downloads

Show more details

	All versions	This version
Views	360	32
Downloads	707	43
Data volume	187.4 GB	17.5 GB

More info on how stats are collected....

DOI

Resource type

Dataset

Publisher

Zenodo

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: July 22, 2025
Modified: July 22, 2025

Execution-Feedback Driven Test Generation from SWE Issues

Authors/Creators

Description

Files

e-otter_lite_claude.zip

Files (2.8 GB)