Published June 21, 2025
| Version v2
Project deliverable
Open
Halligan
Creators
Description
Introduction
This repository provides the official artifact for the paper:
"Are CAPTCHAs Still Bot-Hard? Generalized Visual CAPTCHA Solving with Agentic Vision Language Models"
Our work explores the effectiveness of Vision-Language Model (VLM) agents in solving modern visual CAPTCHAs by leveraging reasoning, abstraction, and code synthesis capabilities.
Contents
- benchmark.zip: An interactive offline benchmark suite designed to evaluate VLM agents on their ability to solve visual CAPTCHA challenges.
- halligan.zip: The implementation of Halligan, our proposed VLM agent introduced in the paper
Files
benchmark.zip
Files
(2.6 GB)
Name | Size | Download all |
---|---|---|
md5:80854a4947265de9ebfcc56cf9e70501
|
298.4 MB | Preview Download |
md5:b71dd0cb06d2f69ca56820b63cc158c3
|
2.3 GB | Preview Download |