Published June 21, 2025 | Version v2
Project deliverable Open

Halligan

Creators

Description

Introduction

This repository provides the official artifact for the paper:
"Are CAPTCHAs Still Bot-Hard? Generalized Visual CAPTCHA Solving with Agentic Vision Language Models"

Our work explores the effectiveness of Vision-Language Model (VLM) agents in solving modern visual CAPTCHAs by leveraging reasoning, abstraction, and code synthesis capabilities.

Contents

  • benchmark.zip: An interactive offline benchmark suite designed to evaluate VLM agents on their ability to solve visual CAPTCHA challenges.
  • halligan.zip: The implementation of Halligan, our proposed VLM agent introduced in the paper

Files

benchmark.zip

Files (2.6 GB)

Name Size Download all
md5:80854a4947265de9ebfcc56cf9e70501
298.4 MB Preview Download
md5:b71dd0cb06d2f69ca56820b63cc158c3
2.3 GB Preview Download