Patcher: Post-Hoc Patching of Backdoored Large Language Models

Gao, Anjun

doi:10.5281/zenodo.20362596

Published May 26, 2026 | Version v1

Software Open

Patcher: Post-Hoc Patching of Backdoored Large Language Models

Gao, Anjun

This project is official implementation from the paper "Patcher: Post-Hoc Patching of Backdoored Large Language Models". A security research framework for localizing and removing backdoor attacks from Large Language Models (LLMs) by patching the models. This project implements the attack, patching, and evaluation pipeline using gradient saliency analysis to identify trigger tokens and patch compromised models. For more details, see the README.md in the files.

Files

Patcher.zip

Files (34.1 kB)

Name	Size	Download all
Patcher.zip md5:0e7db0dc963df27ab6a0db374df96638	34.1 kB	Preview Download

Views

Downloads

Show more details

	All versions	This version
Views	31	31
Downloads	5	5
Data volume	170.6 kB	170.6 kB

More info on how stats are collected....

DOI

Resource type

Software

Publisher

Zenodo

License: MIT License

A short and simple permissive license with conditions only requiring preservation of copyright and license notices. Licensed works, modifications, and larger works may be distributed under different terms and without source code. Read more

Technical metadata

Created: May 27, 2026
Modified: May 27, 2026

Patcher: Post-Hoc Patching of Backdoored Large Language Models

Authors/Creators

Description

Files

Patcher.zip

Files (34.1 kB)