There is a newer version of the record available.

Published January 24, 2025 | Version v1
Software Restricted

JBShield: Defending Large Language Models from Jailbreak Attacks through Activated Concept Analysis and Manipulation

Authors/Creators

  • 1. ROR icon Wuhan University

Description

This is the artifact for USENIX Security 2025 accepted paper "JBShield: Defending Large Language Models from Jailbreak Attacks through Activated Concept Analysis and Manipulation." To test the artifact, please download and uncompress the zip file. 

Files

Restricted

The record is publicly accessible, but files are restricted. <a href="https://zenodo.org/account/settings/login?next=https://zenodo.org/records/14732884">Log in</a> to check if you have access.