Published March 31, 2025 | Version v4
Software Open

JBShield: Defending Large Language Models from Jailbreak Attacks through Activated Concept Analysis and Manipulation

  • 1. ROR icon Wuhan University

Description

This is the artifact for USENIX Security 2025 accepted paper "JBShield: Defending Large Language Models from Jailbreak Attacks through Activated Concept Analysis and Manipulation." To test the artifact, please download and uncompress the zip file. You can also refer to https://github.com/NISPLab/JBShield.

Files

JBShield-final.zip

Files (32.7 MB)

Name Size Download all
md5:0c3588b6d7fae0c8a0b0ab9edbaae69c
32.7 MB Preview Download