Published March 31, 2025
| Version v4
Software
Open
JBShield: Defending Large Language Models from Jailbreak Attacks through Activated Concept Analysis and Manipulation
Description
This is the artifact for USENIX Security 2025 accepted paper "JBShield: Defending Large Language Models from Jailbreak Attacks through Activated Concept Analysis and Manipulation." To test the artifact, please download and uncompress the zip file. You can also refer to https://github.com/NISPLab/JBShield.
Files
JBShield-final.zip
Files
(32.7 MB)
Name | Size | Download all |
---|---|---|
md5:0c3588b6d7fae0c8a0b0ab9edbaae69c
|
32.7 MB | Preview Download |