Published June 7, 2025
| Version v1
Dataset
Open
PrivacyXray: Detecting Privacy Breaches in LLMs through Semantic Consistency and Probability Certainty
Authors/Creators
Description
This artifact contains the code and dataset used in our paper to analyze and classify privacy leakage behaviors in LLMs. It includes scripts for model fine-tuning, hidden state extraction, and classification, as well as training data.
Files
privacyxray.zip
Files
(27.1 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:26cf7963e2e3d091e06f7770edc8b0d3
|
27.1 MB | Preview Download |