PrivacyXray: Detecting Privacy Breaches in LLMs through Semantic Consistency and Probability Certainty

He, Jinwen; Lu, Yiyang; Lin, Zijin; Zhao, Yue; Chen, Kai

doi:10.5281/zenodo.15615045

There is a newer version of the record available.

Published June 7, 2025 | Version v1

Dataset Open

PrivacyXray: Detecting Privacy Breaches in LLMs through Semantic Consistency and Probability Certainty

This artifact contains the code and dataset used in our paper to analyze and classify privacy leakage behaviors in LLMs. It includes scripts for model fine-tuning, hidden state extraction, and classification, as well as training data.

Files

privacyxray.zip

Files (27.1 MB)

Name	Size	Download all
privacyxray.zip md5:26cf7963e2e3d091e06f7770edc8b0d3	27.1 MB	Preview Download

187

Views

Downloads

Show more details

	All versions	This version
Views	187	45
Downloads	80	15
Data volume	2.4 GB	433.1 MB

More info on how stats are collected....

DOI

Resource type

Dataset

Publisher

Zenodo

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: June 7, 2025
Modified: June 7, 2025

PrivacyXray: Detecting Privacy Breaches in LLMs through Semantic Consistency and Probability Certainty

Authors/Creators

Description

Files

privacyxray.zip

Files (27.1 MB)