JailFact-Bench: A Comprehensive Analysis of Jailbreak Attacks vs. Hallucinations in LLMs

Nambiar, Sanjana; Poepper, Christina

doi:10.5281/zenodo.15318905

Published May 1, 2025 | Version v3

Dataset Open

JailFact-Bench: A Comprehensive Analysis of Jailbreak Attacks vs. Hallucinations in LLMs

1. New York University Abu Dhabi
2. New York University Abu Dhabi (NYUAD)
3. Ruhr University Bochum

JailFact-Bench is a curated benchmark dataset for analyzing jailbreak attacks and hallucination patterns in Large Language Models (LLMs). It contains semantically aligned jailbreak and factuality prompts, along with metadata including toxicity shifts, similarity scores, and annotation strategies. Developed at NYU Abu Dhabi under Professor Christina Pöpper, this dataset accompanies the paper accepted at the SiMLA 2025 Workshop, co-located with the 23rd International Conference on Applied Cryptography and Network Security (ACNS).

Files

README.md

Files (24.5 kB)

Name	Size	Download all
jailfact-bench.xlsx md5:3b2456c4cda0b65710b4984cadd54b24	22.0 kB	Download
README.md md5:28060159ebec18c06210c82c0113d2fc	2.5 kB	Preview Download

Additional details

Created: 2025-04-30

Dataset creation and submission date

567

Views

170

Downloads

Show more details

	All versions	This version
Views	567	453
Downloads	170	88
Data volume	2.1 MB	1.1 MB

More info on how stats are collected....

DOI

Resource type

Dataset

Publisher

Zenodo

Conference

SiMLA 2025 Workshop: Securing Intelligent Machines through Language Alignment (SiMLA 2025) , LMU Munich, Germany, 2025-06-26

Languages

English

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: May 1, 2025
Modified: May 1, 2025

JailFact-Bench: A Comprehensive Analysis of Jailbreak Attacks vs. Hallucinations in LLMs

Authors/Creators

Description

Files

README.md

Files (24.5 kB)

Additional details

Dates