Anonymized Dataset for: The Paternalistic Filter in LLM-Mediated History Education

Anonymous, Author(s)

doi:10.5281/zenodo.19891086

Published April 29, 2026 | Version v1

Dataset Open

Anonymized Dataset for: The Paternalistic Filter in LLM-Mediated History Education

Anonymous, Author(s)

This repository contains the data and methodology files for a double-blind peer-reviewed study evaluating Large Language Model (LLM) bias in history education. The dataset captures 1,800 API responses from four models (GPT-OSS, LLaMA, Deepseek, Kimi K2) acting as history tutors discussing the 1989 Romanian Revolution. Responses are categorized across five distinct student personas varying by socio-economic tier and ethnicity.

Files included in this dataset:

Dataset_All_Prompts.csv: The consolidated dataset containing the raw API responses across all three prompt structures: general explanations (P1), causes and consequences (P2), and epistemic justification scores from 1 to 10 (P3). This data supports the study's complete textual analysis (Type-Token Ratio, Agency Theft, Coup Gap) and hesitation metrics.
Personas.txt: The complete definitions of the five student profiles (Baseline, Roma Minority, Hungarian Minority, Top Tier, Low Tier) used in the system prompts.
Prompts.txt: The exact system instructions and user prompts used to evaluate the models.

Files

Dataset_All_Prompts.csv

Files (4.2 MB)

Name	Size	Download all
Dataset_All_Prompts.csv md5:62db153e476b1eb1110f9474cf37c97c	4.2 MB	Preview Download
Personas.txt md5:4dd0dc6e314edd8df51ab8f4f186dce4	1.3 kB	Preview Download
Prompts.txt md5:36a8f264e7d155a9c1b4a2287021aa24	1.1 kB	Preview Download

Views

Downloads

Show more details

	All versions	This version
Views	11	11
Downloads	2	2
Data volume	25.3 MB	25.3 MB

More info on how stats are collected....

DOI

Resource type

Dataset

Publisher

Zenodo

Published in

Conversational Agents for Teaching and Support (CATS) 2026, 2026.

Conference

27th International Conference on Artificial Intelligence in Education (AIED 2026) (CATS 2026), Hybrid / Online, June 2026

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: April 29, 2026
Modified: April 29, 2026

Anonymized Dataset for: The Paternalistic Filter in LLM-Mediated History Education

Authors/Creators

Description

Files

Dataset_All_Prompts.csv

Files (4.2 MB)