Published January 19, 2026
| Version v1
Dataset
Open
Sample Dataset for AI-Generated Scientific Storytelling
Authors/Creators
Description
This repository contains the dataset used to fine-tune an AI Scientific Storyteller .
The released data represents the training material and is provided to illustrate the structure, format, and intermediate representations used in the scientific storytelling pipeline.
Contents:
- new_dataset.json: metadata describing scientific papers and associated narrative sources.
- new_parsed_output.json: parsed text of scientific papers used as model input.
- new_stories_with_text.json: narrative texts used as supervision for story generation.