Published August 13, 2025 | Version v1
Dataset Open

Wikipedia Dec 2021 Text List (100-sec chunks)

Creators

Description

A processed Wikipedia dump from December 2021, split into ~100-second chunks of text for NLP and IR research.

Files

Files (20.9 GB)

Name Size Download all
md5:744c287f51fe8f60eac3e3f6b883c294
20.9 GB Download