Published August 13, 2025
| Version v1
Dataset
Open
Wikipedia Dec 2021 Text List (100-sec chunks)
Creators
Description
A processed Wikipedia dump from December 2021, split into ~100-second chunks of text for NLP and IR research.
Files
Files
(20.9 GB)
Name | Size | Download all |
---|---|---|
md5:744c287f51fe8f60eac3e3f6b883c294
|
20.9 GB | Download |