Published May 19, 2020 | Version 1.0
Dataset Open

OpenWebText

  • 1. Brown University

Description

An open-source replication of the WebText dataset from OpenAI.

For more info please visit https://skylion007.github.io/OpenWebTextCorpus/

@misc{Gokaslan2019OpenWeb,  
	title={OpenWebText Corpus},
	author={Aaron Gokaslan*, Vanya Cohen*, Ellie Pavlick, Stefanie Tellex},
	howpublished{\url{http://Skylion007.github.io/OpenWebTextCorpus}}, 
	year={2019}
}

Files

Files (12.9 GB)

Name Size Download all
md5:de5b8055484816de0640e928cf81099d
12.9 GB Download