Dataset Open Access
This repository contains 25 Wikitext-103 LSTM models and 25 LSTM models trained on a 100 million token subset of the OpenWebTextCorpus. Training/validation/test data is included with the Web models. By-epoch validation perplexity is given in the logs (within the directory for the models). Please write to me if you have any questions :)
Name | Size | |
---|---|---|
openwebtextcorpus-25-models.tar.gz
md5:21616173d195a6c1f19fd447fad41c65 |
2.3 GB | Download |
wikitext103-25-models.tar.gz
md5:189990cac92603769d9d2c4531e5aa9b |
2.1 GB | Download |
All versions | This version | |
---|---|---|
Views | 100 | 100 |
Downloads | 25 | 25 |
Data volume | 54.7 GB | 54.7 GB |
Unique views | 90 | 90 |
Unique downloads | 16 | 16 |