Published September 27, 2020 | Version v1
Dataset Open

Wikitext-103 and OpenWebText Models

  • 1. Cornell University


This repository contains 25 Wikitext-103 LSTM models and 25 LSTM models trained on a 100 million token subset of the OpenWebTextCorpus. Training/validation/test data is included with the Web models. By-epoch validation perplexity is given in the logs (within the directory for the models). Please write to me if you have any questions :) 


Files (4.4 GB)

Name Size Download all
2.3 GB Download
2.1 GB Download