Published September 27, 2020 | Version v1
Dataset Open

Wikitext-103 and OpenWebText Models

  • 1. Cornell University

Description

This repository contains 25 Wikitext-103 LSTM models and 25 LSTM models trained on a 100 million token subset of the OpenWebTextCorpus. Training/validation/test data is included with the Web models. By-epoch validation perplexity is given in the logs (within the directory for the models). Please write to me if you have any questions :) 

Files

Files (4.4 GB)

Name Size Download all
md5:21616173d195a6c1f19fd447fad41c65
2.3 GB Download
md5:189990cac92603769d9d2c4531e5aa9b
2.1 GB Download