Published March 26, 2024 | Version v1
Dataset Open

MAD Benchmark

  • 1. Stanford University

Contributors

Contact person:

  • 1. Stanford University

Description

Mechanistic Architecture Design (in-short "MAD") represents a simple framework to accelerate the deep learning architecture design process. MAD uses simple synthetic tasks that can be implemented quickly and without much compute to predict how well new candidate architectures will perform at scale in sequence modeling. Each synthetic task is specifically designed to probe skills of a model relevant for squence modeling, such as compression and recall.

Here, we host a set of benchmark datasets for MAD. For more details, see our accompanying GitHub repository: github.com/athms/mad-lab

Files

data.zip

Files (335.5 MB)

Name Size Download all
md5:b0a4541443c7e30fc1063daaa7457a79
335.5 MB Preview Download

Additional details

Software