There is a newer version of the record available.

Published March 26, 2021 | Version v1
Dataset Open

NaturalProofs Dataset

  • 1. University of Washington
  • 2. Allen Institute for Artificial Intelligence
  • 3. New York University

Description

The NaturalProofs Dataset is a large-scale dataset for studying mathematical reasoning in natural language. NaturalProofs consists of roughly 20,000 theorem statements and proofs, 12,500 definitions, and 1,000 additional pages (e.g. axioms, corollaries) derived from ProofWiki, an online compendium of mathematical proofs written by a community of contributors. 

Please see the description in [Welleck et al 2021] for more details.

Files

dataset.json

Files (111.2 MB)

Name Size Download all
md5:2c709a9513b16b4c631b3b94e8893716
111.2 MB Preview Download

Additional details

Related works