Published March 26, 2021
| Version v1
Dataset
Open
NaturalProofs Dataset
- 1. University of Washington
- 2. Allen Institute for Artificial Intelligence
- 3. New York University
Description
The NaturalProofs Dataset is a large-scale dataset for studying mathematical reasoning in natural language. NaturalProofs consists of roughly 20,000 theorem statements and proofs, 12,500 definitions, and 1,000 additional pages (e.g. axioms, corollaries) derived from ProofWiki, an online compendium of mathematical proofs written by a community of contributors.
Please see the description in [Welleck et al 2021] for more details.
Files
dataset.json
Files
(111.2 MB)
Name | Size | Download all |
---|---|---|
md5:2c709a9513b16b4c631b3b94e8893716
|
111.2 MB | Preview Download |
Additional details
Related works
- Is documented by
- Preprint: https://wellecks.github.io/naturalproofs/welleck2021naturalproofs.pdf (URL)
- Is supplemented by
- Other: https://github.com/wellecks/naturalproofs (URL)