Journal article Open Access

Appendix to "Simple Dataset for Proof Method Recommendation in Isabelle/HOL (Dataset Description)"

Nagashima, Yutaka

Recently, a growing number of researchers have applied machine learning to assist users of interactive theorem provers.
However, the expressive nature of underlying logics and esoteric structures of proof documents impede machine learning practitioners,
who often do not have much expertise in formal logic, let alone Isabelle/HOL, from achieving a large scale success in this field. 
In this data description, we present a simple dataset that contains data on over 400k proof method applications along with over 100 extracted features for each in a format that can be processed easily without any knowledge about formal logic.
Our simple data format allows machine learning practitioners to try machine learning tools to predict proof methods in Isabelle/HOL without requiring domain expertise in logic.

This is an appendix to our paper "Simple Dataset for Proof Method Recommendation in Isabelle/HOL (Dataset Description)" accepted at the 13th Conference on Intelligent Computer Mathematics (CICM 2020). This work was supported by the European Regional Development Fund under the project AI & Reasoning (reg. no.CZ.02.1.01/0.0/0.0/15_003/0000466) and by NII under NII-Internship Program 2019-2nd call.
Files (283.7 kB)
Name Size
283.7 kB Download
All versions This version
Views 5959
Downloads 2626
Data volume 7.4 MB7.4 MB
Unique views 5454
Unique downloads 2525


Cite as