Published April 23, 2017 | Version v1
Conference paper Open

Automating Formalization by Statistical and Semantic Parsing of Mathematics

  • 1. Czech Technical University in Prague, Czech Republic

Description

We discuss the progress in our project which aims to automate formalization by combining natural language processing with deep semantic understanding of mathematical expressions. We introduce the overall motivation and ideas behind this project, and then propose a context-based parsing approach that combines efficient statistical learning of deep parse trees with their semantic pruning by type checking and large-theory automated theorem proving. We show that our learning method allows efficient use of large amount of contextual information, which in turn significantly boosts the precision of the statistical parsing
and also makes it more efficient. This leads to a large improvement of our first results in parsing theorems from the Flyspeck corpus.

Files

10.1007/978-3-319-66107-0_2.pdf

Files (359.4 kB)

Name Size Download all
md5:38021aff387f368fccfa425ca23aace6
359.4 kB Preview Download

Additional details

Funding

European Commission
AI4REASON - Artificial Intelligence for Large-Scale Computer-Assisted Reasoning 649043
European Commission
SMART - Strong Modular proof Assistance: Reasoning across Theories 714034