Published March 14, 2020 | Version 1.0.0
Dataset Open

Superposition for Lambda-Free Higher-Order Logic — Supplementary Material for the Journal Article

  • 1. Vrije Universiteit Amsterdam
  • 2. Aesthetic Integration
  • 3. Max-Planck-Institut für Informatik, Saarland Informatics Campus

Description

We provide the following supplementary material for our article.

Zipperposition

Compilation instructions for Zipperposition, in particular instructions for compilation for StarExec, can also be found in the Zipperposition readme. We used OCaml 4.07.0, branch lmcs2020, commit 2031e216c1941acd76187882a073e8f1e53383f2

Problems

We used the following first-order (TFF) and the higher-order (THF) TPTP (v7.3.0) problems for the evaluation: TFF problem list  THF problem list. These lists were obtained by excluding all problems that contain arithmetic, the symbols (@@+)(@@-)(@+)(@-)(&), or tuples, as well as the SYN000 problems, which are only intended to test the parser, and problems whose clausal normal form takes longer than 15s to compute or falls outside the lambda-free fragment. The following archive contains instructions on how the benchmarks were selected: Benchmark selection

Note that Zipperposition is not aware that our calculi are complete for this fragment and it will always report "GaveUp" instead of "CounterSatisfiable" if the calculus saturates.

The selection of TPTP problems and the problems generated by Isabelle/Sledgehammer can be downloaded here: Benchmarks

Run scripts

We used the following run scripts on StarExec. This archive also contains the Zipperposition binary, compiled for StarExec: StarExec run scripts

The scripts use the following command-line options for Zipperposition

  • First-order mode:
    ./zipperposition.exe --mode=fo-complete-basic
  • Applicative encoding mode (intensional):
    ./zipperposition.exe --mode=fo-complete-basic --app-encode=intensional
  • Applicative encoding mode (extensional):
    ./zipperposition.exe --mode=fo-complete-basic --app-encode=extensional
  • Nonpurifying intensional calculus:
    ./zipperposition.exe --mode=lambda-free-intensional
  • Nonpurifying extensional calculus:
    ./zipperposition.exe --mode=lambda-free-extensional
  • Purifying intensional calculus:
    ./zipperposition.exe --mode=lambda-free-purify-intensional
  • Purifying extensional calculus:
    ./zipperposition.exe --mode=lambda-free-purify-extensional

As additional command line arguments, we provided the problem's filename, the order (--ord=lambdafree_rpo or --ord lambdafree_kbo or --ord epo), and the following parameters for heuristics that were obtained by optimizing the first-order mode in preliminary experiments:

--kbo-weight-fun=modarity \
-q "7|prefer-sos|pnrefined(2,1,1,1,2,2,2)" \
-q "4|prefer-short-trail|pnrefined(1,1,1,2,2,2,0.5)" \
-q "1|prefer-processed|fifo" \
-q "7|prefer-ground|conjecture-relative-var(1,l,f)" \
-q "6|prefer-goals|conjecture-relative-var(1,s,f)" \
--select=e-selection7

On Starexec, we chose a wallclock timeout of 360 s, a CPU timeout of 180 s, and a memory limit of 128 GB. StarExec's machine specifications are:

Intel(R) Xeon(R) CPU E5-2609 0 @ 2.40GHz (2393 MHZ)
10240  KB Cache
263932744 kB main memory
OS:       CentOS Linux release 7.7.1908 (Core)
kernel:   3.10.0-1062.4.3.el7.x86_64

Results

Download the raw output of the evaluation and the .csv files created by StarExec here:

Examples

We tested the examples given in our paper in Zipperposition. Here are the problem files we used. Some are in TPTP format (.p) and some are in Zipperposition format (.zf).

Files

benchmark_selection.zip

Files (200.6 MB)

Name Size Download all
md5:af7dae6d406ebfa63f76e503913492e0
195.8 kB Preview Download
md5:791f9e583fca037adddf7fa29e4b9913
88.1 MB Preview Download
md5:4bde314f3278f3c14a6d5acfe220e2ae
2.2 kB Download
md5:d6c27bbd92c6821025896faf55e7b0eb
656 Bytes Download
md5:5693ba6fe577492441cd65cc77349b1e
755 Bytes Download
md5:7359d1d798c82d0c4bc9c7e1e631df73
1.1 kB Download
md5:4de47a8f5bca4942c9555685d4674c1d
682 Bytes Download
md5:ec7b250b2ef96d1a90531d97f683bfab
10.6 kB Preview Download
md5:4b0178fb15b660d97873b165c8840236
10.8 kB Preview Download
md5:66408491dbd066dc95216795f1bd277d
1.0 MB Preview Download
md5:d59d66d2f09068873a2dd145814a1ce4
63.6 MB Preview Download
md5:ec152495f1ce08a67e30f34c615967af
18.0 MB Preview Download
md5:a2f574287b1dd276a4452416666d9694
10.5 MB Preview Download
md5:bf0dfe48b625fc3dce3e43b45f32691f
15.1 MB Preview Download
md5:802cac73ec3197ce130cbb8b24fccffc
4.0 MB Preview Download

Additional details

Funding

Matryoshka – Fast Interactive Verification through Strong Higher-Order Automation 713999
European Commission