There is a newer version of the record available.

Published August 21, 2019 | Version v4
Dataset Open

Learning Lenient Parsing & Typing via Indirect Supervision

  • 1. UC Davis

Description

Validation Set: We have shared 3 CSV files containing human-annotated validation sets of our paper (Validation Data.zip).

AST and Student Code Correction:  For generating AST and code correction, please check the two files, AST.py and Top1.py ( in AST & Top-1.zip ). In AST.py, we present the output of different parts of the program with an example. Please read that one before Top1.py. We follow the implementation of https://github.com/Lsdefine/attention-is-all-you-need-keras. Please check the remaining code in the above link. We made a minor correction in the dataloader.py to use two separate vocabulary cutoffs for input and output. dataloader1.py, transformer1.py, etc. are an exact replication of dataloader.py and transformer.py. Since we are using two models, we did it that way to avoid any conflict.

TypeFix: Check the code in TypeFix.zip.

We have also published data set for FragFix and BlockFix.

Edit (adding complete code): Please find the complete FragFix+BlockFix pipeline in FragFix_BlockFix.zip. Please use Tensorflow 1.13 and Keras 2.2.4 for the shared code.

 

Files

AST & Top-1.zip

Files (914.0 MB)

Name Size Download all
md5:6df2d1ac98e8d9201b1b475c7d01bea6
16.5 kB Preview Download
md5:8f58054ff346ed7f3e73ca1a408d2f38
117.1 MB Preview Download
md5:9df26570e575743f0b849f6730797690
103.3 MB Preview Download
md5:01de2b5a37e790f97cc3c8aa076bc6f4
693.5 MB Preview Download
md5:44881dfef8d95148d93984d7c46626de
10.4 kB Preview Download
md5:71aabd74ee5d4682357b14595a8ba212
44.0 kB Preview Download