Published September 2, 2021 | Version v1
Dataset Open

Automating Code Review Activities 2.0 (datasets, models and results)

Description

Resources related by the research work "Automating Code Review Activities 2.0".

  • automating_code_review.zip contains the material to successfully run our Colab notebooks;
  • dataset.zip contains all the preprocessed datasets used in our work;
  • generate_prediction.zip contains the material to successfully generate predictions using a T5 model chekpoint;
  • models.zip contains the (best) checkpoints of the fine-tuned T5 models;
  • results.zip contains our results;
  • tokenizer.zip contains the Sentencepiece model and vocabulary trained on our pre-training dataset.

More information in the replication package of our work: code_review_autmoation

Files

automating_code_review.zip

Files (2.9 GB)

Name Size Download all
md5:51e8cbe20ec150ba03d853526dd31689
713.2 MB Preview Download
md5:63dc5c774b5e20ccdb3c0aae19e16c75
614.4 MB Preview Download
md5:a56b70e5e5ddb908b7a711bb7e2c9a0d
365.0 MB Preview Download
md5:67adafb6c10796c81a2cb43dac98470b
1.1 GB Preview Download
md5:1130d23c9c147bfd3b0002bc014a5522
157.0 MB Preview Download
md5:976cc134497e2c38fea869c340d0d811
639.2 kB Preview Download