Published April 12, 2024 | Version v2
Conference paper Open

Mining Factors in Review Comment Generation

Creators

  • 1. Anonymous

Description

Mining Factors in Review Comment Generation

The appendix of complete results are shown in this repository as previewed.

1. Brief Introduction

Link: Zenodo Repository

Paper Title: Enhancing Code Review Automation by Mining Factors in Review Comment Generation

This study conducts a detailed comparison of the following aspects:

  • Paradigms:

    • Pre-training and Fine-tuning (CodeT5)

    • Zero-shot Prompt Learning (GPT-4)

  • Input Settings:

    • Exploration of 29 diverse input settings, formulated by combining representations of code changes, review tags, and more.

  • Structural Information:

    • Introduction of a change-aware GAT component that consolidates information from both the old and the new ASTs.

Additionally, the study provides:

  • A comprehensive evaluation methodology incorporating METEOR and BERTScore.

  • A versatile and traceable dataset named CodeReviewCommentNet (CRCN).

  • Insightful observations regarding existing gaps and suggesting potential paths forward.

2. Artifact Structure

The artifact is organized into the following main components:

In this Repository (Codes and Results):

  • CRCN (codes): Scripts for generating datasets.

  • pretraining_and_finetuning (codes): Scripts and results related to the "pretraining and finetuning" paradigm, including experiments with the graph component and baselines.

  • zero_short_prompt_learning (codes): Scripts related to the "zero-shot prompt learning" paradigm.

  • evaluation: Codes and results of various evaluations.

Dataset Repository:

  • CRCN (dataset_short): The processed short dataset.

  • CRCN (dataset_long): The processed long dataset.

  • CRCN (raw_data): The raw crawled dataset.

Model Repositories:

Note 1: Our "zero-shot prompt learning" implementation is based on the APIs of GPT-4, hence it does not necessitate concrete models.

Note 2: Please refer to the comments in the specific files to determine the name and order of each individual experiment.

Files

Appendix.pdf

Files (1.5 GB)

Name Size Download all
md5:823ab879dceea8bacd7b2a51426b01fa
217.7 kB Preview Download
md5:f475b6efefec111ffc50ed4b6d2e7c6b
371.4 MB Preview Download
md5:7a9aaa7a409b27f19b6277555523f138
522.3 MB Preview Download
md5:a639b35402909a66afdb499b5f239c30
585.9 MB Preview Download
md5:e0a2705c1d9016d79c3342393c5af6c0
4.6 kB Preview Download