Mining Factors in Review Comment Generation

Published April 12, 2024 | Version v2

Conference paper Open

The appendix of complete results are shown in this repository as previewed.

1. Brief Introduction

Link: Zenodo Repository

Paper Title: Enhancing Code Review Automation by Mining Factors in Review Comment Generation

This study conducts a detailed comparison of the following aspects:

Paradigms:
- Pre-training and Fine-tuning (CodeT5)
- Zero-shot Prompt Learning (GPT-4)
Input Settings:
- Exploration of 29 diverse input settings, formulated by combining representations of code changes, review tags, and more.
Structural Information:
- Introduction of a change-aware GAT component that consolidates information from both the old and the new ASTs.

Additionally, the study provides:

A comprehensive evaluation methodology incorporating METEOR and BERTScore.
A versatile and traceable dataset named CodeReviewCommentNet (CRCN).
Insightful observations regarding existing gaps and suggesting potential paths forward.

2. Artifact Structure

The artifact is organized into the following main components:

In this Repository (Codes and Results):

CRCN (codes): Scripts for generating datasets.
pretraining_and_finetuning (codes): Scripts and results related to the "pretraining and finetuning" paradigm, including experiments with the graph component and baselines.
zero_short_prompt_learning (codes): Scripts related to the "zero-shot prompt learning" paradigm.
evaluation: Codes and results of various evaluations.

Model Repositories:

Note 1: Our "zero-shot prompt learning" implementation is based on the APIs of GPT-4, hence it does not necessitate concrete models.

Note 2: Please refer to the comments in the specific files to determine the name and order of each individual experiment.

Files

Name	Size	Download all
Appendix.pdf md5:823ab879dceea8bacd7b2a51426b01fa	217.7 kB	Preview Download
CRCN(codes).zip md5:f475b6efefec111ffc50ed4b6d2e7c6b	371.4 MB	Preview Download
evaluation.zip md5:7a9aaa7a409b27f19b6277555523f138	522.3 MB	Preview Download
pretraining_and_finetuning(codes).zip md5:a639b35402909a66afdb499b5f239c30	585.9 MB	Preview Download
zero_short_prompt_learning(codes).zip md5:e0a2705c1d9016d79c3342393c5af6c0	4.6 kB	Preview Download