Planned intervention: On Thursday 19/09 between 05:30-06:30 (UTC), Zenodo will be unavailable because of a scheduled upgrade in our storage cluster.
Published March 16, 2021 | Version v1
Dataset Open

Using Grammar Patterns to Interpret Test Method Name Evolution

  • 1. Rochester Institute of Technology
  • 2. Tufts University
  • 3. Stony Brook University

Description

This is the dataset that accompanies the study: "Using Grammar Patterns to Interpret Test Method Name Evolution." This study has been accepted for publication at 29th IEEE/ACM International Conference on Program Comprehension.

Following is the abstract of the study:
It is good practice to name test methods such that they are comprehensible to developers; they must be written in such a way that their purpose and functionality are clear to those who will maintain them. Unfortunately, there is little automated support for writing or maintaining the names of test methods. This can lead to inconsistent and low-quality test names and increase the maintenance cost of supporting these methods. Due to this risk, it is essential to help developers in maintaining their test method names over time. In this paper, we use grammar patterns, and how they relate to test method behavior, to understand test naming practices. This data will be used to support an automated tool for maintaining test names.

Following are the contents of the dataset:

  • ICPC2021-Public.sqlite -- A SQLite database containing the raw dataset used in this project
  • ICPC2021-Public.xlsx -- Excel spreadsheet containing the complete listings for the tables in the paper 

---------------------------------------------------------------------------------------------------------------------------
Contents of SANER2021-Public.sqlite

Table Name ---- Table Description
"gitCommit" ---- The commit log for all projects
"refactoring" ---- Mined refactoring operations from RefactoringMiner
"refactoring_renamedMethod" ---- Mined Rename Method refactoring operations 
"detected_testfiles" ---- Detected unit test files 
"detected_testfiles_refactored" ---- Refactored unit test files
"detected_testfiles_refactored_renamemethod" ---- Renamed Methods in refactored unit test files 
"annotation_grammar" ---- The data that was provided to the annotators
"annotation_grammar_results" ---- The finalized results of the annotation
"annotation_grammar_results_prefix2" ---- The first two part-of-speech tags of finalized annotation 
"annotation_grammar_results_prefix3" ---- The first three part-of-speech tags of finalized annotation
"annotation_grammar_results_prefix4" ---- The first four part-of-speech tags of finalized annotation
"annotation_grammar_results_prefix5" ---- The first five part-of-speech tags of finalized annotation
"annotation_grammar_results_semantic " ---- The semantic relationship between the old and new names of the annotation results
---------------------------------------------------------------------------------------------------------------------------

Notes

This study is part of the work conducted by the Source Code Analysis And Natural Language Laboratory. For more information about what we do and to download the preprint of this study, visit: https://scanl.org/

Files

ICPC2021-PublicReadMe.txt

Files (1.8 GB)

Name Size Download all
md5:2e9bac1ea789564a19aeaa7afedb86b0
1.8 GB Download
md5:e1541db8a4c82e864f467b5575524158
195.3 kB Download
md5:f1502ff102b1d2f5eeeb73edd15448b4
1.6 kB Preview Download