ISSTA 2023 Artifact for "ROME: Testing Image Captioning Systems via Recursive Object Melting"
- 1. The Chinese University of Hong Kong, Shenzhen
- 2. Microsoft Research
Description
Artifact of ROME
This artifact is the package for ROME along with the video tutorials for reproducing the experiments, which includes the following content:
Content Description
- Video tutorials
- In Tutorial_1, we demonstrate how to perform object selection, image mutation, and fine-tuning.
- In Tutorial_2, we demonstrate how to do error detection with ROME.
- In Tutorial_3, we demonstrate how to analyze our results of the user study, how to find labeling errors in the MS COCO Caption dataset, how to generate test cases, and how to detect captioning errors with MetaIC.
- In Tutorial_4, we demonstrate how to use the images that triggered bugs in the IC systems under test for testing the real-world IC systems, i.e., Microsoft Powerpoint and Facebook Automatic Alternative Text.
- object_selection
This repository contains the codes used for the first step in ROME, which is object selection.
For more details, please refer to object_selection/README.md.
- image_mutation
This repository contains the codes used to melt objects in images and generate new images.
For more details, please refer to image_mutation/README.md.
- inference_scripts
This folder provides scripts for caption generation using different captioning systems.
For more details, please refer to inference_scripts/README.md.
- error_detection
After obtaining captions for synthetic images, this folder provides the codes and materials for error detection based on two metamorphic relations.
For more details, please refer to error_detection/README.md.
- finetuning
This folder contains the re-labeled annotations for 1,000 synthesized images as well as scripts for finetuning and checkpoint file.
For more details, please refer to finetuning/README.md.
- naturalness
This folder contains codes and results of the user study on image naturalness.
For more details, please refer to naturalness/README.md.
- gt_err
This folder contains codes, suspicious issues, and results for finding labeling errors in the MS COCO Caption dataset.
For more details, please refer to gt_err/README.md.
- metaic_insertion
This folder contains codes for synthesizing test cases with MetaIC.
For more details, please refer to metaic_insertion/README.md.
- metaic_error_detection
This folder contains codes for detecting captioning errors with MetaIC.
For more details, please refer to metaic_error_detection/README.md.
- ms&face_Exp
This folder contains image test cases for testing real-world IC systems ms&face_Exp/README.md.
Files
rome_artifact.zip
Files
(4.3 GB)
Name | Size | Download all |
---|---|---|
md5:52d45fa1dfee5aa03c7ffdf7184c4efe
|
4.3 GB | Preview Download |