Published March 9, 2021 | Version 1.0
Conference paper Open

Combining Query Reduction and Expansion for Text-Retrieval-Based Bug Localization

  • 1. The University of Texas at Dallas
  • 2. College of William & Mary
  • 3. The University of Adelaide

Description

ABSTRACT

Automated text-retrieval-based bug localization (TRBL) techniques normally use the full text of a bug report to formulate a query and retrieve parts of the code that are buggy. Previous research has shown that reducing the size of the query increases the effectiveness of TRBL. On the other hand, researchers also found improvements when expanding the query (i.e., adding more terms).
In this paper, we bring these two views together to reformulate queries for TRBL. Specifically, we improve discourse-based query reduction strategies, by adopting a combinatorial approach and using task phrases from bug reports, and combine them with a state-of-the-art query expansion technique, resulting in 970 query reformulation strategies. We investigate the benefits of these strategies for localizing buggy code elements and define a new approach, called QREX , based on the most effective strategy.
We evaluated the reformulation strategies, including QREX , on 1,217 queries from different software systems to retrieve buggy code artifacts at three code granularities, using five state-of-the-art automated TRBL approaches. The results indicate that QREX increases TRBL effectiveness by 4% - 12.6%, compared to applying query reduction and expansion in isolation, and by 32.1%, compared to the no-reformulation baseline.

Files

blizzard_replication.csv

Files (69.8 MB)

Name Size Download all
md5:06dc1fb9c1dfc9641432b05148d69bab
428 Bytes Preview Download
md5:6bce67b847c266be2f583c03a199b533
300 Bytes Preview Download
md5:5f4c7bfa221a7919fc7269e0abe71ef2
4.0 kB Preview Download
md5:2af61918782c94cb21da191dcc5cc1dd
41.4 MB Preview Download
md5:977d12dea2ab1fa7f4754273fae6d901
15.6 MB Preview Download
md5:dff16573fe40d48bf29af772effe828f
121.1 kB Preview Download
md5:4d977877768f78dcaf57af653a253713
122.5 kB Preview Download
md5:ffed2cc605add6eff8801278fed9eccd
122.2 kB Preview Download
md5:cace48d07f0b5a40ff2b85920bb0cbc2
41.1 kB Preview Download
md5:75ed116db732de38cbf118a7ed468b63
41.3 kB Preview Download
md5:ec077ce3463c94ca5561770459715843
41.2 kB Preview Download
md5:9f6d097cb37349e944ead2f481846376
166.2 kB Preview Download
md5:93bc6e1dec7262511b1e51fa21d7ab8f
165.5 kB Preview Download
md5:ade9081fbeddd6230a42687cd3296ba2
164.8 kB Preview Download
md5:e84bdf642557e16655be2927249d71c7
127.9 kB Preview Download
md5:16dc826ab0e07444c883d4be8bfef584
128.2 kB Preview Download
md5:2b7bcdf7d48e38664b29551b9e33c202
127.6 kB Preview Download
md5:af28c9e091b90bef4f56bdf9e3c547f8
137.8 kB Preview Download
md5:dd2567d2bd217cf3ee101055a5014c74
137.6 kB Preview Download
md5:b4f85e8a74b0dce0bbfd9d6bf6637052
11.0 MB Preview Download