Published January 19, 2024 | Version v1
Conference paper Open

From Cluster Hypothesis to Semi-Supervised Clustering: Exploiting LLMs to Solve the Next Release Problem

Creators

Description

A: Datasets

            A1. Zoom

            A2. WebEx

            A3. Microsoft 365

            A4. Discord

 

A dataset file contains multiple NRP (next release problem) instances: each instance per sheet. Regarding the labels, “1” means the correct NRP solutions, “0” means the candidate requirements that are not part of the next release, and “-1” means the already implemented requirements.

B: Python Code

            B1. Cluster_hypothesis.ipynb

            B2. LLM.ipynb

            B3. PCKMeans.ipynb

 

C: Cluster vs LLM Graphs

            C1. Average F-measure results of clustering and LLM

 

           

Files

Folder A.zip

Files (287.0 kB)

Name Size Download all
md5:80ce00a0d63a275c0dc92561b0b78d72
6.6 kB Download
md5:2f8e659a3095982102f9e2f67c1c72c8
220.6 kB Preview Download
md5:b1a3879cea11f2baa40a1db625627bbc
53.2 kB Preview Download
md5:66e804aae4d549a61c8c04b19ec28f1b
6.7 kB Download