Published September 6, 2022
| Version v1
Journal article
Open
Replication Package for the Paper: Warnings: Violation Symptoms Indicating Architecture Erosion
Creators
Description
This is the replication package for the paper: "Warnings: Violation Symptoms Indicating Architecture Erosion".
Scripts.zip includes the Python scripts used to collect data and refine keywords.
- get_comments.py is used to collect code review comments from OpenStack and Qt community.
- keywords_search.py is used to search the review comments that contain at least one keyword.
- keywords.txt contains keywords related to violation symptoms to code review comments from four popular OSS projects (i.e., Nova, Neutron, Qt Base, and Qt Creator).
- Similar_words.py is used to query similar keywords through a pre-trained word embedding model in the software engineering domain.
- SO_vectors_200.bin is the pre-trained word embedding model base on Stack Overflow posts.
- Preprocessing.py and Cooccurrence.py are used to check the missing potential co-occurred and associated words.
Violation symptoms.xlsx (i.e., the dataset of this study) contains the identified violation symptoms from code review comments of the four OSS projects (i.e., Nova, Neutron, Qt Base, and Qt Creator) in this study.
Files
Scripts.zip
Files
(1.4 GB)
Name | Size | Download all |
---|---|---|
md5:1610a34c373c29fbcd712e6bfa1dc86e
|
1.4 GB | Preview Download |
md5:f2a74278e80b91b5ae2eaa2ab390d8b1
|
188.7 kB | Download |