Published September 6, 2022 | Version v1
Journal article Open

Replication Package for the Paper: Warnings: Violation Symptoms Indicating Architecture Erosion

Description

This is the replication package for the paper: "Warnings: Violation Symptoms Indicating Architecture Erosion".

Scripts.zip includes the Python scripts used to collect data and refine keywords.

  • get_comments.py is used to collect code review comments from OpenStack and Qt community.
  • keywords_search.py is used to search the review comments that contain at least one keyword.
  • keywords.txt contains keywords related to violation symptoms to code review comments from four popular OSS projects (i.e., Nova, Neutron, Qt Base, and Qt Creator).
  • Similar_words.py is used to query similar keywords through a pre-trained word embedding model in the software engineering domain.
  • SO_vectors_200.bin is the pre-trained word embedding model base on Stack Overflow posts.
  • Preprocessing.py and Cooccurrence.py are used to check the missing potential co-occurred and associated words.

Violation symptoms.xlsx (i.e., the dataset of this study) contains the identified violation symptoms from code review comments of the four OSS projects (i.e., Nova, Neutron, Qt Base, and Qt Creator) in this study.

Files

Scripts.zip

Files (1.4 GB)

Name Size Download all
md5:1610a34c373c29fbcd712e6bfa1dc86e
1.4 GB Preview Download
md5:f2a74278e80b91b5ae2eaa2ab390d8b1
188.7 kB Download