Replication package for the paper "Multi-language Software Development in the LLM Era: Insights from Practitioners' Conversations with ChatGPT"

Aguiar, Lucas; Paixao, Matheus; Carmo, Rafael; Freitas, Matheus; Gama, Eliakim; Leal, Antonio; Soares, Edson

doi:10.5281/zenodo.13710992

Published September 2024 | Version v1

Dataset Open

Replication package for the paper "Multi-language Software Development in the LLM Era: Insights from Practitioners' Conversations with ChatGPT"

1. Universidade Estadual do Ceará
2. Universidade Federal do Ceará

This is the replication package for the paper “Multi-language Software Development in the LLM Era: Insights from Practitioners’ Conversations with ChatGPT”.

This research was done using the 2023-10-12 release of DevGPT; for an overall explanation of the dataset, please consult the original work at: DevGPT/snapshot_20231012 at main · NAIST-SE/DevGPT (github.com).

After downloading the snapshot, extract the archives into the same folder of the notebooks.

The "analysis_single_source.ipynb" script collects all metrics for every file individually. The variable "fileNumber" selects which JSON to use. For overall metrics, the "analysis_all_sources.ipynb" file was used.

The normalization process is inside both notebooks and can be found below the 'Normalization' tag.

The folder "open_card_sorting" contains the 3 participants' Opencard results and analysis.

The folder "results" contains all the CSV files produced by the scripts.

The folder "normalized" contains the individual datasets already preprocessed through the scripts.

Files

MLSD-in-the-LLM-Era.zip

Files (134.2 MB)

Name	Size	Download all
MLSD-in-the-LLM-Era.zip md5:a3510c2d509903e97cff83f043b34882	134.2 MB	Preview Download

116

Views

Downloads

Show more details

	All versions	This version
Views	116	116
Downloads	32	32
Data volume	4.4 GB	4.4 GB

More info on how stats are collected....

DOI

Resource type

Dataset

Publisher

Zenodo

Conference

18th ACM / IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM'24) , Barcelona, Spain, October 24--25, 2024

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: September 6, 2024
Modified: September 6, 2024

Replication package for the paper "Multi-language Software Development in the LLM Era: Insights from Practitioners' Conversations with ChatGPT"

Authors/Creators

Description

Files

MLSD-in-the-LLM-Era.zip

Files (134.2 MB)