Published November 9, 2023 | Version v9
Dataset Open

DevGPT: Studying Developer-ChatGPT Conversations

  • 1. Nara Institute of Science and Technology
  • 2. The University of Melbourne
  • 3. Shinshu University

Description

DevGPT is a curated dataset which encompasses 17,913 prompts and ChatGPT's responses including 11,751 code snippets, coupled with the corresponding software development artifacts—ranging from source code, commits, issues, pull requests, to discussions and Hacker News threads—to enable the analysis of the context and implications of these developer interactions with ChatGPT.

Important
Version 9 (2023-11-09) resolves the empty list of conversations attribute: https://github.com/NAIST-SE/DevGPT/issues/8

Files

DevGPT.zip

Files (681.6 MB)

Name Size Download all
md5:8aef14da58427ddc990d05450d98e4a2
681.6 MB Preview Download