Published July 24, 2025 | Version v10
Dataset Open

DevGPT: Studying Developer-ChatGPT Conversations

  • 1. Nara Institute of Science and Technology
  • 2. The University of Melbourne
  • 3. Shinshu University

Description

DevGPT is a curated dataset which encompasses 17,913 prompts and ChatGPT's responses including 11,751 code snippets, coupled with the corresponding software development artifacts—ranging from source code, commits, issues, pull requests, to discussions and Hacker News threads—to enable the analysis of the context and implications of these developer interactions with ChatGPT.

Important
Version 9 (2023-11-09) resolves the empty list of conversations attribute: https://github.com/NAIST-SE/DevGPT/issues/8

Files

DevGPT.zip

Files (926.8 MB)

Name Size Download all
md5:01e730f3c4f77f785f293d177268a798
926.8 MB Preview Download