Published March 1, 2025 | Version 1.0.1
Dataset Open

Developer Troubleshooting Experience Study - grounded theory coded interview data

  • 1. ROR icon University of Victoria

Description

This repository contains a dataset from a research study on developer troubleshooting experiences conducted by researchers at University of Victoria, used to construct a Theory of Troubleshooting as the developer's cognitive experience of overcoming confusion.  As a central research question, we asked “What is the developer thinking, feeling, and striving for during the experience of troubleshooting?" We define troubleshooting as the cognitive problem-solving process of identifying, understanding, and constructing a mental model of the cause of an unexpected system behavior, and consider troubleshooting (cognitive process) to be an integral part of the activity of debugging.

The study included 27 semi-structured interviews asking software developers to reflect on their experiences of troubleshooting, talking through both specific experiences and general impressions, both individually and collaboratively.    We used a Constructivist Grounded Theory (CGT) approach to the analysis, reviewing the interview transcripts line by line, interpreting what is happening in the developer's experience, creating initial grounded codes that are low-level and interpretive, then sorting and grouping to raise the abstraction level with higher-level focus codes and connecting to theoretical categories.  After a broader analysis of the data, we narrowed our focus to eight theoretical categories centered around the developer's experience of overcoming confusion:

  • Confusion Experience
  • Trouble in the Creation Process
  • Trying to Gain Clarity
  • Poking and Seeing
  • Elucidating the Problem
  • Frustration vs Confidence
  • Experiential Intuition
  • Figuring It Out

The dataset includes:

  • Eight theoretical category reports (prefixed "category_report_") that include 681 initial grounded codes and corresponding participant numbers across all 27 interviews, that we used to construct our theoretical models.  For example, the category_report_confusion_experience.csv includes 117 examples of experiences related to confusion.
  • 16 emerging question reports (prefixed "rq_") that includes a broad set of 1032 initial grounded codes sorted by emerging question with corresponding participant numbers across the first 12 interviews before we reached theoretical saturation and narrowed our theoretical focus.
  • A summary of emerging questions and Miro board links by emerging question which includes the 1032 initial grounded codes sorted and grouped into higher level focus codes (Miro_boards_per_emerging_question.pdf)
  • The developer interview protocol that generated the dataset (dev_interview_protocol.pdf)
  • The developer follow up interview protocol that we used to validate and refine the models and test for resonance, showing an early version of the model diagrams prior to refinement (dev_interview_protocol_followup.pdf)
  • Demographics data by participant (demographics.csv), with gender summarized for anonymity (the participants include 8 women, 1 non-binary, and 18 men)

 

Files

demographics.csv

Files (2.0 MB)

Name Size Download all
md5:6747cf945492fe0229d44482d2c36126
16.8 kB Preview Download
md5:f3ee378f12964f2b5bc5afe8baa670d5
6.8 kB Preview Download
md5:ec3b41dd01f78d02c6076d81b51d42b8
15.3 kB Preview Download
md5:0ad0aa69869bb640a9e80967ca78b45b
4.0 kB Preview Download
md5:42e54046b8558776bb62d3af8f1b89ba
13.2 kB Preview Download
md5:427dc5a0c4d2a6b77c4902f422d5feb2
17.2 kB Preview Download
md5:a21a7133cb4c16ac44dae13f0920e4d9
18.7 kB Preview Download
md5:bfddad9d0f91bce6174fd672ccbf5155
2.6 kB Preview Download
md5:a271c3756175bde82241ca6c10de17db
1.6 kB Preview Download
md5:ec3963b1ff99f72bb220887b1ab014a0
127.9 kB Preview Download
md5:86a2e9c88b0592720c99f214de267924
1.4 MB Preview Download
md5:504fcf42659956aabe9fd6ecd34bc2dc
94.3 kB Preview Download
md5:d2d803548eeda4e2d307b80fc183a074
4.2 kB Preview Download
md5:b1024660d27ed40e06ef28163c82f599
4.5 kB Preview Download
md5:c3d55f83c6f35a182f481dad2337fe86
5.5 kB Preview Download
md5:7ebe0568f6b1d8ff081e552463db6dba
3.8 kB Preview Download
md5:e79822b7bc6698b07a612969cd2b9c0c
5.3 kB Preview Download
md5:85773ef804e93ca465119c2bd7c94f48
13.7 kB Preview Download
md5:1abc19ed0a35597fb443c7a81937da5a
2.4 kB Preview Download
md5:4513862cbe4f73353eab4a7da1360bc3
10.7 kB Preview Download
md5:2bcbb2b3361f127de99b6a0493cc385a
18.4 kB Preview Download
md5:17d55822812022881074d169fd96c213
12.0 kB Preview Download
md5:e30d03481b13fa0131664de6644ed909
22.8 kB Preview Download
md5:5faeac317a12e12eb7e9ce5ca67aa33d
22.9 kB Preview Download
md5:e38853a2fe422fd86a95b2b969244d8a
27.4 kB Preview Download
md5:913439dee8bc420b494e07b370d1c80a
10.9 kB Preview Download
md5:8caa0d6d6830aa31fbf30bbe9a69fb2b
34.7 kB Preview Download
md5:5f49d504cf0d4c5d706e0b9ad6384177
45.0 kB Preview Download

Additional details

Dates

Collected
2023
Interview data collected
Available
2025-03-01
Dataset first made available
Updated
2025-08-28
Dataset augmented and made available