File | Description
--- | ---
commit_categorizations.csv
| Categorizations for the commits in our dataset.
commits.csv
| Information for the commits in our dataset.
datasets.csv
| Dataset's name, study information and description.
issue_categorizations.csv
| Categorizations for the chosen issues from our dataset.
issues.csv
| Information for the issues in our dataset.
pipeline_stages.csv
| DL pipeline stages and their respective descriptions.
problem_categories.csv
| Problem category names with their respective descriptions.
problem_causes.csv
| Problem cause category names with their respective descriptions.
problem_fixes.csv
| Problem fixes category names with their respective descriptions.
problem_symptoms.csv
| Problem symptoms category names with their respective descriptions.
studied_subjects_commits.csv
| Project data for commits.
studied_subjects_issues.csv
| Project data for issues.
studies.csv
| Contains the different studies with their respective descriptions.
commit_categorizations.csv
Column | Description
--- | ---
tf.function related fix?
| TRUE
when a bug fix related to tf.function
was found and FALSE
otherwise. If FALSE
, subsequent column values will be blank.
stage
| DL pipeline stage where the problem fix was found.
issue_categorizations.csv
Column | Description
--- | ---
tf.function related problem?
| TRUE
when a bug related to tf.function
was found and FALSE
otherwise. If FALSE
, subsequent column values will be blank.
stage
| DL pipeline stage where the problem was found.
GH_id
| GitHub issue unique identifier
issues.csv
Column | Description
--- | ---
GH_id
| GitHub issue unique identifier
datasets.csv
Column | Description
--- | ---
study
| Study for which the dataset was used.
description
| Describes the parameters/source from where the information was obtained.
studies.csv
Column | Description
--- | ---
study
| Name of the study
description
| Describes the what was studied.