Published March 13, 2026
| Version v1.0.0
Software
Open
Deep4ge: Mutation-Induced Training Dynamics Dataset for DNN Fault Detection and Diagnosis
Authors/Creators
Description
Deep4ge v1.0.0
A dataset of 14,227 mutation-induced per-epoch training logs generated from 60 real-world DNN programs, engineered for fault detection and diagnosis research in deep neural networks.
Dataset contents
- 9,845 faulty training logs across 7 fault categories
- 4,382 correct baseline logs
- 60 StackOverflow seed programs (FNN / CNN / RNN)
- 31 dynamic features per epoch (gradients, activations, weights, hardware)
- Manifest index, data dictionary, and validation scripts
Licenses
- Dataset: CC BY 4.0
- Mutation framework: MIT
Citation
If you use Deep4ge, please cite this release and refer to the CITATION.cff file in the repository.
Submitted to ICST 2026 Testing Tools and Data Showcase Track.
Notes
Files
SigmaJahan/deep4ge-dataset-v1.0.0.zip
Files
(68.8 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:4015afaaccf2713ef24094617229a20c
|
68.8 MB | Preview Download |
Additional details
Related works
- Is supplement to
- Software: https://github.com/SigmaJahan/deep4ge-dataset/tree/v1.0.0 (URL)
Software
- Repository URL
- https://github.com/SigmaJahan/deep4ge-dataset