A Synthetic, Clinically Inspired Neonatal Dataset for Multi-Class Disease Prediction and Trust-Aware Machine Learning
Authors/Creators
- 1. Research Scholar, Indus University, Gujarat, India
- 2. Research Guide, Head of Department (Computer Engineering), Indus University, Gujarat, India
Description
This dataset provides a synthetic, clinically inspired collection of neonatal health records designed to support methodological research in machine learning. It focuses on multi-class neonatal disease prediction across nine clinically motivated outcome categories and is intended for studies in trust-aware modeling, uncertainty estimation, confidence-aware decision logic, and calibration analysis.
The dataset is entirely synthetic and does not contain any real patient data. It is designed as a research-enabling resource and is not clinically validated. It must not be used for diagnosis, treatment planning, or real-world medical decision-making.
All features are provided in a single consolidated dataset file. An accompanying metadata file documents clinically inspired feature groupings corresponding to conceptual stages of maternal, birth-related, and postnatal information. These stage definitions are provided for research guidance only and do not impose any constraints on how the dataset must be used.
The dataset has been developed as part of doctoral research focused on advancing trustworthy and transparent machine learning frameworks for neonatal health applications.
Files
Files
(6.2 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:c8fbce3ec746c7f4a5bd66f0a18ba12c
|
6.2 MB | Download |