Published February 2, 2026 | Version v1
Dataset Open

SteelBench v1.0: A Benchmark Dataset for Steel Mechanical Property Prediction with Grade-Shift Evaluation

  • 1. ROR icon National University of Science and Technology
  • 2. ROR icon Lomonosov Moscow State University
  • 3. Interdata

Description

SteelBench is an open benchmark for steel mechanical property prediction that links heat-level chemistry, heat-treatment parameters, and tensile properties with grade identity and data-origin labels. The dataset contains 1,636 samples across 594 steel grades and 17 steel families, aggregated from public and semi-public sources: MMPDS, NIMS, EMK, Kaggle, and laboratory measurements.

Two variants are included:

  • steelbench_core.csv — strict publication variant (original reported values only)
  • steelbench_full.csv — filled training variant (missing heat-treatment parameters imputed using documented assumptions)

Key features:

  • 11 input features: C, Mn, Si, Cr, Ni, Mo, V, Cu, Al, austenitize_T, temper_T
  • Targets: tensile_strength, yield_strength, elongation
  • Grade-shift evaluation protocols: RandomKFold, GKF-grade, LOFO (Leave-One-Family-Out), LOSO (Leave-One Source-Out)
  • Data-origin (provenance) labels for each filled field

Associated paper: "SteelBench: A Physics-Aware Benchmark for Steel Mechanical Property Prediction" (under review at KDD 2026).

Reference code: https://github.com/cornada/steelbench 

Files

croissant.json

Files (206.7 kB)

Name Size Download all
md5:9af1198cdc7ddab21d139f0a41b23264
3.6 kB Preview Download
md5:ab298c45d9557d2a6546c709aa197336
4.4 kB Preview Download
md5:9e9aa90818565e748d1393ee17908853
1.4 kB Download
md5:ca0e58deadd2900ab7a3f96a25d5b0d2
197.2 kB Preview Download

Additional details

Software

Repository URL
https://github.com/cornada/steelbench
Programming language
Python