Published May 26, 2021
| Version 0.0.3
Dataset
Open
Characterizing Distributed Machine Learning Workloads on Apache Spark
Authors/Creators
Contributors
Data collector (2):
Data manager:
Description
This dataset was used for our submission at Middleware'22 titled: "Characterizing Distributed ML Workloads"
It will contains the description and the raw data, its format, as well as a detailed description of the cluster deployments used by these experiments.
The full paper is available here:
https://dl.acm.org/doi/10.1145/3590140.3629112
Files
MLlib.zip
Additional details
Related works
- Is described by
- Publication: 10.1145/3590140.362911 (DOI)
Software
- Development Status
- Inactive