Published November 18, 2025 | Version v1
Conference proceeding Open

Streamlining ML Training in Kubernetes: An MLOps Architecture with Kubeflow

  • 1. ROR icon Harokopio University of Athens
  • 2. ROR icon National Technical University of Athens
  • 3. ORAMAVR S.A.
  • 4. ROR icon Foundation for Research and Technology Hellas

Description

Machine Learning Operations (MLOps) is essential for automating the deployment, monitoring, and management of ML models. By integrating MLOps with DevOps practices, developers can create automated training pipelines. This paper explores using Kubeflow as an MLOps platform and GitHub Actions as a CI/CD pipeline for training and deploying ML models. Kubeflow provides a scalable framework for orchestrating ML workflows in containers, with Kubernetes enabling efficient resource management. Containerization ensures consistency, portability, and reproducibility across environments, while GitHub Actions automates testing, version control, and deployment. A real-world case study demonstrates this architecture and discusses challenges and best practices for modern MLOps workflows.

Files

Streamlining_ML_Training_in_Kubernetes__An_MLOps_Architecture_with_Kubeflow (pre-print).pdf

Additional details

Funding

European Commission
PANDORA - A Comprehensive Framework enabling the Delivery of Trustworthy Datasets for Efficient AIoT Operation 101135775
European Commission
SOPRANO - Socially-Acceptable and Trustworthy Human-Robot Teaming for Agile Industries 101120990