Published January 1, 2024 | Version v1
Journal article Open

Managing Distributed Machine Learning Lifecycle for Healthcare Data in the Cloud

  • 1. Centre Tecnològic de Telecomunicacions de Catalunya, Castelldefels, Barcelona, 08860, Spain
  • 2. Bogaziçi University, Department of Computer Engineering, Istanbul, 34342, Turkey; Massachusetts Institute of Technology, Department of Brain and Cognitive Sciences, Cambridge, 02139, MA, United States
  • 3. University College Dublin, School of Computer Science, Dublin, D04 V1W8, Ireland

Description

The main objective of this paper is to highlight the research directions and explain the main roles of current Artificial Intelligence (AI)/Machine Learning (ML) frameworks and available cloud infrastructures in building end-to-end ML lifecycle management for healthcare systems and sensitive biomedical data. We identify and explore the versatility of many genuine techniques from distributed computing and current state-of-the-art ML research, such as building cognition-inspired learning pipelines and federated learning (FL) ecosystem. Additionally, we outline the advantages and highlight the main obstacles of our methodology utilizing contemporary distributed secure ML techniques, such as FL, and tools designed for managing data throughout its lifecycle. For a robust system design, we present key architectural decisions essential for optimal healthcare data management, focusing on security, privacy and interoperability. Finally, we discuss ongoing efforts and future research directions to overcome existing challenges and improve the effectiveness of AI/ML applications in the healthcare domain. © 2013 IEEE.

Notes

This work has been partially supported by the Spanish Ministry of Economy and Competitiveness (MINECO)\u2014Program UNICO I+D under Grant TSI-063000-2021-54, Grant TSI-063000-2021-55, \u201CERDF A way of making Europe\u201D project funded by MCIN/AEI/ 10.13039/501100011033 under grant PID2021-126431OB-I00 and Generalitat de Catalunya grant 2021 SGR 00770 .

Files

Managing Distributed Machine Learning Lifecycle for Healthcare Data in the Cloud.pdf

Additional details

Funding

Ministerio de Asuntos Económicos y Transformación Digital
6GDAWN-RESILIENT-Decentralized AI and Architectures for Massive Wireless Network Slicing Scalability and Sustainability in 6G-RESILIENT TSI-063000-2021-55
Ministerio de Asuntos Económicos y Transformación Digital
6GDAWN-ELASTIC-Decentralized AI and Architectures for Massive Wireless Network Slicing Scalability and Sustainability in 6G-ELASTIC TSI-063000-2021-54
Ministerio de Asuntos Económicos y Transformación Digital
ANEMONE-Scalable and decentralized management of open 6G networks PID2021-126431OB-I00