Published September 28, 2021 | Version v1
Report Open

Towards Integrated Hardware/Software Ecosystems for the Edge-Cloud-HPC Continuum

  • 1. INRIA
  • 2. scapos AG
  • 3. Fraunhofer ITWM

Description

Modern use cases such as autonomous vehicles, digital twins, smart buildings and precision agriculture, greatly increase the complexity of application workflows. They typically combine physics-based simulations, analysis of large data volumes and machine learning and require a hybrid execution infrastructure: edge devices create streams of input data, which are processed by data analytics and machine learning applications in the Cloud, and simulations on large, specialised HPC systems provide insights into and prediction of future system state. From these results, additional steps create and communicate output data across the infrastructure levels, and for some use cases, control devices or cyber-physical systems in the real world are controlled (as in the case of smart factories). All of these steps pose different requirements for the best suited execution platforms, and they need to be connected in an efficient and secure way. This assembly is called the Computing Continuum (CC) (1). It raises challenges at multiple levels: at the application level, innovative algorithms are needed to bridge simulations, machine learning  and data-driven analytics; at the middleware level,  adequate tools must enable efficient deployment, scheduling and orchestration of the workflow components across the whole distributed infrastructure;  and, finally, a capable resource management system must allocate a suitable set of components of the infrastructure to run the application workflow, preferably in a dynamic and adaptive way, taking into account the specific capabilities of each component of the underlying heterogeneous infrastructure.

To address the challenges, we foresee an increasing need for integrated software ecosystems which combine current “island” solutions and bridge the gaps between them. These ecosystems must facilitate the full lifecycle of CC use cases, including initial modelling, programming, deployment, execution, optimisation, as well as monitoring and control. It will be important to ensure adequate reproducibility of workflow results and to find ways for creating and managing trust when sharing systems, software and data. All of these will in turn require novel or improved hardware capabilities. This white paper provides an initial discussion of the gaps. Our objective is to accelerate progress in both hardware and software infrastructures to build CC use cases, with the ultimate goals of accelerating scientific discovery, improving timeliness, quality and sustainability of engineering artefacts, and supporting decisions in complex and potentially urgent situations

Notes

The authors would like to thank Rafael Mayo-García from CIEMAT and Marion Carrier from CybeleTech for their help in describing relevant use cases for the computing continuum.

Files

ETP4HPC_WP_Computing-Continuum_FINAL.pdf

Files (2.4 MB)

Name Size Download all
md5:9b121a9e3bf78d4335c0742a01ad4c4c
2.4 MB Preview Download

Additional details

Funding

European Commission
ACROSS - HPC BIG DATA ARTIFICIAL INTELLIGENCE CROSS STACK PLATFORM TOWARDS EXASCALE 955648

References