"Implementation Notes for the Storage and Data Access Infrastructure", ExaNeSt project Deliverable D4.3
In this deliverable of WP4, D4.3 “Implementation Notes for the Storage and Data Access Infrastructure”, we describe the implementation of each software component and tool that was specified in deliverable D4.2:
- Extensions and enhancements to the common I/O path in Linux with focus on two key areas: (i) supporting protected access to storage devices from user space, i.e. direct access to storage with minimal kernel related overheads; and (ii) enhancements to the access path for memory-mapped file access by, amongst others, making use of NVM devices. [FORTH]
- Extensions to the BeeGFS parallel file system: Metadata replication mechanisms to handle consistency management and resilience to failures, as well as incorporating those mechanisms in a caching extension. [FHG]
- Acceleration mechanisms for Host-to-VM and VM-to-VM interactions that take into account properties of the hardware platform and the unified interconnect. Two main technologies for hardware-assisted virtualisation are considered: (i) use of RDMA capabilities for accessing remote memory from within a virtual machine, and (ii) HPC API remoting to improve the performance of HPC APIs (specifically MPI) for applications that execute as an ensemble of virtual machines. [VOSYS]
- The design of a new replication mechanism for analytical databases, and its implementation in MonetDB. This replication scheme has been designed in such a way that it will not only help improving the availability, reliability, performance and scalability of a database server in general, but also help exploiting the ExaNeSt storage infrastructure and the ExaNeSt platform in particular. [MDBS]
- Several monitoring and testing tools: the ExaNeSt storage administration and monitoring [INFN]; experiment automation, stress-load and fault injection tools [FORTH]; tests simulating HPC application I/O behaviours [INFN]; and a database profiler [MDBS].
In the storage infrastructure, resilience is mainly addressed by replication mechanisms in the file system. The system’s resilience will be stressed by the stress-load and fault injection tools, while its health is monitored by the monitoring tools. In addition, the system’s resilience level will be evaluated by the HPC and DBMS applications. To achieve the objectives of WP4, the storage and data access infrastructure proposed by ExaNeSt is built upon the following main components: (i) the distributed file system BeeGFS, (ii) Linux data access technologies, (iii) KVM based virtualisation, and (iv) storage system monitoring and administration tools. Moreover, we will use the state-of-the-art HPC applications and analytical databases to guide the design of the envisioned infrastructure and to evaluate and showcase the result.