Info: Zenodo’s user support line is staffed on regular business days between Dec 23 and Jan 5. Response times may be slightly longer than normal.

Published November 11, 2018 | Version v1
Conference paper Open

Next-Generation Cluster Management Architecture and Software

  • 1. Los Alamos National Laboratory

Description

Over the last six decades, Los Alamos National Laboratory (LANL) has acquired, accepted, and integrated over 100 new HPC systems, from MANIAC in 1952 to Trinity in 2017. These systems range from small clusters to large supercomputers. The high performance computing (HPC) system architecture has progressively changed over this time as well; from single system images to complex, interdependent service infrastructures within a large HPC system. The authors are proposing a redesign of the current HPC system architecture to help reduce downtime and provide a more resilient architectural design.

Files

ws_hpcsysp109.pdf

Files (128.1 kB)

Name Size Download all
md5:4a7df970f12bd5c4bf92f20e075ed79c
128.1 kB Preview Download