Published January 11, 2018 | Version 1.0.1
Software Open

ICPE 2018 Artifact - Measuring Network Latency Variation Impacts to High Performance Computing Application Performance

  • 1. Clemson University

Description

ICPE 2018 published paper artifact submission.  Abstract:

In this paper, we study the impacts of latency variation versus latency mean on application runtime, library performance, and packet delivery. Our contributions include the design and implementation of a network latency injector that is suitable for most QLogic and Mellanox InfiniBand cards. We fit statistical distributions of latency mean and variation to varying levels of network contention for a range of parallel application workloads. We use the statistical distributions to characterize the latency variation impacts to application degradation. The level of application degradation caused by variation in network latency depends on application characteristics, and can be significant. Observed degradation varies from no degradation for applications without communicating processes to 3.5 times slower for communication-intensive parallel applications. We support our results with statistical analysis of our experimental observations. For communication-intensive high performance computing applications, we show statistically significant evidence that changes in performance are more highly correlated with changes of variation in network latency than with changes of mean network latency alone. 

Notes

1.0.0 - Initial release. 1.0.1 - Adds a missing ssh_config template required for running Ansible script.

Files

ICPE_2018_Artifact.zip

Files (26.3 MB)

Name Size Download all
md5:0308ddd2c8764581f85fd3cfb8384225
26.3 MB Preview Download