Published February 3, 2014 | Version v1
Working paper Open

GROMACS on Hybrid CPU-GPU and CPU-MIC Clusters: Preliminary Porting Experiences, Results and Next Steps

Creators

  • 1. Swiss National Supercomputing Centre, Lugano, Switzerland

Contributors

  • 1. Swiss National Supercomputing Centre, Lugano, Switzerland

Description

This report introduces hybrid implementation of the Gromacs application, and provides instructions on building and executing on PRACE prototype platforms with Grahpical Processing Units (GPU) and Many Intergrated Cores (MIC) accelerator technologies. GROMACS currently employs message-passing MPI parallelism, multi-threading using OpenMP and contains kernels for non-bonded interactions that are accelerated using the CUDA programming language. As a result, the execution model is multi-faceted where end users can tune the application execution according to the underlying platforms. We present results that have been collected on the PRACE prototype systems as well as on other GPU and MIC accelerated platforms with similar configurations. We also report on the preliminary porting effort that involves a fully portable implementation of GROMACS using OpenCL programming language instead of CUDA, which is only available on NVIDIA GPU devices.

Files

WP120.pdf

Files (444.0 kB)

Name Size Download all
md5:256c1cf2c72098a4e3f0ce6b90215ba7
444.0 kB Preview Download

Additional details

Funding

PRACE-2IP – PRACE - Second Implementation Phase Project 283493
European Commission