GROMACS on Hybrid CPU-GPU and CPU-MIC Clusters: Preliminary Porting Experiences, Results and Next Steps

doi:10.5281/zenodo.822572

Published February 3, 2014 | Version v1

Working paper Open

GROMACS on Hybrid CPU-GPU and CPU-MIC Clusters: Preliminary Porting Experiences, Results and Next Steps

Sadaf Alam¹

1. Swiss National Supercomputing Centre, Lugano, Switzerland

Other:

Ugo Varetto¹

1. Swiss National Supercomputing Centre, Lugano, Switzerland

This report introduces hybrid implementation of the Gromacs application, and provides instructions on building and executing on PRACE prototype platforms with Grahpical Processing Units (GPU) and Many Intergrated Cores (MIC) accelerator technologies. GROMACS currently employs message-passing MPI parallelism, multi-threading using OpenMP and contains kernels for non-bonded interactions that are accelerated using the CUDA programming language. As a result, the execution model is multi-faceted where end users can tune the application execution according to the underlying platforms. We present results that have been collected on the PRACE prototype systems as well as on other GPU and MIC accelerated platforms with similar configurations. We also report on the preliminary porting effort that involves a fully portable implementation of GROMACS using OpenCL programming language instead of CUDA, which is only available on NVIDIA GPU devices.

Files

WP120.pdf

Files (444.0 kB)

Name	Size	Download all
WP120.pdf md5:256c1cf2c72098a4e3f0ce6b90215ba7	444.0 kB	Preview Download

Additional details

PRACE-2IP – PRACE - Second Implementation Phase Project 283493: European Commission

	All versions	This version
Views	69	68
Downloads	48	48
Data volume	23.5 MB	23.5 MB

GROMACS on Hybrid CPU-GPU and CPU-MIC Clusters: Preliminary Porting Experiences, Results and Next Steps

Creators

Contributors

Other:

Description

Files

WP120.pdf

Files (444.0 kB)

Additional details

Funding