Selection of Task Implementations in the Nanos++ Runtime
Creators
- 1. Barcelona Supercomputing Center, Barcelona, Spain; Universitat Politècnica de Catalunya, Barcelona, Spain
Contributors
- 1. Barcelona Supercomputing Center, Barcelona, Spain; Universitat Politècnica de Catalunya, Barcelona, Spain; Artificial Intelligence Research Institute (IIIA), Spanish National Research Council (CSIC), Madrid, Spain
- 2. Barcelona Supercomputing Center, Barcelona, Spain; Universitat Politècnica de Catalunya, Barcelona, Spain
Description
New heterogeneous systems and hardware accelerators can give higher levels of computational power to high performance computers. However, this does not come for free, since the more heterogeneity the system presents, the more complex becomes the programming task in terms of resource utilization.
OmpSs is a task-based programming model and framework focused on the automatic parallelization of sequential applications. We present a set of extensions to this framework: we show how the application programmer can expose different specialized versions of tasks (i.e. pieces of specific code targeted and optimized for a particular architecture) and how the framework will choose between these versions at runtime to obtain the best performance achievable for the given application. From our results, obtained in a multi-GPU system, we can prove that our project gives flexibility to application's source code and can potentially increase application’s performance.
Files
Selection of Task Implementations in the Nanox Runtime.pdf
Files
(644.6 kB)
Name | Size | Download all |
---|---|---|
md5:43fa10ea84d341f5a9ab2698e9c88acd
|
644.6 kB | Preview Download |