Published July 30, 2013 | Version v1
Working paper Open

Performance Analysis of BLAS Libraries in SuperLU_DIST for SuperLU_MCDT (Multi Core Distributed) Development

Creators

  • 1. Istanbul Technical University,Department of Mathematics, Istanbul 34469, Turkey; Istanbul Technical University, National Center for High Performance Computing of Turkey (UHeM), Istanbul 34469, Turkey
  • 1. Istanbul Technical University, National Center for High Performance Computing of Turkey (UHeM), Istanbul 34469, Turkey; Istanbul Technical University, National Center for High Performance Computing of Turkey (UHeM), Istanbul 34469, Turkey

Description

SuperLU_DIST is a distributed memory parallel solver for sparse linear systems. The solver makes several calls to BLAS library routines in its numerical factorization phase. The performance of the BLAS library can significantly affect the overall performance of the solver as the required BLAS operations are typically computationally dense. In this regard, we examine how the overall performance of the SuperLU_DIST solver can be improved by employing optimized BLAS libraries. In particular, we try using Intel Math Kernel Library (MKL) and Parallel Linear Algebra Subroutines for Multicore Architecture (PLASMA) libraries. Using MKL can provide an approximate performance improvement of 50 %, and using PLASMA can improve the performance by around 10 % for the best tile size. Based on our findings, we have improved SuperLU_MCDT solver.

Files

WP83.pdf

Files (234.4 kB)

Name Size Download all
md5:0a38a62586b47aa8ab45157c7f3184bf
234.4 kB Preview Download

Additional details

Funding

PRACE-2IP – PRACE - Second Implementation Phase Project 283493
European Commission