Software Open Access

# The GPU version of LICOM3 under HIP framework and its large-scale application (updated)

Hailong Liu; Pengfei Wang; Jinrong Jiang; Pengfei Lin

### Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
<dc:creator>Hailong Liu</dc:creator>
<dc:creator>Pengfei Wang</dc:creator>
<dc:creator>Jinrong Jiang</dc:creator>
<dc:creator>Pengfei Lin</dc:creator>
<dc:date>2020-12-03</dc:date>
<dc:description>A high-resolution (1/20°) global ocean general circulation model with Graphics processing units (GPUs) code implementations is developed based on the LASG/IAP Climate system Ocean Model version 3 (LICOM3) under Heterogeneous-compute Interface for Portability (HIP) framework. The dynamic core and physics package of LICOM3 are both ported to the GPU, and 3-dimensional parallelization is applied. The HIP version of the LICOM3 (LICOM3-HIP) is 42 times faster than what the same number of CPU cores dose, when 384 AMD GPUs and CPU cores are used. The LICOM3-HIP has excellent scalability; it can still obtain speedup of more than four on 9216 GPUs comparing to 384 GPUs. In this phase, we successfully performed a test of 1/20° LICOM3-HIP using 6550 nodes and 26200 GPUs, and at the grand scale, the model’s time to solution can still obtain an increasing, about 2.72 simulated years per day (SYPD). The high performance was due to putting almost all of computation processes inside GPUs, and thus greatly reduces the time cost of data transfer between CPUs and GPUs. At the same time, a 14-year spin-up integration following the phase 2 of Ocean Model Intercomparison Project (OMIP-2) protocol of surface forcing has been conducted, and the preliminary results have been evaluated. We found that the model results have little differences from the CPU version. Further comparison with observations and lower-resolution LICOM3 results suggests that the 1/20° LICOM3-HIP can not only reproduce the observations, but also produce much smaller scale activities, such as submesoscale eddies and frontal scales structures.</dc:description>
<dc:identifier>https://zenodo.org/record/4302813</dc:identifier>
<dc:identifier>10.5281/zenodo.4302813</dc:identifier>
<dc:identifier>oai:zenodo.org:4302813</dc:identifier>
<dc:language>eng</dc:language>
<dc:relation>doi:10.5281/zenodo.4302812</dc:relation>
<dc:rights>info:eu-repo/semantics/openAccess</dc:rights>
<dc:subject>LASG/IAP Climate system Ocean Model version 3 (LICOM3)</dc:subject>
<dc:subject>Heterogeneous-compute Interface for Portability (HIP)</dc:subject>
<dc:subject>Graphics processing units (GPUs)</dc:subject>
<dc:title>The GPU version of LICOM3 under HIP framework and its large-scale application (updated)</dc:title>
<dc:type>info:eu-repo/semantics/other</dc:type>
<dc:type>software</dc:type>
</oai_dc:dc>

42
8
views