ndzip-gpu: efficient lossless compression of scientific floating-point data on GPUs

Fabian Knorr; Peter Thoman; Thomas Fahringer

doi:10.5281/zenodo.7437646

Published November 13, 2021 | Version v1

Preprint Open

ndzip-gpu: efficient lossless compression of scientific floating-point data on GPUs

1. University of Innsbruck, Austria

Lossless data compression is a promising software approach for reducing the bandwidth requirements of scientific applications on accelerator clusters without introducing approximation errors. Suitable compressors must be able to effectively compact floating-point data while saturating the system interconnect to avoid introducing unnecessary latencies.

We present ndzip-gpu, a novel, highly-efficient GPU parallelization scheme for the block compressor ndzip, which has recently set a new milestone in CPU floating-point compression speeds.

Through the combination of intra-block parallelism and efficient memory access patterns, ndzip-gpu achieves high resource utilization in decorrelating multi-dimensional data via the Integer Lorenzo Transform. We further introduce a novel, efficient warp-cooperative primitive for vertical bit packing, providing a high-throughput data reduction and expansion step.

Using a representative set of scientific data, we compare the performance of ndzip-gpu against five other, existing GPU compressors. While observing that effectiveness of any compressor strongly depends on characteristics of the dataset, we demonstrate that ndzip-gpu offers the best average compression ratio for the examined data. On Nvidia Turing, Volta and Ampere hardware, it achieves the highest single-precision throughput by a significant margin while maintaining a favorable trade-off between data reduction and throughput in the double-precision case.

Files

2021-ndzip-gpu-efficient-lossless-compression-of-scientific-floating-point-data-on-gpus.pdf

Files (605.8 kB)

Name	Size	Download all
2021-ndzip-gpu-efficient-lossless-compression-of-scientific-floating-point-data-on-gpus.pdf md5:a03f52be022dfe5f120745fac2eb6e01	605.8 kB	Preview Download

Additional details

European Commission
LIGATE - LIgand Generator and portable drug discovery platform AT Exascale 956137

	All versions	This version
Views	181	181
Downloads	128	128
Data volume	77.5 MB	77.5 MB

ndzip-gpu: efficient lossless compression of scientific floating-point data on GPUs

Authors/Creators

Description

Files

2021-ndzip-gpu-efficient-lossless-compression-of-scientific-floating-point-data-on-gpus.pdf

Files (605.8 kB)

Additional details

Funding