Published 2026 | Version v1

Chrominance Aware Progressive Rate-Distortion Optimization for Learned Image Compressio

Description

We present an end-to-end learned image compression system trained under a unified rate-distortion objective, comprising a convolutional encoder, a U-Net-based entropy model, and a convolutional decoder operating in the YCbCr colourspace. The entropy model predicts per-symbol Gaussian distribution parameters over the quantized latent representation, providing a differentiable bitrate surrogate during training, while inferencetime compression is performed using zlib on the serialized latent. We introduce a progressive rate-distortion training strategy in which multiple model variants are obtained through iterative finetuning across successive stages, each adapting the rate-distortion balance without independent retraining. Additionally, we propose a chrominance-aware distortion curriculum that progressively reallocates channel weighting from luminance-dominant to balanced YCbCr emphasis, enabling stable structural reconstruction prior to enhanced colour fidelity. The proposed system achieves compression ratios of approximately 32×, 17×, and 3.2× relative to uncompressed PNG, with PSNR values of up to 32.78 dB on BSD100, while maintaining total encode-to-decode latency under 35 ms. Evaluation across BSD100, Set14, and Urban100 demonstrates consistent performance across diverse image content.

Files

3-CRD3576.pdf

Files (4.8 MB)

Name Size Download all
md5:18ec39a0bb3da056253e23af7eacca42
4.8 MB Preview Download