Chrominance Aware Progressive Rate-Distortion Optimization for Learned Image Compressio

Mr. Abhay S. Dubey; Mrs. Pradnya K. Bachhav; Mr. Sandeep G. Shukla; Mr. Akshay R. Jain; Mr. Pramod C. Patil

doi:10.5281/zenodo.20571759

Published 2026 | Version v1

Journal article Open

Chrominance Aware Progressive Rate-Distortion Optimization for Learned Image Compressio

We present an end-to-end learned image compression system trained under a unified rate-distortion objective, comprising a convolutional encoder, a U-Net-based entropy model, and a convolutional decoder operating in the YCbCr colourspace. The entropy model predicts per-symbol Gaussian distribution parameters over the quantized latent representation, providing a differentiable bitrate surrogate during training, while inferencetime compression is performed using zlib on the serialized latent. We introduce a progressive rate-distortion training strategy in which multiple model variants are obtained through iterative finetuning across successive stages, each adapting the rate-distortion balance without independent retraining. Additionally, we propose a chrominance-aware distortion curriculum that progressively reallocates channel weighting from luminance-dominant to balanced YCbCr emphasis, enabling stable structural reconstruction prior to enhanced colour fidelity. The proposed system achieves compression ratios of approximately 32×, 17×, and 3.2× relative to uncompressed PNG, with PSNR values of up to 32.78 dB on BSD100, while maintaining total encode-to-decode latency under 35 ms. Evaluation across BSD100, Set14, and Urban100 demonstrates consistent performance across diverse image content.

Files

3-CRD3576.pdf

Files (4.8 MB)

Name	Size	Download all
3-CRD3576.pdf md5:18ec39a0bb3da056253e23af7eacca42	4.8 MB	Preview Download

	All versions	This version
Views	15	15
Downloads	10	10
Data volume	53.0 MB	53.0 MB

Chrominance Aware Progressive Rate-Distortion Optimization for Learned Image Compressio

Authors/Creators

Description

Files

3-CRD3576.pdf

Files (4.8 MB)