Skip to content

Supplementary material for the paper "PERCEPTUAL LEARNED IMAGE COMPRESSION VIA END-TO-END JND-BASED OPTIMIZATION", IEEE ICIP 2024

License

Notifications You must be signed in to change notification settings

sanaznami/JND-LC

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 

Repository files navigation

PERCEPTUAL LEARNED IMAGE COMPRESSION VIA END-TO-END JND-BASED OPTIMIZATION

Introduction

This is the implementation of PERCEPTUAL LEARNED IMAGE COMPRESSION VIA END-TO-END JND-BASED OPTIMIZATION paper in Pytorch.

Acknowledgement

The framework is based on CompressAI. We modify examples/train.py, compressai/datasets/image.py and compressai/losses/rate_distortion.py.

Abstract

Emerging Learned image Compression (LC) achieves significant improvements in coding efficiency by end-to-end training of neural networks for compression. An important benefit of this approach over traditional codecs is that any optimization criteria can be directly applied to the encoder-decoder networks during training. Perceptual optimization of LC to comply with the Human Visual System (HVS) is among such criteria, which has not been fully explored yet. This paper addresses this gap by proposing a novel framework to integrate Just Noticeable Distortion (JND) principles into LC. Leveraging existing JND datasets, three perceptual optimization methods are proposed to integrate JND into the LC training process: (1) Pixel-Wise JND Loss (PWL) prioritizes pixel-by-pixel fidelity in reproducing JND characteristics, (2) Image-Wise JND Loss (IWL) emphasizes on overall imperceptible degradation levels, and (3) Feature-Wise JND Loss (FWL) aligns the reconstructed image features with perceptually significant features. Experimental evaluations demonstrate the effectiveness of JND integration, highlighting improvements in rate-distortion performance and visual quality, compared to baseline methods. The proposed methods add no extra complexity after training.

The proposed framework

image

The proposed learned image compression framework and JND-based perceptual loss functions. (a) overall framework. (b) Pixel-wise Loss (PWL). (c) Image-wise Loss (IWL). (d) Feature-wise Loss (FWL).

Pre-trained Models

Our pre-trained models can be downloaded using this link.

Dataset Structure

The dataset used for training and testing should have the following structure. Original images are stored in the "Ref" subfolder, while JND-quality images are located in the "JND1" subfolder.

- rootdir/
     - train/
         - Ref/
              - img#1
              - ...             
         - JND1/
              - img#1
              - ...

     - test/
         - Ref/
              - img#1
              - ...             
         - JND1/
              - img#1
              - ...

Evaluation

If you want evaluate with our pretrained model, please download them using this link and put them in the directory "./PretrainedModels/".

python3 -m compressai.utils.eval_model checkpoint /path/to/images/folder/ -a cheng2020-anchor -p ./PretrainedModels/name/of/model --output_directory /path/to/output/folder --output-file name-of-output-file

Train

To train with the proposed methods on your own dataset, please download the modified codes from the ./examples/ directory and put them in the ./examples/ directory of CompressAI repository.

To train based on MSE-based loss function, use the following command:

python3 examples/[trainPWL.py or trainIWL.py or trainFWL.py] -d /path/to/my/image/dataset/ -e epoch_num -m cheng2020-anchor --quality [1,...,6] --lambda lambda -lr lr --batch-size batch_size --cuda --save --chkpntname name

To train based on MSSSIM-based loss function, use the following command:

python3 examples/[trainPWLMS.py or trainIWLMS.py or trainFWLMS.py] -d /path/to/my/image/dataset/ -e epoch_num -m cheng2020-anchor --quality [1,...,6] --lambda lambda -lr lr --batch-size batch_size --cuda --save --chkpntname name

Citation

If our work is useful for your research, please cite our paper:

@inproceedings{pakdaman2024,
	title={PERCEPTUAL LEARNED IMAGE COMPRESSION VIA END-TO-END JND-BASED OPTIMIZATION},
author={Pakdaman, Farhad and Nami, Sanaz and Gabbouj, Moncef},
booktitle={Proceedings of the IEEE International Conference on Image Processing (ICIP)},
year={2024}
}

Contact

If you have any question, leave a message here or contact Sanaz Nami (snami@ut.ac.ir, sanaz.nami@tuni.fi).