We optimize the implemention codes of three representative GFVC works, i.e., FOMM, CFTE and FV2V, and further provide the unified codes regarding the encoder and decoder processes.
- Download the
CFTE-checkpoint.pth.tar
,FOMM-checkpoint.pth.tar
, andFV2V-checkpoint.pth.tar
to the path./GFVC/CFTE/checkpoint/
,./GFVC/FOMM/checkpoint/
, and./GFVC/FV2V/checkpoint/
respectively. The unified checkpoint dir is available at this link. - The overall testing dataset is available at this link.
In details, we provide the specific introductions about the hyper parameters and their definitions in the GFVC software tools as follows,
--original_seq
: the path to the input test sequence--encoding_frames
: the number of frames to be encoded--seq_width
: "the width of encoding frames--seq_height
: the height of encoding frames--quantization_factor
: the quantization factor for the type conversion (i.e., from float-type to int-type) for the residual of feature parameter--Iframe_QP
: the quantization parameter for encoding the base picture--Iframe_format
: the coded color format for the base picture, e.g., YUV 420 or RGB 444
The platform details can be described as follows,
- The pretrained analysis/synthesis models and codes of the three representative GFVC algorithms are encapsulated in the
GFVC
folder. - The corresponding interfaced functions regarding the encoder and decoder are defined in
CFTE_Encoder.py
,CFTE_Decoder.py
,FOMM_Encoder.py
,FOMM_Decoder.py
,FV2V_Encoder.py
andFV2V_Decoder.py
. - The
arthmetic
andvtm
folders include the packaged tools regarding the context-adaptive arithmetic coder for feature parameter encoding and the latest VVC software VTM 22.2 for base picture encoding. - The shell file (i.e.,
RUN.sh
) and batch execution code (i.e.,RUN.py
) are provided to complete the encoding and decoding processes.
The usages can be provided as follows,
- Modify the corresponding hyper parameters in
RUN.py
. The specific details have been provided in thisRUN.py
.
In the folder evaluate
, we further provide the codes to calculate the rate and distortion.
multiMetric.py
: a unified code including DISTS/LPIPS/PSNR/SSIMgetbits.py
: calculate the coding bits of base picture (VVC bitstream) and feature parameter (feature bitstream)
VTM 22.2 is used and the test is conducted under low-delay B (LDB) configuration. And we provide the batch execution code to encode these sequences and obtain the coresponding rate-distortion performance. The codes can be seen in folder VVC_anchor
. You can execute RUN_Encode.py
and RUN_Decode.py
.
We really appreciate all authors for making their codes available to public.
- The GFVC software package includes FOMM, CFTE and FV2V.
- The testing dataset is sourced and preprocessed from VoxCeleb and CFVQA.
- The quality assessment metrics include DISTS and LPIPS.
@inproceedings{Chen_JVET_AG0042,
title={AHG 16: Proposed Common Software Tools and Testing Conditions for Generative Face Video Compression},
author={Bolin, Chen and Jie, Chen and Ru-Ling, Liao and Yan, Ye and Shiqi, Wang},
journal={The Joint Video Experts Team of ITU-T SG 16 WP 3 and ISO/IEC JTC 1/SC 29, doc. no. JVET-AG0042},
year={2024}
}
If you have any question or collaboration need (research purpose or commercial purpose), please email bolinchen3-c@my.cityu.edu.hk
or chenbolin.chenboli@alibaba-inc.com
.