Open-source machine learning powered image codec.
View on GitHubYou don't need to manually compile any part of the code. You will need a PyTorch (>=1.3) environment to run the code.
Follow the installation instructions on pytorch.org to install the library.
Encode an image with filename INPUT_IMAGE to bit-stream named CODED_BIN with the following command.
You need to specify the model index in the command line to run different models.
$ python inference.py –-encode –i INPUT_IMAGE -o CODED_BIN -m MODEL_INDEX
Decode from a binary bit-stream CODED_BIN to reconstruct an image named REC_IMAGE with the following command.
$ python inference.py –-decode -i CODEC_BIN -o REC_IMAGE
High Bitrate | Low Bitrate | ||||||||
---|---|---|---|---|---|---|---|---|---|
Methods | Classes | RGB PSNR | RGB MS-SSIM | YUV PSNR | Y MS-SSIM | RGB PSNR | RGB MS-SSIM | YUV PSNR | Y MS-SSIM |
VVC | ClassA_6K | -15.1% | -16.0% | -15.9% | -13.1% | -23.6% | -23.1% | -25.1% | -19.0% |
ClassB_4K | -15.3% | -17.2% | -15.9% | -13.6% | -23.7% | -24.7% | -25.3% | -18.8% | |
ClassC_2K | -22.4% | -24.1% | -22.6% | -18.9% | -28.8% | -28.2% | -29.6% | -22.8% | |
ClassD_Kodak | -19.0% | -18.3% | -20.4% | -14.6% | -23.5% | -20.7% | -27.0% | -16.0% | |
overall | -17.9% | -18.9% | -18.7% | -15.1% | -24.9% | -24.2% | -26.8% | -19.1% | |
NIC-0.1 (MSE) | ClassA_6K | -18.7% | -26.0% | -16.0% | -16.9% | -19.6% | -26.7% | -17.2% | -18.7% |
ClassB_4K | -17.3% | -26.4% | -14.5% | -17.7% | -18.9% | -28.8% | -18.7% | -19.2% | |
ClassC_2K | -19.2% | -31.2% | -12.5% | -21.0% | -20.5% | -32.2% | -16.3% | -23.2% | |
ClassD_Kodak | -4.2% | -21.5% | -2.6% | -14.0% | -7.3% | -27.7% | -9.8% | -21.8% | |
overall | -14.9% | -26.3% | -11.4% | -17.4% | -16.6% | -28.9% | -15.5% | -20.7% | |
NIC-0.1 (MS-SSIM) |
ClassA_6K | 32.4% | -53.7% | 33.1% | -46.9% | 27.9% | -50.1% | 33.5% | -45.6% |
ClassB_4K | 37.1% | -54.2% | 37.3% | -47.6% | 37.2% | -51.4% | 40.8% | -45.3% | |
ClassC_2K | 38.2% | -58.3% | 52.7% | -48.7% | 25.5% | -55.2% | 31.7% | -46.8% | |
ClassD_Kodak | 80.6% | -52.4% | 68.3% | -44.4% | 69.0% | -50.1% | 52.0% | -45.9% | |
overall | 47.1% | -54.7% | 47.8% | -46.9% | 39.9% | -51.7% | 39.5% | -45.9% |
The contributors of this project are listed as follows.
CHEN. Tong, Nanjing University, tong@smail.nju.edu.cn
JIA. Chuanmin, Peking University, cmjia@pku.edu.cn
LIN. Jianping, University of Science and Technology of China, ljp105@mail.ustc.edu.cn
LIU. Feiyang, Wuhan University, liufeiyang@whu.edu.cn
LIU. Haojie, Nanjing University, haojie@smail.nju.edu.cn
MA. Haichuan, University of Science and Technology of China, hcma@mail.ustc.edu.cn
WANG. Yefei, University of Science and Technology of China, wyfei@mail.ustc.edu.cn
Website maintainer
HU. Yueyu, Peking University, huyy@pku.edu.cn
[1] Neural Image Compression via Non-Local Attention Optimization and Improved Context Modeling, Tong Chen, Haojie Liu, Zhan Ma, Qiu Shen, Xun Cao, Yao Wang, arXiv preprint arXiv:1910.06244
[2] An End-to-End Joint Learning Scheme of Image Compression and Quality Enhancement with Improved Entropy Minimization, Jooyoung Lee, Seunghyun Cho, Munchurl Kim, arXiv preprint arXiv:1912.12817