Awesome 🎉Deep Learning Based Video Compression

Contents (This part is updated from June 2024)

Generative compression
VCM
Rate Control
INNs
Low Complexity
Motion
Feature Coding

Group by time （This section stops updating from June 2024）

2024
2023
2022
2021
2020
2019
2018
2017

Generative compression

Title	Pub. & Date
CodingHomo: Bootstrapping Deep Homography with Video Coding	TCSVT 2024
I2VC: A Unified Framework for Intra- & Inter-frame Video Compression	Arixv 2024
PredToken: Predicting Unknown Tokens and Beyond with Coarse-to-Fine Iterative Decoding	Arxiv 2024
SMC++: Masked Learning of Unsupervised Video Semantic Compression	Arxiv 2024

Video Coding for Machine

Title	Pub. & Date
On Annotation-free Optimization of Video Coding for Machines	arXiv 2024
Competitive Learning for Achieving Content-specific Filters in Video Coding for Machines	arXiv 2024

Rate Control

Title	Pub. & Date
Deep Video Codec Control for Vision Models	CVPR 2024

INNs

Title	Pub. & Date
QS-NeRV: Real-Time Quality-Scalable Decoding with Neural Representation for Videos	ACM MM 2024
Temporal Enhanced Hybrid Neural Representation for Video Compression	PCS 2024
Combining Frame and GOP Embeddings for Neural Video Representation	CVPR 2024

Low Complexity

Title	Pub. & Date
Accelerating Learned Video Compression via Low-Resolution Representation Learning	arXiv 2024
Standard compliant video coding using low complexity, switchable neural wrappers	arXiv 2024

Motion Related

Title	Pub. & Date
Spatial Neighbor Information Assisted Motion Compensated Temporal Filter for Video Coding	PCS 2024

Feature Coding

Title	Pub. & Date
Deep Video Compression with Conditional Feature Coding	PCS 2024

✔2024 «🎯Back To Top»

(CVPR 2024) Deep Video Codec Control for Vision Models Reich C, Debnath B, Patel D, et al. paper
(ToMM 2024) Learned Video Compression with Adaptive Temporal Prior and Decoded Motion-aided Quality Enhancement Yang, Jiayu and Yang, Chunhui and Xiong, Fei and Zhai, Yongqi and Wang, Ronggangpaper
(Trans Broadcasting 2024) Depth Video Inter Coding Based on Deep Frame GenerationlLi, Ge and Lei, Jianjun and Pan, Zhaoqing and Peng, Bo and Ling, Nampaper
(ICASSP 2024) Rate-Quality Based Rate Control Model for Neural Video CompressionLiao, Shuhong and Jia, Chuanmin and Fan, Hongfei and Yan, Jingwen and Ma, Siweipaper
(ICASSP 2024) Learned Video Compression with Spatial-Temporal Optimization Wang, Yiming and Huang, Qian and Tang, Bin and Liu, Wenting and Shan, Wenchao and Xu, Qianpaper
(ICASSP 2024) Region-Adaptive Video Sharpening Via Rate-Perception Optimization Pang, Yingxue and Zhao, Shijie and Guo, Mengxi and Li, Junlin and Zhang, Li paper
(ICASSP 2024) Leveraging Redundancy in Feature for Efficient Learned Image Compression Qin, Peng and Bao, Youneng and Meng, Fanyang and Tan, Wen and Li, Chao and Wang, Genhong and Liang, Yongsheng paper
(ICASSP 2024) A Tri-Dynamic Preprocessing Framework for UGC Video Compression Zhao, Fei and Guo, Mengxi and Zhao, Shijie and Li, Junlin and Zhang, Li and Xie, Xiaodong paper
(ICASSP 2024) Improving Learned Video Compression by Exploring Spatial Redundancy Yang, Jiayu and Yang, Chunhui and Zhai, Yongqi and Wang, Qi and Pan, Xinghao and Wang, Ronggang paper
(ICASSP 2024) Learned Video Compression with Spatial-Temporal Optimization Wang, Yiming and Huang, Qian and Tang, Bin and Liu, Wenting and Shan, Wenchao and Xu, Qian paper
(WCACV 2024) MobileNVC: Real-time 1080p Neural Video Compression on a Mobile Device van Rozendaal, Ties and others paper
(TPAMI 2024) VNVC: A Versatile Neural Video Coding Framework for Efficient Human-Machine Vision Sheng, Xihua and Li, Li and Liu, Dong and Li, Houqiang paper
(TPAMI 2024) A Coding Framework and Benchmark towards Low-Bitrate Video Understanding Tian, Yuan and Lu, Guo and Yan, Yichao and Zhai, Guangtao and Chen, Li and Gao, Zhiyong paper
(TIP 2024) Cross-Component Prediction Boosted With Local and Non-Local Information in Video Coding Zhang, Kai and Deng, Zhipin and Zhang, Li paper
(TCSVT 2024) Exploiting Bidirectional Quality Impulse for Reference Picture Resampled Gaming Video Coding Fang, Xiaohan and Chen, Peilin and Wang, Meng and Xie, Xi and Wang, Shiqi and Wang, Shanshe and Ma, Siwei paper
(TCSVT 2024) Spatial Decomposition and Temporal Fusion based Inter Prediction for Learned Video Compression Becking, Daniel and M{"u}ller, Karsten and Haase, Paul and Kirchhoffer, Heiner and Tech, Gerhard and Samek, Wojciech and Schwarz, Heiko and Marpe, Detlev and Wiegand, Thomas paper
(Arxiv 2024) Efficient Learned Wavelet Image and Video CodingMeyer, Anna and Prativadibhayankaram, Srivatsa and Kaup, Andrepaper
(Arxiv 2024) Group-aware Parameter-efficient Updating for Content-Adaptive Neural Video Compression Chen, Zhenghao and Zhou, Luping and Hu, Zhihao and Xu, Dongpaper
(Arxiv 2024) Parameter-Efficient Instance-Adaptive Neural Video Compression Yang, Hyunmo and Oh, Seungjun and Park, Eunbyungpaper
(Arxiv 2024) Task-Aware Encoder Control for Deep Video CompressionGe, Xingtong and Luo, Jixiang and Zhang, Xinjie and Xu, Tongda and Lu, Guo and He, Dailan and Geng, Jing and Wang, Yan and Zhang, Jun and Qin, Hongweipaper
(Arxiv 2024) Image and Video Compression using Generative Sparse Representation with Fidelity ControlsJiang, Wei and Wang, Weipaper
(Arxiv 2024) A Perspective on Deep Vision Performance with Standard Image and Video CodecsReich, Christoph and Hahn, Oliver and Cremers, Daniel and Roth, Stefan and Debnath, Biplobpaper
(Arxiv 2024) Group-aware Parameter-efficient Updating for Content-Adaptive Neural Video Compression Chen, Zhenghao and Zhou, Luping and Hu, Zhihao and Xu, Dongpaper
(Arxiv 2024) CGVC-T: Contextual Generative Video Compression with Transformers Du, Pengli and Liu, Ying and Ling, Nampaper
(Arxiv 2024) Low-Latency Neural Stereo Streaming Hou, Qiqi and Farhadzadeh, Farzad and Said, Amir and Sautiere, Guillaume and Le, Hoangpaper
(Arxiv 2024) Analysis of Neural Video Compression Networks for 360-Degree Video Coding Regensky, Andy and Brand, Fabian and Kaup, Andr{'e}paper
(Arxiv 2024) Extreme Video Compression with Pre-trained Diffusion Models Li, Bohan and Liu, Yiming and Niu, Xueyan and Bai, Bo and Deng, Lei and G{"u}nd{"u}z, Deniz paper
(Arxiv 2024) Boosting Neural Representations for Videos with a Conditional Decoder Zhang, Xinjie and Yang, Ren and He, Dailan and Ge, Xingtong and Xu, Tongda and Wang, Yan and Qin, Hongwei and Zhang, Jun paper
(Arxiv 2024) Optimal Quality and Efficiency in Adaptive Live Streaming with JND-Aware Low latency Encoding Menon, Vignesh V and Zhu, Jingwen and Rajendran, Prajit T and Afzal, Samira and Schoeffmann, Klaus and Callet, Patrick Le and Timmerer, Christian paper
(Arxiv 2024) VQ-NeRV: A Vector Quantized Neural Representation for Videos Xu, Yunjie and Feng, Xiang and Qin, Feiwei and Ge, Ruiquan and Peng, Yong and Wang, Changmiaopaper
(Arxiv 2024) Low-Rate, Low-Distortion Compression with Wasserstein Distortion Qiu, Yang and Wagner, Aaron B paper
(Arxiv 2024) LVC-LGMC: Joint Local and Global Motion Compensation for Learned Video Compression Jiang, Wei and Li, Junru and Zhang, Kai and Zhang, Lipaper
(Arxiv 2024) Immersive Video Compression using Implicit Neural Representations Kwan, Ho Man and Zhang, Fan and Gower, Andrew and Bull, Davidpaper
(Arxiv 2024) Cool-chic video: Learned video coding with 800 parameters Leguay, Thomas and Ladune, Th{'e}o and Philippe, Pierrick and D{'e}forges, Olivierpaper
(Arxiv 2024) A Neural-network Enhanced Video Coding Framework beyond ECM Zhao, Yanchen and He, Wenxuan and Jia, Chuanmin and Wang, Qizhe and Li, Junru and Li, Yue and Lin, Chaoyi and Zhang, Kai and Zhang, Li and Ma, Siwei paper
(Arxiv 2024) Motion-Adaptive Inference for Flexible Learned B-Frame Compression Akin Yilmaz, M and Ugur Ulas, O and Bilican, Ahmet and Murat Tekalp, A paper
(Arxiv 2024) Analysis of Neural Video Compression Networks for 360-Degree Video Coding Regensky, Andy and Brand, Fabian and Kaup, Andr{'e} paper
(VICP 2024) High-Fidelity Free-View Talking Head Synthesis for Low-Bandwidth Video Conference Zhang, Zhiyu and Tang, Anni and Zhu, Chen and Lu, Guo and Xie, Rong and Song, Li paper
(MMM 2024) Hierarchical Bi-directional Temporal Context Mining for Improved Video Compression Lin, Zijian and Luo, Jianping paper

✔2023 «🎯Back To Top»

(NeurIPS 2023) HiNeRV: Video Compression with Hierarchical Encoding based Neural Representation Kwan, Ho Man and Gao, Ge and Zhang, Fan and Gower, Andrew and Bull, David paper code
(TPAMI 2023) Compressed-SDR to HDR Video Reconstruction Wang, Hu and Ye, Mao and Zhu, Xiatian and Li, Shuai and Li, Xue and Zhu, Ce paper
(TIP 2023) Sur-driven video coding rate control for jointly optimizing perceptual quality and buffer control Yang, Zetao and Gao, Wei and Li, Ge and Yan, Yiqiang paper
(Trans BROADCASTING 2023) Virtual-Competitors-Based Rate Control for 360-Degree Video Coding Lin, Jielian and Lin, Hongbin and Xu, Yiwen and Kang, Yuanxun and Zhao, Tiesong paper
(Neurocomputing 2023) Multiple Hypotheses Based Motion Compensation for Learned Video Compression Lin, Rongqun and Wang, Meng and Zhang, Pingping and Wang, Shiqi and Kwong, Sam paper
(ACMMM 2023) High Visual-Fidelity Learned Video Compression Li, Meng and Shi, Yibo and Wang, Jing and Huang, Yunqi paper
(ACMMM 2023) DeepSVC: Deep Scalable Video Coding for Both Machine and Human Vision Li, Meng and Shi, Yibo and Wang, Jing and Huang, Yunqi paper
(ACMMM 2023) Neural Video Compression with Spatio-Temporal Cross-Covariance Transformers Chen, Zhenghao and Relic, Lucas and Azevedo, Roberto and Zhang, Yang and Gross, Markus and Xu, Dong and Zhou, Luping and Schroers, Christopher paper
(ACMMM 2023) Peering into The Sketch: Ultra-Low Bitrate Face Compression for Joint Human and Machine Perception Mao, Yudong and Chen, Peilin and Wang, Shurun and Wang, Shiqi and Wu, Dapeng paper
(TMM 2023) End-to-End Distortion Modeling for Error-Resilient Screen Content Video Coding Tang, Tong and Yin, Zhiyang and Li, Jie and Wang, Honggang and Wu, Dapeng and Wang, Ruyan paper
(TMM 2023) Learning to Predict Object-Wise Just Recognizable Distortion for Image and Video Compression Zhang, Yun and Lin, Haoqin and Sun, Jing and Zhu, Linwei and Kwong, Sam paper
(TMM 2023) Enhanced Context Mining and Filtering for Learned Video Compression Guo, Haifeng and Kwong, Sam and Ye, Dongjie and Wang, Shiqi paper
(TMM 2023) Content-adaptive Rate-Distortion Modeling for Frame-level Rate Control in Versatile Video Coding Liao, Junqi and Li, Li and Liu, Dong and Li, Houqiang paper
(TOMM 2023) Principal Component Approximation Network for Image Compression Zhang, Shupei and Zhao, Chenqiu and Basu, Anup paper
(ICCV 2023) Non-Semantics Suppressed Mask Learning for Unsupervised Video Semantic Compression Abdulmotaleb El{-}Saddik and Tao Mei and Rita Cucchiara and Marco Bertini and Diana Patricia Tobon Vallejo and Pradeep K. Atrey and M. Shamim Hossain paper
(ICIP 2023) FGC-VC: Flow-Guided Context Video Compression Wang, Yiming and Huang, Qian and Tang, Bin and Sun, Huashan and Guo, Xiaotong paper
(ICIP 2023) Block-Based Motion Estimation for Deep-Learned Video Coding S. Pientka, M. Schäfer, J. Pfaff, H. Schwarz, D. Marpe and T. Wiegand paper
(ICIP 2023) Learned Image Compression with Large Capacity and Low Redundancy of Latent Representation Meng, Xiandong and Zhu, Shuyuan and Ma, Siwei and Zeng, Bing paper
(ICIP 2023) Multi-scale deformable alignment and content-adaptive inference for flexible-rate bi-directional video compression Y{\i}lmaz, M Ak{\i}n and Ulas, O Ugur and Tekalp, A Murat paper
(ICIP 2023) Machine-Attention-based Video Coding for Machines Lee, Yegi and Kim, Shin and Yoon, Kyoungro and Lim, Hanshin and Kwak, Sangwoon and Choo, Hyon-Gon paper
(ICIP 2023) Predictive Coding for Animation-Based Video Compression Konuko, Goluck and Lathuili{`e}re, St{'e}phane and Valenzise, Giuseppe paper
(ICIP 2023) Blurry Video Compression: A Trade-Off Between Visual Enhancement and Data Compression Argaw, Dawit Mureja and Kim, Junsik and Kweon, In So paper
(TCSVT 2023) End-to-end learnable multi-scale feature compression for vcm Kim, Yeongwoong and Jeong, Hyewon and Yu, Janghyun and Kim, Younhee and Lee, Jooyoung and Jeong, Se Yoon and Kim, Hui Yong paper
(TCSVT 2023) Camera Pose-Based Background Modeling for Video Coding in Moving Cameras Fang, Zheng and Zheng, Mingkui and Chen, Pingping and Chen, Zhifeng and Wu, Dapeng Oliver paper
(TCSVT 2023) Sparse-to-Dense: High Efficiency Rate Control for End-to-end Scale-Adaptive Video Coding Chen, Jiancong and Wang, Meng and Zhang, Pingping and Wang, Shurun and Wang, Shiqi paper
(TCSVT 2023) MPAI-EEV: Standardization Efforts of Artificial Intelligence based End-to-End Video Coding Jia, Chuanmin and Ye, Feng and Dong, Fanke and Lin, Kai and Chiariglione, Leonardo and Ma, Siwei and Sun, Huifang and Gao, Wen paper
(TCSVT 2023) DBVC: An End-to-End 3-D Deep Biomedical Video Coding Framework Xue, Dongmei and Ma, Haichuan and Li, Li and Liu, Dong and Xiong, Zhiwei and Li, Houqiang paper
(CVPR 2023) Towards Scalable Neural Representation for Diverse Videos He, Bo and Yang, Xitong and Wang, Hanyu and Wu, Zuxuan and Chen, Hao and Huang, Shuaiyi and Ren, Yixuan and Lim, Ser-Nam and Shrivastava, Abhinav paper
(CVPR 2023) DNeRV: Modeling Inherent Dynamics via Difference Neural Representation for Videos Zhao, Qi and Asif, M Salman and Ma, Zhan paper
(CVPR 2023) HNeRV: A Hybrid Neural Representation for Videos Chen, Hao and Gwilliam, Matt and Lim, Ser-Nam and Shrivastava, Abhinav paper
(CVPR 2023) Motion Information Propagation for Neural Video Compression Qi, Linfeng and Li, Jiahao and Li, Bin and Li, Houqiang and Lu, Yan paper
(ICASSP 2023) LCCM-VC: LEARNED CONDITIONAL CODING MODES FOR VIDEO CODING Hadi Hadizadeh and Ivan V. Bajic paper
(Arxiv 2023) Implicit-explicit Integrated Representations for Multi-view Video Compression Zhu, Chen and Lu, Guo and He, Bing and Xie, Rong and Song, Lipaper
(Arxiv 2023) Accelerating Learnt Video Codecs with Gradient Decay and Layer-wise Distillation Peng, Tianhao and Gao, Ge and Sun, Heming and Zhang, Fan and Bull, Davidpaper
(Arxiv 2023) Hierarchical Autoencoder-based Lossy Compression for Large-scale High-resolution Scientific Data Le, Hieu and Santos, Hernan and Tao, Jianpaper
(Arxiv 2023) Offline and Online Optical Flow Enhancement for Deep Video Compression Tang, Chuanbo and Sheng, Xihua and Li, Zhuoyuan and Zhang, Haotian and Li, Li and Liu, Dongpaper
(Arxiv 2023) CANF-VC++: Enhancing Conditional Augmented Normalizing Flows for Video Compression with Advanced Techniques Chen, Peng-Yu and Peng, Wen-Hsiao paper
(Arxiv 2023) Implicit-explicit Integrated Representations for Multi-view Video Compression Zhu, Chen and Lu, Guo and He, Bing and Xie, Rong and Song, Li paper
(Arxiv 2023) C3: High-performance and low-complexity neural compression from a single image or video Kim, Hyunjik and Bauer, Matthias and Theis, Lucas and Schwarz, Jonathan Richard and Dupont, Emilien paper
(Arxiv 2023) Interactive Face Video Coding: A Generative Compression Framework Chen, Bolin and Wang, Zhao and Li, Binzhe and Wang, Shurun and Wang, Shiqi and Ye, Yan paper
(Arxiv 2023) MaskCRT: Masked Conditional Residual Transformer for Learned Video Compression Chen, Yi-Hsin and Xie, Hong-Sheng and Chen, Cheng-Wei and Gao, Zong-Lin and Peng, Wen-Hsiao and Benjak, Martin and Ostermann, J{"o}rnpaper
(Arxiv 2023) Interactive Face Video Coding: A Generative Compression Framework Chen, Bolin and Wang, Zhao and Li, Binzhe and Wang, Shurun and Wang, Shiqi and Ye, Yan paper
(Arxiv 2023) Butterfly: Multiple Reference Frames Feature Propagation Mechanism for Neural Video Compression Wang, Feng and Ruan, Haihang and Xiong, Fei and Yang, Jiayu and Li, Litian and Wang, Ronggang paper
(Arxiv 2023) IBVC: Interpolation-driven B-frame Video Compression Liu, Meiqin and Xu, Chenming and Yao, Chao and Lin, Weisi and Zhao, Yao paper
(Arxiv 2023) Multiscale Motion-Aware and Spatial-Temporal-Channel Contextual Coding Network for Learned Video Compression Wang, Yiming and Huang, Qian and Tang, Bin and Sun, Huashan and Li, Xing paper
(Arxiv 2023) Effortless Cross-Platform Video Codec: A Codebook-Based Method Tian, Kuan and Guan, Yonghang and Xiang, Jinxi and Zhang, Jun and Han, Xiao and Yang, Wei paper
(Arxiv 2023) Generative Face Video Coding Techniques and Standardization Efforts: A Review Chen, Bolin and Chen, Jie and Wang, Shiqi and Ye, Yan paper
(Arxiv 2023) Bitstream Organization for Parallel Entropy Coding on Neural Network-based Video Codecs Said, Amir and Le, Hoang and Farhadzadeh, Farzad paper
(Arxiv 2023) Hyperspectral Image Compression Using Sampling and Implicit Neural Representations Rezasoltani, Shima and Qureshi, Faisal Z paper
(Arxiv 2023) Deep Hierarchical Video Compression Lu, Ming and Duan, Zhihao and Zhu, Fengqing and Ma, Zhan paper
(Arxiv 2023) VCD: A Video Conferencing Dataset for Video Compression Naderi, Babak and Cutler, Ross and Khongbantabam, Nabakumar Singh and Hosseinkashi, Yasaman paper

✔2022 «🎯Back To Top»

（Arxiv 2022） VCT: A Video Compression Transformer Mentzer, Fabian and Toderici, George and Minnen, David and Hwang, Sung-Jin and Caelles, Sergi and Lucic, Mario and Agustsson, Eirikur paper
(ECCV 2022) Neural Video Compression Using GANs for Detail Synthesis and Propagation Mentzer, Fabian and Agustsson, Eirikur and Ball{'e}, Johannes and Minnen, David and Johnston, Nick and Toderici, George paper
(ECCV 2022) Canf-vc: Conditional augmented normalizing flows for video compression Ho, Yung-Han and Chang, Chih-Peng and Chen, Peng-Yu and Gnutti, Alessandro and Peng, Wen-Hsiao paper
(ECCV 2022) AlphaVC: High-Performance and Efficient Learned Video Compression Shi, Yibo and Ge, Yunying and Wang, Jing and Mao, Jue paper
(ECCV 2022) E-nerv: Expedite neural video representation with disentangled spatial-temporal context Li, Zizhang and Wang, Mengmeng and Pi, Huaijin and Xu, Kechun and Mei, Jianbiao and Liu, Yong paper
(ACM MM 2022) Hybrid Spatial-Temporal Entropy Modelling for Neural Video Compression Li, Jiahao and Li, Bin and Lu, Yan. paper
(TMM 2022) Temporal Context Mining for Learned Video Compression Sheng, Xihua and Li, Jiahao and Li, Bin and Li, Li and Liu, Dong and Lu, Yan paper
(TCSVT 2022) HMFVC: A Human-Machine Friendly Video Compression Scheme Huang, Zhimeng and Jia, Chuanmin and Wang, Shanshe and Ma, Siwei paper
(arXiv preprint 2022) CONTENT-ADAPTIVE MOTION RATE ADAPTION FOR LEARNED VIDEO COMPRESSION Chen, Chih-Hsuan Lin Yi-Hsin and Peng, Wen-Hsiao [paper]
(CVPRW 2022) Learned Low Bitrate Video Compression with Space-Time Super-Resolution Yang, Jiayu and Yang, Chunhui and Xiong, Fei and Wang, Feng and Wang, Ronggang [paper]
(CVPRW 2022) Learned Low Bitrate Video Compression With Space-Time Super-Resolution Yang, Jiayu and Yang, Chunhui and Xiong, Fei and Wang, Feng and Wang, Ronggang [paper]
(CVPR 2022) Coarse-to-fine Deep Video Coding with Hyperprior-guided Mode Prediction Zhihao Hu, Guo Lu, Jinyang Guo, Shan Liu, Wei Jiang, Dong Xu [paper]
(CVPR 2022) Learning Based Multi-Modality Image and Video Compression, Lu, Guo and Zhong, Tianxiong and Geng, Jing and Hu, Qiang and Xu, Dong [paper]
(CVPR 2022) LSVC: A Learning-based Stereo Video Compression Framework, Chen, Zhenghao and Lu, Guo and Hu, Zhihao and Liu, Shan and Jiang, Wei and Xu, Dong [paper]
(TPAMI 2022) Multi-modality deep restoration of extremely compressed face videos, Zhang, Xi and Wu, Xiaolin. [paper]
(arXiv preprint 2022) A Coding Framework and Benchmark towards Compressed Video Understanding, Yuan Tian, Guo Lu, Yichao Yan, Guangtao Zhai, Li Chen, Zhiyong Gao. [paper]
(Under review ICLR 2022) Learning Perceptual Compression of Facial Video, Shukor, Mustafa and Xu, YAO and Damodaran, Bharath Bhushan and Hellier, Pierre. [paper]
(Under review ICLR 2022) Uncertainty-Aware Deep Video Compression with Ensembles, Ma, Wufei and Li, Jiahao and Li, Bin and Lu, Yan. [paper]
(Signal Processing: Image Communication 2022) Learning to compress videos without computing motion, Chen, Meixu and Goodall, Todd and Patney, Anjul and Bovik, Alan C. [paper]
(arXiv preprint 2022) Multi-View Video Coding with GAN Latent Learning, Lan, Chengdong and Luo, Cheng and Yan, Hao and Zhao, Tiesong and Kwong, Sam. [paper]
(ICASSP 2022) Rate Control for Learned Video Compression, Li, Yanghao and Chen, Xinyao and Li, Jisheng and Wen, Jiangtao and Han, Yuxing and Liu, Shan and Xu, Xiaozhong. [paper]
(TCSVT 2022) Edge-Based Video Compression Texture Synthesis using Generative Adversarial Network, Zhu, Chen and Xu, Jun and Feng, Donghui and Xie, Rong and Song, Li. [paper]

✔2021 «🎯Back To Top»

(NeurIPS 2021) Nerv: Neural representations for videosChen, Hao and He, Bo and Wang, Hanyu and Ren, Yixuan and Lim, Ser Nam and Shrivastava, Abhinav [paper]
(ICLR 2021) Hierarchical autoregressive modeling for neural video compression, Yang, Ruihan and Yang, Yibo and Marino, Joseph and Mandt, Stephan. [paper]
(TPAMI 2021) An end-to-end learning framework for video compression, Lu, Guo and Zhang, Xiaoyun and Ouyang, Wanli and Chen, Li and Gao, Zhiyong and Xu, Dong. [paper]
(TIP 2021) End-to-End Rate-Distortion Optimized Learned Hierarchical Bi-Directional Video Compression, Y{\i}lmaz, M Ak{\i}n and Tekalp, A Murat. [paper]
(CVPR 21) Online-trained Upsampler for Deep Low Complexity Video Compression, Klopp, Jan P and Liu, Keng-Chi and Chien, Shao-Yi and Chen, Liang-Gee. [paper]
(NIPS 21) Deep Contextual Video Compression, Li, Jiahao and Li, Bin and Lu, Yan. [paper]
(CVPR 21) ELF-VC: Efficient Learned Flexible-Rate Video Coding, Rippel, Oren and Anderson, Alexander G and Tatwawadi, Kedar and Nair, Sanjay and Lytle, Craig and Bourdev, Lubomir. [paper]
(CVPR 21) FVC: A New Framework towards Deep Video Compression in Feature Space, Hu, Zhihao and Lu, Guo and Xu, Dong. [paper]
(CVPR 21) Deep Perceptual Preprocessing for Video Coding, Aaron Chadha, Yiannis Andreopoulos. [paper]
(CVPR 21) Deep learning in latent space for video prediction and compression, Liu, Bowen and Chen, Yu and Liu, Shiyu and Kim, Hun-Seok. [paper]
(ICIP 21) Variable-Rate Video Compression[C]//2021 IEEE International Conference on Image Processing, Lin, Jianping and Liu, Dong and Liang, Jie and Li, Houqiang and Wu, Feng. [paper] VR
(VCIP 21) DVC-P: Deep Video Compression with Perceptual Optimizations, Zhang, Saiping and Mrak, Marta and Herranz, Luis and Blanch, Marc G{'o}rriz and Wan, Shuai and Yang, Fuzheng. [paper]
(MTICTI 2021) Review and Evaluation of End-to-End Video Compression with Deep-Learning, Yasin, Hajar Maseeh and Ameen, Siddeeq Yosef. [paper]
(arXiv preprint 2021) Deep Video Coding with Dual-Path Generative Adversarial Network, Zhao, Tiesong and Feng, Weize and Zeng, Hongji and Niu, Yuzhen and Liu, Jiaying. [paper]
(arXiv preprint 2021) Versatile Learned Video Compression, Feng, Runsen and Guo, Zongyu and Zhang, Zhizheng and Chen, Zhibo. [paper]
(arXiv preprint 2021) A. Generalized Difference Coder: A Novel Conditional Autoencoder Structure for Video Compression, Brand, Fabian and Seiler, J{"u}rgen and Kaup, Andr{'e}. [paper]
(arXiv preprint 2021) Implicit Neural Video Compression, Zhang, Yunfan and van Rozendaal, Ties and Brehmer, Johann and Nagel, Markus and Cohen, Taco. [paper]
(arXiv preprint 2021) Self-Supervised Learning of Perceptually Optimized Block Motion Estimates for Video Compression, Guo, Zongyu and Feng, Runsen and Zhang, Zhizheng and Jin, Xin and Chen, Zhibo. [paper] MV
(arXiv preprint 2021) Learning Cross-Scale Prediction for Efficient Neural Video Compression, Paul, Somdyuti and Norkin, Andrey and Bovik, Alan C. [paper] MV
(arXiv preprint 2021) Neural Video Compression using GANs for Detail Synthesis and Propagation, Mentzer, Fabian and Agustsson, Eirikur and Ball{'e}, Johannes and Minnen, David and Johnston, Nick and Toderici, George. [paper]
(arXiv preprint 2021) Neural weight step video compression, Czerkawski, Mikolaj and Cardona, Javier and Atkinson, Robert and Michie, Craig and Andonovic, Ivan and Clemente, Carmine and Tachtatzis, Christos. [paper]
(arXiv preprint 2021) Perceptual Learned Video Compression with Recurrent Conditional GAN, Yang, Ren and Van Gool, Luc and Timofte, Radu. [paper]

✔2020 «🎯Back To Top»

(AAAI 20) Learned video compression via joint spatial-temporal correlation exploration, Yang, Ren and Mentzer, Fabian and Gool, Luc Van and Timofte, Radu. [paper]
(CVPR 20) Learning for video compression with hierarchical quality and recurrent enhancement, Liu, Haojie and Shen, Han and Huang, Lichao and Lu, Ming and Chen, Tong and Ma, Zhan. [paper]
(CVPR 20) M-LVC: Multiple frames prediction for learned video compression, Lin, Jianping and Liu, Dong and Li, Houqiang and Wu, Feng. [paper]
(CVPR 20) Learned video compression with feature-level residuals, Feng R, Wu Y, Guo Z, et al. [paper]
(ACCV 20) Feedback recurrent autoencoder for video compression, Lin, Golinski, Adam and Pourreza, Reza and Yang, Yang and Sautiere, Guillaume and Cohen, Taco S. [paper]
(CSUR 20) Deep learning-based video coding: A review and a case study, Liu, Dong and Li, Yue and Lin, Jianping and Li, Houqiang and Wu, Feng. [paper]

✔2019 «🎯Back To Top»

(ICCV 19) Dvc: An end-to-end deep video compression framework, Lu, Guo and Ouyang, Wanli and Xu, Dong and Zhang, Xiaoyun and Cai, Chunlei and Gao, Zhiyong. [paper]
(ICCV 19) Learned video compression, Rippel, Oren and Nair, Sanjay and Lew, Carissa and Branson, Steve and Anderson, Alexander G and Bourdev, Lubomir. [paper]
(NIPS 19) Deep generative video compression, Lombardo, Salvator and Han, Jun and Schroers, Christopher and Mandt, Stephan. [paper]
(TCSVT 19) Image and video compression with neural networks: A review, Ma, Siwei and Zhang, Xinfeng and Jia, Chuanmin and Zhao, Zhenghui and Wang, Shiqi and Wang, Shanshe. [paper]

✔2018 «🎯Back To Top»

(ECCV 18) Video compression through image interpolation, Wu, Chao-Yuan and Singhal, Nayan and Krahenbuhl, Philipp. [paper]

✔2017 «🎯Back To Top»

(VCIP 17) Video compression based on spatio-temporal resolution adaptation, Afonso, Mariana and Zhang, Fan and Bull, David R. [paper]

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
README.md		README.md
motion		motion

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Awesome 🎉Deep Learning Based Video Compression

Contents (This part is updated from June 2024)

Group by time （This section stops updating from June 2024）

Generative compression

Video Coding for Machine

Rate Control

INNs

Low Complexity

Motion Related

Feature Coding

✔2024 «🎯Back To Top»

✔2023 «🎯Back To Top»

✔2022 «🎯Back To Top»

✔2021 «🎯Back To Top»

✔2020 «🎯Back To Top»

✔2019 «🎯Back To Top»

✔2018 «🎯Back To Top»

✔2017 «🎯Back To Top»

About

Releases

Packages

ppingzhang/Awesome-Deep-Learning-Based-Video-Compression

Folders and files

Latest commit

History

Repository files navigation

Awesome 🎉Deep Learning Based Video Compression

Contents (This part is updated from June 2024)

Group by time （This section stops updating from June 2024）

Generative compression

Video Coding for Machine

Rate Control

INNs

Low Complexity

Motion Related

Feature Coding

✔2024 «🎯Back To Top»

✔2023 «🎯Back To Top»

✔2022 «🎯Back To Top»

✔2021 «🎯Back To Top»

✔2020 «🎯Back To Top»

✔2019 «🎯Back To Top»

✔2018 «🎯Back To Top»

✔2017 «🎯Back To Top»

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages