视频编码的技术基础及发展方向

周建同; 杨海涛; 刘东; 马祥; 王田

doi:10.11959/j.issn.1000-0801.2017248

您当前的位置：

首页 >

文章列表页 >

视频编码的技术基础及发展方向

专题：视频技术的理论与实践 | 更新时间：2024-06-05

- 视频编码的技术基础及发展方向
- Trends and technologies of video coding
- 电信科学 2017年33卷第8期页码：16-25
- 作者机构：
  
  1. 华为技术有限公司，广东深圳 518129
  2. 中国科学技术大学，安徽合肥 230026
- 作者简介：
  
  [ "周建同（1980-），男，华为技术有限公司主任工程师，主要研究方向为多媒体应用系统和视频通信。" ]
  [ "杨海涛（1983-），男，华为技术有限公司主任工程师，主要研究方向为图像视频处理、压缩和通信。" ]
  [ "刘东（1983-），男，中国科学技术大学副教授，主要研究方向为图像视频压缩和多媒体数据挖掘。" ]
  [ "马祥（1987-），男，华为技术有限公司工程师，主要研究方向为视频压缩。" ]
  [ "王田（1967-），男，华为技术有限公司媒体技术实验室主任，主要研究方向为多媒体通信系统、虚拟/增强现实和计算机视觉。" ]
- 基金信息：
- DOI：10.11959/j.issn.1000-0801.2017248
  中图分类号： TP393
- 网络出版日期：2017-08，
  
  纸质出版日期：2017-08-15
- 稿件说明：
移动端阅览
周建同, 杨海涛, 刘东, 等. 视频编码的技术基础及发展方向[J]. 电信科学, 2017,33(8):16-25.

Jiantong ZHOU, Haitao YANG, Dong LIU, et al. Trends and technologies of video coding[J]. Telecommunications science, 2017, 33(8): 16-25.
周建同, 杨海涛, 刘东, 等. 视频编码的技术基础及发展方向[J]. 电信科学, 2017,33(8):16-25. DOI： 10.11959/j.issn.1000-0801.2017248.

Jiantong ZHOU, Haitao YANG, Dong LIU, et al. Trends and technologies of video coding[J]. Telecommunications science, 2017, 33(8): 16-25. DOI： 10.11959/j.issn.1000-0801.2017248.

摘要

现有视频编码采用基于块的混合编码架构，利用预测、变换、量化和熵编码技术实现对视频信号的高效压缩。在现有架构基础上进一步优化，提供针对视频图像信号局部特性的更加灵活的处理和编码。基于机器学习的视频编码技术有望部分或全面地改变现有的混合编码框架，给视频编码带来新的研究思路。未来视频除了现有的二维平面视频，还需要编码面向AR/VR应用的球面视频数据和体视频数据，这些新的视频源数据格式也给视频编码技术研究带来新的机会和挑战。

Abstract

The current video coding uses block based hybrid architecture

which uses predictive

transform

quantization and entropy coding techniques to efficiently compress video signals.Further optimizations on current architectures provide more flexible processing and coding for local characteristics of video image signals.Video coding based on machine learning was expected to change the existing hybrid coding framework partially or comprehensively

and bring new research ideas to video coding.In addition to existing 2D video signal

the future of video also needs to spherical video coding and volumetric video coding for AR/VR applications

the new video source data format of the video encoding technology has brought new opportunities and challenges.

关键词

Keywords

references

施唯佳 , 蒋力 , 贾立鼎 . OTT TV和IPTV的技术比较分析 [J ] . 电信科学 , 2014 , 30 ( 5 ): 15 - 19 ,26.

SHI W J , JIANG L , JIA L D . Technique comparative analysis of OTT TV and IPTV [J ] . Telecommunications Science , 2014 , 30 ( 5 ): 15 - 19 ,26.

魏峥 , 施唯佳 , 祝谷乔 . 互联网视频中多屏互动技术的应用 [J ] . 电信科学 , 2014 , 30 ( 5 ): 27 - 32 ,39.

WEI Z , SHI W J , ZHU G Q . Multi-screen interaction technologies on internet streaming video [J ] . Telecommunications Science , 2014 , 30 ( 5 ): 27 - 32 ,39.

张敏 , 宋杰 , 刘晓峰 . 电信运营商面对 OTT 的战略选择 [J ] . 电信科学 , 2014 , 30 ( 2 ): 142 - 146 ,151.

ZHANG M , SONG J , LIU X F . Strategic selection of telecom operators to counter OTT [J ] . Telecommunications Science , 2014 , 30 ( 2 ): 142 - 146 ,151.

MPEG . Presentations of the brainstorming session of the future of video coding standardization:MPEG-w15050 [S ] . 2014 .

MPEG . Steps towards a future video compression standard:MPEG-w15272 [S ] . 2015 .

MPEG . Requirements for a future video coding standard:MPEG-w15090 [S ] . 2015 .

MPEG . Request for contributions on future video compression technology:MPEG-w15273 [S ] . 2015 .

JVET . Joint call for evidence on video compression with capability beyond HEVC:JVET-F1002 [S ] . 2017 .

MPEG . Joint group on future video coding technology exploration (JVET):MPEG-w15897 [S ] . 2015 .

ITU . Coding tools investigation for next generation video coding:ITU-T SG16-C806 [S ] . 2015 .

JVET . JVET common test conditions and software reference configurations:JVET-B1010 [S ] . 2016 .

JVET . Algorithm description of joint exploration test model 6:JVET-F1001 [S ] . 2017 .

YUAN Y , KIM I K , ZHENG X , et al . Quadtree based nonsquare block structure for inter frame coding in high efficiency video coding [J ] . IEEE Transactions on Circuits and Systems for Video Technology , 2012 , 22 ( 12 ): 1707 - 1719 .

AN J , CHEN Y W , ZHANG K , et al . Block partitioning structure for next generation video coding:COM 16–C966 [S ] . 2015 .

JVET . Multi-type-tree:JVET-D0117 [S ] . 2016 .

YANG H , FU J , LIN S , et al . Description of video coding technology proposal by Huawei Technologies ＆ Hisilicon Technologies [C ] // ISO/IEC JTC1/SC29/WG11,JCTVC-A111,April 15-23,2010,Dresden,Germany.[S.1.:s.n] . 2010 .

KAMP S , WIEN M . Description of video coding technology proposal by RWTH Aachen University [C ] // JVT on Video Coding of ITU-T VCEG and ISO/IEC MPEG 1st Meeting,JCTVC,JCTVC-A112,April 15-23,2010,Dresden,Germany.[S.1.:s.n] . 2010 .

KAMP S , WIEN M . Decoder-side motion vector derivation for block-based video coding [J ] . IEEE Transactions on Circuits and Systems for Video Technology , 2012 , 22 ( 12 ): 1732 - 1745 .

CHIU Y , XU L , ZHANG W , et al . Description of video coding technology proposal:self derivation of motion estimation and adaptive (Wiener) loop filtering [C ] // JCT-VC 1st Meeting,JCTVC-A106,April 15-23,2010,Dresden,Germany.[S.1.:s.n] . 2010 .

CHEN J , CHIEN W J , KARCZEWICZ M , et al . Further improvements to HMKTA-1.0 [J ] . Doc VECG-AZO7 , 2015 .

LIN S , CHEN H , ZHANG H , et al . Affine transform prediction for next generation video coding [J ] . ITU-T SG16 Doc COM16-C1016 , 2015 .

CHEN H , LIANG F , LIN S . Affine SKIP and MERGE modes for video coding [C ] // 2015 IEEE 17th International Workshop on Multimedia Signal Processing (MMSP),Oct 19-21,2015,Xiamen,China . New Jersey:IEEE Press , 2015 : 1 - 5 .

LI L , LI H , LIU D , et al . An efficient four-parameter affine motion model for video coding [J ] . IEEE Transactions on Circuits and Systems for Video Technology , 2017 .

WITTMANN S , WEDI T . Transmission of post-filter hints for video coding schemes [C ] // 2007 IEEE International Conference on Image Processing,Sept 16-Oct 19,San Antonio,TX,USA . New Jersey:IEEE Press , 2007 : 81 - 84 .

ITU . Adaptive (Wiener) filter for video compression:ITU-T SG16 Contribution C,VCEG-C437 [S ] . 2008 .

ITU . Adaptive loop filter for improving coding efficiency:ITU-T SG16 Contribution C,VCEG-C402 [S ] . 2008 .

TSAI C Y , CHEN C Y , YAMAKAGE T , et al . Adaptive loop filtering for video coding [J ] . IEEE Journal of Selected Topics in Signal Processing , 2013 , 7 ( 6 ): 934 - 945 .

JIANG J . Image compression with neural networks–a survey [J ] . Signal Processing:Image Communication , 1999 , 14 ( 9 ): 737 - 760 .

TODERICI G , O'MALLEY S M , HWANG S J , et al . Variable rate image compression with recurrent neural networks [J ] . arXiv preprint arXiv:1511.06085 , 2015 .

TODERICI G , VINCENT D , JOHNSTON N , et al . Full resolution image compression with recurrent neural networks [J ] . arXiv preprint arXiv:1608.05148 , 2016 .

DUMAS T , ROUMY A , GUILLEMOT C . Image compression with stochastic winner-take-all auto-encoder [C ] // 2017 IEEE International Conference on Acoustics (ICASSP 2017),March 5-9,2017,New Orleans,USA . New Jersey:IEEE Press , 2017 : 1512 - 1516 .

PRAKASH A , MORAN N , GARBER S , et al . Semantic perceptual image compression using deep convolution networks [J ] . arXiv preprint arXiv:1612.08712 , 2016 .

BALLÉ J , LAPARRA V , SIMONCELLI E P . End-to-end optimization of nonlinear transform codes for perceptual quality [J ] . arXiv preprint arXiv:1607.05006 , 2016 .

BALLÉ J , LAPARRA V , SIMONCELLI E P . End-to-end optimized image compression [J ] . arXiv preprint arXiv:1611.01704 , 2016 .

DONG C , DENG Y , CHANGE Loy C , et al . Compression artifacts reduction by a deep convolutional network [C ] // 2017 IEEE International Conference on Computer Vision (ICCV 2015),Dec 7-13,2015,Santiago,Chile . New Jersey:IEEE Press , 2017 : 576 - 584 .

PARK W S , KIM M . CNN-based in-loop filtering for coding efficiency improvement [C ] // 2016 IEEE Image,Video,and Multi dimensional Signal Processing Workshop (IVMSP),July 11-12,2016,Bordeaux,France . New Jersey:IEEE Press , 2016 : 1 - 5 .

DAI Y , LIU D , WU F . A convolutional neural network approach for post-processing in HEVC intra coding [C ] // 2017 International Conference on Multimedia Modeling (MMM 2017),January 4-6,2017,Reykjavik,Iceland . Heidelberg:Springer , 2017 : 28 - 39 .

LIU Z , YU X , CHEN S , et al . CNN oriented fast HEVC intra CU mode decision [C ] // 2016 IEEE International Symposium on Circuits and Systems (ISCAS 2016),May 22-25,2016,Montreal,Canada . New Jersey:IEEE Press , 2016 : 2270 - 2273 .

LAFRUIT G , QUACKENBUSH S , FOESSEL S , et al . Technical report of the joint ad hoc group for digital representations of light/sound fields for immersive media applications [R ] . 2016 .

TULVAN C , MEKURIA R , LI Z , et al . Use cases for point cloud compression [R ] . 2016 .

MEKURIA R , LI Z , TULVAN C . Call for proposals for point cloud compression [R ] . 2017 .

PALOMO C M . Interactive image-based rendering for virtual view synthesis from depth image [D ] . Rio de Janeiro:Pontífícia Universidade Católica do Rio de Janeiro , 2009 .

浏览量

1632

下载量

CSCD

文章被引用时，请邮件提醒。

提交

工具集

关联资源

基于可解释机器学习模型的电信行业客户流失预测研究

基于流量特征重构与映射的物联网DDoS攻击单流检测方法

基于5G语音质差自适应算法研究及应用

基于区块链与深度学习的空间分集协作频谱感知系统

基于柔性太阳能电池和超薄水凝胶薄膜的手势识别