结合随机森林的FVC帧内编码单元快速划分

任妍; 彭宗举; 崔鑫; 陈芬; 陈华

doi:10.11834/jig.180490

图像处理和编码 | 浏览量 : 0 下载量: 70 CSCD: 0

PDF
导出
分享
收藏
专辑

结合随机森林的FVC帧内编码单元快速划分
Random forest-based fast intra coding unit partition algorithm for FVC
2019年24卷第5期页码：724-733
收稿：2018-08-27，

修回：2018-10-17，

纸质出版：2019-05-16
DOI： 10.11834/jig.180490
稿件说明：

移动端阅览

任妍, 彭宗举, 崔鑫, 陈芬, 陈华. 结合随机森林的FVC帧内编码单元快速划分[J]. 中国图象图形学报, 2019,24(5):724-733. DOI： 10.11834/jig.180490.

Yan Ren, Zongju Peng, Xin Cui, Fen Chen, Hua Chen. Random forest-based fast intra coding unit partition algorithm for FVC[J]. Journal of Image and Graphics, 2019, 24(5): 724-733. DOI： 10.11834/jig.180490.

摘要

目的

未来视频编码（FVC）是在高效视频编码标准（HEVC）的基础上提出的新一代编码技术，复杂度极高。针对现有的基于HEVC的快速编码方法不适用于FVC中的四叉树加二叉树编码结构或节省时间有限的问题，提出了一种结合随机森林的FVC帧内编码单元（CU）快速划分算法。

方法

针对FVC中的四叉树加二叉树结构进行优化。首先，提取视频编码过程中的各CU的图像纹理特征和划分结果；然后，分别使用各划分深度下的纹理特征和划分结果进行在线训练，建立多个随机森林模型，不同深度的CU对应不同的模型；最后，使用模型对视频其余帧的CU进行划分结果预测，从而减少了划分模式遍历和率失真代价计算的次数，节省了编码时间。

结果

实验结果表明，与原始平台算法相比，本文算法能够节省44.1%的时间，在相同峰值信噪比的情况下，比特率仅上升2.6%；与当前先进的方法相比，能进一步节省20%以上的时间。

结论

通过提取图像的纹理特征，建立随机森林模型，对CU划分结果进行预测，在保证编码率失真性能的前提下，有效地降低了FVC的帧内CU划分复杂度。

Abstract

Objective

Given the development of digital video technology

especially the emergence of ultra-high definition (UHD) video technology

video compression faces enormous challenges. To solve the problem of voluminous data and to address the high-speed transmission requirements of UHD videos

the Joint Video Experts Team (JVET) is exploring future video coding (FVC) based on the high-efficiency video coding (HEVC) standard. FVC uses the hybrid coding framework of HEVC with new techniques. The compression efficiency of FVC is higher than that of HEVC; however

its coding complexity is extremely high. Therefore

reducing the complexity of FVC is of great significance. Among all the new techniques in FVC

the most effective but extremely time consuming one is the quad tree plus binary tree (QTBT) coding structure

which includes four partition modes

namely

quad tree split

vertical split

horizontal split

and no-split. The final split of coding units (CUs) is decided after trying all the partition modes and calculating the rate distortion cost. Thus

the complexity of the QTBT is extremely high. The existing HEVC-based fast coding method is no longer suitable for FVC because the QTBT coding structure and the recent work about low-complexity encoding methods are insufficient for FVC applications. To reduce the high complexity of FVC

the complexity of the QTBT structure should be considered. The traversal process of CU partition modes exhibits redundancy

and unnecessary attempts to achieve mode partition should be avoided. To optimize CUs' split process

we propose a random forest-based fast intra coding unit partition algorithm for FVC.

Method

The proposed algorithm is designed to optimize the QTBT structure in FVC. Compared with traditional statistical-based methods

the machine learning-based approach is more applicable because of the elaborate split modes of the QTBT structure. Among the methods of machine learning

random forest offers unique advantages. Random forest can handle the classification problem of multi-dimensional data and is strongly resistant to over-fitting and estimation. Furthermore

the approach performs well on classification issues and is suitable for CU splitting. Therefore

a fast algorithm based on random forest is proposed. The problem of distinguishing different split results of CUs is considered a classification problem

and random forest is used as the classifier. The image texture features and split results of the CU in the first frame of video sequences are first extracted. Image texture features have a strong correlation with split results and can thus be selected as the training data of the model. Various image texture features are used in the algorithm to achieve superior performance

and they are carefully selected by the calculation of feature importance. Specifically

the features finally used in the proposed algorithm are the width and height of the CU

Haar wavelet coefficients

angular second moment

entropy

contrast

inverse differential moment

and standard deviation. After the data collection process

four random forest models are established for different depths of CUs. CU depth can be represented as the joint depth of the quad tree and the binary tree

and this representative method is used to collect data in the algorithm. Then

the texture features and split results are set as multidimensional data

and they are separately trained online for each model. The training time is included in the entire encoding time and is relatively shorter than the encoding time. Finally

the trained models are used to predict the split results of the CUs of the remaining frames of the video sequences

thereby reducing the traversal of the partition modes and the time of rate distortion cost calculation. To ensure the algorithm's effectiveness

we test the accuracy of the models online by using different video sequences. The algorithm is implemented on the recently released JEM5.0 platform. A total of 22 test sequences of different contents and resolutions from class A1 to class E are tested under the common test condition

which is a full I-frame configuration mode with quantization parameters 22

and 37. The encoding performance of the algorithm is evaluated using the Bjontegaard delta bitrate (BDBR) and average amount of time saved between the proposed algorithm and the original platform.

Result

Experimental results show that compared with the original platform's algorithm

the proposed algorithm can decrease the average encoding time by 44.1% with negligible coding performance loss

and the BDBR only increases by 2.6%. The approach can also save more than 20% of encoding time relative to state-of-the-art methods

with BDBR slightly increasing. This algorithm is suitable for various classes of video sequences with different resolutions and textures. Among all the sequences

the sequences with high resolution save more encoding time than other sequences do because of the online training time consumption. Furthermore

the coding performance of the proposed algorithm is stable

thereby proving the effectiveness of the models.

Conclusion

A random forest-based fast intra CU partition algorithm for FVC is proposed to reduce the complexity of the QTBT structure in FVC. By extracting the texture features of images

the algorithm establishes random forest models to predict the CU partitioning result while avoiding the unnecessary traversal of split modes to save encoding time. The proposed intra prediction coding algorithm can effectively reduce the complexity of FVC and maintain the encoding performance. The proposed algorithm is more suitable for video sequences with high resolution. Furthermore

the proposed algorithm should be optimized in the future to enhance time reduction and reduce coding performance loss. The possibilities of machine learning in FVC inter-prediction will also be explored in the future.

关键词

Keywords

references

Bross B, Han W J, Ohm J R, et al. High efficiency video coding (HEVC) text specification draft 10[C]//Proceedings of the 12th Joint Collaborative Team on Video Coding. Geneva: JCT-VC, 2013.

Chen J, Alshina E, Sullivan G J, et al. Algorithm description of joint exploration test model 1[C]//Proceedings of the Joint Video Exploration Team. Geneva: JVET, 2015.

Karczewicz M, Alshina E. JVET AHG report: tool evaluation (AHG1)[C]//Proceedings of the Joint Video Exploration Team 8th Meeting. Macau: JVET, 2017.

Lee D, Jeong J. Fast intra coding unit decision for high efficiency video coding based on statistical information[J]. Signal Processing:Image Communication, 2017, 55:121-129.[DOI:10.1016/j.image.2017.03.019]

Sun X B, Chen X D, Xu Y, et al. Fast CU partition strategy for HEVC based on Haar wavelet[J]. IET Image Processing, 2017, 11(9):717-723.[DOI:10.1049/iet-ipr.2016.1082]

Du B C, Siu W C, Yang X F. Fast CU partition strategy for HEVC intra-frame coding using learning approach via random forests[C ] //Proceedings of 2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference. Hong Kong, China: IEEE, 2016: 1085-1090.[ DOI: 10.1109/APSIPA.2015.7415439 http://dx.doi.org/10.1109/APSIPA.2015.7415439 ]

Ruiz D, Fernández-Escribano G, Adzic V, et al. Fast CU partitioning algorithm for HEVC intra coding using data mining[J]. Multimedia Tools and Applications, 2017, 76(1):861-894.[DOI:10.1007/s11042-015-3014-6]

Shen X L, Yu L. CU splitting early termination based on weighted SVM[J]. EURASIP Journal on Image and Video Processing, 2013, 2013(1):4.[DOI:10.1186/1687-5281-2013-4]

Wang Z, Wang S Q, Zhang J, et al. Probabilistic decision based block partitioning for future video coding[J]. IEEE Transactions on Image Processing, 2018, 27(3):1475-1486.[DOI:10.1109/TIP.2017.2778564]

Ryu S, Kang J. Machine learning-based fast angular prediction mode decision technique in video coding[J]. IEEE Transactions on Image Processing, 2018, 27(11):5525-5538.[DOI:10.1109/TIP.2018.2857404]

Lin T L, Jiang H Y, Huang J Y, et al. Fast intra coding unit partition decision in H.266/FVC based on spatial features[J]Journal of Real-Time Image Processing, 2018.[DOI:10.1007/s11554-018-0794-8]

Wang Z, Wang S Q, Zhang J, et al. Effective quadtree plus binary tree block partition decision for future video coding[C ] //Proceedings of 2017 Data Compression Conference. Snowbird, UT, USA: IEEE, 2017.[ DOI: 10.1109/DCC.2017.70 http://dx.doi.org/10.1109/DCC.2017.70 ]

Breiman L. Random forests[J]. Machine Learning, 2001, 45(1):5-32.[DOI:10.1023/A:1010933404324]

Segall A, Francois E, Rusanovskyy D, et al. JVET common test conditions and evaluation procedures for HDR/WCG video[C]//Proceedings of the Joint Video Exploration Team 5th Meeting. Geneva: JVET, 2017.

Huang H, Liu S, Huang Y W, et al. AHG5: Speed-up for JEM-3.1[C]//Proceedings of the Joint Video Exploration Team 4th Meeting. Chengdu, China: JVET, 2016.

文章被引用时，请邮件提醒。

提交