结合随机森林的FVC帧内编码单元快速划分
Random forest-based fast intra coding unit partition algorithm for FVC
- 2019年24卷第5期 页码:724-733
收稿:2018-08-27,
修回:2018-10-17,
纸质出版:2019-05-16
DOI: 10.11834/jig.180490
移动端阅览

浏览全部资源
扫码关注微信
收稿:2018-08-27,
修回:2018-10-17,
纸质出版:2019-05-16
移动端阅览
目的
2
未来视频编码(FVC)是在高效视频编码标准(HEVC)的基础上提出的新一代编码技术,复杂度极高。针对现有的基于HEVC的快速编码方法不适用于FVC中的四叉树加二叉树编码结构或节省时间有限的问题,提出了一种结合随机森林的FVC帧内编码单元(CU)快速划分算法。
方法
2
针对FVC中的四叉树加二叉树结构进行优化。首先,提取视频编码过程中的各CU的图像纹理特征和划分结果;然后,分别使用各划分深度下的纹理特征和划分结果进行在线训练,建立多个随机森林模型,不同深度的CU对应不同的模型;最后,使用模型对视频其余帧的CU进行划分结果预测,从而减少了划分模式遍历和率失真代价计算的次数,节省了编码时间。
结果
2
实验结果表明,与原始平台算法相比,本文算法能够节省44.1%的时间,在相同峰值信噪比的情况下,比特率仅上升2.6%;与当前先进的方法相比,能进一步节省20%以上的时间。
结论
2
通过提取图像的纹理特征,建立随机森林模型,对CU划分结果进行预测,在保证编码率失真性能的前提下,有效地降低了FVC的帧内CU划分复杂度。
Objective
2
Given the development of digital video technology
especially the emergence of ultra-high definition (UHD) video technology
video compression faces enormous challenges. To solve the problem of voluminous data and to address the high-speed transmission requirements of UHD videos
the Joint Video Experts Team (JVET) is exploring future video coding (FVC) based on the high-efficiency video coding (HEVC) standard. FVC uses the hybrid coding framework of HEVC with new techniques. The compression efficiency of FVC is higher than that of HEVC; however
its coding complexity is extremely high. Therefore
reducing the complexity of FVC is of great significance. Among all the new techniques in FVC
the most effective but extremely time consuming one is the quad tree plus binary tree (QTBT) coding structure
which includes four partition modes
namely
quad tree split
vertical split
horizontal split
and no-split. The final split of coding units (CUs) is decided after trying all the partition modes and calculating the rate distortion cost. Thus
the complexity of the QTBT is extremely high. The existing HEVC-based fast coding method is no longer suitable for FVC because the QTBT coding structure and the recent work about low-complexity encoding methods are insufficient for FVC applications. To reduce the high complexity of FVC
the complexity of the QTBT structure should be considered. The traversal process of CU partition modes exhibits redundancy
and unnecessary attempts to achieve mode partition should be avoided. To optimize CUs' split process
we propose a random forest-based fast intra coding unit partition algorithm for FVC.
Method
2
The proposed algorithm is designed to optimize the QTBT structure in FVC. Compared with traditional statistical-based methods
the machine learning-based approach is more applicable because of the elaborate split modes of the QTBT structure. Among the methods of machine learning
random forest offers unique advantages. Random forest can handle the classification problem of multi-dimensional data and is strongly resistant to over-fitting and estimation. Furthermore
the approach performs well on classification issues and is suitable for CU splitting. Therefore
a fast algorithm based on random forest is proposed. The problem of distinguishing different split results of CUs is considered a classification problem
and random forest is used as the classifier. The image texture features and split results of the CU in the first frame of video sequences are first extracted. Image texture features have a strong correlation with split results and can thus be selected as the training data of the model. Various image texture features are used in the algorithm to achieve superior performance
and they are carefully selected by the calculation of feature importance. Specifically
the features finally used in the proposed algorithm are the width and height of the CU
Haar wavelet coefficients
angular second moment
entropy
contrast
inverse differential moment
and standard deviation. After the data collection process
four random forest models are established for different depths of CUs. CU depth can be represented as the joint depth of the quad tree and the binary tree
and this representative method is used to collect data in the algorithm. Then
the texture features and split results are set as multidimensional data
and they are separately trained online for each model. The training time is included in the entire encoding time and is relatively shorter than the encoding time. Finally
the trained models are used to predict the split results of the CUs of the remaining frames of the video sequences
thereby reducing the traversal of the partition modes and the time of rate distortion cost calculation. To ensure the algorithm's effectiveness
we test the accuracy of the models online by using different video sequences. The algorithm is implemented on the recently released JEM5.0 platform. A total of 22 test sequences of different contents and resolutions from class A1 to class E are tested under the common test condition
which is a full I-frame configuration mode with quantization parameters 22
27
32
and 37. The encoding performance of the algorithm is evaluated using the Bjontegaard delta bitrate (BDBR) and average amount of time saved between the proposed algorithm and the original platform.
Result
2
Experimental results show that compared with the original platform's algorithm
the proposed algorithm can decrease the average encoding time by 44.1% with negligible coding performance loss
and the BDBR only increases by 2.6%. The approach can also save more than 20% of encoding time relative to state-of-the-art methods
with BDBR slightly increasing. This algorithm is suitable for various classes of video sequences with different resolutions and textures. Among all the sequences
the sequences with high resolution save more encoding time than other sequences do because of the online training time consumption. Furthermore
the coding performance of the proposed algorithm is stable
thereby proving the effectiveness of the models.
Conclusion
2
A random forest-based fast intra CU partition algorithm for FVC is proposed to reduce the complexity of the QTBT structure in FVC. By extracting the texture features of images
the algorithm establishes random forest models to predict the CU partitioning result while avoiding the unnecessary traversal of split modes to save encoding time. The proposed intra prediction coding algorithm can effectively reduce the complexity of FVC and maintain the encoding performance. The proposed algorithm is more suitable for video sequences with high resolution. Furthermore
the proposed algorithm should be optimized in the future to enhance time reduction and reduce coding performance loss. The possibilities of machine learning in FVC inter-prediction will also be explored in the future.
Bross B, Han W J, Ohm J R, et al. High efficiency video coding (HEVC) text specification draft 10[C]//Proceedings of the 12th Joint Collaborative Team on Video Coding. Geneva: JCT-VC, 2013.
Chen J, Alshina E, Sullivan G J, et al. Algorithm description of joint exploration test model 1[C]//Proceedings of the Joint Video Exploration Team. Geneva: JVET, 2015.
Karczewicz M, Alshina E. JVET AHG report: tool evaluation (AHG1)[C]//Proceedings of the Joint Video Exploration Team 8th Meeting. Macau: JVET, 2017.
Lee D, Jeong J. Fast intra coding unit decision for high efficiency video coding based on statistical information[J]. Signal Processing:Image Communication, 2017, 55:121-129.[DOI:10.1016/j.image.2017.03.019]
Sun X B, Chen X D, Xu Y, et al. Fast CU partition strategy for HEVC based on Haar wavelet[J]. IET Image Processing, 2017, 11(9):717-723.[DOI:10.1049/iet-ipr.2016.1082]
Du B C, Siu W C, Yang X F. Fast CU partition strategy for HEVC intra-frame coding using learning approach via random forests[C ] //Proceedings of 2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference. Hong Kong, China: IEEE, 2016: 1085-1090.[ DOI: 10.1109/APSIPA.2015.7415439 http://dx.doi.org/10.1109/APSIPA.2015.7415439 ]
Ruiz D, Fernández-Escribano G, Adzic V, et al. Fast CU partitioning algorithm for HEVC intra coding using data mining[J]. Multimedia Tools and Applications, 2017, 76(1):861-894.[DOI:10.1007/s11042-015-3014-6]
Shen X L, Yu L. CU splitting early termination based on weighted SVM[J]. EURASIP Journal on Image and Video Processing, 2013, 2013(1):4.[DOI:10.1186/1687-5281-2013-4]
Wang Z, Wang S Q, Zhang J, et al. Probabilistic decision based block partitioning for future video coding[J]. IEEE Transactions on Image Processing, 2018, 27(3):1475-1486.[DOI:10.1109/TIP.2017.2778564]
Ryu S, Kang J. Machine learning-based fast angular prediction mode decision technique in video coding[J]. IEEE Transactions on Image Processing, 2018, 27(11):5525-5538.[DOI:10.1109/TIP.2018.2857404]
Lin T L, Jiang H Y, Huang J Y, et al. Fast intra coding unit partition decision in H.266/FVC based on spatial features[J]Journal of Real-Time Image Processing, 2018.[DOI:10.1007/s11554-018-0794-8]
Wang Z, Wang S Q, Zhang J, et al. Effective quadtree plus binary tree block partition decision for future video coding[C ] //Proceedings of 2017 Data Compression Conference. Snowbird, UT, USA: IEEE, 2017.[ DOI: 10.1109/DCC.2017.70 http://dx.doi.org/10.1109/DCC.2017.70 ]
Breiman L. Random forests[J]. Machine Learning, 2001, 45(1):5-32.[DOI:10.1023/A:1010933404324]
Segall A, Francois E, Rusanovskyy D, et al. JVET common test conditions and evaluation procedures for HDR/WCG video[C]//Proceedings of the Joint Video Exploration Team 5th Meeting. Geneva: JVET, 2017.
Huang H, Liu S, Huang Y W, et al. AHG5: Speed-up for JEM-3.1[C]//Proceedings of the Joint Video Exploration Team 4th Meeting. Chengdu, China: JVET, 2016.
相关作者
相关机构
京公网安备11010802024621