3D-HEVC深度图帧内预测快速算法

王驰; 冯桂; 蔡春亭; 韩雪

doi:10.11834/jig.170176

图像处理和编码 | 浏览量 : 0 下载量: 4 CSCD: 1

PDF
导出
分享
收藏
专辑

3D-HEVC深度图帧内预测快速算法
Fast intra prediction algorithm for depth maps in 3D-HEVC
2018年23卷第1期页码：18-27
收稿：2017-05-08，

修回：2017-10-2，

纸质出版：2018-01-16
DOI： 10.11834/jig.170176
稿件说明：

移动端阅览

王驰, 冯桂, 蔡春亭, 韩雪. 3D-HEVC深度图帧内预测快速算法[J]. 中国图象图形学报, 2018,23(1):18-27. DOI： 10.11834/jig.170176.

Chi Wang, Gui Feng, Chunting Cai, Xue Han. Fast intra prediction algorithm for depth maps in 3D-HEVC[J]. Journal of Image and Graphics, 2018, 23(1): 18-27. DOI： 10.11834/jig.170176.

摘要

目的

多视点纹理加深度视频（MVD）格式逐渐成为立体视频的主流表现形式之一。新一代高效率立体视频编码（3D-HEVC）继承了HEVC的编码结构并引入一些新的编码技术，导致深度图帧内编码过程具有较高的计算复杂度。针对这一问题，提出了一种深度图帧内编码快速算法。

方法

本文算法利用深度图的特征分别对CU分割过程和粗略模式选择（RMD）过程进行优化。首先在四叉树编码结构上，利用基于纹理元的图像分析方法计算编码单元的梯度矩阵，若梯度矩阵中的梯度值之和小于给定的阈值，则终止该CU的分割进程。同时，对大尺寸的PU和小尺寸的PU分别利用纹理特征与粗略模式选择过程中Planar和DC进行低复杂度率失真计算后的最小率失真代价，跳过RMD中角度模式的检查过程。

结果

实验结果表明，与原始算法相比，本文算法平均节省40.64%的深度图编码时间，而合成视点的平均比特率仅仅增加了0.17%。本文算法不仅能对平坦的CU跳过不必要的深度决策过程，而且有效地减少了RMD中需要遍历的模式数目，提高了编码器的效率。

结论

该算法对CU分割进程和粗略模式选择过程都进行优化，在合成视点的视频质量几乎不变的前提下，有效降低了深度图的帧内编码复杂度。

Abstract

Objective

The multi-view video plus depth (MVD) format is gradually becoming one of the main representations for 3D videos. The 3D high-efficiency video coding (3D-HEVC) is the latest coding standard for compressing the MVD format. The 3D-HEVC inherits the coding structure of HEVC. Consequently

the splitting process of coding units (CUs) and the intra mode search process in depth map intra coding have great computational complexity. New techniques for depth map intra coding

such as depth modeling mode and simplified depth coding

have been introduced in recent years to preserve the sharp edges of depth maps. These techniques play an important role in the coding of depth maps. However

the adoption of these techniques further increases the computational complexity in 3D-HEVC encoder. A fast algorithm for depth map intra coding is proposed in this study to reduce the computational complexity of depth map intra coding.

Method

There are a lot of large smooth regions which are separated by sharp edges in depth maps. The CU splitting process and the rough mode decision (RMD) process are improved by the proposed algorithm by using the unique characteristics of depth maps. An algorithm based on the concept of texture primitive is proposed for the hierarchical quad-tree coding structure to early terminate the CU splitting process. First

the gradient matrix of the current CU can be calculated by using the texture analysis algorithm based on texture primitive. On the basis of statistical analysis

a strong correlation between the optimal size and sum of gradient values in gradient matrix is considered for each CU. If the sum of gradient values in gradient matrix is small

then the optimal size of current CU will be large. By contrast

if the sum of gradient values in gradient matrix is large

then the optimal size of current CU will be small. Therefore

if the sum of gradient values in gradient matrix is smaller than a given threshold

then the CU splitting process should be terminated. For the RMD process

the texture features and the smallest

$$LCRD_{\rm cost}$$

of Planar and DC are used to skip the search of angular modes in RMD for prediction units (PUs) of large size and PUs of small size

respectively. Planar and DC are two intra prediction modes that are highly suited to code smooth PUs. If the texture of current PU is flat

then Planar or DC is likely to be selected as the optimal mode. Hence

for PUs of large size

if the sum of the gradient values in the gradient matrix is zero

then only Planar and DC are added to the full-RD search list

and the RMD process is skipped. When the size of PUs is small

if the smallest

$$LCRD_{\rm cost}$$

of Planar and DC is smaller than a given threshold

then the RMD process is terminated immediately and the search of angular modes in RMD is skipped.

Results

In the proposed approach

the unnecessary depth levels of smooth CUs can be skipped

and the number of intra mode candidates for RMD is effectively reduced. The reference software HTM 13.0 of the 3D-HEVC standard is used to verify the coding performance of the proposed algorithm. Eight JCT-3V specified test sequences with two resolutions of 1 024×768 and 1 920×1088 are tested. The quantization parameter (QP) values for texture are 25

and 40

and the QP values for depth maps are 34

and 45. Experimental results show that compared with HTM 13.0

the proposed algorithm achieves an average depth map coding time reduction of 40.64% with a small bitrate loss of 0.17% for synthesized views under all intra scenario. For the eight test sequences

the coding time reduction of depth maps ranges from 34.77% to 51.42%

which indicates that the proposed algorithm can effectively improve encoder efficiency and has a general validity. In particular

the time saving of Poznan_Hall2 is over 50%

which is considerably larger than that of other sequences. This result is due to the fact that the depth maps of Poznan_Hall2 contain lesser edges and have a larger proportion of flat regions. The proposed algorithm also has advantages compared with other existing algorithms. A subjective quality comparison of synthesized views for Balloons (1 024×768) sequence and Poznan_Hall2 (1 920×1 088) sequence is presented to further evaluate the performance of the proposed algorithm. The results indicate that the quality of decoded synthesized views generated by the proposed algorithm is almost the same as the quality of those generated by the original HTM-13.0. The proposed algorithm can preferably preserve the edge information of depth maps.

Conclusion

The proposed algorithm not only accelerates quad-tree decision but also optimizes the RMD process. The algorithm periodically updates thresholds on the basis of temporal correlation of video sequences to ensure a good video quality of synthesized views. Subjective and objective evaluations show that the proposed algorithm can significantly reduce the computational complexity of depth map intra coding without decreasing the quality of synthesized views. The proposed algorithm also has practical values and can be applied to actual situations. Nonetheless

the proposed algorithm can be further improved. The algorithm optimizes the recursive splitting process of smooth CUs; however

the splitting process of CUs with a complex texture still has high computational complexity. Therefore

effective and efficient ways for reducing the depth levels of CUs with a complex texture will be studied in future research.

关键词

Keywords

references

Zhang Q W, An P, Zhang Y, et al. Depth map intra coding based on virtual view rendering distortionestimation[J]. Journal of Electronics and Information Technology, 2011, 33(11):2541-2546.

张秋闻, 安平, 张艳, 等.基于虚拟视点绘制失真估计的深度图帧内编码[J].电子与信息学报, 2011, 33(11):2541-2546.][DOI:10.3724/SP.J.1146.2011. 00218

Sullivan G J, Ohm J, Han W J, et al. Overview of the high efficiency video coding (HEVC) standard[J]. IEEE Transactions on circuits and systems for video technology, 2012, 22(12):1649-1668.[DOI:10.1109/TCSVT.2012. 2221191]

Sullivan G J, Boyce J M, Chen Y, et al. Standardized extensions of high efficiency video coding (HEVC)[J]. IEEE Journal of selected topics in Signal Processing, 2013, 7(6):1001-1016.[DOI:10.1109/JSTSP.2013. 2283657]

Kauff P, Atzpadin N, Fehn C, et al. Depth map creation and image-based rendering for advanced 3DTV services providing interoperability and scalability[J]. Signal Processing:Image Communication, 2007, 22(2):217-234.[DOI:10.1016/j.image.2006.11.013]

Fehn C. Depth-image-based rendering(DIBR), compression, and transmission for a new approach on 3 D-TV[C]//Proceedings of SPIE. San Jose, CA, USA:SPIE, 2004, 5291(2):93-104.[ DOI:10.1117/12. 524762 http://dx.doi.org/10.1117/12.524762 ]

Zhang Y, Zhu C, Lin Y, et al. Simplified reference pixel selection for constant partition value coding in 3D-HEVC[C]//Signal and Information Processing (ChinaSIP), 2015 IEEE China Summit and International Conference on. Chengdu, China:IEEE, 2015:244-248.[ DOI:10.1109/ChinaSIP.2015.7230400 http://dx.doi.org/10.1109/ChinaSIP.2015.7230400 ]

Merkle P, Müller K, Wiegand T. Coding of depth signals for 3D video using wedgelet block segmentation with residual adaptation[C]//Multimedia and Expo (ICME), 2013 IEEE International Conference on. San Jose, CA, USA:IEEE, 2013:1-6.[ DOI:10.1109/ICME.2013.6607429 http://dx.doi.org/10.1109/ICME.2013.6607429 ]

Zhang H B, Fu C H, Su W M, et al. Fast coding unit decision algorithm for depth intra coding in 3D-HEVC[J]. Journal of electronics and information technology, 2016, 38(10):2523-2530.

张洪彬, 伏长虹, 苏卫民, 等. 3D-HEVC深度图像帧内编码单元划分快速算法[J].电子与信息学报, 2016, 38(10):2523-2530.][DOI:10.11999/JEIT151426

Kim M, Ling N, Song L. Fast single depth intra mode decision for depth map coding in 3D-HEVC[C]//Multimedia & Expo Workshops (ICMEW), 2015 IEEE International Conference on. Turin, Italy:IEEE, 2015:1-6.[ DOI:10.1109/ICMEW.2015.7169769 http://dx.doi.org/10.1109/ICMEW.2015.7169769 ]

Park C S. Edge-based intra mode selection for depth-map coding in 3D-HEVC[J]. IEEE Transcation on Image Processing, 2015, 24(1):155-162.[DOI:10.1109/TIP.2014. 2375653]

Zhang H B, Tsang S H, Chan Y L, et al. Early determination of intra mode and segment-wise DC coding for depth map based on hierarchical coding structure in 3D-HEVC[C]//Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2015 Asia-Pacific. Hong Kong, China:IEEE, 2015:374-378.[ DOI:10.1109/APSIPA.2015.7415297 http://dx.doi.org/10.1109/APSIPA.2015.7415297 ]

Marpe D, Schwarz H, Bosse S, et al. Video compression using nested quadtree structures, leaf merging, and improved techniques for motion representation and entropy coding[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2010, 20(12):1676-1687.[DOI:10.1109/TCSVT.2010.2092615]

Wang C R, Zhang T. A fast image texture analysis algorithm[J]. Opto-Electronic Engineering, 2005, 32(1):74-76.

王成儒, 张涛.一种快速图像纹理分析算法[J].光电工程, 2005, 32(1):74-76.][DOI:10.3969/j.issn. 1003-501X.2005.01.020

Lainema J, Bossen F, Han W J, et al. Intra coding of the HEVC standard[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2012, 22(12):1792-1801.[DOI:10.1109/TCSVT.2012.2221525]

MPEG 3D-HTM Software V13.0.[CP].[2015-09-08] http://hevc.kw.bbc.co.uk/git/w/jctvc-3de.git http://hevc.kw.bbc.co.uk/git/w/jctvc-3de.git