面向纹理平滑的方向性滤波尺度预测模型

林俊彦; 刘春晓; 章金凯; 李泓易

doi:10.11834/jig.210176

图像理解和计算机视觉 | 浏览量 : 0 下载量: 168 CSCD: 1

PDF
导出
分享
收藏
专辑

面向纹理平滑的方向性滤波尺度预测模型
Texture-smoothing-oriented directional filtering scales-predicting model
2022年27卷第8期页码：2506-2515
收稿：2021-03-17，

修回：2021-5-31，

录用：2021-6-7，

纸质出版：2022-08-16
DOI： 10.11834/jig.210176
稿件说明：

移动端阅览

林俊彦, 刘春晓, 章金凯, 李泓易. 面向纹理平滑的方向性滤波尺度预测模型[J]. 中国图象图形学报, 2022,27(8):2506-2515. DOI： 10.11834/jig.210176.

Junyan Lin, Chunxiao Liu, Jinkai Zhang, Hongyi Li. Texture-smoothing-oriented directional filtering scales-predicting model[J]. Journal of Image and Graphics, 2022, 27(8): 2506-2515. DOI： 10.11834/jig.210176.

摘要

目的

传统图像处理的纹理滤波方法难以区分梯度较强的纹理与物体的结构，而深度学习方法使用的训练集生成方式不够合理，且模型表示方式比较粗糙，为此本文设计了一种面向纹理平滑的方向性滤波尺度预测模型，并生成了含有标签的新的纹理滤波数据集。

方法

在现有结构图像中逐连通区域填充多种纹理图，生成有利于模型训练的纹理滤波数据集。设计了方向性滤波尺度预测模型，该模型包含尺度感知子网络和图像平滑子网络。前者预测得到的滤波尺度图不但体现了该像素与周围像素是否为同一纹理，而且还隐含了该像素是否为结构像素的信息。后者以滤波尺度图和原图的堆叠作为输入，凭借少量的卷积层快速得出纹理滤波的结果。

结果

在本文的纹理滤波数据集上与7个算法进行比较，峰值信噪比(peak signal to noise ratio，PSNR)与结构相似度(structural similarity，SSIM)分别高于第2名2.79 dB、0.0133，均方误差(mean squared error，MSE)低于第2名6.863 8，运算速度快于第2名0.002 s。在其他数据集上的实验对比也显示出本文算法更好地保持结构与平滑纹理。通过比较不同数据集上训练的同一网络模型，证实了本文的纹理滤波数据集有助于增强模型对于强梯度纹理与物体结构的区分能力。

结论

本文制作的纹理滤波数据集使模型更好地区分强梯度纹理与物体结构并增强模型的泛化能力。本文设计的方向性滤波尺度预测模型在性能上超越了已有的大多数纹理平滑方法，尤其在强梯度纹理的抑制和弱梯度结构的保持两个方面表现优异。

Abstract

Objective

Texture filtering is a low-level task in image processing and computer vision

which aims to filter the image through the essential image structure preservation and other texture smoothing details. Current texture filtering algorithms are mainly divided into two categories like local-based and global-based orientation. Traditional methods are challenged to distinguish image structure and strong gradient textures in common. Due to the lack of reliable training set

recent deep learning algorithms often use the results of existing traditional methods as ground truth

so they are unable to refill the gaps of the existing traditional algorithms. For example

texture and structure aware filtering network (TSAFN) performs data synthesis by filling the whole image with the same texture

but textures should be object-dependent. Such a synthesis way will lead to a large gap between synthetic images and real-world images. In order to solve these problems

our novel dataset is generated for texture filtering training

and the image smoothing algorithm is proposed based on directional filtering scales-predicting model.

Method

First

a texture filtering dataset for deep learning is generated by filling texture images per object structure based on the existing structure images. At the same time

we processed the image structure via smoothing and compression. Hence

the dataset we generated can not only enhance the ability of the algorithm to distinguish strong gradient texture and structure

but also reduce the domain gap between synthetic images and real images. Then

the image smoothing algorithm based on directional filtering scales-predicting model is designed

which includes a scale-aware sub-network and an image smoothing sub-network. The scale-aware sub-network is used to predict directional texture filtering scales map. It not only reflects whether a pixel and its surrounding pixels are in the same texture

but also implies information about whether the pixel is a structural pixel or not. The image smoothing sub-network takes the stack of scales map predicted by the scale-aware sub-network and original image as input

and gets the filtered image through a small amount of convolution layer. It can complete the smoothing and correct the imperfection of the result of scale-aware sub-network quickly. In edge-aware sub-network

we applied the classic U-Net because of its excellent ability to easy use low-level features straightforward

and we change its input and output dimensions. The input of the scale-aware sub-network is the stack of RGB image and gradient map

the output of the scale-aware sub-network is a six-dimensional scales map. The image smoothing sub-network consists of seven convolutional layers

the first six layers are followed by ReLU and batch normalization

while the last layer is followed by sigmoid for preventing the pixel value out of bounds

the input of the image smoothing sub-network is the stack of an image and six-dimensional scales map

the output of the image smoothing sub-network is the filtered image. The number of images related to our training set

test set and verification set are 10 000

1 500

1 000

respectively

they were selected from our dataset randomly

and they did not overlap. Our network is implemented in Pytorch toolbox. The input images and ground truth images are clipped to 224×224 pixels for training

the momentum parameter is set to 0.9

the learning rate is set to 1E-2

and the weight decay is 0.000 2. We use an adaptive method

that is

if the loss does not decrease by more than 0.003 for 5 epochs

then the learning rate will be halved. The stochastic gradient descent(SGD) learning procedure is accelerated using a NVIDIA RTX 2080 GPU device.

Result

We compared our algorithm to the five traditional algorithms and two deep learning algorithms on our dataset and other real-world image datasets. The quantitative evaluation metrics used in our dataset contain the peak signal to noise ratio (PSNR)

the structural similarity (SSIM)

the mean square error (MSE) and the running time. In comparison with the results of different filtering algorithms from our dataset

our PSNR is 2.79 higher (higher is better) than the second-best

our SSIM is 0.013 3 higher (higher is better) than the second-best

our MSE is 6.863 8 lower (less is better) than the second-best

the running time of our method is 0.002 s faster than the second-best. All deep learning algorithms have been re-trained from our dataset

it is sorted out that our algorithm keeps the leading effect in the discrimination of structure and strong gradient texture based on the comparative results of the texture filtering results of real-world images. The results are trained by different datasets are compared in terms of same model

and it is proved that our dataset can make the model have better generalization ability and stronger ability of distinguishing the strong gradient texture and structure.

Conclusion

Our dataset contains a variety of textures and structures

which can develop the model to distinguish strong gradient texture and object structure better. Our data synthesis method can make the model have better generalization potential ability. Additionally

the designed image smoothing algorithm surpasses the existing methods in performance and speed based on directional filtering scales-predicting model.

关键词

Keywords

references

Cheng M M, Mitra N J, Huang X L and Hu S M. 2014. Salientshape: group saliency in image collections. The Visual Computer, 30(4): 443-453 [DOI: 10.1007/s00371-013-0867-4]

Cimpoi M, Maji S, Kokkinos I, Mohamed S and Vedaldi A. 2014. Describing textures in the wild//Proceedings of 2014 IEEE Conference on Computer Vision and Pattern Recognition. Columbus, USA: IEEE: 3606-3613 [ DOI: 10.1109/cvpr.2014.461 http://dx.doi.org/10.1109/cvpr.2014.461 ]

Fan Q N, Yang J L, Hua G, Chen B Q and Wipf D. 2017. A generic deep architecture for single image reflection removal and image smoothing//Proceedings of 2017 IEEE International Conference on Computer Vision. Venice, Italy: IEEE: 3258-3267 [ DOI: 10.1109/iccv.2017.351 http://dx.doi.org/10.1109/iccv.2017.351 ]

Fan Q N, Yang J L, Wipf D, Chen B Q and Tong X. 2018. Image smoothing via unsupervised learning. ACM Transactions on Graphics (TOG), 37(6): 1-14 [DOI: 10.1145/3272127.3275081]

Ham B, Cho M and Ponce J. 2018. Robust guided image filtering using nonconvex potentials. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40(1): 192-207 [DOI: 10.1109/tpami.2017.2669034]

He K M, Sun J and Tang X O. 2010. Guided image filtering//Proceedings of the 11th European Conference on Computer Vision. Crete, Greece: Springer: 1-14 [ DOI: 10.1007/978-3-642-15549-9_1 http://dx.doi.org/10.1007/978-3-642-15549-9_1 ]

Liu C X, Shao H, Chen Y J and Zhou Y G. 2018. Scale adaptive texture filtering based on semi-Gaussian gradient operator. Journal of Computer-Aided Design and Computer Graphics, 30(5): 878-885

刘春晓, 邵欢, 陈艳杰, 周杨钢. 2018. 基于半高斯梯度算子的尺度自适应纹理滤波. 计算机辅助设计与图形学学报, 30(5): 878-885 [DOI: 10.3724/SP.J.1089.2018.16610]

Lu K Y, You S D and Barnes N. 2018. Deep texture and structure aware filtering network for image smoothing//Proceedings of the 15th European Conference on Computer Vision. Munich, Germany: Springer: 229-245 [ DOI: 10.1007/978-3-030-01225-0_14 http://dx.doi.org/10.1007/978-3-030-01225-0_14 ]

Movahedi V and Elder J H. 2010. Design and perceptual validation of performance measures for salient object segmentation//Proceedings of 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition-Workshops. San Francisco, USA: IEEE: 49-56 [ DOI: 10.1109/cvprw.2010.5543739 http://dx.doi.org/10.1109/cvprw.2010.5543739 ]

Ronneberger O, Fischer P and Brox T. 2015. U-Net: convolutional networks for biomedical image segmentation//Proceedings of the 18th International Conference on Medical Image Computing and Computer-Assisted Intervention. Munich, Germany: Springer: 234-241 [ DOI: 10.1007/978-3-319-24574-4_28 http://dx.doi.org/10.1007/978-3-319-24574-4_28 ]

Shao H and Liu C X. 2018. Texture filtering by using texture gradient suppression and L 0 gradient minimization. Journal of Image and Graphics, 23(11): 1666-1675.

邵欢, 刘春晓. 2018. 结合纹理梯度抑制与 L 0 梯度最小化的纹理滤波. 中国图象图形学报, 23(11): 1666-1675 [DOI: 10.11834/jig.180280 ] .

Wang L J, Lu H C, Wang Y F, Feng M Y, Wang D, Yin B C and Ruan X. 2017. Learning to detect salient objects with image-level supervision//Proceedings of 2017 IEEE Conference on Computer Vis ion and Pattern Recognition. Honolulu, USA: IEEE: 3796-3805 [ DOI: 10.1109/CVPR.2017.404 http://dx.doi.org/10.1109/CVPR.2017.404 ]

Xu L, Lu C W, Xu Y and Jia J Y. 2011. Image smoothing via L 0 gradient minimization. ACM Transactions on Graphics, 30(6): 1-12 [DOI: 10.1145/2070781.2024208 ] .

Xu L, Ren J S J, Yan Q, Liao R J and Jia J Y. 2015. Deep edge-aware filters//Proceedings of the 32nd International Conference on Machine Learning. Lille, France: JMLR. org: 1669-1678

Xu L, Yan Q, Xia Y and Jia J Y. 2012. Structure extraction from texture via relative total variation. ACM Transactions on Graphics (TOG), 31(6): 1-10 [DOI: 10.1145/2366145.2366158]

ZhangF H, Dai L Q, Xiang S M and Zhang X P. 2015. Segment graph based image filtering: fast structure-preserving smoothing//Proceedings of 2015 IEEE International Conference on Computer Vision. Santiago, Chile: IEEE: 361-369 [ DOI: 10.1109/iccv.2015.49 http://dx.doi.org/10.1109/iccv.2015.49 ]

Zhang Q, Shen X Y, Xu L and Jia J Y. 2014. Rolling guidance filter//Proceedings of the 13th European Conference on Computer Vision. Zurich, Switzerland: Springer: 815-830 [ DOI: 10.1007/978-3-319-10578-9_53 http://dx.doi.org/10.1007/978-3-319-10578-9_53 ]

Zhu F D, Liang Z T, Jia X X, Zhang L and Yu Y Z. 2019. A benchmark for edge-preserving image smoothing. IEEE Transactions on Image Processing, 28(7): 3556-3570 [DOI: 10.1109/tip.2019.2908778]