自适应卷积特征选择的实时跟踪算法

熊昌镇; 车满强; 王润玲

doi:10.11834/jig.180252

NCIG 2018会议专栏 | 浏览量 : 0 下载量: 0 CSCD: 1

导出
分享
收藏
专辑

自适应卷积特征选择的实时跟踪算法
Adaptive convolutional feature selection for real-time visual tracking
2018年23卷第11期页码：1742-1750
收稿：2018-04-12，

修回：2018-6-9，

纸质出版：2018-11-16
DOI： 10.11834/jig.180252
稿件说明：

移动端阅览

熊昌镇, 车满强, 王润玲. 自适应卷积特征选择的实时跟踪算法[J]. 中国图象图形学报, 2018,23(11):1742-1750. DOI： 10.11834/jig.180252.

Changzhen Xiong, Manqiang Che, Runling Wang. Adaptive convolutional feature selection for real-time visual tracking[J]. Journal of Image and Graphics, 2018, 23(11): 1742-1750. DOI： 10.11834/jig.180252.

摘要

目的

针对深度卷积特征相关滤波跟踪算法因特征维度多造成的跟踪速度慢及其在目标发生形变、遮挡等情况时存在跟踪失败的问题，提出了一种自适应卷积特征选择的实时跟踪算法。

方法

该算法先分析结合深度卷积特征的相关滤波跟踪算法定位目标的特性，然后提出使用目标区域和搜索区域的特征均值比来评估卷积操作，选取满足均值比大于阈值的特征通道数最多的卷积层，减少卷积特征的层数及维度，并提取该卷积层的有效卷积特征来训练相关滤波分类器，最后采用稀疏的模型更新策略提高跟踪速度。

结果

在OTB-100标准数据集上进行算法测试，本文算法的平均距离精度值达86.4%，平均跟踪速度达29.9帧/s，比分层卷积相关滤波跟踪算法平均距离精度值提高了2.7个百分点，速度快将近3倍。实验结果表明，本文自适应特征选择的方式在保证跟踪精度的同时有效地提升了跟踪的速度，且优于当前使用主成分分析降维的方式；与现有前沿跟踪算法对比，本文算法的整体性能优于实验中对比的9种算法。

结论

该算法采用自适应卷积通道和卷积层选择的方式有效地减少了卷积层数和特征维度，降低了模型的复杂度，提升了跟踪速度，利用稀疏模型更新策略进一步提升了跟踪的速度，减少了模型漂移现象，当目标发生快速运动、遇到遮挡、光照变化等复杂场景时，仍可实时跟踪到目标，具有较强的鲁棒性和适应性。

Abstract

Objective

In the field of object tracking

the most serious difficulty is that the object may have a motion in different degrees in each video frame. Different types of movements will cause complex scenes of the object's own non-rigid deformation

background clusters

occlusion

fast motion and so on

thereby making object tracking more difficult. The balance between high speed and high accuracy remains a challenging task

although considerable progress in enhancing the accuracy and speed of tracking has been achieved. Recently

discriminative correlation filter methods have been successfully and widely applied to the visual tracking field. The standard correlation filter method can obtain numerous training samples through a cyclic shift and can train the filters through fast Fourier transform algorithm

which can ensure real-time favorable performance and robustness. However

the tracking accuracy of the correlation filter tracking algorithms based on traditional manual features must be improved given the limitations of traditional manual features. Therefore

correlation filter tracking algorithms based on convolutional features have been proposed and developed. The correlation filter tracking algorithms based on deep convolutional features can lead to a low tracking speed considering multiple feature dimensions and tracking failure problems when the object is subjected to deformation or occlusion despite a high accuracy of such algorithms. Thus

a real-time tracking algorithm based on adaptive convolutional feature selection is proposed to solve these problems.

Method

First

the proposed method analyzes the characteristics of convolution features extracted from the convolutional network model trained on the classification data set and selects the multilayer convolution features suitable for object tracking. The method also analyzes the characteristics of localization prediction of correlation filter trackers based on deep convolutional features. Analysis results show that a large average feature ratio between object and search regions indicates an improved convolution operator. Thus

this study proposed the average feature ratio between object and search regions to evaluate the convolution operator of each channel of every convolution layer. Then

the feature selection strategy is applied to select the convolution layer with the most convolution channels whose feature mean ratio is larger than the threshold for each preselected convolution layer. This strategy can effectively reduce the number of layers with convolution features. Simultaneously

the strategy can reduce the dimensions of the selected convolution layer by removing the convolution features that are not larger than the threshold. Then

the correlation filter classifier is trained by extracting the remaining effective convolutional features from the selected layer. The trained classifier is used to predict the position of the object. Finally

a sparse model updating strategy is adopted to prevent overfitting of the correlation filter classifier and improve the tracking speed.

Result

The proposed approach is evaluated on 100 sequences of Object Tracker Benchmark (OTB-100)

which mainly contains 11 challenges (e.g.

variation

background blusters

low resolution and so on) that may be encountered in object tracking

and compared with 9 other state-of-the-art tracking methods. The selected benchmark

namely

center location error

distance precision

overlap precision

and one-pass evaluation is applied to evaluate the tracking algorithm. The experiments are divided into two parts. The first part analyzes the tracking results of the different pre-selected convolutional layers. This part includes the results of no dimension reduction method

dimension reduction using principal component analysis

and our adaptive feature selection method using the feature mean ratio. The average distance accuracy of our adaptive feature selection method is 86.4%

which is higher than that of other methods. Experimental results show that the method can effectively improve the tracking speed and that it is better than the current trackers which use principal component analysis in reducing feature dimensions. The second part presents the comparison of our method and the existing mainstream object tracking method. These algorithms include the original hierarchical convolutional filter tracking algorithm and other correlation filter tracking algorithms that use convolutional features or traditional manual features. The average distance accuracy of our algorithm is 86.4%

which is 2.7 percent points higher than the original hierarchical convolutional features for visual tracking algorithm. The average success rate in the proposed approach is 68.4%

which is 2.9 percent points higher than the original hierarchical convolutional filter tracking algorithm. The average tracking speed is 29.9 frame/s

which is approximately three times faster than the previous performance. The experimental results show that the adaptive feature selection method can effectively improve the tracking speed while ensuring the tracking accuracy. The overall performance is superior to the nine other state-of-the-art tracking methods in the experiment.

Conclusion

The feature mean ratio of the object and search regions is used to evaluate the convolution operator. The convolutional layer with the largest number of convolutional channels that satisfy the feature mean ratio threshold is selected

and the convolutional effective features of the selected convolutional layer are extracted to train the correlation filter classifier. The method not only effectively reduces the number of convolutional layers and the dimensions of the feature but also reduces the complexity of the model to improve the tracking speed by adaptively selecting convolutional channels and layers. In addition

a sparse model update strategy is utilized to further enhance the tracking speed and prevent model drifting. The proposed algorithm has excellent robustness and adaptability under complex scenes

such as occlusion

illumination change

and fast motion.

关键词

Keywords

references

Bolme D S, Beveridge J R, Draper B A, et al. Visual object tracking using adaptive correlation filters[C]//Proceedings of 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. San Francisco: IEEE, 2010: 2544-2550.[ DOI:10.1109/CVPR.2010.5539960 http://dx.doi.org/10.1109/CVPR.2010.5539960 ]

Henriques J F, Caseiro R, Martins P, et al. Exploiting the circulant structure of tracking-by-detection with kernels[C]//Proceedings of the 12th European Conference on Computer Vision. Florence, Italy: Springer, 2012: 702-715.[ DOI:10.1007/978-3-642-33765-9_50 http://dx.doi.org/10.1007/978-3-642-33765-9_50 ]

Henriques J F, Caseiro R, Martins P, et al. High-speed tracking with kernelized correlation filters[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(3):583-596.[DOI:10.1109/TPAMI.2014.2345390]

Danelljan M, Khan F S, Felsberg M, et al. Adaptive color attributes for real-time visual tracking[C]//Proceedings of 2014 IEEE Conference on Computer Vision and Pattern Recognition. Columbus, Ohio, USA: IEEE, 2014: 1090-1097.[ DOI:10.1109/CVPR.2014.143 http://dx.doi.org/10.1109/CVPR.2014.143 ]

Danelljan M, H? ger G, Khan F S, et al. Learning spatially regularized correlation filters for visual tracking[C]//Proceedings of the 2015 IEEE International Conference on Computer Vision. Santiago, Chile: IEEE, 2015: 4310-4318.[ DOI:10.1109/ICCV.2015.490 http://dx.doi.org/10.1109/ICCV.2015.490 ]

Danelljan M, Hager G, Khan F S, et al. Convolutional features for correlation filter based visual tracking[C]//Proceedings of IEEE International Conference on Computer Vision Workshop. Santiago, Chile: IEEE, 2016: 621-629.[ DOI:10.1109/ICCVW.2015.84 http://dx.doi.org/10.1109/ICCVW.2015.84 ]

Ma C, Huang J B, Yang X K, et al. Hierarchical convolutional features for visual tracking[C]//Proceedings of 2015 IEEE International Conference on Computer Vision. Santiago, Chile: IEEE, 2015: 3074-3082.[ DOI:10.1109/ICCV.2015.352 http://dx.doi.org/10.1109/ICCV.2015.352 ]

Wang X Y, Li H X, Li Y, et al. Robust and real-time deep tracking via multi-scale domain adaptation[C]//Proceedings of 2017 IEEE International Conference on Multimedia and Expo. Hong Kong, China: IEEE, 2017: 1338-1343.[ DOI:10.1109/ICME.2017.8019450 http://dx.doi.org/10.1109/ICME.2017.8019450 ]

Huang C, Lucey S, Ramanan D. Learning policies for adaptive tracking with deep feature cascades[C]//Proceedings of 2017 IEEE International Conference on Computer Vision. Venice, Italy: IEEE, 2017: 105-114.[ DOI:10.1109/ICCV.2017.21 http://dx.doi.org/10.1109/ICCV.2017.21 ]

Song Y B, Ma C, Gong L J, et al. CREST: convolutional residual learning for visual tracking[C]//Proceedings of 2017 IEEE International Conference on Computer Vision. Venice, Italy: IEEE, 2017: 2574-2583.[ DOI:10.1109/ICCV.2017.279 http://dx.doi.org/10.1109/ICCV.2017.279 ]

Lukezic A, Vojir T, Zajc L C, et al. Discriminative correlation filter with channel and spatial reliability[C]//Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition. Honolulu, Hawaii: IEEE, 2017: 4847-4856.[ DOI:10.1109/CVPR.2017.515 http://dx.doi.org/10.1109/CVPR.2017.515 ]

Bertinetto L, Valmadre J, GolodetzS, et al. Staple: complementary learners for real-time tracking[C]//Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, NV, USA: IEEE, 2016: 1401-1409.[ DOI:10.1109/CVPR.2016.156 http://dx.doi.org/10.1109/CVPR.2016.156 ]

Xiong C Z, Zhao L L, Guo F H. Kernelized correlation filters tracking based on adaptive feature fusion[J]. Journal of Computer-Aided Design&Computer Graphics, 2017, 29(6):1068-1074.

熊昌镇, 赵璐璐, 郭芬红.自适应特征融合的核相关滤波跟踪算法[J].计算机辅助设计与图形学学报, 2017, 29(6):1068-1074. [DOI:10.3969/j.issn.1003-9775.2017.06.012]

Danelljan M, Robinson A, Khan F S, et al. Beyond correlation filters: learning continuous convolution operators for visual tracking[C]//Proceedings of the 14th European Conference on Computer Vision. Amsterdam, The Netherlands: Springer, 2016: 472-488.[ DOI:10.1007/978-3-319-46454-1_29 http://dx.doi.org/10.1007/978-3-319-46454-1_29 ]

Danelljan M, Bhat G, Khan F S, et al. ECO: efficient convolution operators for tracking[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Honolulu, Hawaii: IEEE, 2017: 6931-6939.[ DOI:10.1109/CVPR.2017.733 http://dx.doi.org/10.1109/CVPR.2017.733 ]

Rifkin R M, Yeo G, Poggio T. Regularized least-squares classification[C]//Advances in Learning Theory: Methods, Model and Applications. NATO Science Series: Ⅲ: Computer and Systems Sciences. Amsterdam: IOS Press, 2003, 190: 131-154.

Qi Y K, Zhang S P, Qin L, et al. Hedged deep tracking[C]//Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, NV, USA, 2016: 4303-4311.[ DOI:10.1109/CVPR.2016.466 http://dx.doi.org/10.1109/CVPR.2016.466 ]

Ning J F, Yang J M, JiangS J, et al. Object tracking via dual linear structured SVM and explicit feature map[C]//Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, NV, USA: IEEE, 2016: 4266-4274.[ DOI:10.1109/CVPR.2016.462 http://dx.doi.org/10.1109/CVPR.2016.462 ]

文章被引用时，请邮件提醒。

提交

视觉目标跟踪方法研究综述

基于视觉的液晶屏/OLED屏缺陷检测方法综述

无人机航拍图像中电力线检测方法研究进展

近年目标跟踪算法短评——相关滤波与深度学习

基于增强注意力的点云语义实例联合分割