-
导出
分享
-
收藏
-
专辑
自适应卷积特征选择的实时跟踪算法
Adaptive convolutional feature selection for real-time visual tracking
- 2018年23卷第11期 页码:1742-1750
收稿:2018-04-12,
修回:2018-6-9,
纸质出版:2018-11-16
DOI: 10.11834/jig.180252
移动端阅览

浏览全部资源
扫码关注微信
导出
分享
收藏
专辑
收稿:2018-04-12,
修回:2018-6-9,
纸质出版:2018-11-16
移动端阅览
目的
2
针对深度卷积特征相关滤波跟踪算法因特征维度多造成的跟踪速度慢及其在目标发生形变、遮挡等情况时存在跟踪失败的问题,提出了一种自适应卷积特征选择的实时跟踪算法。
方法
2
该算法先分析结合深度卷积特征的相关滤波跟踪算法定位目标的特性,然后提出使用目标区域和搜索区域的特征均值比来评估卷积操作,选取满足均值比大于阈值的特征通道数最多的卷积层,减少卷积特征的层数及维度,并提取该卷积层的有效卷积特征来训练相关滤波分类器,最后采用稀疏的模型更新策略提高跟踪速度。
结果
2
在OTB-100标准数据集上进行算法测试,本文算法的平均距离精度值达86.4%,平均跟踪速度达29.9帧/s,比分层卷积相关滤波跟踪算法平均距离精度值提高了2.7个百分点,速度快将近3倍。实验结果表明,本文自适应特征选择的方式在保证跟踪精度的同时有效地提升了跟踪的速度,且优于当前使用主成分分析降维的方式;与现有前沿跟踪算法对比,本文算法的整体性能优于实验中对比的9种算法。
结论
2
该算法采用自适应卷积通道和卷积层选择的方式有效地减少了卷积层数和特征维度,降低了模型的复杂度,提升了跟踪速度,利用稀疏模型更新策略进一步提升了跟踪的速度,减少了模型漂移现象,当目标发生快速运动、遇到遮挡、光照变化等复杂场景时,仍可实时跟踪到目标,具有较强的鲁棒性和适应性。
Objective
2
In the field of object tracking
the most serious difficulty is that the object may have a motion in different degrees in each video frame. Different types of movements will cause complex scenes of the object's own non-rigid deformation
background clusters
occlusion
fast motion and so on
thereby making object tracking more difficult. The balance between high speed and high accuracy remains a challenging task
although considerable progress in enhancing the accuracy and speed of tracking has been achieved. Recently
discriminative correlation filter methods have been successfully and widely applied to the visual tracking field. The standard correlation filter method can obtain numerous training samples through a cyclic shift and can train the filters through fast Fourier transform algorithm
which can ensure real-time favorable performance and robustness. However
the tracking accuracy of the correlation filter tracking algorithms based on traditional manual features must be improved given the limitations of traditional manual features. Therefore
correlation filter tracking algorithms based on convolutional features have been proposed and developed. The correlation filter tracking algorithms based on deep convolutional features can lead to a low tracking speed considering multiple feature dimensions and tracking failure problems when the object is subjected to deformation or occlusion despite a high accuracy of such algorithms. Thus
a real-time tracking algorithm based on adaptive convolutional feature selection is proposed to solve these problems.
Method
2
First
the proposed method analyzes the characteristics of convolution features extracted from the convolutional network model trained on the classification data set and selects the multilayer convolution features suitable for object tracking. The method also analyzes the characteristics of localization prediction of correlation filter trackers based on deep convolutional features. Analysis results show that a large average feature ratio between object and search regions indicates an improved convolution operator. Thus
this study proposed the average feature ratio between object and search regions to evaluate the convolution operator of each channel of every convolution layer. Then
the feature selection strategy is applied to select the convolution layer with the most convolution channels whose feature mean ratio is larger than the threshold for each preselected convolution layer. This strategy can effectively reduce the number of layers with convolution features. Simultaneously
the strategy can reduce the dimensions of the selected convolution layer by removing the convolution features that are not larger than the threshold. Then
the correlation filter classifier is trained by extracting the remaining effective convolutional features from the selected layer. The trained classifier is used to predict the position of the object. Finally
a sparse model updating strategy is adopted to prevent overfitting of the correlation filter classifier and improve the tracking speed.
Result
2
The proposed approach is evaluated on 100 sequences of Object Tracker Benchmark (OTB-100)
which mainly contains 11 challenges (e.g.
variation
background blusters
low resolution and so on) that may be encountered in object tracking
and compared with 9 other state-of-the-art tracking methods. The selected benchmark
namely
center location error
distance precision
overlap precision
and one-pass evaluation is applied to evaluate the tracking algorithm. The experiments are divided into two parts. The first part analyzes the tracking results of the different pre-selected convolutional layers. This part includes the results of no dimension reduction method
dimension reduction using principal component analysis
and our adaptive feature selection method using the feature mean ratio. The average distance accuracy of our adaptive feature selection method is 86.4%
which is higher than that of other methods. Experimental results show that the method can effectively improve the tracking speed and that it is better than the current trackers which use principal component analysis in reducing feature dimensions. The second part presents the comparison of our method and the existing mainstream object tracking method. These algorithms include the original hierarchical convolutional filter tracking algorithm and other correlation filter tracking algorithms that use convolutional features or traditional manual features. The average distance accuracy of our algorithm is 86.4%
which is 2.7 percent points higher than the original hierarchical convolutional features for visual tracking algorithm. The average success rate in the proposed approach is 68.4%
which is 2.9 percent points higher than the original hierarchical convolutional filter tracking algorithm. The average tracking speed is 29.9 frame/s
which is approximately three times faster than the previous performance. The experimental results show that the adaptive feature selection method can effectively improve the tracking speed while ensuring the tracking accuracy. The overall performance is superior to the nine other state-of-the-art tracking methods in the experiment.
Conclusion
2
The feature mean ratio of the object and search regions is used to evaluate the convolution operator. The convolutional layer with the largest number of convolutional channels that satisfy the feature mean ratio threshold is selected
and the convolutional effective features of the selected convolutional layer are extracted to train the correlation filter classifier. The method not only effectively reduces the number of convolutional layers and the dimensions of the feature but also reduces the complexity of the model to improve the tracking speed by adaptively selecting convolutional channels and layers. In addition
a sparse model update strategy is utilized to further enhance the tracking speed and prevent model drifting. The proposed algorithm has excellent robustness and adaptability under complex scenes
such as occlusion
illumination change
and fast motion.
Bolme D S, Beveridge J R, Draper B A, et al. Visual object tracking using adaptive correlation filters[C]//Proceedings of 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. San Francisco: IEEE, 2010: 2544-2550.[ DOI:10.1109/CVPR.2010.5539960 http://dx.doi.org/10.1109/CVPR.2010.5539960 ]
Henriques J F, Caseiro R, Martins P, et al. Exploiting the circulant structure of tracking-by-detection with kernels[C]//Proceedings of the 12th European Conference on Computer Vision. Florence, Italy: Springer, 2012: 702-715.[ DOI:10.1007/978-3-642-33765-9_50 http://dx.doi.org/10.1007/978-3-642-33765-9_50 ]
Henriques J F, Caseiro R, Martins P, et al. High-speed tracking with kernelized correlation filters[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(3):583-596.[DOI:10.1109/TPAMI.2014.2345390]
Danelljan M, Khan F S, Felsberg M, et al. Adaptive color attributes for real-time visual tracking[C]//Proceedings of 2014 IEEE Conference on Computer Vision and Pattern Recognition. Columbus, Ohio, USA: IEEE, 2014: 1090-1097.[ DOI:10.1109/CVPR.2014.143 http://dx.doi.org/10.1109/CVPR.2014.143 ]
Danelljan M, H? ger G, Khan F S, et al. Learning spatially regularized correlation filters for visual tracking[C]//Proceedings of the 2015 IEEE International Conference on Computer Vision. Santiago, Chile: IEEE, 2015: 4310-4318.[ DOI:10.1109/ICCV.2015.490 http://dx.doi.org/10.1109/ICCV.2015.490 ]
Danelljan M, Hager G, Khan F S, et al. Convolutional features for correlation filter based visual tracking[C]//Proceedings of IEEE International Conference on Computer Vision Workshop. Santiago, Chile: IEEE, 2016: 621-629.[ DOI:10.1109/ICCVW.2015.84 http://dx.doi.org/10.1109/ICCVW.2015.84 ]
Ma C, Huang J B, Yang X K, et al. Hierarchical convolutional features for visual tracking[C]//Proceedings of 2015 IEEE International Conference on Computer Vision. Santiago, Chile: IEEE, 2015: 3074-3082.[ DOI:10.1109/ICCV.2015.352 http://dx.doi.org/10.1109/ICCV.2015.352 ]
Wang X Y, Li H X, Li Y, et al. Robust and real-time deep tracking via multi-scale domain adaptation[C]//Proceedings of 2017 IEEE International Conference on Multimedia and Expo. Hong Kong, China: IEEE, 2017: 1338-1343.[ DOI:10.1109/ICME.2017.8019450 http://dx.doi.org/10.1109/ICME.2017.8019450 ]
Huang C, Lucey S, Ramanan D. Learning policies for adaptive tracking with deep feature cascades[C]//Proceedings of 2017 IEEE International Conference on Computer Vision. Venice, Italy: IEEE, 2017: 105-114.[ DOI:10.1109/ICCV.2017.21 http://dx.doi.org/10.1109/ICCV.2017.21 ]
Song Y B, Ma C, Gong L J, et al. CREST: convolutional residual learning for visual tracking[C]//Proceedings of 2017 IEEE International Conference on Computer Vision. Venice, Italy: IEEE, 2017: 2574-2583.[ DOI:10.1109/ICCV.2017.279 http://dx.doi.org/10.1109/ICCV.2017.279 ]
Lukezic A, Vojir T, Zajc L C, et al. Discriminative correlation filter with channel and spatial reliability[C]//Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition. Honolulu, Hawaii: IEEE, 2017: 4847-4856.[ DOI:10.1109/CVPR.2017.515 http://dx.doi.org/10.1109/CVPR.2017.515 ]
Bertinetto L, Valmadre J, GolodetzS, et al. Staple: complementary learners for real-time tracking[C]//Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, NV, USA: IEEE, 2016: 1401-1409.[ DOI:10.1109/CVPR.2016.156 http://dx.doi.org/10.1109/CVPR.2016.156 ]
Xiong C Z, Zhao L L, Guo F H. Kernelized correlation filters tracking based on adaptive feature fusion[J]. Journal of Computer-Aided Design&Computer Graphics, 2017, 29(6):1068-1074.
熊昌镇, 赵璐璐, 郭芬红.自适应特征融合的核相关滤波跟踪算法[J].计算机辅助设计与图形学学报, 2017, 29(6):1068-1074. [DOI:10.3969/j.issn.1003-9775.2017.06.012]
Danelljan M, Robinson A, Khan F S, et al. Beyond correlation filters: learning continuous convolution operators for visual tracking[C]//Proceedings of the 14th European Conference on Computer Vision. Amsterdam, The Netherlands: Springer, 2016: 472-488.[ DOI:10.1007/978-3-319-46454-1_29 http://dx.doi.org/10.1007/978-3-319-46454-1_29 ]
Danelljan M, Bhat G, Khan F S, et al. ECO: efficient convolution operators for tracking[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Honolulu, Hawaii: IEEE, 2017: 6931-6939.[ DOI:10.1109/CVPR.2017.733 http://dx.doi.org/10.1109/CVPR.2017.733 ]
Rifkin R M, Yeo G, Poggio T. Regularized least-squares classification[C]//Advances in Learning Theory: Methods, Model and Applications. NATO Science Series: Ⅲ: Computer and Systems Sciences. Amsterdam: IOS Press, 2003, 190: 131-154.
Qi Y K, Zhang S P, Qin L, et al. Hedged deep tracking[C]//Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, NV, USA, 2016: 4303-4311.[ DOI:10.1109/CVPR.2016.466 http://dx.doi.org/10.1109/CVPR.2016.466 ]
Ning J F, Yang J M, JiangS J, et al. Object tracking via dual linear structured SVM and explicit feature map[C]//Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, NV, USA: IEEE, 2016: 4266-4274.[ DOI:10.1109/CVPR.2016.462 http://dx.doi.org/10.1109/CVPR.2016.462 ]
相关作者
相关机构
京公网安备11010802024621