采用核相关滤波的快速TLD视觉目标跟踪

王姣尧; 侯志强; 余旺盛; 廖秀峰; 陈传华

doi:10.11834/jig.170655

图像理解和计算机视觉 | 浏览量 : 0 下载量: 4 CSCD: 3

PDF
导出
分享
收藏
专辑

采用核相关滤波的快速TLD视觉目标跟踪
Fast TLD visual tracking algorithm with kernel correlation filter
2018年23卷第11期页码：1686-1696
收稿：2018-12-22，

修回：2018-4-18，

纸质出版：2018-11-16
DOI： 10.11834/jig.170655
稿件说明：

移动端阅览

王姣尧, 侯志强, 余旺盛, 廖秀峰, 陈传华. 采用核相关滤波的快速TLD视觉目标跟踪[J]. 中国图象图形学报, 2018,23(11):1686-1696. DOI： 10.11834/jig.170655.

Jiaoyao Wang, Zhiqiang Hou, Wangsheng Yu, Xiufeng Liao, Chuanhua Chen. Fast TLD visual tracking algorithm with kernel correlation filter[J]. Journal of Image and Graphics, 2018, 23(11): 1686-1696. DOI： 10.11834/jig.170655.

摘要

目的

如何对目标进行快速鲁棒的跟踪一直是计算机视觉的重要研究方向之一，TLD（tracking-learning-detection）算法为这一问题提供了一种有效的解决方法，为了进一步提高TLD算法的跟踪性能，从两个方面对其进行了改进。

方法

首先在跟踪模块采用尺度自适应的核相关滤波器（KCF）作为跟踪器，考虑到跟踪模块与检测模块相互独立，本文算法使用检测模块对跟踪模块结果的准确性进行判断，并根据判断结果对KCF滤波器模板进行有选择地更新；然后在检测模块，运用光流法对目标位置进行初步预测，依据预测结果动态调整目标检测区域后，再使用分类器对目标进行精确定位。

结果

为了验证本文算法的优越性，对其进行了两组实验，实验1在OTB2013和Temple Color128这两个平台上对本文算法进行了跟踪性能的测试，其结果表明本文算法在OTB2013上的跟踪精度和成功率分别为0.761和0.559，在Temple Color128上的跟踪精度和成功率分别为0.678和0.481，且在所有测试视频上的平均跟踪速度达到了27.92帧/s；实验2将本文算法与其他3种改进算法在随机选取的8组视频上进行了跟踪测试与对比分析，实验结果表明，本文算法具有最小的中心位置误差14.01、最大的重叠率72.2%以及最快的跟踪速度26.23帧/s，展现出良好的跟踪性能。

结论

本文算法使用KCF跟踪器，提高了算法对遮挡、光照变化和运动模糊等场景的适应能力，使用光流法缩小检测区域，提高了算法的跟踪速度。实验结果表明，本文算法在多数情况下均取得优于参考算法的跟踪性能，在对目标进行长时间跟踪时表现出良好的跟踪鲁棒性。

Abstract

Objective

Visual tracking is widely applied in fields

such as video surveillance

human-computer interaction

and intelligent transportation

at present. In recent years

domestic and foreign researchers have proposed numerous tracking algorithms for this purpose. When applied to practical use

these algorithms are required to track a target extensively. However

continuously tracking a target is difficult for most algorithms given the complexity of the tracking scenario. Therefore

conducting rapid and robust tracking of a target is a key issue that must be solved when applying visual target tracking technology to practical use. TLD algorithm provides an effective solution to this issue. This study improves two aspects of the TLD algorithm to improve its tracking performance.

Method

First

a scale adaptive kernel correlation filter (KCF) is used as a tracker in the tracking module. The KCF algorithm cannot adapt to the scale change of the target because the size of the filter template is fixed. However

the detection module of the TLD algorithm has a certain scale adaptability. Therefore

the proposed algorithm utilizes the scale adaptive capabilities of the detection module to measure the scale of the region of interest of the KCF tracker. Moreover

the scale adjustments can enable the KCF tracker to achieve an improved tracking precision. The algorithm uses the detection module to assess the accuracy of the results of the tracking

module and selectively updates the KCF filter template in accordance with the assessed result because the tracking and detection modules are independent of each other. Second

an optical flow method in the detection module is used to preliminarily predict a target position. The optical flow method is used to estimate the target movement between two adjacent frames without any prior knowledge. The target detection area is set in accordance with the predicted position

and the size of the detection area is proportional to the target size. A three-layer cascade classifier is used to locate the target accurately after dynamically adjusting the target detection area. An anti-interference capability of the algorithm to similar objects in the scene is enhanced since the target motion information is introduced.

Result

Two sets of experiments are conducted to verify the superiority of the proposed algorithm. The first set of experiments is conducted on the OTB2013 and Temple Color 128 data platforms. The OTB2013 data platform has 50 sets of video sequences

and the Temple Color 128 data platform has 128 sets of video sequences. Results show that the tracking accuracy and success rate of the algorithm on the OTB2013 data platform are 0.761 and 0.559

respectively

and the tracking accuracy and success rate of the algorithm on the Temple Color 128 data platform are 0.678 and 0.481

correspondingly. The proposed algorithm is compared with six state-of-the art algorithms

namely

DSST

KCF

CNT

Struck

TLD

and DLT. Among all the algorithms

the proposed algorithm exhibits the optimum performance on the two data platforms

Besides

the. The average tracking speed of all test videos reaches 27.92 frame/s

thereby indicating a favorable real-time performance. In another set of experiments

the proposed algorithm and three other improved algorithms are tested and compared with the randomly selected eight sets of video sequences. The experimental results show that the proposed algorithm has the smallest center position error of 14.01

the largest overlap rate of 72.2%

and the fastest tracking speed of 26.23 frame/s

thus denoting that the proposed algorithm achieves the optimum tracking performance among all of the improved algorithms.

Conclusion

The proposed algorithm uses the KCF tracker to improve the capability of the algorithm to adapt to different scenes

such as occlusion

illumination change

and motion blur. Furthermore

the proposed algorithm uses the optical flow method to narrow the detection area. Consequently

the tracking speed of the algorithm is improved. The experimental results show that the proposed algorithm exhibits better tracking performance than the reference algorithm in most cases and achieves favorable tracking robustness in an extensive tracking process.

关键词

Keywords

references

Smeulders A W M, Chu D M, Cucchiara R, et al. Visual tracking:an experimental survey[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2014, 36(7):1442-1468.[DOI:10.1109/TPAMI.2013.230]

Kalal Z, Mikolajczyk K, Matas J. Tracking-learning-detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012, 34(7):1409-1422.[DOI:10.1109/TPAMI.2011.239]

Qin F, Wang R G, Liang Q X, et al. Improved TLD target tracking algorithm based on key feature points[J]. Computer Engineering and Application, 2016, 52(4):181-187.

秦飞, 汪荣贵, 梁启香, 等.基于关键特征点的改进TLD目标跟踪算法研究[J].计算机工程与应用, 2016, 52(4):181-187. [DOI:10.3778/j.issn.1002-8331.1402-0365]

Zhou X, Qian Q M, Ye Y Q, et al. Improved TLD visual target tracking algorithm[J]. Journal of Image and Graphics, 2013, 18(9):1115-1123.

周鑫, 钱秋朦, 叶永强, 等.改进后的TLD视频目标跟踪方法[J].中国图象图形学报, 2013, 18(9):1115-1123. [DOI:10.11834/jig.20130908]

Sun C J, Zhu S H, Liu J W. Fusing Kalman filter with TLD algorithm for target tracking[C]//Proceedings of the 34th Chinese Control Conference. Hangzhou, China: IEEE, 2015: 3736-3741.[ DOI: 10.1109/ChiCC.2015.7260218 http://dx.doi.org/10.1109/ChiCC.2015.7260218 ]

Lü N P, Cai X Y, Dong L, et al. Context object tracking algorithm based on TLD framework[J]. Video Engineering, 2015, 39(9):6-9, 43.

吕枘蓬, 蔡肖芋, 董亮, 等.基于TLD框架的上下文目标跟踪算法[J].电视技术, 2015, 39(9):6-9, 43. [DOI:10.16280/j.videoe.2015.09.002]

Henriques J F, Caseiro R, Martins P, et al. High-speed tracking with Kernelized correlation filters[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(3):583-596.[DOI:10.1109/TPAMI.2014.2345390]

Wu Y, Lim J, Yang M H. Online object tracking: a benchmark[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Portland, USA: IEEE, 2013, 9(4): 2411-2418.[ DOI: 10.1109/CVPR.2013.312 http://dx.doi.org/10.1109/CVPR.2013.312 ]

Liang P P, Blasch E, Ling H B. Encoding color information for visual tracking:algorithms and benchmark[J]. IEEE Transactions on Image Processing, 2015, 24(12):5630-5644.[DOI:10.1109/TIP.2015.2482905]

Zhang K H, Liu Q S, Wu Y, et al. Robust visual tracking via convolutional networks without training[J]. IEEE Transactions on Image Processing, 2016, 25(4):1779-1792.[DOI:10.1109/TIP.2016.2531283]

Wang N Y, Yeung D Y. Learning a deep compact image representation for visual tracking[C]//Proceedings of the 26th International Conference on Neural Information Processing Systems. Lake Tahoe, Nevada: Curran Associates Inc., 2013: 809-817.

Zhang B H, Lu H C, Xiao Z Y, et al. Visual tracking via discriminative sparse similarity map[J]. IEEE Transactions on Image Processing, 2014, 23(4):1872-1881.[DOI:10.1109/TIP.2014.2308414]

Hare S, Golodetz S, Saffari A, et al. Struck:structured output tracking with kernels[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016, 38(10):2096-2109.[DOI:10.1109/TPAMI.2015.2509974]

Yang T Y, Peng G H. Fast algorithm for image matching based on NCC[J]. Modern Electronics Technique, 2012, 33(22):107-109.

杨通钰, 彭国华.基于NCC的图像匹配快速算法[J].现代电子技术, 2012, 33(22):107-109. [DOI:10.3969/j.issn.1004-373X.2010.22.033]

Kalal Z, Matas J, Mikolajczyk K. P-N Learning: bootstrapping binary classifiers by structural constraints[C] //Proceedings of 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. San Francisco, USA: IEEE, 2010: 49-56.[ DOI:10.1109/CVPR.2010.5540231 http://dx.doi.org/10.1109/CVPR.2010.5540231 ]

Tu D W, Jiang J L. Improved algorithm for motion image analysis based on optical flow and its application[J]. Optics and Precision Engineering, 2011, 19(5):1159-1164.

屠大维, 江济良.改进的光流运动图像分析方法及其应用[J].光学精密工程, 2011, 19(5):1159-1164. [DOI:10.3788/OPE.20111905.1159]

Yang M H, Tao J H, Ye J T, et al. Robust outlier rejection from optical flow tracking points[J]. Journal of Computer-Aided Design&Computer Graphics, 2012, 24(1):76-82.

杨明浩, 陶建华, 叶军涛, 等.排除光流错误跟踪点的鲁棒方法[J].计算机辅助设计与图形学学报, 2012, 24(1):76-82. [DOI:10.3969/j.issn.1003-9775.2012.01.013]

Zhang B, Makram-Ebeid S, Prevost R, et al. Fast solver for some computational imaging problems:a regularized weighted least-squares approach[J]. Digital Signal Processing, 2014, 27:107-118.[DOI:10.1016/j.dsp.2014.01.007]

文章被引用时，请邮件提醒。

提交

流体运动估计光流算法研究综述

结合混合池化的双流人脸活体检测网络