改进特征与GPU加速的行人检测
Pedestrian detection based on improved feature and GPU acceleration
- 2018年23卷第8期 页码:1171-1180
收稿:2017-09-22,
修回:2018-3-2,
纸质出版:2018-08-16
DOI: 10.11834/jig.170517
移动端阅览

浏览全部资源
扫码关注微信
收稿:2017-09-22,
修回:2018-3-2,
纸质出版:2018-08-16
移动端阅览
目的
2
目前行人检测存在特征维度高、检测耗时的问题,行人图像易受到光照、背景、遮挡等影响,给实际行人检测造成了一定困难。为了提高检测准确性,减少检测耗时,针对以上问题,提出一种改进特征与GPU(graphic processing unit)加速的行人检测算法。
方法
2
首先,采用多尺度无缩放思想,通过canny算子对所有样本进行预处理,减少背景干扰与统一归格化的形变影响。然后,针对实际视频中的遮挡问题,把图像分成头部、左臂、上身、右臂、左腿、右腿6个区域。接着选取比LBP(local binary patterns)特征鲁棒性更好的SILTP(scale invariant local ternary pattern)特征作为纹理特征,在GPU空间中并行提取;同时,分别提取6个区域的HOG(histogram of oriented gradient)特征值,结合行人轮廓在6个区域上的梯度方向分布特性,对其进行加权。最后,将提取的全部特征输出到CPU(central processing unit),利用支持向量机(SVM)分类器实现行人检测。
结果
2
在INRIA、NICTA数据集上进行实验,INRIA数据集上检测率达到99.80%,NICTA数据集上检测率达到99.91%,并且INRIA数据集上检测时间加速比达到12.19,NICTA数据集上达到13.49,相对传统HOG、LBP算法,检测率、时间比实现提高。
结论
2
提出的改进HOG-SILTP特征与GPU加速的行人检测算法,能够有效表达行人信息,改善传统特征提取方式带来的耗时与形变影响,对环境变化、遮挡具有较强的鲁棒性。该算法在检测率、检测时间方面均有提高,能够实现有效、快速的行人检测,具有实际意义。
Objective
2
With the growing attention paid to public safety and security
video surveillance systems become increasingly important. Pedestrian detection is the first step in video surveillance systems when analyzing behaviors of pedestrians
so it plays a critical role in computer vision research area. Although pedestrian detection has realized several achievements in recent years
speed and accuracy continue to show rooms for improvement. On the one hand
pedestrian detection is time consuming as it requires extensive calculation and high dimension of features. On the other hand
the detection results are easily influenced by different environmental factors
such as the changes of illumination and background
different postures of pedestrians
and occlusion among pedestrians. As detection results depend on the performances of the feature set and classifier
the features should be adequately representative to distinguish a pedestrian from other objects and background. This study proposes an algorithm for pedestrian detection based on improved feature and graphic processing unit (GPU) acceleration to reduce detection time and improve detection rate.
Method
2
First
Canny operator is used to process the original images and obtain images with enhanced edge information. Second
the images are processed in three scales to reduce the interference of background and deformation effect of unified standardized. Third
the images are divided into six regions to address the occlusion problem among pedestrians. These regions are head
left arm
upper body
right arm
left leg
and right leg
which are divided according to the characteristics of pedestrian action. Thereafter
the scale invariant local ternary pattern (SILTP) feature is used instead of the local binary pattern (LBP) feature to improve the pixel of low resolution and varied illuminations of images. The SILTP feature in parallel is extracted as the texture feature in GPU space to reduce the time of calculation. At the same time
the gradient and amplitude information of the six regions is calculated in the GPU space
and the value of the gradient is weighed with the value of the distribution characteristics. Therefore
the improved histogram of oriented gradient (HOG) features with 180 dimensions are obtained. The dimensions are much lower compared with traditional HOG feature and the calculation time is reduced. Finally
the features extracted in the three scales is concatenated
including HOG and SILTP feature. All features are outputted to central processing unit (CPU) space from GPU space
achieving pedestrian detection by the linear support vector machine (SVM) classifier.
Result
2
The proposed algorithm is demonstrated on two datasets
namely
INRIA and NICTA. The INRIA dataset is presently the most widely used static pedestrian dataset. The backgrounds
pedestrians' postures
and occlusions among pedestrians in this dataset are complex. By contrast
the NICTA dataset contains a large number of pedestrian images in different sizes. Therefore
the detection results of INRIA and NICTA datasets are representative. The INRIA dataset contains 2 416 positive training and 1 218 negative training samples and 1 126 positive detecting and 453 negative detecting samples. The NICTA dataset contains 142 598 positive training and 90 605 negative training samples and 34 416 positive detecting and 42800 negative detecting samples. The proposed method achieves a detection rate of 99.80% and 99.91% on INRIA dataset and NICTA dataset
respectively. On the INRIA dataset
the acceleration ratio is 12.19 compared with the algorithm based on traditional HOG and LBP
and the acceleration ratio of feature extraction is more than 8.19. On the NICTA dataset
the acceleration ratio is 13.49 compared with the algorithm based on traditional HOG and LBP. Therefore
the proposed algorithm based on improved feature and GPU acceleration enhances the detection rate and reduces the detection time.
Conclusion
2
Experiment results show that the proposed algorithm based on improved feature and GPU acceleration perform better than other algorithms in terms of accuracy and speed. The improved feature exhibits strong robustness to the changes of illumination and environment and performs well despite the occlusion among pedestrians. Apart from its lower dimension which improves the speed of pedestrian detection
the algorithm provides accurate information on pedestrians. The proposed algorithm is suitable for most situations involving pedestrian detection
especially for images or videos in an environment with different illuminations and occlusions. Its speed also performs well
especially with large amount of calculation and high repeatability. The proposed algorithm based on improved feature and GPU acceleration can achieve effective and fast pedestrian detection and has practical value.
Luo R C, Chen O. Wireless and pyroelectric sensory fusion system for indoor human/robot localization and monitoring[J]. IEEE/ASME Transactions on mechatronics, 2013, 18(3):845-853.[DOI:10.1109/TMECH.2012.2188300]
Geronimo D, López A M, Sappa A D, et al. Survey of pedestrian detection for advanced driver assistance systems[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2010, 32(7):1239-1258.[DOI:10.1109/TPAMI.2009.122]
Marín J, Vázquez D, López A M, et al. Random forests of local experts for pedestrian detection[C ] //Proceedings of 2013 IEEE International Conference on Computer Vision. Sydney, Australia: IEEE, 2013: 2592-2599. [ DOI: 10.1109/ICCV.2013.322 http://dx.doi.org/10.1109/ICCV.2013.322 ]
Dalal N, Triggs B. Histograms of oriented gradients for human detection[C ] //Proceedings of 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. San Diego, CA, USA: IEEE, 2005, 1: 886-893. [ DOI: 10.1109/CVPR.2005.177 http://dx.doi.org/10.1109/CVPR.2005.177 ]
Hu Y X, Zeng Z H, Yin L J, et al. Multi-view facial expression recognition[C ] //Proceedings of the 8th IEEE International Conference on Automatic Face & Gesture Recognition. Amsterdam, Netherlands: IEEE, 2008, 1-6. [ DOI: 10.1109/AFGR.2008.4813445 http://dx.doi.org/10.1109/AFGR.2008.4813445 ]
Mu Y D, Yan S C, Liu Y, et al. Discriminative local binary patterns for human detection in personal album[C ] //Proceedings of 2008 IEEE Conference on Computer Vision and Pattern Recognition. Anchorage, Alaska, USA: IEEE, 2008: 1-8. [ DOI: 10.1109/CVPR.2008.4587800 http://dx.doi.org/10.1109/CVPR.2008.4587800 ]
Liu Y C, Huang S S, Lu C H, et al. Thermal pedestrian d etection using block LBP with multi-level classifier[C ] //Proceedings of 2017 International Conference on Applied System Innovation. Sapporo, Japan: IEEE, 2017: 602-605. [ DOI: 10.1109/ICASI.2017.7988495 http://dx.doi.org/10.1109/ICASI.2017.7988495 ]
Liao S C, Zhao G Y, Kellokumpu V, et al. Modeling pixel process with scale invariant local patterns for background subtraction in complex scenes[C ] //Proceedings of 2010 IEEE Conference on Computer Vision and Pattern Recognition. San Francisco, USA: IEEE, 2010: 1301-1306. [ DOI: 10.1109/CVPR.2010.5539817 http://dx.doi.org/10.1109/CVPR.2010.5539817 ]
Cosmo D L,Salles E O T, Ciarelli P M. Pedestrian detection utilizing gradient orientation histograms and color self similarities descriptors[J]. IEEE Latin America Transactions, 2015, 13(7):2416-2422.[DOI:10.1109/TLA.2015.7273807]
Liu Y P, Li Y B, Ma X, et al. Facial expression recognition with fusion features extracted from salient facial areas[J]. Sensors, 2017, 17(4):712.[DOI:10.3390/s17040712]
Wang X Y, Han T X, Yan S C. An HOG-LBP human detector with partial occlusion handling[C ] //Proceedings of the 12th IEEE International Conference on Computer Vision. Kyoto, Japan: IEEE, 2009: 32-39. [ DOI: 10.1109/ICCV.2009.5459207 http://dx.doi.org/10.1109/ICCV.2009.5459207 ]
Zheng C H, Pei W J, Yan Q, et al. Pedestrian detection based on gradient and texture feature integration[J]. Neurocomputing, 2017, 228:71-78.[DOI:10.1016/j.neucom.2016.09.085]
Zeng C B, Ma H D. Robust head-shoulder detection by PCA-based multilevel HOG-LBP detector for people counting[C ] //Proceedings of 20th International Conference on Pattern Recognition. Istanbul, Turkey: IEEE, 2010: 2069-2072. [ DOI: 10.1109/ICPR.2010.509 http://dx.doi.org/10.1109/ICPR.2010.509 ]
Wei H H, Cao G, JinT, et al. Improved pedestrian detection based on modified star-cascade DPM model[J]. Journal of Image and Graphics, 2017, 22(2):170-178.
韦皓瀚, 曹国, 金挺, 等.改进星型级联可形变部件模型的行人检测[J].中国图象图形学报, 2017, 22(2):170-178. [DOI:10.11834/jig.20170204]
Qian Y Q, Yang M, Wang C X, et al. Self-adapting part-based pedestrian detection using a fish-eye camera[C ] //2017 IEEE Conference on Intelligent Vehicles Symposium. Los Angeles, CA, USA: IEEE, 2017: 33-38. [ DOI: 10.1109/IVS.2017.7995695 http://dx.doi.org/10.1109/IVS.2017.7995695 ]
Campmany V, Silva S, Espinosa A, et al. GPU-based pedestrian detection for autonomous driving[J]. Procedia Computer Science, 2016, 80:2377-2381.[DOI:10.1016/j.procs.2016.05.455]
Lai C Q, Teoh S S. An efficient method of hog feature extraction using selective histogram bin and PCA feature reduction[J]. Advances in Electrical and Computer Engineering, 2016, 16(4):101-108.[DOI:10.4316/AECE.2016.04016]
Benenson R, Mathias M, Timofte R, et al. Pedestrian detection at 100 frames per second[C ] //Proceedings of 2012 IEEE Conference on Computer Vision and Pattern Recognition. Providence, RI, USA: IEEE, 2012: 2903-2910. [ DOI: 10.1109/CVPR.2012.6248017 http://dx.doi.org/10.1109/CVPR.2012.6248017 ]
Wu B, Nevatia R. Detection of multiple, partially occluded humans in a single image by Bayesian combination of edgelet part detectors[C ] //Proceedings of the Tenth IEEE International Conference on Computer Vision. Beijing, China: IEEE, 2005: 90-97. [ DOI: 10.1109/ICCV.2005.74 http://dx.doi.org/10.1109/ICCV.2005.74 ]
Gan L, Zou K Z, Liu X. Pedestrian detection based on PCA dimension reduction of multi-feature cascade[J]. Computer Science, 2016, 43(6):308-311.
甘玲, 邹宽中, 刘肖.基于PCA降维的多特征级联的行人检测[J].计算机科学, 2016, 43(6):308-311. [DOI:10.11896/j.issn.1002-137X.2016.6.061]
相关作者
相关机构
京公网安备11010802024621