融合多层特征的多尺度行人检测
Multi-scale pedestrian detection algorithm with multi-layer features
- 2019年24卷第10期 页码:1683-1691
收稿:2019-01-14,
修回:2019-5-3,
录用:2019-5-10,
纸质出版:2019-10-16
DOI: 10.11834/jig.190009
移动端阅览

浏览全部资源
扫码关注微信
收稿:2019-01-14,
修回:2019-5-3,
录用:2019-5-10,
纸质出版:2019-10-16
移动端阅览
目的
2
行人检测在自动驾驶、视频监控领域中有着广泛应用,是一个热门的研究话题。针对当前基于深度学习的行人检测算法在分辨率较低、行人尺度较小的情况下存在误检和漏检问题,提出一种融合多层特征的多尺度的行人检测算法。
方法
2
首先,针对行人检测问题,删除了深度残差网络的一部分,仅采用深度残差网络的3个区域提取特征图,然后采用最邻近上采样法将最后一层提取的特征图放大两倍后再用相加法,将高层语义信息丰富的特征和低层细节信息丰富的特征进行融合;最后将融合后的3层特征分别输入区域候选网络中,经过softmax分类,得到带有行人的候选框,从而实现行人检测的目的。
结果
2
实验结果表明,在Caltech行人检测数据集上,在每幅图像虚警率(FPPI)为10%的条件下,本文算法丢失率仅为57.88%,比最好的模型之一——多尺度卷积神经网络模型(MS-CNN)丢失率(60.95%)降低3.07%。
结论
2
深层的特征具有高语义信息且感受野较大的特点,而浅层的特征具有位置信息且感受野较小的特点,融合两者特征可以达到增强深层特征的效果,让深层的特征具有较为丰富的目标位置信息。融合后的多层特征图具有不同程度的细节和语义信息,对检测不同尺度的行人有较好的效果。所以利用融合后的特征进行行人检测,能够提高行人检测性能。
Objective
2
Humans fully understand a picture
often classify different images
and understand all the information in each image
including the location and concept of the object. This task is called object detection and is one of the basic research areas in computer vision. Object detection consists of different subtasks
such as pedestrian detection and skeleton detection. Pedestrian detection is a key link in object detection and one of the difficult tasks. This study mainly investigates pedestrian detection in traffic scenes
which is one of the most valuable topics in the field of pedestrian detection. Pedestrian detection in traffic scenes has always been a key technology for intelligent video surveillance technology
unmanned technology
intelligent transportation
and other issues. In recent years
this topic has been the research focus in academic and industrial circles. With the upsurge of artificial intelligence technology development
a large number of computer vision technologies are widely used. Multi-scale pedestrian detection has great research value because the development and application of pedestrian detection has complex real scenes and different pedestrian scales. Pedestrian detection is widely used in the field of automatic driving and video surveillance and is a hot research topic. Current pedestrian detection algorithms based on deep learning have false detection and miss detection problems in the case of low resolution and small pedestrian scale. A multi-scale pedestrian detection algorithm based on multi-layer features is proposed. The proposed convolutional neural network exhibits improved accuracy of pedestrian detection by a level
and there has been no small progress in practical applications. The academic enthusiasm brought about by deep learning has enabled scholars to make great progress and breakthroughs in pedestrian detection in complex scenes. Deep learning in the future will be a major boost for pedestrian detection.
Method
2
The deep residual network is mainly used in the multi-objective classification field. After analyzing the network
only the feature maps of the three stages are used and the residual unit and the full connection layer of the last stage are deleted. The deep residual network is mainly used to extract the feature maps of the three stages. The feature map extracted by the last layer is doubled using the characteristics of the three feature maps and then added by the nearest neighbor sampling method. The features with rich high-level semantic information and the features with rich low-level detail information are combined to improve the detection effect. The merged three-layer features are encoded into the region proposal network
and the proposal frames with pedestrians are obtained through Softmax classification for pedestrian detection. In this work
four experiments are designed
three of which are used to verify the validity of the proposed method. Results are compared with the mainstream algorithm results. Comparative experiments indicate that simple stratification does not improve the effect and the effect of multi-layer fusion is unsatisfactory. Therefore
the method of adjacent layer fusion is selected
and the result of multi-scale pedestrian detection is directly compared with that of the deepest network. The effect of adjacent layer fusion is better than the result. All experimental results are compared
and the fusion results of the adjacent layers are the best. The rate of missed detection is lower than that of the mainstream algorithm. The network is fully convolved and consists end-to-end training through random downsampling and backpropagation. Each image contains a number of candidate boxes for positive and negative samples. However
directly taking the optimized sample will easily lead to loss bias to the negative sample because the number of negative samples is larger than that of positive samples. This study takes an image to select 256 anchors and calculates its loss. The ratio of the positive and negative samples is 1:1. This article will randomly initialize all new layers in the network
and the standard of initialization is from the zero mean standard deviation. The value set is 0.01
and the weight is taken from Gaussian distribution. The other layers are initialized by classifying the pre-trained model
and the entire training process iterates through two epochs.
Result
2
On the Caltech pedestrian detection dataset and under the condition that each image false alarm rate (FPPI) is 10%
the loss rate of the proposed algorithm is only 57.88%
which is decreased by 3.07% compared with the loss of one of the best models
namely
MS-CNN (multi-scale convolutional neural network) (60.95%). This work also adopts comparative experiment. The overall loss rate of Ped-RPN is 64.55%
which is worse than that of the proposed algorithm. The loss rate of the layered and then detected method (Ped-muti-RPN) is 77.15%
which is better than that of Ped-RPN method. Ped-fused-RPN is a detection algorithm that combines multiple layers. The result is 61.32%
and the effect is better than the proposed algorithm.
Conclusion
2
Small-scale pedestrians have the disadvantage of blurred images
which make the detection effect extremely poor and affect the overall multi-scale detection. In order to solve the problem of the sharp decline of small-scale pedestrian detection
this paper proposes a method of integrating deep semantic information and shallow detail features so the features of all scales have rich semantic information. The deep features have high semantic information
and the receptive field is small. The shallow features have positional information
and the receptive field is more fused. The two features can enhance the deep features
which have rich target position information. The merged feature map has different levels of detail and semantic information and has a good effect on detecting pedestrians of different scales.
Su S Z, Li S Z, Chen S Y, et al. A survey on pedestrian detection[J]. Acta Electronica Sinica, 2012, 40(4):814-820.
苏松志, 李绍滋, 陈淑媛, 等.行人检测技术综述[J].电子学报, 2012, 40(4):814-820. [DOI:10.3969/j.issn.0372-2112.2012.04.031]
Dollár P, Wojek C, Schiele B, et al. Pedestrian detection:an evaluation of the state of the art[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012, 34(4):743-761.[DOI:10.1109/TPAMI.2011.155]
Benenson R, Omran M, Hosang J, et al. Ten years of pedestrian detection, what have we learned?[C]//Proceedings of European Conference on Computer Vision. Zurich, Switzerland: Springer, 2014: 613-627.[ DOI: 10.1007/978-3-319-16181-5_47 http://dx.doi.org/10.1007/978-3-319-16181-5_47 ]
Dalal N, Triggs B. Histograms of oriented gradients for human detection[C]//Proceedings of 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. San Diego, USA: IEEE, 2005: 886-893.[ DOI: 10.1109/CVPR.2005.177 http://dx.doi.org/10.1109/CVPR.2005.177 ]
Tian X X, Bao H, Xu C. Improved HOG algorithm of pedestrian detection[J]. Computer Science, 2014, 41(9):320-324.
田仙仙, 鲍泓, 徐成.一种改进HOG特征的行人检测算法[J].计算机科学, 2014, 41(9):320-324. [DOI:10.11896/j.issn.1002-137X.2014.09.062]
Che Z F, Miao Z J, Wang M S. Investigation and application of pedestrian detection in metro video monitoring system[J]. Modern Urban Transit, 2010, (2):31-33, 36.
车志富, 苗振江, 王梦思.地铁视频监控系统中的行人检测研究与应用[J].现代城市轨道交通, 2010, (2):31-33, 36. [DOI:10.3969/j.issn.1672-7533.2010.02.011]
Cao J L, Pang Y W, Li X L. Pedestrian detection inspired by appearance constancy and shape symmetry[J]. IEEE Transactions on Image Processing, 2016, 25(12):5538-5551.[DOI:10.1109/TIP.2016.2609807]
Dollár P, Belongie S, Perona P. The fastest pedestrian detector in the west[C]. The British Machine Vision Conference, 2010.
Nam W, Dollár P, Han J H. Local decorrelation for improved pedestrian detection[C]//Proceedings of the 27th International Conference on Neural Information Processing Systems. Montreal, Canada: MIT Press, 2014: 424-432.
Zeng J X, Cheng X. Pedestrian detection Co mbined with single and couple pedestrian DPM models in traffic scene[J]. Acta Electronica Sinica, 2016, 44(11):2668-2675.
曾接贤, 程潇.结合单双行人DPM模型的交通场景行人检测[J].电子学报, 2016, 44(11):2668-2675. [DOI:10.3969/j.issn.0372-2112.2016.11.015]
Wu Z. Occluded pedestrian detection that combines multiple classifiers in street scene[D]. Nanchang: Nanchang Hangkong University, 2016. http://xuewen.cnki.net/CMFD-1017711737.nh.html .
吴喆.多分类器级联的街道场景遮挡行人检测[D].南昌: 南昌航空大学, 2016.]
Krizhevsky A, Sutskever I, Hinton G E. ImageNet classification with deep convolutional neural networks[C]//Proceedings of the 25th International Conference on Neural Information Processing Systems. Lake Tahoe, Nevada, USA: Curran Associates Inc., 2012: 1097-1105.
Ma Z G, Gao P P. Research on the cascade pedestrian detection model based on LDCF and CNN[C]//Proceedings of 2018 IEEE International Conference on Big Data and Smart Computing. Shanghai, China: IEEE, 2018: 314-320.[ DOI: 10.1109/BigComp.2018.00053 http://dx.doi.org/10.1109/BigComp.2018.00053 ]
Wen Z, Du Y H, Yoshida T, et al. DRI-RCNN:an approach to deceptive review identification using recurrent convolutional neural network[J]. Information Processing&Management, 2018, 54(4):576-592.[DOI:10.1016/j.ipm.2018.03.007]
Zhao X T, Li W, Zhang Y F, et al. A faster RCNN-based pedestrian detection system[C]//Proceedings of 2016 IEEE 84th Vehicular Technology Conference. Montreal, QC, Canada: IEEE, 2016: 1-5.[ DOI: 10.1109/VTCFall.2016.7880852 http://dx.doi.org/10.1109/VTCFall.2016.7880852 ]
Luo J, Zeng J X, Leng L, et al. Pedestrian detection based on improved region proposal network[J]. Journal of Nanchang Hangkong University:Natural Sciences, 2018, 32(2):1-7, 43.
罗杰, 曾接贤, 冷璐, 等.基于改进的区域候选网络的行人检测[J].南昌航空大学学报:自然科学版, 2018, 32(2):1-7, 43. [DOI:10.3969/j.issn.1001-4926.2018.02.001]
Xiang X Z, Lv N, Guo X L, et al. Engineering vehicles detection based on modified faster R-CNN for power grid surveillance[J]. Sensors, 2018, 18(7):2258.[DOI:10.3390/s18072258]
Zhang X G, Chen G Y, Saruta K, et al. Deep convolutional neural networks for all-day pedestrian detection[M]//Kim K, Joukov N. Information Science and Applications 2017. Singapore: Springer, 2017: 171-178.[ DOI: 10.1007/978-981-10-4154-9_21 http://dx.doi.org/10.1007/978-981-10-4154-9_21 ]
He K, Zhang X, Ren S, et al. Deep Residual Learning for Image Recognition[C]//Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, USA: IEEE 2016: 770-778.[ DOI: 10.1109/CVPR.2016.90 http://dx.doi.org/10.1109/CVPR.2016.90 ]
Ren S, He K, Girshick R, et al. Faster R-CNN:towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis&Machine Intelligence, 2015, 39(6):1137-1149.[DOI:10.1109/TPAMI.2016.2577031]
Wojek C, Dollar P, Schiele B. Pedestrian detection:an evaluation of the state of the art[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012, 34(4):743-761.[DOI:10.1109/TPAMI.2011.155]
Li S H, Lin J Z, Li G Q, et al. Vehicle type detection based on deep learning in traffic scene[J]. Procedia Computer Science, 2018, 131:564-572.[DOI:10.1016/j.procs.2018.04.281]
相关作者
相关机构
京公网安备11010802024621