低分辨率行人重识别数据集及其基准方法

杨露露; 蓝龙; 孙冬婷; 滕霄; 贲晛烨; 沈肖波

发布时间： 2023-05-16
摘要点击次数： 1123
全文下载次数： 1033
DOI: 10.11834/jig.221082
2023 | Volume 28 | Number 5

低分辨率行人重识别数据集及其基准方法

杨露露^1,2, 蓝龙^1,2, 孙冬婷^1,2, 滕霄^1,2, 贲晛烨³, 沈肖波⁴(1.国防科技大学计算机学院, 长沙 410073;2.国防科技大学量子信息研究所兼高性能计算国家重点实验室, 长沙 410073;3.山东大学信息科学与工程学院, 青岛 266237;4.南京理工大学计算机科学与工程学院, 南京 210094)

摘要

目的行人重识别旨在解决多个非重叠摄像头下行人的查询和识别问题。在很多实际的应用场景中,监控摄像头获取的是低分辨率行人图像,而现有的许多行人重识别方法很少关注真实场景中低分辨率行人相互匹配的问题。为研究该问题,本文收集并标注了一个新的基于枪球摄像头的行人重识别数据集,并基于此设计了一种低分辨率行人重识别模型来提升低分辨率行人匹配性能。方法该数据集由部署在3个不同位置的枪机摄像头和球机摄像头收集裁剪得到,最终形成包含200个有身份标签的行人和320个无身份标签的行人重识别数据集。与同类其他数据集不同,该数据集为每个行人同时提供高分辨率和低分辨率图像。针对低分辨率下的行人匹配难题,本文提出的基准模型考虑了图像超分、行人特征学习以及判别3个方面因素,并设计了相应的超分模块、特征学习模块和特征判别器模块,分别完成低分辨率图像超分、行人特征学习以及行人特征判断。结果提出的基准模型在枪球行人重识别数据集上的实验表明,对比于经典的行人重识别模型,新基准模型在平均精度均值(mean average precision,mAP)和Rank-1指标上分别提高了3.1%和6.1%。结论本文构建了典型的低分辨率行人重识别数据集,为研究低分辨率行人重识别问题提供了重要的数据来源,并基于该数据集研究了低分辨率下行人重识别基础方法。研究表明,提出的基准方法能够有效地解决低分辨行人匹配问题。

关键词

行人重识别基准数据集低分辨率（LR）超分辨率（SR）判别器

Newly low-resolution pedestrian re-identification-relevant dataset and its benched method

Yang Lulu^1,2, Lan Long^1,2, Sun Dongting^1,2, Teng Xiao^1,2, Ben Xianye³, Shen Xiaobo⁴(1.College of Computer Science, National University of Defense Technology, Changsha 410073, China;2.Institute for Quantum Information & State Key Laboratory of High Performance Computing, National University of Defense Technology, Changsha 410073, China;3.College of Information Science and Engineering, Shandong University, Qingdao 266237, China;4.College of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China)

Abstract

Objective Pedestrian re-identification is focused on multiview non-overlapping-derived problem of querying and identifying the same identity pedestrian. However，such real-world application scenarios are challenged to some camera relevant factors like 1）hardware，2）shooting distance，3）angle of view，4）background clutters，and 5）occlusions. Current surveillance camera-based images can be captured and it is still challenged for its low resolution（LR）as well. In real scenes，pedestrian re-identification（re-id）methods are required to resolve multiple pedestrians-oriented heterogeneous problem for low resolution image further. To deal with the mismatch problem between high resolution（HR）images and LR images，conventional re-id methods are mainly concerned of the cross-resolution matching problem. It is essential to richer mutual-benefited ability between low-resolution gallery images and query images. To improve the low-resolution pedestrian matching performance，we develop a novel of gun-ball camera-based pedestrian re-identification benchmark dataset and a low-resolution pedestrian re-identification benchmark model is designed as well. Method This data collection is acquired by the gun and ball system，which is deployed at three intersections. To capture LR images，two of three cameras are placed at each intersection，and the gun camera has a fixed direction and focal length. To obtain high-resolution images more，the other ball camera can be used to tune the focal length and the direction of view according to the target pedestrian position. And，a pedestrian re-identification dataset can be built and sampled by 200 pedestrians-identified categories（the same pedestrian is captured and identified at different locations），and a sample of 320 pedestrian-unidentified categories（pedestrians can be captured under a certain camera only）. Each of these pedestrians-related images are all in related to the two aspects of high resolution and low resolution. A pedestrian-identified image can be captured by at least 2 different gun-ball cameras from different places，and a pedestrian-unidentified image can be captured by one gun-ball camera only and it is required to be searched and matched across cameras further. Pedestrian-unidentified images are in relevance to both of LR & HR as well. Some optimal factors are illustrated as mentioned below：1）a richer and more diverse pedestrian dataset：the gun-ball camera-based pedestrian re-identification dataset can be used to acquire various pedestrian images from intersections. 2）The pedestrian dynamics：each pedestrian image has its temporal information because the gun-ball camera-based pedestrian dataset is captured and cropped from the video stream. Such temporal-based pedestrian images can be used for video-related pedestrian re-identification as well. 3）Other potentials：some pedestrians-unidentified images can be focused on，which can be used to study pedestrian re-identification algorithms in semi-supervised or unsupervised domains，as well as the ground truth of identification systems. That is，given an unknown identity of a pedestrian，its identification system can automatically detect the similar one in the surveillance screen or database. To strengthen the matching problem of LR images of pedestrians，we consider 1）image super-resolution，2）pedestrian-related feature learning，and 3）discrimination as three key factors in our baseline. Specifically，to optimize each aspect of resolution，pedestrian feature-related learning and discrimination，the baseline is involved of a super-resolution module，a pedestrian re-identification module，and a pedestrian feature discriminator. Therefore，we design a baseline pedestrian re-identification model and it is benched on the 5 following aspects：generator G，image discriminator D_s，gradient discriminator D_g，pedestrian feature extractor F，and pedestrian feature discriminator D_f. To resolve the problem of low-resolution pedestrian re-identification in real scenes，our proposed model can be used to optimize both of the resolution of pedestrian image and pedestrian discrimination features. Result The low-resolution pedestrian re-identification baseline model is demonstrated and experimentally validated on the gun-ball pedestrian re-identification dataset. The mean average precision（mAP）and Rank-1 metrics are improved by 3. 1% and 6. 1%. Conclusion LR-related pedestrian recognition in natural scenes can be facilitated，and its pixel misalignment-derived problem of low quality of generated super-resolved images can be resolved to a certain extent. The dataset and benchmark model are proposed and its potential is in related to pedestrian re-identification and image superresolution. It provides a data source for the field of low-resolution pedestrian re-identification. Also，the proposed baseline model can be predicted to tackle the low-resolution pedestrian matching problem further.

Keywords

pedestrian re-identification benchmark dataset low-resolution （LR） super-resolution （SR） discriminator