Newly low-resolution pedestrian re-identification-relevant dataset and its benched method

Yang Lulu; Lan Long; Sun Dongting; Teng Xiao; Ben Xianye; Shen Xiaobo

doi:10.11834/jig.221082

Dataset | Views : 0 下载量: 1 CSCD: 0

PDF
Export
Share
Collection
Album

Newly low-resolution pedestrian re-identification-relevant dataset and its benched method
Vol. 28, Issue 5, Pages: 1346-1359(2023)
Published： 16 May 2023 ，
DOI： 10.11834/jig.221082
稿件说明：

移动端阅览

杨露露，蓝龙，孙冬婷，滕霄，贲晛烨，沈肖波. 2023. 低分辨率行人重识别数据集及其基准方法. 中国图象图形学报， 28(05):1346-1359

Yang Lulu， Lan Long， Sun Dongting， Teng Xiao， Ben Xianye， Shen Xiaobo. 2023. Newly low-resolution pedestrian re-identification-relevant dataset and its benched method. Journal of Image and Graphics， 28(05):1346-1359
杨露露，蓝龙，孙冬婷，滕霄，贲晛烨，沈肖波. 2023. 低分辨率行人重识别数据集及其基准方法. 中国图象图形学报， 28(05):1346-1359 DOI： 10.11834/jig.221082.

Yang Lulu， Lan Long， Sun Dongting， Teng Xiao， Ben Xianye， Shen Xiaobo. 2023. Newly low-resolution pedestrian re-identification-relevant dataset and its benched method. Journal of Image and Graphics， 28(05):1346-1359 DOI： 10.11834/jig.221082.

摘要

目的

行人重识别旨在解决多个非重叠摄像头下行人的查询和识别问题。在很多实际的应用场景中，监控摄像头获取的是低分辨率行人图像，而现有的许多行人重识别方法很少关注真实场景中低分辨率行人相互匹配的问题。为研究该问题，本文收集并标注了一个新的基于枪球摄像头的行人重识别数据集，并基于此设计了一种低分辨率行人重识别模型来提升低分辨率行人匹配性能。

方法

该数据集由部署在3个不同位置的枪机摄像头和球机摄像头收集裁剪得到，最终形成包含200个有身份标签的行人和320个无身份标签的行人重识别数据集。与同类其他数据集不同，该数据集为每个行人同时提供高分辨率和低分辨率图像。针对低分辨率下的行人匹配难题，本文提出的基准模型考虑了图像超分、行人特征学习以及判别3个方面因素，并设计了相应的超分模块、特征学习模块和特征判别器模块，分别完成低分辨率图像超分、行人特征学习以及行人特征判断。

结果

提出的基准模型在枪球行人重识别数据集上的实验表明，对比于经典的行人重识别模型，新基准模型在平均精度均值（mean average precision，mAP）和Rank-1指标上分别提高了3.1%和6.1%。

结论

本文构建了典型的低分辨率行人重识别数据集，为研究低分辨率行人重识别问题提供了重要的数据来源，并基于该数据集研究了低分辨率下行人重识别基础方法。研究表明，提出的基准方法能够有效地解决低分辨行人匹配问题。

Abstract

Objective

Pedestrian re-identification is focused on multiview non-overlapping-derived problem of querying and identifying the same identity pedestrian. However， such real-world application scenarios are challenged to some camera-relevant factors like 1） hardware， 2） shooting distance， 3） angle of view， 4） background clutters， and 5） occlusions. Current surveillance camera-based images can be captured and it is still challenged for its low resolution （LR） as well. In real scenes， pedestrian re-identification （re-id） methods are required to resolve multiple pedestrians-oriented heterogeneous problem for low resolution image further. To deal with the mismatch problem between high resolution （HR） images and LR images， conventional re-id methods are mainly concerned of the cross-resolution matching problem. It is essential to richer mutual-benefited ability between low-resolution gallery images and query images. To improve the low-resolution pedestrian matching performance， we develop a novel of gun-ball camera-based pedestrian re-identification benchmark dataset and a low-resolution pedestrian re-identification benchmark model is designed as well.

Method

This data collection is acquired by the gun and ball system， which is deployed at three intersections. To capture LR images， two of three cameras are placed at each intersection， and the gun camera has a fixed direction and focal length. To obtain high-resolution images more， the other ball camera can be used to tune the focal length and the direction of view according to the target pedestrian position. And， a pedestrian re-identification dataset can be built and sampled by 200 pedestrians-identified categories （the same pedestrian is captured and identified at different locations）， and a sample of 320 pedestrian-unidentified categories （pedestrians can be captured under a certain camera only）. Each of these pedestrians-related images are all in related to the two aspects of high resolution and low resolution. A pedestrian-identified image can be captured by at least 2 different gun-ball cameras from different places， and a pedestrian-unidentified image can be captured by one gun-ball camera only and it is required to be searched and matched across cameras further. Pedestrian-unidentified images are in relevance to both of LR

HR as well. Some optimal factors are illustrated as mentioned below： 1） a richer and more diverse pedestrian dataset： the gun-ball camera-based pedestrian re-identification dataset can be used to acquire various pedestrian images from intersections. 2） The pedestrian dynamics： each pedestrian image has its temporal information because the gun-ball camera-based pedestrian dataset is captured and cropped from the video stream. Such temporal-based pedestrian images can be used for video-related pedestrian re-identification as well. 3） Other potentials： some pedestrians-unidentified images can be focused on， which can be used to study pedestrian re-identification algorithms in semi-supervised or unsupervised domains， as well as the ground truth of identification systems. That is， given an unknown identity of a pedestrian， its identification system can automatically detect the similar one in the surveillance screen or database. To strengthen the matching problem of LR images of pedestrians， we consider 1） image super-resolution， 2） pedestrian-related feature learning， and 3） discrimination as three key factors in our baseline. Specifically， to optimize each aspect of resolution， pedestrian feature-related learning and discrimination， the baseline is involved of a super-resolution module， a pedestrian re-identification module， and a pedestrian feature discriminator. Therefore， we design a baseline pedestrian re-identification model and it is benched on the 5 following aspects： generator

， image discriminator

， gradient discriminator

， pedestrian feature extractor

， and pedestrian feature discriminator

. To resolve the problem of low-resolution pedestrian re-identification in real scenes， our proposed model can be used to optimize both of the resolution of pedestrian image and pedestrian discrimination features.

Result

The low-resolution pedestrian re-identification baseline model is demonstrated and experimentally validated on the gun-ball pedestrian re-identification dataset. The mean average precision （mAP） and Rank-1 metrics are improved by 3.1% and 6.1%.

Conclusion

LR-related pedestrian recognition in natural scenes can be facilitated， and its pixel misalignment-derived problem of low quality of generated super-resolved images can be resolved to a certain extent. The dataset and benchmark model are proposed and its potential is in related to pedestrian re-identification and image super-resolution. It provides a data source for the field of low-resolution pedestrian re-identification. Also， the proposed baseline model can be predicted to tackle the low-resolution pedestrian matching problem further.

关键词

行人重识别基准数据集低分辨率（LR）超分辨率（SR）判别器

Keywords

pedestrian re-identificationbenchmark datasetlow-resolution（LR）super-resolution（SR）discriminator

references

Adil M， Mamoon S， Zakir A， Manzoor M A and Lian Z C. 2020. Multi scale-adaptive super-resolution person re-identification using GAN. IEEE Access， 8： 177351-177362 ［DOI： 10.1109/ACCESS.2020.3023594http://dx.doi.org/10.1109/ACCESS.2020.3023594］

Arjovsky M， Chintala S and Bottou L. 2017. Wasserstein GAN ［EB/OL］. ［2022-11-04］. httpshttps： https：//arxiv.org/pdf/1701.07875.pdfhttps://arxiv.org/pdf/1701.07875.pdf

Ben X Y， Xu S and Wang K J. 2012. Review on pedestrian gait feature expression and recognition. Pattern Recognition and Artificial Intelligence， 25（1）： 71-81

贲晛烨，徐森，王科俊. 2012. 行人步态的特征表达及识别综述. 模式识别与人工智能， 25（1）： 71-81 ［DOI： 10.3969/j.issn.1003-6059.2012.01.010http://dx.doi.org/10.3969/j.issn.1003-6059.2012.01.010］

Chen T L， Ding S J， Xie J Y， Yuan Y， Chen W Y， Yang Y， Ren Z and Wang Z Y. 2019a. ABD-Net： attentive but diverse person re-identification//Proceedings of 2019 IEEE/CVF International Conference on Computer Vision （ICCV）. Seoul， Korea （South）： IEEE： 8350-8360 ［DOI： 10.1109/ICCV.2019.00844http://dx.doi.org/10.1109/ICCV.2019.00844］

Chen Y C， Li Y J， Du X F and Wang Y C F. 2019b. Learning resolution-invariant deep representations for person re-identification. Proceedings of 2019 AAAI Conference on Artificial Intelligence， 33（1）： 8215-8222 ［DOI： 10.1609/AAAI.v33i01.33018215http://dx.doi.org/10.1609/AAAI.v33i01.33018215］

Cheng Z Y， Dong Q， Gong S H and Zhu X T. 2020. Inter-task association critic for cross-resolution person re-identification//Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition （CVPR）. Seattle， USA： IEEE： 2602-2612 ［DOI： 10.1109/CVPR42600.2020.00268http://dx.doi.org/10.1109/CVPR42600.2020.00268］

Chollet F. 2017. Xception： deep learning with depthwise separable convolutions//Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition （CVPR）. Honolulu， USA： IEEE： 1800-1807 ［DOI： 10.1109/CVPR.2017.195http://dx.doi.org/10.1109/CVPR.2017.195］

Dong C， Loy C C， He K M and Tang X O. 2014. Learning a deep convolutional network for image super-resolution//Proceedings of the 13th European Conference on Computer Vision （ECCV）. Zurich， Switzerland： Springer： 184-199 ［DOI： 10.1007/978-3-319-10593-2_13http://dx.doi.org/10.1007/978-3-319-10593-2_13］

He K M， Zhang X Y， Ren S Q and Sun J. 2016. Deep residual learning for image recognition//Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition （CVPR）. Las Vegas， USA： IEEE： 770-778 ［DOI： 10.1109/CVPR.2016.90http://dx.doi.org/10.1109/CVPR.2016.90］

Jing X Y， Zhu X K， Wu F， Hu R M， You X G， Wang Y H， Feng H and Yang J Y. 2017. Super-resolution person re-identification with semi-coupled low-rank discriminant dictionary learning. IEEE Transactions on Image Processing， 26（3）： 1363-1378 ［DOI： 10.1109/TIP.2017.2651364http://dx.doi.org/10.1109/TIP.2017.2651364］

Kalayeh M M， Basaran E， Gökmen M， Kamasak M E and Shah M. 2018. Human semantic parsing for person re-identification//Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition （CVPR）. Salt Lake City， USA： IEEE： 1062-1071 ［DOI： 10.1109/CVPR.2018.00117http://dx.doi.org/10.1109/CVPR.2018.00117］

Kim T， Cha M， Kim H， Lee J K and Kim J. 2017. Learning to discover cross-domain relations with generative adversarial networks//Proceedings of the 34th International Conference on Machine Learning. Sydney， Australia： IMLS： 1857-1865 ［DOI： 10.48550/arXiv.1703.05192http://dx.doi.org/10.48550/arXiv.1703.05192］

Lan L， Wang X C， Hua G， Huang T S and Tao D C. 2020. Semi-online multi-people tracking by re-identification. International Journal of Computer Vision， 128（7）： 1937-1955 ［DOI： 10.1007/s11263-020-01314-1http://dx.doi.org/10.1007/s11263-020-01314-1］

Li H J， Wu G J and Zheng W S. 2021b. Combined depth space-based architecture search for person re-identification//Proceedings of 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition （CVPR）. Nashville， USA： IEEE： 6725-6734 ［DOI： 10.1109/ CVPR46437.2021.00666http://dx.doi.org/10.1109/CVPR46437.2021.00666］

Li Y J， Chen Y C， Lin Y Y， Du X F and Wang Y C F. 2019. Recover and identify： a generative dual model for cross-resolution person re-identification//Proceedings of 2019 IEEE/CVF International Conference on Computer Vision （ICCV）. Seoul， Korea （South）： IEEE： 8089-8098 ［DOI： 10.1109/ICCV.2019.00818http://dx.doi.org/10.1109/ICCV.2019.00818］

Li Y L， He J F， Zhang T Z， Liu X， Zhang Y D and Wu F. 2021a. Diverse part discovery： occluded person re-identification with part-aware transformer//Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition （CVPR）. Nashville， USA： IEEE： 2897-2906 ［DOI： 10.1109/CVPR46437.2021.00292http://dx.doi.org/10.1109/CVPR46437.2021.00292］

Liang J Y， J. Cao J Z， Sun G L， K. Zhang K， Van Gool L and Timofte R. 2021b. SwinIR： image restoration using swin transformer//Proceedings of 2021 IEEE/CVF International Conference on Computer Vision Workshops （ICCVW）， Montreal， Canada： IEEE： 1833-1844 ［DOI： 10.1109/ICCVW54120.2021.00210http://dx.doi.org/10.1109/ICCVW54120.2021.00210］

Liang T Y， Lan L， Zhang X and Luo Z G. 2021a. A generic MOT boosting framework by combining cues from SOT， tracklet and re-identification. Knowledge and Information Systems， 63（8）： 2109-2127 ［DOI： 10.1007/s10115-021-01576-2http://dx.doi.org/10.1007/s10115-021-01576-2］

Liu J X， Ni B B， Yan Y C， Zhou P， Cheng S and Hu J G. 2018. Pose transferrable person re-identification//Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City， USA： IEEE： 4099-4108 ［DOI： 10.1109/CVPR.2018.00431http://dx.doi.org/10.1109/CVPR.2018.00431］

Liu Y Q， Zhang X F， Wang S S， Ma S W and Gao W. 2020. Progressive multi-scale residual network for single image super-resolution ［EB/OL］. ［2022-11-04］. https：//arxiv.org/pdf/2007.09552.pdfhttps://arxiv.org/pdf/2007.09552.pdf

Luo H， Gu Y Z， Liao X Y， Lai S Q and Jiang W. 2019. Bag of tricks and a strong baseline for deep person re-identification//Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. Long Beach， USA： IEEE： 1487-1495 ［DOI： 10.1109/CVPRW.2019.00190http://dx.doi.org/10.1109/CVPRW.2019.00190］

Makhzani A， Shlens J， Jaitly N， Goodfellow I and Frey B. 2015. Adversarial autoencoders ［EB/OL］. ［2022-11-04］. https：//arxiv.org/pdf/1511.05644.pdfhttps://arxiv.org/pdf/1511.05644.pdf

Munir A， Lyu C， Goossens B， Philips W and Micheloni C. 2021. Resolution based feature distillation for cross resolution person re-identification//Proceedings of 2021 IEEE/CVF International Conference on Computer Vision Workshops. Montreal， Canada： IEEE： 281-289 ［DOI： 10.1109/ICCVW54120.2021.00036http://dx.doi.org/10.1109/ICCVW54120.2021.00036］

Shen Q， Tian C， Wang J B， Jiao S S and Du L. 2020. Multi-resolution feature attention fusion method for person re-identification. Journal of Image and Graphics， 25（5）： 946-955

沈庆，田畅，王家宝，焦珊珊，杜麟. 2020. 多分辨率特征注意力融合行人再识别. 中国图象图形学报， 25（5）： 946-955 ［DOI： 10.11834/jig.190237http://dx.doi.org/10.11834/jig.190237］

Shi W D， Zhang Y Z， Liu S W， Zhu S D and Bao J N. 2020. Person re-identification based on deformation and occlusion mechanisms. Journal of Image and Graphics， 25（12）： 2530-2540

史维东，张云洲，刘双伟，朱尚栋，暴吉宁. 2020. 针对形变与遮挡问题的行人再识别. 中国图象图形学报， 25（12）： 2530-2540 ［DOI： 10.11834/jig.200016http://dx.doi.org/10.11834/jig.200016］

Sun D T， Yang L L， Lan L and Luo Z G. 2022. Toward to real low-resolution person re-identification： a new dataset and baseline//Proceedings of 2022 IEEE International Conference on Multimedia and Expo （ICME）. Taipei， China： IEEE： #9860022 ［DOI： 10.1109/ICME52920.2022.9860022http://dx.doi.org/10.1109/ICME52920.2022.9860022］

Szegedy C， Ioffe S， Vanhoucke V and Alemi A. 2016. Inception-v4， Inception-ResNet and the impact of residual connections on learning//Proceedings of the 31st AAAI Conference on Artificial Intelligence. San Francisco， USA： AAAI： 4278-4284 ［DOI： 10.1609/aaai.v31i1.11231http://dx.doi.org/10.1609/aaai.v31i1.11231］

Wang H C， Shen J Y， Liu Y T， Gao Y and Gavves E. 2022. NFormer： robust person re-identification with neighbor transformer//Proceedings of 2022 IEEE Conference on Computer Vision and Pattern Recognition. New Orleans， USA： IEEE： 7287-7297 ［DOI： 10.48550/ arXiv.2204.09331http://dx.doi.org/10.48550/arXiv.2204.09331］

Wang Z， Ye M， Yang F， Bai X and Satoh S. 2018. Cascaded SR-GAN for scale-adaptive low resolution person re-identification//Proceedings of the 27th International Joint Conference on Artificial Intelligence （IJCAI）. Stockholm， Sweden： AAAI Press： 3891-3897

Wu Z Z， Yu X C， Zhu D L， Pang Q W， Shen S T， Ma T and Zheng J. 2022. SR-DSFF and FENet-ReID： a two-stage approach for cross resolution person re-identification. Computational Intelligence and Neuroscience， 2022： #4398727 ［DOI： 10.1155/2022/4398727http://dx.doi.org/10.1155/2022/4398727］

Ye M， Shen J B， Lin G J， Xiang T and Hoi S C H. 2022. Deep learning for person re-identification： a survey and outlook. IEEE Transactions on Pattern Analysis and Machine Intelligence， 44（6）： 2827-2893 ［DOI： 10.1109/TPAMI.2021.3054775http://dx.doi.org/10.1109/TPAMI.2021.3054775］

Zhang G Q， Ge Y， Dong Z C， Wang H， Zheng Y H and Chen S Y. 2021b. Deep high-resolution representation learning for cross-resolution person re-identification. IEEE Transactions on Image Processing， 30： 8913-8925 ［DOI： 10.1109/TIP.2021.3120054http://dx.doi.org/10.1109/TIP.2021.3120054］

Zhang P， Xu J S， Wu Q， Huang Q and Ben X Y. 2021a. Learning spatial-temporal representations over walking tracklet for long-term person re-identification in the wild. IEEE Transactions on Multimedia， 23： 3562-3576 ［DOI： 10.1109/TMM.2020.3028461http://dx.doi.org/10.1109/TMM.2020.3028461］

Zhang Y L， Li K P， Li K， Wang L C， Zhong B N and Fu Y. 2018.Image super-resolution using very deep residual channel attention networks//Proceedings of the 15th European Conference on Computer Vision （ECCV）. Munich， Germany： Springer： 294-310 ［DOI： 10.1007/978-3-030-01234-2_18http://dx.doi.org/10.1007/978-3-030-01234-2_18］

Zheng W S， Hong J C， Jiao J N， Wu A C， Zhu X T， Gong S G， Qin J Y and Lai J H. 2022. Joint bilateral-resolution identity modeling for cross-resolution person re-identification. International Journal of Computer Vision， 130（1）： 136-156 ［DOI： 10.1007/s11263-021-01518-zhttp://dx.doi.org/10.1007/s11263-021-01518-z］

Zheng X， Lin L， Ye M， Wang L and He C L. 2020. Improving person re-identification by attention and multi-attributes. Journal of Image and Graphics， 25（5）： 936-945

郑鑫，林兰，叶茂，王丽，贺春林. 2020. 结合注意力机制和多属性分类的行人再识别. 中国图象图形学报， 25（5）： 936-945 ［DOI： 10.11834/jig.190185http://dx.doi.org/10.11834/jig.190185］

Zhou K Y， Yang Y X， Cavallaro A and Xiang T. 2019. Omni-scale feature learning for person re-identification//Proceedings of 2019 IEEE/CVF International Conference on Computer Vision （ICCV）. Seoul， Korea （South）： IEEE： 3701-3711 ［DOI： 10.1109/ICCV.2019.00380http://dx.doi.org/10.1109/ICCV.2019.00380］

Alert me when the article has been cited

提交

Semantic segmentation benchmark dataset for coastal ecosystem monitoring based on unmanned aerial vehicle （UAV）

A summary on group re-identification

MTMS300: a multiple-targets and multiple-scales benchmark dataset for salient object detection

HSRS-SC: a hyperspectral image dataset for remote sensing scene classification

Prostate MRI segmentation by using conditional generative adversarial networks with multi-scale discriminators