Multi-strategy Optimization for Cross-modal Pedestrian Re-identification Based on Deep Q-Network Reinforcement Learning
Abstract
Cross-modal pedestrian re-identification (C-ReID) is a crucial task in computer vision, aiming to match pedestrian identities across different modalities of data. This paper proposes a reinforcement learningbased framework, RLCMPRF, to tackle the challenges of modality variability, data diversity, annotation difficulties, and optimal strategy selection. RLCMPRF uses deep Q-network (DQN) reinforcement learning to dynamically select the best feature extraction and matching strategies, ensuring robustness against these challenges. We introduce a dual-stream network to process multimodal images, followed by a feature fusion layer for integration. The DQN-based strategy learning is complemented by a reward function designed to optimize matching accuracy, speed, and robustness. Experimental results demonstrate that RLCMPRF outperforms state-of-the-art methods based on deep learning, attention mechanisms, meta-learning, and generative adversarial networks. RLCMPRF achieves a success rate of 82% and an average cumulative reward of 150, showing improvements in convergence speed and generalization ability across multiple datasets.References
References
. Bouzid A, Sierra-Sosa D, Elmaghraby A. A Robust pedestrian re-identification and out-of-distribution detection framework. Drones. 2023; 7(6).
. Liu SL, Zhang SM, Diao ZJ, Fang ZB, Jiao ZY, Zhong ZY. Pedestrian re-identification based on attention mechanism and Multi- scale feature fusion. Mathematical Biosciences and Engineering. 2023; 20(9):16913-38.
. Zheng Y, Zhou Y, Zhao JQ, Jian M, Yao R, Liu B, Chen Y. A siamese pedestrian alignment network for person re-identification. Multimedia Tools and Applications. 2021; 80(25):33951-70.
. Organisciak D, Sakkos D, Ho ESL, Aslam N, Shum HPH. Unifying person and vehicle re-identification. IEEE Access. 2020; 8:115673-84.
. Wang YY, Li X, Jiang MX, Zhang HY, Tang E. Cross-view pedestrian clustering via graph convolution network for unsupervised person re-identification. Journal of Intelligent & Fuzzy Systems. 2020; 39(3):4453-62.
. Cheng KY, Tao F, Zhan YZ, Li MZ, Li KL. Hierarchical attributes learning for pedestrian re-identification via parallel stochastic gradient descent combined with momentum correction and adaptive learning rate. Neural Computing & Applications. 2020; 32(10):5695-712.
. Zhao YB, Zhu SH. Occluded pedestrian re-identification via Res-ViT double-branch hybrid network. Multimedia Systems. 2024; 30(1).
. Xue C, Deng ZL, Wang S, Hu EW, Zhang Y, Yang WW, Wang YM. GLSFF: Global-local specific feature fusion for cross-modality pedestrian re-identification. Computer Communications. 2024; 215:157-68.
. Ke X, Lin XR, Qin LY. Lightweight convolutional neural network-based pedestrian detection and re-identification in multiple scenarios. Machine Vision and Applications. 2021; 32(2).
. Zhang XQ, Wang XX, Gu CH. Online multi-object tracking with pedestrian re-identification and occlusion processing. Visual Computer. 2021; 37(5):1089 -99.
. Yang XF, Zhou ZH, Wang QS, Wang ZW, Li X, Li HF. Cross-domain unsupervised pedestrian re-identification based on multi-view decomposition. Multimedia Tools and Applications. 2022; 81(27):39387-408.
. Wei D, Hu XQ, Wang ZY, Shen JL, Ren HJ. Pose-Guided multi-scale structural relationship learning for video-based pedestrian re-identification. IEEE Access. 2021; 9:34845-58.
. Chen H, Zhao Y, Wang SG. Person re-identification based on contour information embedding. Sensors. 2023; 23(2).
. Chen MZY, Banitaan S, Maleki M. Pedestrian group re-identification and trajectory prediction through zone-based clustering. IEEE Access. 2024; 12: 101549-62.
. Liu YJ, Shao WB, Sun XR. Learn robust pedestrian representation within minimal modality discrepancy for visible-infrared person re-identification. Journal of Computer Science and Technology. 2022; 37(3):641-51.
. Manzoor S, An YC, In GG, Zhang YY, Kim S, Kuc TY. SPT: Single pedestrian tracking framework with re-identification-based learning using the Siamese model. Sensors. 2023; 23(10).
. Han H, Zhou MC, Shang XW, Cao W, Abusorrah A. KISS plus for rapid and accurate pedestrian re-identification. IEEE Transactions on Intelligent Transportation Systems. 2021; 22(1):394-403.
. Hong F, Lu CH, Tao W, Jiang WW. OMNet: Object-Perception multi-branch network for pedestrian re-identification. Big Data Research. 2022; 27.
. Yun X, Ge M, Sun YJ, Dong KW, Hou XF. Margin CosReid network for pedestrian re-identification. Applied Sciences-Basel. 2021; 11(4).
. Grigorev A, Tian ZH, Rho S, Xiong JX, Liu SH, Jiang F. Deep person re-identification in UAV images. Eurasip Journal on Advances in Signal Processing. 2019; (1).
. Cheng KY, Xu FJ, Tao F, Qi M, Li MZ. Data-driven pedestrian re-identification based on hierarchical semantic representation. Concurrency and Computation-Practice & Experience. 2018; 30(23).
. Yu ZY, Li LS, Xie JL, Wang CS, Li WJ, Ning X. Pedestrian 3D Shape understanding for person re-identification via multi-view learning. IEEE Transactions on Circuits and Systems for Video Technology. 2024; 34(7):5589-602.
. Shan YX, Yu G, Gao YH. Pedestrian re-identification based on gait analysis. IEEE Access. 2023; 11:106013-23.
. Zhao BT, Wang YJ, Su KK, Ren H, Han XY. Semi-supervised pedestrian re-identification via a teacher-student model with similarity-preserving generative adversarial networks. Applied Intelligence. 2023; 53(2):1605-18.
. Lei MW, Song YC, Zhao JD, Wang X, Lyu J, Xu JD, Yan WQ. End-to-End network for pedestrian detection, tracking and re-identification in real-time surveillance system. Sensors. 2022; 22(22).
. Sun ZJ, Wang X, Zhang YL, Song YC, Zhao JD, Xu JD, et al. A comprehensive review of pedestrian re-identification based on deep learning. Complex & Intelligent Systems. 2024; 10(2):1733-68.
. Bouzid A, Sierra-Sosa D, Elmaghraby A. Directional statistics-based deep metric learning for pedestrian tracking and re-identification. Drones. 2022; 6(11).
. Geng SZ, Yu M, Liu Y, Yu Y, Bai J. Re-ranking pedestrian re-identification with multiple Metrics. Multimedia Tools and Applications. 2019; 78(9):11631- 53.
. Deng X, Liao KY, Zheng YL, Lin GF, Lei H. A deep multi-feature distance metric learning method for pedestrian re-identification. Multimedia Tools and Applications. 2021; 80(15):23113-31.
. Yang XF, Wang QS, Li WK, Zhou ZH, Li HF. Unsupervised domain adaptation pedestrian re-identification based on an improved dissimilarity space. Image and Vision Computing. 2022; 118.
DOI:
https://doi.org/10.31449/inf.v49i11.7247Downloads
Published
How to Cite
Issue
Section
License
I assign to Informatica, An International Journal of Computing and Informatics ("Journal") the copyright in the manuscript identified above and any additional material (figures, tables, illustrations, software or other information intended for publication) submitted as part of or as a supplement to the manuscript ("Paper") in all forms and media throughout the world, in all languages, for the full term of copyright, effective when and if the article is accepted for publication. This transfer includes the right to reproduce and/or to distribute the Paper to other journals or digital libraries in electronic and online forms and systems.
I understand that I retain the rights to use the pre-prints, off-prints, accepted manuscript and published journal Paper for personal use, scholarly purposes and internal institutional use.
In certain cases, I can ask for retaining the publishing rights of the Paper. The Journal can permit or deny the request for publishing rights, to which I fully agree.
I declare that the submitted Paper is original, has been written by the stated authors and has not been published elsewhere nor is currently being considered for publication by any other journal and will not be submitted for such review while under review by this Journal. The Paper contains no material that violates proprietary rights of any other person or entity. I have obtained written permission from copyright owners for any excerpts from copyrighted works that are included and have credited the sources in my article. I have informed the co-author(s) of the terms of this publishing agreement.
Copyright © Slovenian Society Informatika







