Multi-strategy Optimization for Cross-modal Pedestrian Re-identification Based on Deep Q-Network Reinforcement Learning
Abstract
Cross-modal pedestrian re-identification (C-ReID) is a crucial task in computer vision, aiming to match pedestrian identities across different modalities of data. This paper proposes a reinforcement learningbased framework, RLCMPRF, to tackle the challenges of modality variability, data diversity, annotation difficulties, and optimal strategy selection. RLCMPRF uses deep Q-network (DQN) reinforcement learning to dynamically select the best feature extraction and matching strategies, ensuring robustness against these challenges. We introduce a dual-stream network to process multimodal images, followed by a feature fusion layer for integration. The DQN-based strategy learning is complemented by a reward function designed to optimize matching accuracy, speed, and robustness. Experimental results demonstrate that RLCMPRF outperforms state-of-the-art methods based on deep learning, attention mechanisms, meta-learning, and generative adversarial networks. RLCMPRF achieves a success rate of 82% and an average cumulative reward of 150, showing improvements in convergence speed and generalization ability across multiple datasets.References
References
. Bouzid A, Sierra-Sosa D, Elmaghraby A. A Robust pedestrian re-identification and out-of-distribution detection framework. Drones. 2023; 7(6).
. Liu SL, Zhang SM, Diao ZJ, Fang ZB, Jiao ZY, Zhong ZY. Pedestrian re-identification based on attention mechanism and Multi- scale feature fusion. Mathematical Biosciences and Engineering. 2023; 20(9):16913-38.
. Zheng Y, Zhou Y, Zhao JQ, Jian M, Yao R, Liu B, Chen Y. A siamese pedestrian alignment network for person re-identification. Multimedia Tools and Applications. 2021; 80(25):33951-70.
. Organisciak D, Sakkos D, Ho ESL, Aslam N, Shum HPH. Unifying person and vehicle re-identification. IEEE Access. 2020; 8:115673-84.
. Wang YY, Li X, Jiang MX, Zhang HY, Tang E. Cross-view pedestrian clustering via graph convolution network for unsupervised person re-identification. Journal of Intelligent & Fuzzy Systems. 2020; 39(3):4453-62.
. Cheng KY, Tao F, Zhan YZ, Li MZ, Li KL. Hierarchical attributes learning for pedestrian re-identification via parallel stochastic gradient descent combined with momentum correction and adaptive learning rate. Neural Computing & Applications. 2020; 32(10):5695-712.
. Zhao YB, Zhu SH. Occluded pedestrian re-identification via Res-ViT double-branch hybrid network. Multimedia Systems. 2024; 30(1).
. Xue C, Deng ZL, Wang S, Hu EW, Zhang Y, Yang WW, Wang YM. GLSFF: Global-local specific feature fusion for cross-modality pedestrian re-identification. Computer Communications. 2024; 215:157-68.
. Ke X, Lin XR, Qin LY. Lightweight convolutional neural network-based pedestrian detection and re-identification in multiple scenarios. Machine Vision and Applications. 2021; 32(2).
. Zhang XQ, Wang XX, Gu CH. Online multi-object tracking with pedestrian re-identification and occlusion processing. Visual Computer. 2021; 37(5):1089 -99.
. Yang XF, Zhou ZH, Wang QS, Wang ZW, Li X, Li HF. Cross-domain unsupervised pedestrian re-identification based on multi-view decomposition. Multimedia Tools and Applications. 2022; 81(27):39387-408.
. Wei D, Hu XQ, Wang ZY, Shen JL, Ren HJ. Pose-Guided multi-scale structural relationship learning for video-based pedestrian re-identification. IEEE Access. 2021; 9:34845-58.
. Chen H, Zhao Y, Wang SG. Person re-identification based on contour information embedding. Sensors. 2023; 23(2).
. Chen MZY, Banitaan S, Maleki M. Pedestrian group re-identification and trajectory prediction through zone-based clustering. IEEE Access. 2024; 12: 101549-62.
. Liu YJ, Shao WB, Sun XR. Learn robust pedestrian representation within minimal modality discrepancy for visible-infrared person re-identification. Journal of Computer Science and Technology. 2022; 37(3):641-51.
. Manzoor S, An YC, In GG, Zhang YY, Kim S, Kuc TY. SPT: Single pedestrian tracking framework with re-identification-based learning using the Siamese model. Sensors. 2023; 23(10).
. Han H, Zhou MC, Shang XW, Cao W, Abusorrah A. KISS plus for rapid and accurate pedestrian re-identification. IEEE Transactions on Intelligent Transportation Systems. 2021; 22(1):394-403.
. Hong F, Lu CH, Tao W, Jiang WW. OMNet: Object-Perception multi-branch network for pedestrian re-identification. Big Data Research. 2022; 27.
. Yun X, Ge M, Sun YJ, Dong KW, Hou XF. Margin CosReid network for pedestrian re-identification. Applied Sciences-Basel. 2021; 11(4).
. Grigorev A, Tian ZH, Rho S, Xiong JX, Liu SH, Jiang F. Deep person re-identification in UAV images. Eurasip Journal on Advances in Signal Processing. 2019; (1).
. Cheng KY, Xu FJ, Tao F, Qi M, Li MZ. Data-driven pedestrian re-identification based on hierarchical semantic representation. Concurrency and Computation-Practice & Experience. 2018; 30(23).
. Yu ZY, Li LS, Xie JL, Wang CS, Li WJ, Ning X. Pedestrian 3D Shape understanding for person re-identification via multi-view learning. IEEE Transactions on Circuits and Systems for Video Technology. 2024; 34(7):5589-602.
. Shan YX, Yu G, Gao YH. Pedestrian re-identification based on gait analysis. IEEE Access. 2023; 11:106013-23.
. Zhao BT, Wang YJ, Su KK, Ren H, Han XY. Semi-supervised pedestrian re-identification via a teacher-student model with similarity-preserving generative adversarial networks. Applied Intelligence. 2023; 53(2):1605-18.
. Lei MW, Song YC, Zhao JD, Wang X, Lyu J, Xu JD, Yan WQ. End-to-End network for pedestrian detection, tracking and re-identification in real-time surveillance system. Sensors. 2022; 22(22).
. Sun ZJ, Wang X, Zhang YL, Song YC, Zhao JD, Xu JD, et al. A comprehensive review of pedestrian re-identification based on deep learning. Complex & Intelligent Systems. 2024; 10(2):1733-68.
. Bouzid A, Sierra-Sosa D, Elmaghraby A. Directional statistics-based deep metric learning for pedestrian tracking and re-identification. Drones. 2022; 6(11).
. Geng SZ, Yu M, Liu Y, Yu Y, Bai J. Re-ranking pedestrian re-identification with multiple Metrics. Multimedia Tools and Applications. 2019; 78(9):11631- 53.
. Deng X, Liao KY, Zheng YL, Lin GF, Lei H. A deep multi-feature distance metric learning method for pedestrian re-identification. Multimedia Tools and Applications. 2021; 80(15):23113-31.
. Yang XF, Wang QS, Li WK, Zhou ZH, Li HF. Unsupervised domain adaptation pedestrian re-identification based on an improved dissimilarity space. Image and Vision Computing. 2022; 118.
DOI:
https://doi.org/10.31449/inf.v49i11.7247Downloads
Published
How to Cite
Issue
Section
License
Authors retain copyright in their work. By submitting to and publishing with Informatica, authors grant the publisher (Slovene Society Informatika) the non-exclusive right to publish, reproduce, and distribute the article and to identify itself as the original publisher.
All articles are published under the Creative Commons Attribution license CC BY 3.0. Under this license, others may share and adapt the work for any purpose, provided appropriate credit is given and changes (if any) are indicated.
Authors may deposit and share the submitted version, accepted manuscript, and published version, provided the original publication in Informatica is properly cited.







