» Articles » PMID: 39592691

ReMamba: a Hybrid CNN-Mamba Aggregation Network for Visible-infrared Person Re-identification

Overview
Journal Sci Rep
Specialty Science
Date 2024 Nov 26
PMID 39592691
Authors
Affiliations
Soon will be listed here.
Abstract

Visible-Infrared Person Re-identification (VI-ReID) has been consistently challenged by the significant intra-class variations and cross-modality differences between different cameras. Therefore, the key lies in how to extract discriminative modality-shared features. Existing VI-ReID methods based on Convolutional Neural Networks (CNN) and Vision Transformers (ViT) have shortcomings in capturing global features and controlling computational complexity, respectively. To tackle these challenges, we propose a hybrid network framework called ReMamba. Specifically, we first use a CNN as the backbone network to extract multi-level features. Then, we introduce the Visual State Space (VSS) model, which is responsible for integrating the local features output by the CNN from lower to higher levels. These local features serve as a complement to global information and thereby enhancing the local details clarity of the global features. Considering the potential redundancy and semantic differences between local and global features, we design an adaptive feature aggregation module that automatically filters and effectively aggregates both types of features, incorporating an auxiliary aggregation loss to optimize the aggregation process. Furthermore, to better constrain cross-modality features and intra-modal features, we design a modal consistency identity constraint loss to alleviate cross-modality differences and extract modality-shared information. Extensive experiments conducted on the SYSU-MM01, RegDB, and LLCM datasets demonstrate that our proposed ReMamba outperforms state-of-the-art VI-ReID methods.

References
1.
Nguyen D, Hong H, Kim K, Park K . Person Recognition System Based on a Combination of Body Images from Visible Light and Thermal Cameras. Sensors (Basel). 2017; 17(3). PMC: 5375891. DOI: 10.3390/s17030605. View

2.
Ye M, Shen J, Lin G, Xiang T, Shao L, Hoi S . Deep Learning for Person Re-Identification: A Survey and Outlook. IEEE Trans Pattern Anal Mach Intell. 2021; 44(6):2872-2893. DOI: 10.1109/TPAMI.2021.3054775. View

3.
Wei Z, Yang X, Wang N, Gao X . Flexible Body Partition-Based Adversarial Learning for Visible Infrared Person Re-Identification. IEEE Trans Neural Netw Learn Syst. 2021; 33(9):4676-4687. DOI: 10.1109/TNNLS.2021.3059713. View

4.
Hu W, Yang Y, Hu H . Pseudo Label Association and Prototype-Based Invariant Learning for Semi-Supervised NIR-VIS Face Recognition. IEEE Trans Image Process. 2024; 33:1448-1463. DOI: 10.1109/TIP.2024.3364530. View