https://github.com/924973292/mambapro
【AAAI2025】MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt
Science Score: 23.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
✓Academic publication links
Links to: arxiv.org, scholar.google -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (8.3%) to scientific vocabulary
Keywords
Repository
【AAAI2025】MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt
Basic Info
Statistics
- Stars: 51
- Watchers: 1
- Forks: 2
- Open Issues: 0
- Releases: 0
Topics
Metadata Files
README.md
MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt
Yuhao Wang · Xuehu Liu · Tianyu Yan · Yang Liu · Aihua Zheng · Pingping Zhang* · Huchuan Lu
MambaPro is a novel multi-modal object ReID framework that integrates CLIP's pre-trained capabilities with state-of-the-art multi-modal aggregation techniques. Using Parallel Feed-Forward Adapters (PFA), Synergistic Residual Prompts (SRP), and the innovative Mamba Aggregation (MA) mechanism, it achieves robust performance with reduced computational complexity. MambaPro sets new standards in handling long sequences and missing modalities.
News
- We released the MambaPro codebase and paper! 🚀 Paper
- Great news! Our paper has been accepted to AAAI 2025! 🎉
Table of Contents
Introduction
Multi-modal object ReID leverages complementary data from diverse modalities (e.g., RGB, NIR, TIR) to overcome challenges like poor lighting and occlusion. MambaPro advances this field by: 1. PFA: Transferring CLIP's pre-trained knowledge to ReID tasks via parallel adapters. 2. SRP: Integrating modality-specific prompts with synergistic transformations. 3. MA: Efficiently modeling intra- and inter-modality interactions with linear complexity.
Contributions
- Introduced MambaPro, the first CLIP-based framework for multi-modal object ReID.
- Developed SRP for synergistic learning across modalities with residual refinements.
- Proposed MA, achieving linear complexity for long-sequence multi-modal interactions.
- Validated effectiveness on RGBNT201, RGBNT100, and MSVR310 benchmarks.
Results
Multi-Modal Object ReID
Multi-Modal Person ReID [RGBNT201]
Multi-Modal Vehicle ReID [RGBNT100 & MSVR310]
Ablation Studies [RGBNT201]
Hyperparameter Analysis [RGBNT201]
Visualizations
Feature Distribution (t-SNE)
GradCam Visualization
Reproduction
Datasets
- RGBNT201: Google Drive
- RGBNT100: Baidu Pan (Code:
rjin) - MSVR310: Google Drive
Pretrained Models
- CLIP: Baidu Pan (Code:
52fu)
Configuration
- RGBNT201:
configs/RGBNT201/MambaPro.yml - RGBNT100:
configs/RGBNT100/MambaPro.yml - MSVR310:
configs/MSVR310/MambaPro.yml
Training
bash
conda create -n MambaPro python=3.10.13
conda activate MambaPro
pip install torch==2.1.1+cu118 torchvision==0.16.1+cu118 torchaudio==2.1.1+cu118 --index-url https://download.pytorch.org/whl/cu118
cd (your_path)
pip install -r requirements.txt
python train_net.py --config_file configs/RGBNT201/MambaPro.yml
Note
- If you want to use a CLIP-based framework for multi-modal object ReID, DeMo is a better choice, the prompt/adapter tuning configuration in MambaPro is retained for users, besides, we provide detailed visualizations code in DeMo.
- Thanks for your attention and support!
Star History
Citation
If you find MambaPro helpful in your research, please consider citing:
bibtex
@inproceedings{wang2025MambaPro,
title={MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt},
author={Wang, Yuhao and Liu, Xuehu and Yan, Tianyu and Liu, Yang and Zheng, Aihua and Zhang, Pingping and Lu, Huchuan},
booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
year={2025}
}
Owner
- Name: Yuhao Wang
- Login: 924973292
- Kind: user
- Location: Dalian
- Company: Dalian University of Technology
- Repositories: 7
- Profile: https://github.com/924973292
生如芥子,心藏须弥
GitHub Events
Total
- Issues event: 10
- Watch event: 72
- Issue comment event: 20
- Push event: 23
- Fork event: 1
- Create event: 6
Last Year
- Issues event: 10
- Watch event: 72
- Issue comment event: 20
- Push event: 23
- Fork event: 1
- Create event: 6
Issues and Pull Requests
Last synced: 6 months ago
All Time
- Total issues: 2
- Total pull requests: 0
- Average time to close issues: 28 days
- Average time to close pull requests: N/A
- Total issue authors: 2
- Total pull request authors: 0
- Average comments per issue: 3.5
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 2
- Pull requests: 0
- Average time to close issues: 28 days
- Average time to close pull requests: N/A
- Issue authors: 2
- Pull request authors: 0
- Average comments per issue: 3.5
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- eatgreenjiao (2)
- xihuai23 (1)
- memory541 (1)
Pull Request Authors
- alexlugovsk (1)