https://github.com/924973292/mambapro

【AAAI2025】MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt

https://github.com/924973292/mambapro

Science Score: 23.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
  • DOI references
  • Academic publication links
    Links to: arxiv.org, scholar.google
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (8.3%) to scientific vocabulary

Keywords

adapter clip finetuning fusion mamba multi-modal prompt reid
Last synced: 6 months ago · JSON representation

Repository

【AAAI2025】MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt

Basic Info
  • Host: GitHub
  • Owner: 924973292
  • License: mit
  • Language: Python
  • Default Branch: master
  • Homepage:
  • Size: 24.2 MB
Statistics
  • Stars: 51
  • Watchers: 1
  • Forks: 2
  • Open Issues: 0
  • Releases: 0
Topics
adapter clip finetuning fusion mamba multi-modal prompt reid
Created about 1 year ago · Last pushed 12 months ago
Metadata Files
Readme License

README.md

MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt

Description of the image

Yuhao Wang · Xuehu Liu · Tianyu Yan · Yang Liu · Aihua Zheng · Pingping Zhang* · Huchuan Lu

AAAI 2025 Paper

Overall Framework

MambaPro is a novel multi-modal object ReID framework that integrates CLIP's pre-trained capabilities with state-of-the-art multi-modal aggregation techniques. Using Parallel Feed-Forward Adapters (PFA), Synergistic Residual Prompts (SRP), and the innovative Mamba Aggregation (MA) mechanism, it achieves robust performance with reduced computational complexity. MambaPro sets new standards in handling long sequences and missing modalities.


News

  • We released the MambaPro codebase and paper! 🚀 Paper
  • Great news! Our paper has been accepted to AAAI 2025! 🎉

Table of Contents


Introduction

Multi-modal object ReID leverages complementary data from diverse modalities (e.g., RGB, NIR, TIR) to overcome challenges like poor lighting and occlusion. MambaPro advances this field by: 1. PFA: Transferring CLIP's pre-trained knowledge to ReID tasks via parallel adapters. 2. SRP: Integrating modality-specific prompts with synergistic transformations. 3. MA: Efficiently modeling intra- and inter-modality interactions with linear complexity.


Contributions

  • Introduced MambaPro, the first CLIP-based framework for multi-modal object ReID.
  • Developed SRP for synergistic learning across modalities with residual refinements.
  • Proposed MA, achieving linear complexity for long-sequence multi-modal interactions.
  • Validated effectiveness on RGBNT201, RGBNT100, and MSVR310 benchmarks.

Results

Multi-Modal Object ReID

Multi-Modal Person ReID [RGBNT201]

RGBNT201 Results

Multi-Modal Vehicle ReID [RGBNT100 & MSVR310]

Vehicle Results

Ablation Studies [RGBNT201]

Ablation Results

Hyperparameter Analysis [RGBNT201]

Hyperparameter Analysis


Visualizations

Feature Distribution (t-SNE)

t-SNE Distribution

GradCam Visualization

Prompt Comparisons

Prompt Comparisons


Reproduction

Datasets

Pretrained Models

Configuration

  • RGBNT201: configs/RGBNT201/MambaPro.yml
  • RGBNT100: configs/RGBNT100/MambaPro.yml
  • MSVR310: configs/MSVR310/MambaPro.yml

Training

bash conda create -n MambaPro python=3.10.13 conda activate MambaPro pip install torch==2.1.1+cu118 torchvision==0.16.1+cu118 torchaudio==2.1.1+cu118 --index-url https://download.pytorch.org/whl/cu118 cd (your_path) pip install -r requirements.txt python train_net.py --config_file configs/RGBNT201/MambaPro.yml

Note

  • If you want to use a CLIP-based framework for multi-modal object ReID, DeMo is a better choice, the prompt/adapter tuning configuration in MambaPro is retained for users, besides, we provide detailed visualizations code in DeMo.
  • Thanks for your attention and support!

Star History

Star History Chart


Citation

If you find MambaPro helpful in your research, please consider citing: bibtex @inproceedings{wang2025MambaPro, title={MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt}, author={Wang, Yuhao and Liu, Xuehu and Yan, Tianyu and Liu, Yang and Zheng, Aihua and Zhang, Pingping and Lu, Huchuan}, booktitle={Proceedings of the AAAI Conference on Artificial Intelligence}, year={2025} }

Owner

  • Name: Yuhao Wang
  • Login: 924973292
  • Kind: user
  • Location: Dalian
  • Company: Dalian University of Technology

生如芥子,心藏须弥

GitHub Events

Total
  • Issues event: 10
  • Watch event: 72
  • Issue comment event: 20
  • Push event: 23
  • Fork event: 1
  • Create event: 6
Last Year
  • Issues event: 10
  • Watch event: 72
  • Issue comment event: 20
  • Push event: 23
  • Fork event: 1
  • Create event: 6

Issues and Pull Requests

Last synced: 6 months ago

All Time
  • Total issues: 2
  • Total pull requests: 0
  • Average time to close issues: 28 days
  • Average time to close pull requests: N/A
  • Total issue authors: 2
  • Total pull request authors: 0
  • Average comments per issue: 3.5
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Past Year
  • Issues: 2
  • Pull requests: 0
  • Average time to close issues: 28 days
  • Average time to close pull requests: N/A
  • Issue authors: 2
  • Pull request authors: 0
  • Average comments per issue: 3.5
  • Average comments per pull request: 0
  • Merged pull requests: 0
  • Bot issues: 0
  • Bot pull requests: 0
Top Authors
Issue Authors
  • eatgreenjiao (2)
  • xihuai23 (1)
  • memory541 (1)
Pull Request Authors
  • alexlugovsk (1)
Top Labels
Issue Labels
Pull Request Labels