377-multi-class-token-transformer-for-weakly-supervised-semantic-segmentation

https://github.com/szu-advtech-2023/377-multi-class-token-transformer-for-weakly-supervised-semantic-segmentation

Science Score: 28.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
○
codemeta.json file
○
.zenodo.json file
○
DOI references
✓
Academic publication links
Links to: arxiv.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (8.5%) to scientific vocabulary

Last synced: 6 months ago · JSON representation ·

Repository

Basic Info

Host: GitHub
Owner: SZU-AdvTech-2023
Language: Python
Default Branch: main
Size: 4.59 MB

Statistics

Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Created about 2 years ago · Last pushed about 2 years ago

Metadata Files

Citation

https://github.com/SZU-AdvTech-2023/377-Multi-Class-Token-Transformer-for-Weakly-Supervised-Semantic-Segmentation/blob/main/

# MCTformer
The pytorch code for our CVPR2022 paper [Multi-class Token Transformer for Weakly Supervised Semantic Segmentation](https://arxiv.org/abs/2203.02891).

[[Paper]](https://arxiv.org/abs/2203.02891) [[Project Page]](https://xulianuwa.github.io/MCTformer-project-page/)


  


Fig.1 - Overview of MCTformer




## Prerequisite
- Ubuntu 18.04, with Python 3.6 and the following python dependencies.
```
pip install -r prerequisite.txt
```
- Download [the PASCAL VOC 2012 development kit](http://host.robots.ox.ac.uk/pascal/VOC/voc2012).

## Usage
Step 1: Run the run.sh script for training MCTformer, visualizing and evaluating the generated class-specific localization maps. 
```
bash runs/pascal_v2_train.sh
bash runs/pascal_v2_generate.sh
bash runs/pascal_v2_evaluate_generate.sh
```
### PASCAL VOC 2012 dataset

| Model        | Backbone   | Google drive |
|--------------|------------|--------------|
| MCTformer-V1 | DeiT-small | [Weights](https://drive.google.com/file/d/1jLnSbR2DDtjli5EwRYSDi3Xa6xxFIAi0/view?usp=sharing)  |
| MCTformer-V2 | DeiT-small | [Weights](https://drive.google.com/file/d/1w5LDoS_CHtDRXgFSqFtPvIiCajk4ZtMB/view?usp=sharing)  |

Step 2: Run the run_psa.sh script for using [PSA](https://github.com/jiwoon-ahn/psa) to post-process the seeds (i.e., class-specific localization maps) to generate pseudo ground-truth segmentation masks. To train PSA, the pre-trained classification [weights](https://drive.google.com/file/d/1xESB7017zlZHqxEWuh1Rb89UhjTGIKOA/view?usp=sharing) were used for initialization.
```
bash runs/pascal_v2_psa_train.sh
bash runs/pascal_v2_psa_infer.sh
bash runs/pascal_v2_evaluate.sh
```


Step 3: For the segmentation part, run the run_seg.sh script for training and testing the segmentation model. When training on VOC, the model was initialized with the pre-trained classification [weights](https://drive.google.com/file/d/1xESB7017zlZHqxEWuh1Rb89UhjTGIKOA/view?usp=sharing) on VOC.
```
bash runs/pascal_v2_seg_train.sh
bash runs/pascal_v2_seg_infer.sh
```

Owner

Name: SZU-AdvTech-2023
Login: SZU-AdvTech-2023
Kind: organization

Repositories: 1
Profile: https://github.com/SZU-AdvTech-2023

Citation (citation.txt)

@article{REPO377,
    author = "Xu, Lian and Ouyang, Wanli and Bennamoun and Boussaid, Farid and Xu, Dan",
    journal = "2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)",
    pages = "4300-4309",
    title = "{Multi-class Token Transformer for Weakly Supervised Semantic Segmentation}",
    year = "2022"
}

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science