377-multi-class-token-transformer-for-weakly-supervised-semantic-segmentation
Science Score: 28.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
○codemeta.json file
-
○.zenodo.json file
-
○DOI references
-
✓Academic publication links
Links to: arxiv.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (8.5%) to scientific vocabulary
Last synced: 6 months ago
·
JSON representation
·
Repository
Basic Info
- Host: GitHub
- Owner: SZU-AdvTech-2023
- Language: Python
- Default Branch: main
- Size: 4.59 MB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Created about 2 years ago
· Last pushed about 2 years ago
Metadata Files
Citation
https://github.com/SZU-AdvTech-2023/377-Multi-Class-Token-Transformer-for-Weakly-Supervised-Semantic-Segmentation/blob/main/
# MCTformer The pytorch code for our CVPR2022 paper [Multi-class Token Transformer for Weakly Supervised Semantic Segmentation](https://arxiv.org/abs/2203.02891). [[Paper]](https://arxiv.org/abs/2203.02891) [[Project Page]](https://xulianuwa.github.io/MCTformer-project-page/)
![]()
Fig.1 - Overview of MCTformer
## Prerequisite - Ubuntu 18.04, with Python 3.6 and the following python dependencies. ``` pip install -r prerequisite.txt ``` - Download [the PASCAL VOC 2012 development kit](http://host.robots.ox.ac.uk/pascal/VOC/voc2012). ## Usage Step 1: Run the run.sh script for training MCTformer, visualizing and evaluating the generated class-specific localization maps. ``` bash runs/pascal_v2_train.sh bash runs/pascal_v2_generate.sh bash runs/pascal_v2_evaluate_generate.sh ``` ### PASCAL VOC 2012 dataset | Model | Backbone | Google drive | |--------------|------------|--------------| | MCTformer-V1 | DeiT-small | [Weights](https://drive.google.com/file/d/1jLnSbR2DDtjli5EwRYSDi3Xa6xxFIAi0/view?usp=sharing) | | MCTformer-V2 | DeiT-small | [Weights](https://drive.google.com/file/d/1w5LDoS_CHtDRXgFSqFtPvIiCajk4ZtMB/view?usp=sharing) | Step 2: Run the run_psa.sh script for using [PSA](https://github.com/jiwoon-ahn/psa) to post-process the seeds (i.e., class-specific localization maps) to generate pseudo ground-truth segmentation masks. To train PSA, the pre-trained classification [weights](https://drive.google.com/file/d/1xESB7017zlZHqxEWuh1Rb89UhjTGIKOA/view?usp=sharing) were used for initialization. ``` bash runs/pascal_v2_psa_train.sh bash runs/pascal_v2_psa_infer.sh bash runs/pascal_v2_evaluate.sh ``` Step 3: For the segmentation part, run the run_seg.sh script for training and testing the segmentation model. When training on VOC, the model was initialized with the pre-trained classification [weights](https://drive.google.com/file/d/1xESB7017zlZHqxEWuh1Rb89UhjTGIKOA/view?usp=sharing) on VOC. ``` bash runs/pascal_v2_seg_train.sh bash runs/pascal_v2_seg_infer.sh ```
Owner
- Name: SZU-AdvTech-2023
- Login: SZU-AdvTech-2023
- Kind: organization
- Repositories: 1
- Profile: https://github.com/SZU-AdvTech-2023
Citation (citation.txt)
@article{REPO377,
author = "Xu, Lian and Ouyang, Wanli and Bennamoun and Boussaid, Farid and Xu, Dan",
journal = "2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)",
pages = "4300-4309",
title = "{Multi-class Token Transformer for Weakly Supervised Semantic Segmentation}",
year = "2022"
}