Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (11.6%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: hanqiu-hq
- License: other
- Language: Python
- Default Branch: main
- Size: 3.07 MB
Statistics
- Stars: 13
- Watchers: 3
- Forks: 0
- Open Issues: 2
- Releases: 0
Metadata Files
README.md
Masked AutoDecoder
This is the official implementation of the paper "Masked AutoDecoder is Effective Multi-Task Vision Generalist"
Authors: Han Qiu, Jiaxing Huang, Peng Gao, Lewei Lu, Xiaoqin Zhang, Shijian Lu
In this work, we design Masked AutoDecoder(MAD), an effective multi-task vision generalist. MAD consists of two core designs. First, we develop a parallel decoding framework that introduces bi-directional attention to capture contextual dependencies comprehensively and decode vision task sequences in parallel. Second, we design a masked sequence modeling approach that learns rich task contexts by masking and reconstructing task sequences. In this way, MAD handles all the tasks by a single network branch and a simple cross-entropy loss with minimal task-specific designs.
Installation
First Install Detectron2.
Then,
bash
cd MAD
pip install -e .
Please refer to Installation Instructions of Detrex for details of installation.
Data Preparation
```bash
First prepare COCO dataset at "./datasets" as following:
- datasets
- coco
- annotation
- captions_train2017.json
- captions_val2017.json
- instances_train2017.json
- instances_val2017.json
- personkeypointstrain2017.json
- personkeypointsval2017.json
- train2017
- val2017
merge the keypoint anno and coco instance anno
python ./project/mad/data/merge_annotations.py ```
Training
bash
python tools/train_net.py --num-gpus 8 --dist-url auto --config-file ./project/mad/model/config.py
Evaluation
bash
python tools/train_net.py --num-gpus 1 --dist-url auto --config-file ./project/mad/model/config.py --eval-only
Acknowledgement
We build MAD based on detrex.
Citation
If you find our work helpful please cite:
BibTex
@InProceedings{Qiu_2024_CVPR,
author = {Qiu, Han and Huang, Jiaxing and Gao, Peng and Lu, Lewei and Zhang, Xiaoqin and Lu, Shijian},
title = {Masked AutoDecoder is Effective Multi-Task Vision Generalist},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2024},
pages = {14152-14161}
}
Owner
- Name: Han Qiu
- Login: hanqiu-hq
- Kind: user
- Repositories: 1
- Profile: https://github.com/hanqiu-hq
Citation (CITATION.cff)
cff-version: 1.2.0 message: "If you use this software, please cite it as below." authors: - name: "detrex Contributors" title: "IDEA-CVR Detection-Transformer Toolbox and Benchmark" date-released: 2022-09-21 url: "https://github.com/IDEA-Research/detrex" license: Apache-2.0
GitHub Events
Total
- Issues event: 1
- Watch event: 4
Last Year
- Issues event: 1
- Watch event: 4