https://github.com/aim-uofa/adelaidet

AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.

Science Score: 46.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
○
.zenodo.json file
✓
DOI references
Found 1 DOI reference(s) in README
✓
Academic publication links
Links to: arxiv.org
✓
Committers with academic emails
1 of 31 committers (3.2%) from academic institutions
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (7.9%) to scientific vocabulary

Keywords

abcnet adelaidet blendmask boxinst condinst densecl fcos instance-segmentation meinst object-detection ocr solo solov2 text-detection text-recognition

Keywords from Contributors

autograd vision-transformers tensor transformer projection interactive sequences distributed archival pose-estimation

Last synced: 6 months ago · JSON representation

Repository

AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.

Basic Info

Host: GitHub
Owner: aim-uofa
License: other
Language: Python
Default Branch: master
Homepage: https://git.io/AdelaiDet
Size: 663 KB

Statistics

Stars: 3,434
Watchers: 83
Forks: 651
Open Issues: 306
Releases: 0

Topics

abcnet adelaidet blendmask boxinst condinst densecl fcos instance-segmentation meinst object-detection ocr solo solov2 text-detection text-recognition

Created about 6 years ago · Last pushed over 1 year ago

Metadata Files

Readme License

AdelaiDet

As of Jan. 2024, the CloudStor server is dead. Model files are hosted on huggingface:

https://huggingface.co/ZjuCv/AdelaiDet/tree/main
https://huggingface.co/tianzhi/AdelaiDet-FCOS/tree/main
https://huggingface.co/tianzhi/AdelaiDet-CondInst/tree/main
https://huggingface.co/tianzhi/AdelaiDet-BoxInst/tree/main

AdelaiDet is an open source toolbox for multiple instance-level recognition tasks on top of Detectron2. All instance-level recognition works from our group are open-sourced here.

To date, AdelaiDet implements the following algorithms:

Models

COCO Object Detecton Baselines with FCOS

Name | inf. time | box AP | download --- |:---:|:---:|:--- FCOSR50_1x | 16 FPS | 38.7 | model FCOSMSR1012x | 12 FPS | 43.1 | model FCOSMSX10132x8d_2x | 6.6 FPS | 43.9 | model FCOSMSX10132x8ddcnv22x | 4.6 FPS | 46.6 | model FCOSRTMSDLA344xshtw | 52 FPS | 39.1 | model

More models can be found in FCOS README.md.

COCO Instance Segmentation Baselines with BlendMask

Model | Name |inf. time | box AP | mask AP | download --- |:---:|:---:|:---:|:---:|:---: Mask R-CNN | R1013x | 10 FPS | 42.9 | 38.6 | BlendMask | R1013x | 11 FPS | 44.8 | 39.5 | model BlendMask | R101dcni3_5x | 10 FPS | 46.8 | 41.1 | model

For more models and information, please refer to BlendMask README.md.

COCO Instance Segmentation Baselines with MEInst

Name | inf. time | box AP | mask AP | download --- |:---:|:---:|:---:|:---: MEInstR50_3x | 12 FPS | 43.6 | 34.5 | model

For more models and information, please refer to MEInst README.md.

Total_Text results with ABCNet

Name | inf. time | e2e-hmean | det-hmean | download --- |:---------:|:---------:|:---------:|:---: v1-totaltext | 11 FPS | 67.1 | 86.0 | model v2-totaltext | 7.7 FPS | 71.8 | 87.2 | model

For more models and information, please refer to ABCNet README.md.

COCO Instance Segmentation Baselines with CondInst

Name | inf. time | box AP | mask AP | download --- |:---:|:---:|:---:|:---: CondInstMSR501x | 14 FPS | 39.7 | 35.7 | model CondInstMSR50BiFPN3xsem | 13 FPS | 44.7 | 39.4 | model CondInstMSR1013x | 11 FPS | 43.3 | 38.6 | model CondInstMSR101BiFPN3xsem | 10 FPS | 45.7 | 40.2 | model

For more models and information, please refer to CondInst README.md.

Note that: - Inference time for all projects is measured on a NVIDIA 1080Ti with batch size 1. - APs are evaluated on COCO2017 val split unless specified.

Installation

First install Detectron2 following the official guide: INSTALL.md.

Please use Detectron2 with commit id 9eb4831 if you have any issues related to Detectron2.

Then build AdelaiDet with:

git clone https://github.com/aim-uofa/AdelaiDet.git cd AdelaiDet python setup.py build develop

If you are using docker, a pre-built image can be pulled with:

docker pull tianzhi0549/adet:latest

Some projects may require special setup, please follow their own README.md in configs.

Quick Start

Inference with Pre-trained Models

Pick a model and its config file, for example, fcos_R_50_1x.yaml.
Download the model wget https://huggingface.co/tianzhi/AdelaiDet-FCOS/resolve/main/FCOS_R_50_1x.pth?download=true -O fcos_R_50_1x.pth
Run the demo with python demo/demo.py \ --config-file configs/FCOS-Detection/R_50_1x.yaml \ --input input1.jpg input2.jpg \ --opts MODEL.WEIGHTS fcos_R_50_1x.pth

Train Your Own Models

To train a model with "train_net.py", first setup the corresponding datasets following datasets/README.md, then run:

OMP_NUM_THREADS=1 python tools/train_net.py \ --config-file configs/FCOS-Detection/R_50_1x.yaml \ --num-gpus 8 \ OUTPUT_DIR training_dir/fcos_R_50_1x To evaluate the model after training, run:

OMP_NUM_THREADS=1 python tools/train_net.py \ --config-file configs/FCOS-Detection/R_50_1x.yaml \ --eval-only \ --num-gpus 8 \ OUTPUT_DIR training_dir/fcos_R_50_1x \ MODEL.WEIGHTS training_dir/fcos_R_50_1x/model_final.pth Note that: - The configs are made for 8-GPU training. To train on another number of GPUs, change the --num-gpus. - If you want to measure the inference time, please change --num-gpus to 1. - We set OMP_NUM_THREADS=1 by default, which achieves the best speed on our machines, please change it as needed. - This quick start is made for FCOS. If you are using other projects, please check the projects' own README.md in configs.

Acknowledgements

The authors are grateful to Nvidia, Huawei Noah's Ark Lab, ByteDance, Adobe who generously donated GPU computing in the past a few years.

Citing AdelaiDet

If you use this toolbox in your research or wish to refer to the baseline results published here, please use the following BibTeX entries:

```BibTeX

@misc{tian2019adelaidet, author = {Tian, Zhi and Chen, Hao and Wang, Xinlong and Liu, Yuliang and Shen, Chunhua}, title = {{AdelaiDet}: A Toolbox for Instance-level Recognition Tasks}, howpublished = {\url{https://git.io/adelaidet}}, year = {2019} } and relevant publications:BibTeX

@inproceedings{tian2019fcos, title = {{FCOS}: Fully Convolutional One-Stage Object Detection}, author = {Tian, Zhi and Shen, Chunhua and Chen, Hao and He, Tong}, booktitle = {Proc. Int. Conf. Computer Vision (ICCV)}, year = {2019} }

@article{tian2021fcos, title = {{FCOS}: A Simple and Strong Anchor-free Object Detector}, author = {Tian, Zhi and Shen, Chunhua and Chen, Hao and He, Tong}, journal = {IEEE T. Pattern Analysis and Machine Intelligence (TPAMI)}, year = {2021} }

@inproceedings{chen2020blendmask, title = {{BlendMask}: Top-Down Meets Bottom-Up for Instance Segmentation}, author = {Chen, Hao and Sun, Kunyang and Tian, Zhi and Shen, Chunhua and Huang, Yongming and Yan, Youliang}, booktitle = {Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR)}, year = {2020} }

@inproceedings{zhang2020MEInst, title = {Mask Encoding for Single Shot Instance Segmentation}, author = {Zhang, Rufeng and Tian, Zhi and Shen, Chunhua and You, Mingyu and Yan, Youliang}, booktitle = {Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR)}, year = {2020} }

@inproceedings{liu2020abcnet, title = {{ABCNet}: Real-time Scene Text Spotting with Adaptive {B}ezier-Curve Network}, author = {Liu, Yuliang and Chen, Hao and Shen, Chunhua and He, Tong and Jin, Lianwen and Wang, Liangwei}, booktitle = {Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR)}, year = {2020} }

@ARTICLE{9525302, author={Liu, Yuliang and Shen, Chunhua and Jin, Lianwen and He, Tong and Chen, Peng and Liu, Chongyu and Chen, Hao}, journal={IEEE Transactions on Pattern Analysis and Machine Intelligence}, title={ABCNet v2: Adaptive Bezier-Curve Network for Real-time End-to-end Text Spotting}, year={2021}, volume={}, number={}, pages={1-1}, doi={10.1109/TPAMI.2021.3107437} }

@inproceedings{wang2020solo, title = {{SOLO}: Segmenting Objects by Locations}, author = {Wang, Xinlong and Kong, Tao and Shen, Chunhua and Jiang, Yuning and Li, Lei}, booktitle = {Proc. Eur. Conf. Computer Vision (ECCV)}, year = {2020} }

@inproceedings{wang2020solov2, title = {{SOLOv2}: Dynamic and Fast Instance Segmentation}, author = {Wang, Xinlong and Zhang, Rufeng and Kong, Tao and Li, Lei and Shen, Chunhua}, booktitle = {Proc. Advances in Neural Information Processing Systems (NeurIPS)}, year = {2020} }

@article{wang2021solo, title = {{SOLO}: A Simple Framework for Instance Segmentation}, author = {Wang, Xinlong and Zhang, Rufeng and Shen, Chunhua and Kong, Tao and Li, Lei}, journal = {IEEE T. Pattern Analysis and Machine Intelligence (TPAMI)}, year = {2021} }

@article{tian2019directpose, title = {{DirectPose}: Direct End-to-End Multi-Person Pose Estimation}, author = {Tian, Zhi and Chen, Hao and Shen, Chunhua}, journal = {arXiv preprint arXiv:1911.07451}, year = {2019} }

@inproceedings{tian2020conditional, title = {Conditional Convolutions for Instance Segmentation}, author = {Tian, Zhi and Shen, Chunhua and Chen, Hao}, booktitle = {Proc. Eur. Conf. Computer Vision (ECCV)}, year = {2020} }

@article{CondInst2022Tian, title = {Instance and Panoptic Segmentation Using Conditional Convolutions}, author = {Tian, Zhi and Zhang, Bowen and Chen, Hao and Shen, Chunhua}, journal = {IEEE T. Pattern Analysis and Machine Intelligence (TPAMI)}, year = {2022} }

@inproceedings{tian2021boxinst, title = {{BoxInst}: High-Performance Instance Segmentation with Box Annotations}, author = {Tian, Zhi and Shen, Chunhua and Wang, Xinlong and Chen, Hao}, booktitle = {Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR)}, year = {2021} }

@inproceedings{wang2021densecl, title = {Dense Contrastive Learning for Self-Supervised Visual Pre-Training}, author = {Wang, Xinlong and Zhang, Rufeng and Shen, Chunhua and Kong, Tao and Li, Lei}, booktitle = {Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR)}, year = {2021} }

@inproceedings{Mao2021pose, title = {{FCPose}: Fully Convolutional Multi-Person Pose Estimation With Dynamic Instance-Aware Convolutions}, author = {Mao, Weian and Tian, Zhi and Wang, Xinlong and Shen, Chunhua}, booktitle = {Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR)}, year = {2021} } ```

License

For academic use, this project is licensed under the 2-clause BSD License - see the LICENSE file for details. For commercial use, please contact Chunhua Shen.

Owner

Name: Advanced Intelligent Machines (AIM)
Login: aim-uofa
Kind: organization
Location: China

Repositories: 23
Profile: https://github.com/aim-uofa

A research team at Zhejiang University, focusing on Computer Vision and broad AI research ...

GitHub Events

Total

Issues event: 2
Watch event: 90
Issue comment event: 7
Fork event: 14

Last Year

Issues event: 2
Watch event: 90
Issue comment event: 7
Fork event: 14

Committers

Last synced: 9 months ago

All Time

Total Commits: 354
Total Committers: 31
Avg Commits per committer: 11.419
Development Distribution Score (DDS): 0.636

Past Year

Commits: 1
Committers: 1
Avg Commits per committer: 1.0
Development Distribution Score (DDS): 0.0

Top Committers

Name	Email	Commits
Zhi Tian	t**9@1**m	129
Yuliang Liu	3****u	52
Hao Chen	s**u@g**m	45
chhshen	c**n@y**m	30
chenp	c**p@o**n	24
zzzzzz0407	c**g@q**m	15
Xinlong Wang	w**n@g**m	9
Chen Yongfan	7****n	8
Hao Chen	s**s@1**m	5
zhangrufeng.0407	z**7@b**m	4
blueardour	b**r@g**m	4
Johnqczhang	J**g@g**m	3
Yuliang-Liu	l**g@m**n	2
lbin	l**n@o**m	2
Venquieu	4****u	2
YunqiuXu	2****u	2
C Shen	1****n	2
youngwanLEE	y**8@g**m	2
dependabot[bot]	4****]	2
weianmao	1**9@q**m	1
lee1409	l**9@g**m	1
Zhuo Zhang	i**o@f**m	1
Yuxin Wu	p****x	1
LDOUBLEV	l**3@o**m	1
Hu Ye	x**c@g**m	1
Gu Wang	w****2	1
Feiyang Chen	f**8@g**m	1
local	l**l@m**m	1
Derrick Knox	7****x	1
Dennis Park	d**k@t**l	1
and 1 more...

Committer Domains (Top 20 + Academic)

qq.com: 2 163.com: 2 tri.global: 1 mv.com: 1 foxmail.com: 1 mail.scut.edu.cn: 1 bytedance.com: 1

Issues and Pull Requests

Last synced: 6 months ago

All Time

Total issues: 82
Total pull requests: 29
Average time to close issues: 15 days
Average time to close pull requests: 17 days
Total issue authors: 58
Total pull request authors: 9
Average comments per issue: 4.87
Average comments per pull request: 0.72
Merged pull requests: 27
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 4
Pull requests: 0
Average time to close issues: about 11 hours
Average time to close pull requests: N/A
Issue authors: 4
Pull request authors: 0
Average comments per issue: 0.25
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

lucasjinreal (6)
Yuxin-CV (6)
tengerye (3)
zhepherd (3)
jiangzz1628 (3)
AIFANLIN (2)
youngwanLEE (2)
Anikily (2)
stan-haochen (2)
lkevinzc (2)
innat (2)
seecswajid (2)
chenyangMl (2)
haoran1062 (1)
MxxM-max (1)

Pull Request Authors

tianzhi0549 (8)
stan-haochen (7)
blueardour (6)
Yuliang-Liu (2)
stanstarks (2)
lbin (1)
cclauss (1)
youngwanLEE (1)
zzzzzz0407 (1)

Top Labels

Issue Labels

enhancement (2) help wanted (2) duplicate (1) good first issue (1)

Pull Request Labels

Dependencies

docker/Dockerfile docker

nvidia/cuda 10.2-devel-ubuntu18.04 build

docs/requirements.txt pypi

Pillow ==8.1.1
Sphinx >=1.7
cloudpickle *
docutils >=0.14
future *
matplotlib *
mock *
numpy *
recommonmark ==0.4.0
requests *
six *
sphinx_rtd_theme *
tabulate *
termcolor *
tqdm *
yacs *

setup.py pypi

Pillow >=6.0
Polygon3 *
cloudpickle *
editdistance *
matplotlib *
rapidfuzz *
scikit-image *
shapely *
tabulate *
tensorboard *
termcolor >=1.1
tqdm >4.29.0
yacs >=0.1.6