pix2seq-mmdetection
Unofficial implement of "Pix2seq: A Language Modeling Framework for Object Detection" on mmdetection
Science Score: 44.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (8.1%) to scientific vocabulary
Repository
Unofficial implement of "Pix2seq: A Language Modeling Framework for Object Detection" on mmdetection
Basic Info
Statistics
- Stars: 33
- Watchers: 3
- Forks: 4
- Open Issues: 1
- Releases: 1
Metadata Files
README.md
Introduction
This is an unofficial replication of "Pix2seq: A Language Modeling Framework for Object Detection" with pretrained model on mmdetection.
License
This project is released under the Apache 2.0 license.
Installation
Please refer to get_started.md for installation.
Train & Evaluation
Train by running (about 10 days with 8*V100 32GB)
bash
python -m torch.distributed.launch --nproc_per_node=8 --master_port=5003 \
tools/train.py configs/pix2seq/pix2seq_r50_8x4_50e_coco.py --work-dir pix2seq-output --gpus 8 --launcher pytorch
or
Download pretrained pix2seq weights.
Evaluate with single gpu:
bash
python tools/test.py configs/pix2seq/pix2seq_r50_8x4_300_coco.py \
weights/checkpoints.pth --work-dir pix2seq-output --eval bbox --show-dir pix2seq-vis
Evaluate with 8 gpus:
bash
python -m torch.distributed.launch --nproc_per_node=8 --master_port=5003 \
tools/test.py configs/pix2seq/pix2seq_r50_8x4_300_coco.py weights/checkpoints.pth \
--work-dir pix2seq-output --eval bbox --launcher pytorch
| Method | backbone | Epoch | Batch Size | AP | AP50 | AP75 | Weights | | :-----: | :------: | :----:| :---------:| :---:| :---: | :---: | :---: | | Ours | R50 | 300 | 32 | 36.4 | 52.8 | 38.5 | model | | Paper | R50 | 300 | 128 | 43.0 | 61.0 | 45.6 | - |
Visualization






TO-DO
- [x] random shuffle targets
- [x] training from scratch
- [x] drop class token
- [x] stochastic depth
- [x] large scale jittering
- [ ] support for custom dataset
- [x] two independent augmentations for each image
- [x] FrozenBatchNorm2d in backbones
- [x] auto-argument
- [x] nucleus sampling
Acknowledgement
Owner
- Login: Sharpiless
- Kind: user
- Location: Beijing, China
- Company: Beijing Institute of Technology
- Website: https://blog.csdn.net/weixin_44936889
- Repositories: 25
- Profile: https://github.com/Sharpiless
PhD student | 3D Vision | NERF | High Level Vision
Citation (CITATION.cff)
cff-version: 1.2.0 message: "If you use this software, please cite it as below." authors: - name: "MMDetection Contributors" title: "OpenMMLab Detection Toolbox and Benchmark" date-released: 2018-08-22 url: "https://github.com/open-mmlab/mmdetection" license: Apache-2.0
GitHub Events
Total
- Watch event: 2
Last Year
- Watch event: 2
Issues and Pull Requests
Last synced: about 1 year ago
All Time
- Total issues: 1
- Total pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Total issue authors: 1
- Total pull request authors: 0
- Average comments per issue: 0.0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Past Year
- Issues: 0
- Pull requests: 0
- Average time to close issues: N/A
- Average time to close pull requests: N/A
- Issue authors: 0
- Pull request authors: 0
- Average comments per issue: 0
- Average comments per pull request: 0
- Merged pull requests: 0
- Bot issues: 0
- Bot pull requests: 0
Top Authors
Issue Authors
- vansin (1)
Pull Request Authors
Top Labels
Issue Labels
Pull Request Labels
Dependencies
- cython *
- numpy *
- docutils ==0.16.0
- recommonmark *
- sphinx ==4.0.2
- sphinx-copybutton *
- sphinx_markdown_tables *
- sphinx_rtd_theme ==0.5.2
- mmcv-full >=1.3.17
- cityscapesscripts *
- imagecorruptions *
- scipy *
- sklearn *
- mmcv *
- torch *
- torchvision *
- matplotlib *
- numpy *
- pycocotools *
- pycocotools-windows *
- six *
- terminaltables *
- asynctest * test
- codecov * test
- flake8 * test
- interrogate * test
- isort ==4.3.21 test
- kwarray * test
- mmtrack * test
- onnx ==1.7.0 test
- onnxruntime >=1.8.0 test
- pytest * test
- ubelt * test
- xdoctest >=0.10.0 test
- yapf * test