PETDet

[TGRS2023] Official implement for PETDet.

https://github.com/AIRVIC-T/PETDet

Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
✓
Academic publication links
Links to: arxiv.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (10.6%) to scientific vocabulary

Last synced: 10 months ago · JSON representation ·

Repository

[TGRS2023] Official implement for PETDet.

Basic Info

Host: GitHub
Owner: AIRVIC-T
License: apache-2.0
Language: Python
Default Branch: dev
Homepage:
Size: 21.6 MB

Statistics

Stars: 50
Watchers: 2
Forks: 7
Open Issues: 3
Releases: 0

Created over 4 years ago · Last pushed over 1 year ago

Metadata Files

Readme Contributing License Citation

PETDet: Proposal Enhancement for Two-Stage Fine-Grained Object Detection (TGRS 2023)

[📖 Paper]Official implement for PETDet.

The second place winning solution (2/220) in the track of Fine-grained Object Recognition in High-Resolution Optical Images, 2021 Gaofen Challenge on Automated High-Resolution Earth Observation Image Interpretation.

Updates

[08/25/2024] We have fixed the bug in BCFN (#7) and released new model weights. The performance does not change significantly on the three datasets due to this bug.
[11/23/2023] PETDet is accepted to TGRS 2023.

Introduction

petdet_arch

Fine-grained object detection (FGOD) extends object detection with the capability of fine-grained recognition. In recent two-stage FGOD methods, the region proposal serves as a crucial link between detection and fine-grained recognition. However, current methods overlook that some proposal-related procedures inherited from general detection are not equally suitable for FGOD, limiting the multi-task learning from generation, representation, to utilization. In this paper, we present PETDet (Proposal Enhancement for Two-stage fine-grained object detection) to properly handle the sub-tasks in two-stage FGOD methods. Firstly, an anchor-free Quality Oriented Proposal Network (QOPN) is proposed with dynamic label assignment and attention-based decomposition to generate high-quality oriented proposals. Additionally, we present a Bilinear Channel Fusion Network (BCFN) to extract independent and discriminative features from the proposals. Furthermore, we design a novel Adaptive Recognition Loss (ARL) which offers guidance for the R-CNN head to focus on high-quality proposals. Extensive experiments validate the effectiveness of PETDet. Quantitative analysis reveals that PETDet with ResNet50 reaches state-of-the-art performance on various FGOD datasets, including FAIR1M-v1.0 (42.96 AP), FAIR1M-v2.0 (48.81 AP), MAR20 (85.91 AP) and ShipRSImageNet (74.90 AP). The proposed method also achieves superior compatibility between accuracy and inference speed. Our code and models will be released at https://github.com/canoe-Z/PETDet.

Results and Models

FAIR1M-v2.0

| Method | Backbone | Angle | lr
schd | Aug | Batch
Size | AP₅₀ | Download | | :-------------------------------------------------: | :---------------------------: | :---: | :--------: | :---: | :-----------: | :-------------: | :---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------: | | Faster R-CNN | ResNet50
(1024,1024,200) | le90 | 1x | - | 2*4 | 41.64 | model | log | submission | | RoI Transformer | ResNet50
(1024,1024,200) | le90 | 1x | - | 2*4 | 44.03 | model | log | submission | | Oriented R-CNN | ResNet50
(1024,1024,200) | le90 | 1x | - | 2*4 | 43.90 | model | log | submission | | ReDet | ReResNet50
(1024,1024,200) | le90 | 1x | - | 2*4 | 46.03 | model | log | submission | | PETDet | ResNet50
(1024,1024,200) | le90 | 1x | - | 2*4 | 48.73 | model | log | submission |

MAR20

| Method | Backbone | Angle | lr
schd | Aug | Batch
Size | AP₅₀ | mAP | Download | | :-------------------------------------------------: | :-------------------: | :---: | :--------: | :---: | :-----------: | :-------------: | :---: | :--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------: | | Faster R-CNN | ResNet50
(800,800) | le90 | 3x | - | 2*4 | 75.01 | 47.57 | model | log | | RoI Transformer | ResNet50
(800,800) | le90 | 3x | - | 2*4 | 82.46 | 56.43 | model | log | | Oriented R-CNN | ResNet50
(800,800) | le90 | 3x | - | 2*4 | 82.71 | 58.14 | model | log | | PETDet | ResNet50
(800,800) | le90 | 3x | - | 2*4 | 85.89 | 61.46 | model | log |

ShipRSImageNet

| Method | Backbone | Angle | lr
schd | Aug | Batch
Size | AP₅₀ | mAP | Download | | :-------------------------------------------------: | :---------------------: | :---: | :--------: | :---: | :-----------: | :-------------: | :---: | :--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------: | | Faster R-CNN | ResNet50
(1024,1024) | le90 | 3x | - | 2*4 | 54.75 | 27.60 | model | log | | RoI Transformer | ResNet50
(1024,1024) | le90 | 3x | - | 2*4 | 60.98 | 33.56 | model | log | | Oriented R-CNN | ResNet50
(1024,1024) | le90 | 3x | - | 2*4 | 71.76 | 51.90 | model | log | | PETDet | ResNet50
(1024,1024) | le90 | 3x | - | 2*4 | 75.39 | 55.65 | model | log |

Installation

This repo is based on mmrotate 0.x and OBBDetection.

Step 1. Create a conda environment and activate it.

shell conda create --name petdet python=3.10 -y conda activate petdet

Step 2. Install PyTorch following official instructions. Pytorch 1.13.1 is recommend.

shell conda install pytorch==1.13.1 torchvision==0.14.1 torchaudio==0.13.1 pytorch-cuda=11.7 -c pytorch -c nvidia

Step 3. Install MMCV 1.x and MMDetection 2.x using MIM.

shell pip install -U openmim mim install mmcv-full==1.7.1 mim install mmdet==2.28.2

Step 4. Install PETDet from source. shell git clone https://github.com/canoe-Z/PETDet.git cd PETDet pip install -v -e .

Data Preparation

Download datassts: - FAIR1M Dataset - MAR20 Dataset - ShipRSImageNet Dataset

For FAIR1M, Please crop the original images into 1024×1024 patches with an overlap of 200 by run the split tool.

The data structure is as follows:

none PETDet ├── mmrotate ├── tools ├── configs ├── data | ├── FAIR1M1_0 │ │ ├── train │ │ ├── test │ ├── FAIR1M2_0 │ │ ├── train │ │ ├── val │ │ ├── test │ ├── MAR20 │ │ ├── Annotations │ │ ├── ImageSets │ │ ├── JPEGImages │ ├── ShipRSImageNet │ │ ├── COCO_Format │ │ ├── VOC_Format

Inference

Assuming you have put the splited FAIR1M dataset into data/split_ss_fair1m2_0/ and have downloaded the models into the weights/, you can now evaluate the models on the FAIR1M_V2.0 test split:

./tools/dist_test.sh configs/petdet/ \ petdet_r50_fpn_1x_fair1m_le90.py \ weights/petdet_r50_fpn_1x_fair1m_le90.pth 4 --format-only \ --eval-options submission_dir=work_dirs/FAIR1M_2.0_results

Then, you can upload work_dirs/FAIR1M_2.0_results/submission_zip/test.zip to ISPRS Benchmark.

Training

The following command line will train petdet_r50_fpn_1x_fair1m_le90 on 4 GPUs:

./dist_train.sh configs/petdet/petdet_r50_fpn_1x_fair1m_le90.py 4

Notes: - The models will be saved into work_dirs/petdet_r50_fpn_1x_fair1m_le90. - If you use a different mini-batch size, please change the learning rate according to the Linear Scaling Rule. - We use 4 RTX3090 GPUs for the training of these models with a mini-batch size of 8 images (2 images per GPU). However, we found that training with a smaller batchsize may yield slightly better results on the FGOD tasks.

Citations

Please cite our paper in your publications if you find this repository useful.

latex @article{li2023petdet, title={PETDet: Proposal enhancement for two-stage fine-grained object detection}, author={Li, Wentao and Zhao, Danpei and Yuan, Bo and Gao, Yue and Shi, Zhenwei}, journal={IEEE Transactions on Geoscience and Remote Sensing}, year={2023}, publisher={IEEE} }

Owner

Name: airvic-t
Login: AIRVIC-T
Kind: user
Location: China

Repositories: 7
Profile: https://github.com/AIRVIC-T

working on oriented object detection.

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
  - name: "MMRotate Contributors"
title: "OpenMMLab rotated object detection toolbox and benchmark"
date-released: 2022-02-18
url: "https://github.com/open-mmlab/mmrotate"
license: Apache-2.0

GitHub Events

Total

Last Year

Issues and Pull Requests

Last synced: 10 months ago

All Time

Total issues: 0
Total pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Total issue authors: 0
Total pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 0
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 0
Pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Dependencies

.github/workflows/build.yml actions

actions/checkout v2 composite
actions/setup-python v2 composite
codecov/codecov-action v2 composite

.github/workflows/lint.yml actions

actions/checkout v2 composite
actions/setup-python v2 composite

.github/workflows/publish-to-pypi.yml actions

actions/checkout v2 composite
actions/setup-python v2 composite

.github/workflows/test_mim.yml actions

actions/checkout v2 composite
actions/setup-python v2 composite

docker/Dockerfile docker

pytorch/pytorch ${PYTORCH}-cuda${CUDA}-cudnn${CUDNN}-devel build

docker/serve/Dockerfile docker

pytorch/pytorch ${PYTORCH}-cuda${CUDA}-cudnn${CUDNN}-devel build

requirements/build.txt pypi

cython *
numpy *

requirements/docs.txt pypi

docutils ==0.16.0
myst-parser *
sphinx ==4.0.2
sphinx-copybutton *
sphinx_markdown_tables *
sphinx_rtd_theme ==0.5.2

requirements/mminstall.txt pypi

mmcv-full >=1.5.0

requirements/optional.txt pypi

imagecorruptions *
scipy *
sklearn *

requirements/readthedocs.txt pypi

e2cnn *
mmcv *
mmdet *
torch *
torchvision *

requirements/runtime.txt pypi

e2cnn *
matplotlib *
mmcv-full *
mmdet *
numpy *
pycocotools *
six *
terminaltables *
torch *

requirements/tests.txt pypi

asynctest * test
codecov * test
coverage * test
cython * test
e2cnn * test
flake8 * test
interrogate * test
isort ==4.3.21 test
kwarray * test
matplotlib * test
pytest * test
sklearn * test
ubelt * test
wheel * test
xdoctest >=0.10.0 test
yapf * test

requirements.txt pypi

setup.py pypi