orthogonal-fgod

https://github.com/zhuhaoraneis/orthogonal-fgod

Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
✓
Academic publication links
Links to: arxiv.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (12.2%) to scientific vocabulary

Last synced: 10 months ago · JSON representation ·

Repository

Basic Info

Host: GitHub
Owner: ZhuHaoranEIS
License: mit
Language: Python
Default Branch: main
Size: 13.4 MB

Statistics

Stars: 11
Watchers: 1
Forks: 0
Open Issues: 3
Releases: 0

Created about 2 years ago · Last pushed over 1 year ago

Metadata Files

Readme Contributing License Citation

Enhancing Fine-grained Object Detection in Aerial Images via Orthogonal Mapping

This is the official implementation of the paper "Enhancing Fine-grained Object Detection in Aerial Images via Orthogonal Mapping".

:whitecheckmark: Updates

June. 9th, 2024: Update: Important! we release the FCOS w/ OM, RetinaNet w/ OM, Faster R-CNN w/ OM, and PETDet w/ OM models.

Introduction

Orthogonal Mapping (OM) is a simple yet effective method that can be deployed on both one-stage and two-stage networks to mitigate semantic confusion.

Abstract: Fine-Grained Object Detection (FGOD) is a critical task in high-resolution aerial image analysis. This letter introduces Orthogonal Mapping (OM), a simple yet effective method aimed at addressing the challenge of semantic confusion inherent in FGOD. OM introduces orthogonal constraints in the feature space by decoupling features from the last layer of the classification branch with a class-wise orthogonal vector basis. This effectively mitigates semantic confusion and enhances classification accuracy. Moreover, OM can be seamlessly integrated into mainstream object detectors. Extensive experiments conducted on three FGOD datasets (FAIR1M, ShipRSImageNet, and MAR20) demonstrate the effectiveness and superiority of the proposed approach. Notably, with just one line of code, OM achieves a 4.08\% improvement in mean Average Precision (mAP) over FCOS on the ShipRSImageNet dataset.

Methodology

Step 1: Construct the orthogonal class prototype feature

shell script A = np.random.rand(self.feat_channels, self.feat_channels) orthogonal_basis = orth(A) remaining_basis_vectors = orthogonal_basis[:self.cls_out_channels,:] self.prototype = torch.Tensor(remaining_basis_vectors) self.prototype = self.prototype / self.prototype.norm(dim=-1, keepdim=True)

Step 2: Replace convolutional or fully connected layers with orthogonal prototypes

```shell script imagefeats = featsori / featsori.norm(dim=1, keepdim=True) orthfeats = self.prototype.to(featsori.dtype).to(featsori.device)

ori: clsscore = convcls(image_feats)

clsscore = 100 * torch.einsum('bcwh, nc->bnwh', imagefeats, orth_feats) ```

Installation and Get Started

Required environments: * Linux * Python 3.10+ * PyTorch 1.13+ * CUDA 9.2+ * GCC 5+ * MMCV * PETDet

Install:

Note that this repository is based on the MMrotate and PETDet. Assume that your environment has satisfied the above requirements, please follow the following steps for installation.

shell script git clone https://github.com/ZhuHaoranEIS/Orthogonal-FGOD.git cd Orthogonal-FGOD pip install -r requirements/build.txt python setup.py develop

Data Preparation

Download datassts: - FAIR1M Dataset - MAR20 Dataset - ShipRSImageNet Dataset

For FAIR1M, Please crop the original images into 1024×1024 patches with an overlap of 200 by run the split tool.

The data structure is as follows:

none Orthogonal-FGOD ├── mmrotate ├── tools ├── configs ├── data | ├── FAIR1M1_0 │ │ ├── train │ │ ├── test │ ├── FAIR1M2_0 │ │ ├── train │ │ ├── val │ │ ├── test │ ├── MAR20 │ │ ├── Annotations │ │ ├── ImageSets │ │ ├── JPEGImages │ ├── ShipRSImageNet │ │ ├── COCO_Format │ │ ├── VOC_Format

Training

All models of OM are trained with a total batch size of 8 (can be adjusted following MMrotate).

To train FCOS w/ OM and PETDet w/ OM on FAIR1M-v1.0, run

shell script python tools/train.py fari1mv1/fcos_w_om.py python tools/train.py fari1mv1/petdet_w_om.py Please refer to fari1mv1/fcoswom.py, fari1mv1/petdetwom.py,
for model configuration

Inference

Assuming you have put the splited FAIR1M dataset into data/split_ss_fair1m2_0/ and have downloaded the models into the weights/, you can now evaluate the models on the FAIR1M_v1.0 test split:

python tools/test.py configs/.py work_dirs/.pth --format-only --eval-options submission_dir=submit/

Then, you can upload work_dirs/FAIR1M_1.0_results/submission_zip/test.zip to ISPRS Benchmark.

Main results

Table 1. Comparison results on FAIR1M-v1.0 online validation. All the models are with ResNet-50 as the backbone. Moreover, FRCNN, ORCNN, and RoI Trans denote Faster R-CNN, Oriented R-CNN, and RoI Transformer respectively. demo image

Table 2. Comparison results between our OM and other SOTA orthogonal losses on the ShipRSImageNet. demo image

Figure 1. Confusion matrices of detection results (\%) obtained from FCOS w/o OM (top) and FCOS w/ OM (bottom). The horizontal and vertical coordinates represent the ground truth labels and the prediction labels. (a) Airplane. (b) Ship. (c) Vehicle. demo image

Figure 2. The three-dimensional distribution of three main easily confused classes in FAIR1M-v1.0 obtained from FCOS w/o OM (top) and FCOS w/ OM (bottom). (a) Airplane. (b) Ship. (c) Vehicle. demo image

Citation

If you find this work helpful, please consider citing: bibtex @misc{zhu2024enhancingfinegrainedobjectdetection, title={Enhancing Fine-grained Object Detection in Aerial Images via Orthogonal Mapping}, author={Haoran Zhu and Yifan Zhou and Chang Xu and Ruixiang Zhang and Wen Yang}, year={2024}, eprint={2407.17738}, archivePrefix={arXiv}, primaryClass={cs.CV}, url={https://arxiv.org/abs/2407.17738}, }

Owner

Login: ZhuHaoranEIS
Kind: user

Repositories: 1
Profile: https://github.com/ZhuHaoranEIS

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
  - name: "MMRotate Contributors"
title: "OpenMMLab rotated object detection toolbox and benchmark"
date-released: 2022-02-18
url: "https://github.com/open-mmlab/mmrotate"
license: Apache-2.0

GitHub Events

Total

Watch event: 4
Issue comment event: 3
Push event: 1

Last Year

Watch event: 4
Issue comment event: 3
Push event: 1

Dependencies

.github/workflows/build.yml actions

actions/checkout v2 composite
actions/setup-python v2 composite
codecov/codecov-action v2 composite

.github/workflows/lint.yml actions

actions/checkout v2 composite
actions/setup-python v2 composite

.github/workflows/publish-to-pypi.yml actions

actions/checkout v2 composite
actions/setup-python v2 composite

.github/workflows/test_mim.yml actions

actions/checkout v2 composite
actions/setup-python v2 composite

docker/Dockerfile docker

pytorch/pytorch ${PYTORCH}-cuda${CUDA}-cudnn${CUDNN}-devel build

docker/serve/Dockerfile docker

pytorch/pytorch ${PYTORCH}-cuda${CUDA}-cudnn${CUDNN}-devel build

requirements/build.txt pypi

cython *
ninja *
numpy *

requirements/docs.txt pypi

docutils ==0.16.0
myst-parser *
sphinx ==4.0.2
sphinx-copybutton *
sphinx_markdown_tables *
sphinx_rtd_theme ==0.5.2

requirements/mminstall.txt pypi

mmcv-full >=1.5.0

requirements/optional.txt pypi

imagecorruptions *
scipy *
sklearn *

requirements/readthedocs.txt pypi

e2cnn *
mmcv *
mmdet *
torch *
torchvision *

requirements/runtime.txt pypi

e2cnn *
matplotlib *
mmcv-full *
mmdet *
numpy *
pycocotools *
six *
terminaltables *
torch *

requirements/tests.txt pypi

asynctest * test
codecov * test
coverage * test
cython * test
e2cnn * test
flake8 * test
interrogate * test
isort ==4.3.21 test
kwarray * test
matplotlib * test
pytest * test
sklearn * test
ubelt * test
wheel * test
xdoctest >=0.10.0 test
yapf * test

requirements.txt pypi

setup.py pypi

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science