learning-heavily-degraded-prior

This repo is the official implementation of 'Learning Heavily-Degraded Prior for Underwater Object Detection'.

https://github.com/xiaodetection/learning-heavily-degraded-prior

Last synced: 11 months ago · JSON representation ·

Repository

This repo is the official implementation of 'Learning Heavily-Degraded Prior for Underwater Object Detection'.

Basic Info

Host: GitHub
Owner: xiaoDetection
License: apache-2.0
Language: Python
Default Branch: main
Homepage: https://ieeexplore.ieee.org/document/10113328
Size: 7.74 MB

Statistics

Stars: 20
Watchers: 2
Forks: 3
Open Issues: 0
Releases: 0

Created over 3 years ago · Last pushed almost 2 years ago

Metadata Files

Readme License Citation

Learning Heavily Degraded Piror

This repo is the official implementation of Learning Heavily-Degraded Prior for Underwater Object Detection. It is based on mmdetection.

Introduction

we also provide an introduction in Chinese here.

We propose a residual feature transference module (RFTM) to learn a mapping between deep representations of the heavily degraded patches of DFUI and underwater images, and make the mapping as a heavily degraded prior (HDP) for underwater detection. Since the statistical properties are independent to image content, HDP can be learned without the supervision of semantic labels and plugged into popular CNN-based feature extraction networks to improve their performance on underwater object detection. Without bells and whistles, evaluations on URPC2020 and UODD show that our methods outperform CNN-based detectors by a large margin. Our method with higher speeds and less parameters still performs better than transformer-based detectors.

Fig2

Dataset and Training strategy

The DFUI dataset

For the training set of URPC2020 and URPC2021, we use Cascade RCNN to pick up images with $AP \geq 60$ to constrauct DFUI datasets. The DFUI dataset is used for pretraining and unsupervised trainsference training phase in this work.

Notes:

The DFUI dataset can be downloaded from: Github or Google Drive
The URPC2020 dataset can be downloaded from: Github or Google Drive

HD and LD subsets

The patches with the transmission value $t$ less than a threshold $T$ ($T$ = 0.5 as a example) from the DFUI and underwater dataset produce the heavily-degraded($HD$) subsets; those having higher transmission values constitute the respective lightly degraded($LD$) subsets. $t$ value can be easily estimated by common UDCP methods.

We only use $HD$ subsets for training. $HDu$ and $HDf$ represent subsets of underwater datasets and the DFUI, respectively.

Two-stage training strategy

For efficiently plugging RFTM into a detector, we propose a two-stage learning scheme from the perspective of the unsupervised and finetune learning strategy. The first stage is training RFTM in an unsupervised manner on $HDf$ and $HDu$ subsets without semantic labels. The second stage is fixed RFTM to finetune some components of a detector on underwater dataset.

Please refer to our paper for more details.

Results and Models

URPC2020

| Methods | Backbone | Pretrain | $AP$ | $AP{50}$ | $AP{75}$ | $APS$ | $APM$ | $AP_L$ | #params | config | model | | --------- | ----------- | ------------------------------------------------------------------------------------------------------------------------------------------------ | ------ | ----------- | ----------- | -------- | -------- | -------- | ------- | ---------------------------------------------------------------------------------------------------------- | -------------------------------------------------------------------------------------------------------------------------------- | | RFTM-50 | ResNet50 | cascadercnnr50_dfui | 48.2 | 80.7 | 50.0 | 19.5 | 41.6 | 53.1 | 75.5M | config | rftm50urpc | | RFTM-x101 | ResNetXT101 | cascadercnnx101_dfui | 50.9 | 84.7 | 55.2 | 25.5 | 45.1 | 56.9 | 133.4M | config | rftmx101urpc |

UODD

| Methods | Backbone | Pretrain | $AP$ | $AP{50}$ | $AP{75}$ | $APS$ | $APM$ | $AP_L$ | #parames | config | model | | --------- | ----------- | ------------------------------------------------------------------------------------------------------------------------------------------------ | ------ | ----------- | ----------- | -------- | -------- | -------- | -------- | ---------------------------------------------------------------------------------------------------------- | -------------------------------------------------------------------------------------------------------------------------------- | | RFTM-50 | ResNet50 | cascadercnnr50_dfui | 50.8 | 89.0 | 53.6 | 33.6 | 50.9 | 62.8 | 75.5M | config | rftm50uodd | | RFTM-x101 | ResNetXT101 | cascadercnnx101_dfui | 52.7 | 90.8 | 50.0 | 47.7 | 52.4 | 63.5 | 133.4M | config | rftmx101uodd |

Usage

Installation

To install pytorch, run:

conda install pytorch==1.11.0 torchvision==0.12.0 torchaudio==0.11.0 cudatoolkit=11.3 -c pytorch

To install mmdetection, run:

```

install mmcv-full

pip install mmcv-full==1.4.8 -f https://download.openmmlab.com/mmcv/dist/cu113/torch1.11/index.html

install mmdet

pip install -r requirements/build.txt python setup.py develop ```

To install guided-filter-pytorch, run:

pip install guided-filter-pytorch

Inference

python tools/test.py <config_file> <checkpoint_file> --eval bbox

Training

To train RFTM-50, run:

python tools/train.py configs/rftm/rftm_50.py --work-dir <work_dir>

To train RFTM-X101, run:

python tools/train.py configs/rftm/rftm_x101.py --work-dir <work_dir>

Training with custom datasets

Perpare your data

Please convert your labels into COCO format and place your data into data/YOUR_DATASET/.

The directory structure should be like this: Learing-Heavily-Degraed-Prior ├── data │ ├── YOUR_DATASET │ │ ├── annotations │ │ ├── images

Prepare a config

Follow the template below to create a new config file configs/rftm/YOUER_CONFIG.py. ```python base = './rftm_50.py'

numclasses = YOURNUM_CLASSES # number of classes of your dataset

pass 'num_classes' to model settings

model = dict( roihead=dict( bboxhead=[ dict( type='Shared2FCBBoxHead', inchannels=256, fcoutchannels=1024, roifeatsize=7, numclasses=numclasses, bboxcoder=dict( type='DeltaXYWHBBoxCoder', targetmeans=[0., 0., 0., 0.], targetstds=[0.1, 0.1, 0.2, 0.2]), regclassagnostic=True, losscls=dict( type='CrossEntropyLoss', usesigmoid=False, lossweight=1.0), lossbbox=dict(type='SmoothL1Loss', beta=1.0, lossweight=1.0)), dict( type='Shared2FCBBoxHead', inchannels=256, fcoutchannels=1024, roifeatsize=7, numclasses=numclasses, bboxcoder=dict( type='DeltaXYWHBBoxCoder', targetmeans=[0., 0., 0., 0.], targetstds=[0.05, 0.05, 0.1, 0.1]), regclassagnostic=True, losscls=dict( type='CrossEntropyLoss', usesigmoid=False, lossweight=1.0), lossbbox=dict(type='SmoothL1Loss', beta=1.0, lossweight=1.0)), dict( type='Shared2FCBBoxHead', inchannels=256, fcoutchannels=1024, roifeatsize=7, numclasses=numclasses, bboxcoder=dict( type='DeltaXYWHBBoxCoder', targetmeans=[0., 0., 0., 0.], targetstds=[0.033, 0.033, 0.067, 0.067]), regclassagnostic=True, losscls=dict( type='CrossEntropyLoss', usesigmoid=False, lossweight=1.0), lossbbox=dict(type='SmoothL1Loss', beta=1.0, loss_weight=1.0)) ] ) )

customhooks = [ dict(type="NumClassCheckHook"), dict( type='OurHook', cfg='configs/rftm/YOURCONFIG.py', # config file path cp='checkpoints/cascadercnnr50_dfui.pth', # pre-trained weights priority=30) ]

imgprefixdfui='./data/dfui/images/' # DFUI image folder imgprefixurpctrain='YOURTRAINIMGPREIFIX' # training image folder imgprefixurpctest ='YOURTESTIMGPREFIX' # testing image folder

anntraindfui='./data/dfui/annotations/instancestrainval2017.json' # DFUI annotation anntrainurpc='YOURTRAINANNOTATION' # training annotation anntesturpc='YOURTEST_ANNOTATION' # testing annotation

classes = ('CLASSNAME1', 'CLASSNAME2') # tuple of class names

pass the above variables to data settings

data = dict( train=dict( dataset=dict( classes=classes, annfile=anntrainurpc, imgprefix=imgprefixurpctrain ), imgdfuiprefix=imgprefixdfui ), val=dict( dataset=dict( classes=classes, annfile=anntesturpc, imgprefix=imgprefixurpctest ) ), test= dataset=dict( classes=classes, annfile=anntesturpc, imgprefix=imgprefixurpc_test ) ) ```

Training

Run command: shell python tools/train.py configs/rftm/YOUR_CONFIG.py --work-dir <work_dir>

Notes: - Please refer to MMDetection's documentation for more information.

Citing

@article{Fu2022, title = {{Learning Heavily-Degraded Prior for Underwater Object Detection}}, author = {{Fu, Chenping and Fan, Xin and Xiao, Jiewen and Yuan, Wanqi and Liu, Risheng and Luo, Zhongxuan}}, journal = {{IEEE TCSVT}} }

Owner

Login: xiaoDetection
Kind: user

Repositories: 1
Profile: https://github.com/xiaoDetection

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
  - name: "MMDetection Contributors"
title: "OpenMMLab Detection Toolbox and Benchmark"
date-released: 2018-08-22
url: "https://github.com/open-mmlab/mmdetection"
license: Apache-2.0

GitHub Events

Total

Issues event: 7
Watch event: 10
Issue comment event: 4
Fork event: 1

Last Year

Issues event: 7
Watch event: 10
Issue comment event: 4
Fork event: 1

Issues and Pull Requests

Last synced: 11 months ago

All Time

Total issues: 2
Total pull requests: 0
Average time to close issues: about 1 month
Average time to close pull requests: N/A
Total issue authors: 2
Total pull request authors: 0
Average comments per issue: 1.0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 2
Pull requests: 0
Average time to close issues: about 1 month
Average time to close pull requests: N/A
Issue authors: 2
Pull request authors: 0
Average comments per issue: 1.0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

learning-heavily-degraded-prior

Science Score: 54.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

Learning Heavily Degraded Piror

Introduction

Dataset and Training strategy

The DFUI dataset

HD and LD subsets

Two-stage training strategy

Results and Models

URPC2020

UODD

Usage

Installation

install mmcv-full

install mmdet

Inference

Training

Training with custom datasets

Perpare your data

Prepare a config

pass 'num_classes' to model settings

pass the above variables to data settings

Training

Citing

Owner

Citation (CITATION.cff)

GitHub Events

Total

Last Year

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels