dabnet

[TNNLS] Official implementation of 'Real-time Semantic Segmentation via Densely Aggregated Bilateral Network', in Pytorch.

https://github.com/isyangshu/dabnet

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (15.4%) to scientific vocabulary

Last synced: 6 months ago · JSON representation ·

Repository

[TNNLS] Official implementation of 'Real-time Semantic Segmentation via Densely Aggregated Bilateral Network', in Pytorch.

Basic Info

Host: GitHub
Owner: isyangshu
License: apache-2.0
Language: Python
Default Branch: main
Homepage:
Size: 7.79 MB

Statistics

Stars: 7
Watchers: 1
Forks: 3
Open Issues: 1
Releases: 0

Created about 4 years ago · Last pushed over 2 years ago

Metadata Files

Readme License Citation

Real-time Semantic Segmentation via Densely Aggregated Bilateral Network (DABNet) (Accepted by TNNLS)

GitHub License GitHub last commit GitHub issues

Note： I have executed the relevant instructions on docker myself to ensure that the code can be run correctly. I gave the training_log, with ResNet-18 as backbone and Patch size 2. (2022-03-02) I gave the training_log, with ResNet-18 as backbone and Patch size 8. (2022-03-10)

Installation

Our project is developed based on MMsegmentation. MMSegmentation is an open source semantic segmentation toolbox based on PyTorch. Please refer to get_started.md for installation and dataset_prepare.md for dataset preparation.

Documentation: https://mmsegmentation.readthedocs.io/

English | 简体中文

text Due to my unfamiliarity with github, I may make mistakes (so you can use MMsegmentation and just copy my code). At the same time, I usually use "BiseNetV1_res" as my network name during the experiment (because DABNet is indeed improved based on BiseNetV1). Although I made changes before uploading, there may still be some naming conflicts (eg BiseNetV1_Res vs DABNet, HFA vs PCE). I used the naming scheme "BiseNetV1_Res" in my public logs, hope it doesn't confuse you.

Enviroment

Benchmark

```shell conda create -n open-mmlab python=3.7 -y conda activate open-mmlab

conda install pytorch=1.6.0 torchvision cudatoolkit=10.1 -c pytorch -y pip install mmcv-full -f https://download.openmmlab.com/mmcv/dist/cu101/torch1.6.0/index.html git clone https://github.com/isyangshu/DABNet.git cd DABNet pip install -e . # or "python setup.py develop" pip install -e .[all]

mkdir data ```

Additional Dependencies

shell pip install einops # used for transpose pip install timm==0.4.12 # used for Efficientnet_B1

In order to change the BatchNorm in the Efficientnet to SydncBatchNorm, we need to modify the relevant code in timm/models/efficientnet.py:

norm_layer=kwargs.pop('norm_layer', None) or partial(nn.BatchNorm2d, **resolve_bn_args(kwargs)),

For more details, please refer to EfficientNet.

TensorRT

TensorRT needs a version that strictly matches Cuda and Cudnn. Refer to TensorRT to deploy the appropriate Cuda, Cudnn, TensorRT versions, and make sure their versions match MMCV, MMsegmentation. For the specific installation method and version selection, please refer to MMSegmentation.

We simulate inference and measure inference speed (FPS) on NVIDIA GTX 1080Ti GPU with CUDA 10.2, CUDNN 8.0.4, and TensorRT 7.2.1.6. （Just need to configure according to the above method on the machine with GTX1080ti, pay attention to modify the pytorch and mmcv corresponding to the cuda version.）

Datasets

Cityscapes

The data could be found hear after registration.

You can also test Cityscapes results in the same website.

By convention, **labelTrainIds.png are used for cityscapes training. We provided a scripts based on cityscapesscripts to generate **labelTrainIds.png.

```shell

--nproc means 8 process for conversion, which could be omitted as well.

python tools/convert_datasets/cityscapes.py data/cityscapes --nproc 8

```

COCO-10k

```shell

download

mkdir cocostuff10k && cd cocostuff10k wget http://calvin.inf.ed.ac.uk/wp-content/uploads/data/cocostuffdataset/cocostuff-10k-v1.1.zip

unzip

unzip cocostuff-10k-v1.1.zip

--nproc means 8 process for conversion, which could be omitted as well.

python tools/convertdatasets/cocostuff10k.py /path/to/coco_stuff10k --nproc 8 ```

COCO-164k

```shell

download

mkdir cocostuff164k && cd cocostuff164k wget http://images.cocodataset.org/zips/train2017.zip wget http://images.cocodataset.org/zips/val2017.zip wget http://calvin.inf.ed.ac.uk/wp-content/uploads/data/cocostuffdataset/stuffthingmaps_trainval2017.zip

unzip

unzip train2017.zip -d images/ unzip val2017.zip -d images/ unzip stuffthingmaps_trainval2017.zip -d annotations/

--nproc means 8 process for conversion, which could be omitted as well.

python tools/convertdatasets/cocostuff164k.py /path/to/coco_stuff164k --nproc 8 ```

CamVid

The images have a resolution of 960 × 720 and 32 semantic categories, in which the subset of 11 classes are used for segmentation experiments. The original data could be found hear.

The annotations should be label index for mmsegmentation. If each mask is a specific class label for camvid, you should fuse all the masks of an image to generate the label index annotations. Use label index to represent the categories in the annotations.

I provide a script /tools/convert_datasets/camvid.py to convert the Camvid dataset, note that you need to change the path (train/val/test).

This code runs a little slow, you can get the data directly from CamVid-baidu (1snk). TrainID indicates available annotations.

Training and Testing

Please see train.md and inference.md for the basic usage of MMSegmentation. There are also tutorials for customizing dataset, designing data pipeline, customizing modules, and customizing runtime. MMSegmentation also provides many training tricks for better training and useful tools for deployment.

Train

```shell ./tools/disttrain.sh ${configs} ${GPU Nums} nohup ./tools/disttrain.sh ${configs} ${GPU Nums} 2>&1 &

For example, train a DABNet-Resnet18 on Cityscapes dataset with 4 GPUs

nohup ./tools/disttrain.sh configs/dabnet/dabnetr18-d32in1k-pre4x81024x102480k_cityscapes.py 4 2>&1 & ```

More detail, please refer to Train Doc.

Test

Cityscapes val set ```shell

Test mIoU for Cityscapes

python tools/test.py ${configs} ${checkpoints} --eval mIoU ```
Cityscapes test set
- Test DABNet on cityscapes test split with 4 GPUs, and generate the png files to be submit to the official evaluation server.
- First, add following to config file ${configs},
data = dict( test=dict( imgdir='leftImg8bit/test', anndir='gtFine/test')) * Then run test. shell ./tools/dist_test.sh ${configs} ${checkpoints} 4 --format-only --eval-options "imgfile_prefix=./test_results" * zip and submit test_results

More detail, please refer to Test Doc.

Latency

Different from the code of MMsegmentation, we refer to FasterSeg and STDC to implement tools for testing speed.

If you have successfully installed TensorRT, you will automatically use TensorRT for the following latency tests (see function here).
Otherwise you will be switched to use Pytorch for the latency tests (see function here).
I fine-tune related code to fit the MMsegmentation to measure latency. For more details, see tools/print_latency.py.

shell CUDA_VISIBLE_DEVICES=0 python ./tools/print_latency.py ${configs} ${checkpoints}

text In practice, using `torch.backends.cudnn.benchmark = True` may cause out of memory at some resolutions. Meanwhile, using `torch.backends.cudnn.benchmark = False` does not affect the measurement.

Results

I need more time to train and test.

Please wait.

Cityscapes

| Method | Crop Size | Inference Size | Batch size | iteration | set | val mIoU | test mIoU | FPS | model | config | | --------------- | --------- | ---------------- | ---------- | --------- | ---- | ----- | ----- |----- |------------------------------------------------------------ | ------------------------------------------------------------ | | DABNetRP1 | 1024x1024 | 2048x1024 | 8 * 4 | 80k | val | 77.4 | - | 33.5 | - | config | | DABNetRP2 | 1024x1024 | 2048x1024 | 8 * 4 | 80k | val | 77.8 | 76.1 | 34.9 | baidu (2t0o) | config | | DABNetRP4 | 1024x1024 | 2048x1024 | 8 * 4 | 80k | val | 78.1 | - | 34.7 | - | config | | DABNetRP8 | 1024x1024 | 2048x1024 | 8 * 4 | 80k | val | 78.8 | 77.3 | 31.1 | - | config | | DABNetRP16 | 1024x1024 | 2048x1024 | 8 * 4 | 80k | val | 78.4 | - | 21.0 | - | config | | DABNetE | 1024x1024 | 2048x1024 | 4 * 4 | 80k | val | 78.7 | 77.5 | 20.3 | baidu (2t0o) | config | | DABNetRP1 | 1024x1024 | 2048x1024 | 8 * 4 | 80k | train+val | - | - | - | - | - | | DABNetRP2 | 1024x1024 | 2048x1024 | 8 * 4 | 80k | train+val | - | - | - | - | - | | DABNetRP4 | 1024x1024 | 2048x1024 | 8 * 4 | 80k | train+val | - | - | - | - | - | | DABNetRP8 | 1024x1024 | 2048x1024 | 8 * 4 | 80k | trian+val | - | - | - | - | - | | DABNetRP16 | 1024x1024 | 2048x1024 | 8 * 4 | 80k | train+val | - | - | - | - | - | | DABNetE | 1024x1024 | 2048x1024 | 4 * 4 | 80k | trian+val | - | - | - | - | - |

COCO-10k

| Method | Crop Size | Inference Size | Batch size | iteration | mIoU | pixACC | FPS | model | config | | --------------- | --------- | ---------------- | ---------- | --------- | ---- | ----- | ----- |------------------------------------------------------------ | ------------------------------------------------------------ | | DABNetRP2 | 512x512 | 640x640 | 8 * 4 | 80k | 29.5 | 63.1 | 104.6 | baidu (2t0o) | config | | DABNet_E | 512x512 | 640x640 | 8 * 4 | 80k | 32.4 | 65.5 | 70.8 | baidu (2t0o) | config |

CamVid

| Method | Crop Size | Inference Size | Batch size | iteration | set | mIoU | FPS | config | | --------------- | --------- | ---------------- | ---------- | --------- | ---- | ----- |------------------------------------------------------------ | ------------------------------------------------------------ | | DABNetRP2 | 960x768 | 960x768 | 2 * 4 | 10k | train+val | 74.3 | 92.2 | config | | DABNet_E | 960x768 | 960x768 | 2 * 4 | 10k | train+val | 76.5 | 56.5 | config |

Pretrained weights

You can get the pretrained weights directly from baidu (2t0o).

Cityscapes test results

You can get the pretrained weights directly from baidu (vq1n).

Owner

Name: yangshua
Login: isyangshu
Kind: user

Repositories: 1
Profile: https://github.com/isyangshu

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
  - name: "MMSegmentation Contributors"
title: "OpenMMLab Semantic Segmentation Toolbox and Benchmark"
date-released: 2020-07-10
url: "https://github.com/open-mmlab/mmsegmentation"
license: Apache-2.0

GitHub Events

Total

Watch event: 3

Last Year

Watch event: 3

Dependencies

docker/Dockerfile docker

pytorch/pytorch ${PYTORCH}-cuda${CUDA}-cudnn${CUDNN}-devel build

docker/serve/Dockerfile docker

pytorch/pytorch ${PYTORCH}-cuda${CUDA}-cudnn${CUDNN}-devel build

requirements/docs.txt pypi

docutils ==0.16.0
myst-parser *
sphinx ==4.0.2
sphinx_copybutton *
sphinx_markdown_tables *

requirements/mminstall.txt pypi

mmcv-full >=1.3.1,<=1.4.0

requirements/optional.txt pypi

cityscapesscripts *

requirements/readthedocs.txt pypi

mmcv *
prettytable *
torch *
torchvision *

requirements/runtime.txt pypi

matplotlib *
numpy *
packaging *
prettytable *

requirements/tests.txt pypi

codecov * test
flake8 * test
interrogate * test
isort ==4.3.21 test
pytest * test
xdoctest >=0.10.0 test
yapf * test

requirements.txt pypi

setup.py pypi

dabnet

Science Score: 44.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

Real-time Semantic Segmentation via Densely Aggregated Bilateral Network (DABNet) (Accepted by TNNLS)

Installation

Enviroment

Datasets

Cityscapes

--nproc means 8 process for conversion, which could be omitted as well.

COCO-10k

download

unzip

--nproc means 8 process for conversion, which could be omitted as well.

COCO-164k

download

unzip

--nproc means 8 process for conversion, which could be omitted as well.

CamVid

Training and Testing

Train

For example, train a DABNet-Resnet18 on Cityscapes dataset with 4 GPUs

Test

Test mIoU for Cityscapes

Latency

Results

Cityscapes

COCO-10k

CamVid

Pretrained weights

Cityscapes test results

Owner

Citation (CITATION.cff)

GitHub Events

Total

Last Year

Dependencies