bevdet-kitti

https://github.com/yinminzhang/bevdet-kitti

Science Score: 54.0%

This score indicates how likely this project is to be science-related based on various indicators:

✓
CITATION.cff file
Found CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
✓
Academic publication links
Links to: arxiv.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (5.4%) to scientific vocabulary

Last synced: 6 months ago · JSON representation ·

Repository

Basic Info

Host: GitHub
Owner: YinminZhang
License: apache-2.0
Language: Python
Default Branch: dev2.0
Size: 15.9 MB

Statistics

Stars: 9
Watchers: 1
Forks: 3
Open Issues: 1
Releases: 0

Created about 3 years ago · Last pushed almost 3 years ago

Metadata Files

Readme Contributing License Code of conduct Citation

BEVDet

Illustrating the performance of the proposed BEVDet on the nuScenes val set

News

2023.01.12 Support TensorRT-INT8.
2022.11.24 A new branch of bevdet codebase, dubbed dev2.0, is released. dev2.0 includes the following features:

support BEVPoolv2, whose inference speed is up to 15.1 times the previous fastest implementation of Lift-Splat-Shoot view transformer. It is also far less memory consumption.
use the origin of ego coordinate system as the center of the receptive field instead of the Lidar's.
support conversion of BEVDet from pytorch to TensorRT.
use the long term temporal fusion as SOLOFusion.
train models without CBGS by default.
use key frame for temporal fusion.
Technique Report BEVPoolv2 in English and Blog in Chinese.

History

Main Results

| Config | mAP | NDS | Latency(ms) | FPS | Model | Log | | ------------------------------------------------------------------------- | ---------- | ---------- | ---- | ---- | ---------------------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------------------- | | BEVDet-R50 | 27.8 | 32.2 | 37.0/15.9/52.9| 18.9 | google | google | | BEVDet-R50-CBGS | 30.7 | 38.2 |37.0/15.0/52.0 |19.2 | google | google | | BEVDet-R50-4D-CBGS | 34.9/35.4# | 48.4/48.6# | 35.8/15.2/51.0|19.6 | google | google | | BEVDet-R50-4D-Depth-CBGS | 40.2/40.6# | 52.3/52.6# |46.0/14.2/60.2 |16.6 | google | google |

# align previous frame bev feature during the view transformation.

The latency includes Network/Post-Processing/Total.

Inference speed with different backends

| Backend | 256x704 | 384x1056 | 512x1408 | 640x1760 | | ------------- | ------- | -------- | -------- | -------- | | PyTorch | 37.9 | 64.7 | 105.7 | 154.2 | | TensorRT | 18.4 | 25.9 | 40.0 | 58.3 | | TensorRT-FP16 | 7.2 | 10.6 | 15.3 | 21.2 | | TensorRT-INT8 | 4.4 | 5.8 | 8.2 | 11.0 |

Evaluate with BEVDet-R50 on a RTX 3090 GPU. We omit the postprocessing, which runs about 14.3 ms with the PyTorch backend.

Get Started

Installation and Data Preparation

Please refer to getting_started.md for installing BEVDet as mmdetection3d. Docker is recommended for environment preparation.
Prepare nuScenes dataset as introduced in nuscenes_det.md and create the pkl for BEVDet by running:

shell python tools/create_data_bevdet.py

Estimate the inference speed of BEVDet

```shell

with pre-computation acceleration

python tools/analysis_tools/benchmark.py $config $checkpoint --fuse-conv-bn

4D with pre-computation acceleration

python tools/analysistools/benchmarksequential.py $config $checkpoint --fuse-conv-bn

view transformer only

python tools/analysistools/benchmarkview_transformer.py $config $checkpoint ```

Estimate the flops of BEVDet

shell python tools/analysis_tools/get_flops.py configs/bevdet/bevdet-r50.py --shape 256 704

Visualize the predicted result.

Private implementation. (Visualization remotely/locally)

shell python tools/test.py $config $checkpoint --format-only --eval-options jsonfile_prefix=$savepath python tools/analysis_tools/vis.py $savepath/pts_bbox/results_nusc.json

Convert to TensorRT and test inference speed.

shell 1. install mmdeploy from https://github.com/HuangJunJie2017/mmdeploy 2. convert to TensorRT python tools/convert_bevdet_to_TRT.py $config $checkpoint $work_dir --fuse-conv-bn --fp16 --int8 3. test inference speed python tools/analysis_tools/benchmark_trt.py $config $engine

Acknowledgement

This project is not possible without multiple great open-sourced code bases. We list some notable examples below.

Beside, there are some other attractive works extend the boundary of BEVDet.

BEVerse for multi-task learning.
BEVStereo for stero depth estimation.

Bibtex

If this work is helpful for your research, please consider citing the following BibTeX entry.

``` @article{huang2022bevpoolv2, title={BEVPoolv2: A Cutting-edge Implementation of BEVDet Toward Deployment}, author={Huang, Junjie and Huang, Guan}, journal={arXiv preprint arXiv:2211.17111}, year={2022} }

@article{huang2022bevdet4d, title={BEVDet4D: Exploit Temporal Cues in Multi-camera 3D Object Detection}, author={Huang, Junjie and Huang, Guan}, journal={arXiv preprint arXiv:2203.17054}, year={2022} }

@article{huang2021bevdet, title={BEVDet: High-performance Multi-camera 3D Object Detection in Bird-Eye-View}, author={Huang, Junjie and Huang, Guan and Zhu, Zheng and Yun, Ye and Du, Dalong}, journal={arXiv preprint arXiv:2112.11790}, year={2021} } ```

Owner

Name: Yinmin.Zhang
Login: YinminZhang
Kind: user
Location: Dalian University of Technology, Dalian, China

Website: https://yinminzhang.github.io/
Repositories: 1
Profile: https://github.com/YinminZhang

student

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
  - name: "MMDetection3D Contributors"
title: "OpenMMLab's Next-generation Platform for General 3D Object Detection"
date-released: 2020-07-23
url: "https://github.com/open-mmlab/mmdetection3d"
license: Apache-2.0

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science