hop
[ICCV 2023] Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
Science Score: 54.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
✓Academic publication links
Links to: arxiv.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (8.4%) to scientific vocabulary
Repository
[ICCV 2023] Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
Basic Info
- Host: GitHub
- Owner: Sense-X
- License: apache-2.0
- Language: Python
- Default Branch: main
- Size: 3.82 MB
Statistics
- Stars: 186
- Watchers: 7
- Forks: 12
- Open Issues: 10
- Releases: 1
Metadata Files
README.md
Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
This repo is the official implementation of "Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction" by Zhuofan Zong, Dongzhi Jiang, Guanglu Song, Zeyue Xue, Jingyong Su, Hongsheng Li, and Yu Liu.
News
- [07/25/2023] Code for HoP on BEVDet is released!
- [07/14/2023] HoP is accepted to ICCV 2023!
- [04/05/2023] HoP achieves new SOTA performance on nuScenes 3D detection leaderboard with 68.5 NDS and 62.4 mAP.
Model Zoo
Result on BEVDet4D-Depth
| model | backbone | pretrain | img size | Epoch | NDS | mAP | config | ckpt | log | | :----------------------: | :------: | :----------: | :------: | :---: | :----: | :----: | :----------------------------------------------------------: | :----------------------------------------------------------: | :----------------------------------------------------------: | | BEVDet4D-Depth(Baseline) | Res50 | ImageNet | 256x704 | 24 | 0.4930 | 0.3848 | cfg | ckpt | log | | HoP_BEVDet4D-Depth | Res50 | ImageNet | 256x704 | 24 | 0.5099 | 0.3990 | cfg | ckpt | log |
Get Started
Install
We train our models under the following environment:
python=3.6.9
pytorch=1.8.1
torchvision=0.9.1
cuda=11.2
Other versions may possibly be imcompatible.
We use MMDetection3D V1.0.0rc4, MMDetection V2.24.0 and MMCV V1.5.0. The source code of MMDetection3D has been included in this repo.
You can take the following steps to install packages above:
Build MMCV following official instructions.
Install MMDetection by
bash
pip install mmdet==2.24.0
- Copy HoP repo and install MMDetection3D.
bash
git clone git@github.com:Sense-X/HoP.git
cd HoP
pip install -e .
Data Preparation
Follow the steps to prepare nuScenes Dataset introduced in nuscenes_det.md and create the pkl by running:
bash
python tools/create_data_bevdet.py
Train HoP
```bash
single gpu
python tools/train.py configs/hopbevdet/hopbevdet4d-r50-depth.py
multiple gpu
./tools/disttrain.sh configs/hopbevdet/hopbevdet4d-r50-depth.py $numgpu ```
Eval HoP
```bash
single gpu
python tools/test.py configs/hopbevdet/hopbevdet4d-r50-depth.py $checkpoint --eval bbox
multiple gpu
./tools/disttest.sh configs/hopbevdet/hopbevdet4d-r50-depth.py $checkpoint $numgpu --eval bbox ```
Method

TODO
- [ ] Release code for HoP on BEVFormer.
Cite HoP
If you find this repository useful, please use the following BibTeX entry for citation.
latex
@misc{hop2023,
title={Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction},
author={Zhuofan Zong and Dongzhi Jiang and Guanglu Song and Zeyue Xue and Jingyong Su and Hongsheng Li and Yu Liu},
year={2023},
eprint={2304.00967},
archivePrefix={arXiv},
primaryClass={cs.CV}
}
License
This project is released under the MIT license. Please see the LICENSE file for more information.
Owner
- Name: SenseTime X-Lab
- Login: Sense-X
- Kind: user
- Location: Hong Kong
- Company: SenseTime Group Limited
- Website: sensetime.com
- Repositories: 9
- Profile: https://github.com/Sense-X
Powered by X-Lab, SenseTime Research
Citation (CITATION.cff)
cff-version: 1.2.0 message: "If you use this software, please cite it as below." authors: - name: "MMDetection3D Contributors" title: "OpenMMLab's Next-generation Platform for General 3D Object Detection" date-released: 2020-07-23 url: "https://github.com/open-mmlab/mmdetection3d" license: Apache-2.0
GitHub Events
Total
- Watch event: 12
- Fork event: 2
Last Year
- Watch event: 12
- Fork event: 2
Dependencies
- actions/checkout v2 composite
- actions/setup-python v2 composite
- codecov/codecov-action v1.0.10 composite
- codecov/codecov-action v2 composite
- actions/checkout v2 composite
- actions/setup-python v2 composite
- actions/checkout v2 composite
- actions/setup-python v1 composite
- actions/checkout v2 composite
- actions/setup-python v2 composite
- nvcr.io/nvidia/tensorrt 22.07-py3 build
- pytorch/pytorch ${PYTORCH}-cuda${CUDA}-cudnn${CUDNN}-devel build
- docutils ==0.16.0
- m2r *
- mistune ==0.8.4
- myst-parser *
- sphinx ==4.0.2
- sphinx-copybutton *
- sphinx_markdown_tables *
- mmcv-full >=1.4.8,<=1.6.0
- mmdet >=2.24.0,<=3.0.0
- mmsegmentation >=0.20.0,<=1.0.0
- open3d *
- spconv *
- waymo-open-dataset-tf-2-1-0 ==1.2.0
- mmcv >=1.4.8
- mmdet >=2.24.0
- mmsegmentation >=0.20.1
- torch *
- torchvision *
- lyft_dataset_sdk *
- networkx >=2.2,<2.3
- numba ==0.53.0
- numpy *
- nuscenes-devkit *
- plyfile *
- scikit-image *
- tensorboard *
- trimesh >=2.35.39,<2.35.40
- asynctest * test
- codecov * test
- flake8 * test
- interrogate * test
- isort * test
- kwarray * test
- pytest * test
- pytest-cov * test
- pytest-runner * test
- ubelt * test
- xdoctest >=0.10.0 test
- yapf * test