https://github.com/1btu/nanodet
Science Score: 10.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
○DOI references
-
✓Academic publication links
Links to: arxiv.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (11.0%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: 1BTU
- License: apache-2.0
- Language: C++
- Default Branch: main
- Size: 139 MB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
# NanoDet-Plus
**Super fast and high accuracy lightweight anchor-free object detection model. Real-time on mobile devices.**
[](https://img.shields.io/github/checks-status/RangiLyu/nanodet/main?label=CI&style=flat)

[](https://github.com/RangiLyu/nanodet/blob/main/LICENSE)
[](https://img.shields.io/github/downloads/RangiLyu/nanodet/total?color=yellow&label=Downloads&logo=github&logoColor=lightgrey&style=flat)
[](https://img.shields.io/github/v/release/RangiLyu/nanodet?style=flat)
- Super lightweight: Model file is only 980KB(INT8) or 1.8MB(FP16).
- Super fast: 97fps(10.23ms) on mobile ARM CPU.
- High accuracy: Up to 34.3 mAPval@0.5:0.95 and still realtime on CPU.
- Training friendly: Much lower GPU memory cost than other models. Batch-size=80 is available on GTX1060 6G.
- Easy to deploy: Support various backends including ncnn, MNN and OpenVINO. Also provide Android demo based on ncnn inference framework.
Introduction

NanoDet is a FCOS-style one-stage anchor-free object detection model which using Generalized Focal Loss as classification and regression loss.
In NanoDet-Plus, we propose a novel label assignment strategy with a simple assign guidance module (AGM) and a dynamic soft label assigner (DSLA) to solve the optimal label assignment problem in lightweight model training. We also introduce a light feature pyramid called Ghost-PAN to enhance multi-layer feature fusion. These improvements boost previous NanoDet's detection accuracy by 7 mAP on COCO dataset.
QQ908606542 ()
Benchmarks
Model |Resolution| mAPval
0.5:0.95 |CPU Latency
(i7-8700) |ARM Latency
(4xA76) | FLOPS | Params | Model Size
:-------------:|:--------:|:-------:|:--------------------:|:--------------------:|:----------:|:---------:|:-------:
NanoDet-m | 320320 | 20.6 | *4.98ms** | 10.23ms | 0.72G | 0.95M | 1.8MB(FP16) | 980KB(INT8)
NanoDet-Plus-m | 320320 | *27.0** | 5.25ms | 11.97ms | 0.9G | 1.17M | 2.3MB(FP16) | 1.2MB(INT8)
NanoDet-Plus-m | 416416 | *30.4** | 8.32ms | 19.77ms | 1.52G | 1.17M | 2.3MB(FP16) | 1.2MB(INT8)
NanoDet-Plus-m-1.5x | 320320 | *29.9** | 7.21ms | 15.90ms | 1.75G | 2.44M | 4.7MB(FP16) | 2.3MB(INT8)
NanoDet-Plus-m-1.5x | 416416 | *34.1** | 11.50ms | 25.49ms | 2.97G | 2.44M | 4.7MB(FP16) | 2.3MB(INT8)
YOLOv3-Tiny | 416416 | 16.6 | - | 37.6ms | 5.62G | 8.86M | 33.7MB
YOLOv4-Tiny | 416416 | 21.7 | - | 32.81ms | 6.96G | 6.06M | 23.0MB
YOLOX-Nano | 416416 | 25.8 | - | 23.08ms | 1.08G | 0.91M | 1.8MB(FP16)
YOLOv5-n | 640640 | 28.4 | - | 44.39ms | 4.5G | 1.9M | 3.8MB(FP16)
FBNetV5 | 320640 | 30.4 | - | - | 1.8G | - | -
MobileDet | 320320 | 25.6 | - | - | 0.9G | - | -
Download pre-trained models and find more models in Model Zoo or in Release Files
Notes (click to expand)
* ARM Performance is measured on Kirin 980(4xA76+4xA55) ARM CPU based on ncnn. You can test latency on your phone with [ncnn_android_benchmark](https://github.com/nihui/ncnn-android-benchmark). * Intel CPU Performance is measured Intel Core-i7-8700 based on OpenVINO. * NanoDet mAP(0.5:0.95) is validated on COCO val2017 dataset with no testing time augmentation. * YOLOv3&YOLOv4 mAP refers from [Scaled-YOLOv4: Scaling Cross Stage Partial Network](https://arxiv.org/abs/2011.08036).NEWS!!!
[2023.01.20] Upgrade to pytorch-lightning-1.9. The minimum PyTorch version is upgraded to 1.10. Support FP16 training(Thanks @crisp-snakey). Support ignore label(Thanks @zero0kiriyu).
[2022.08.26] Upgrade to pytorch-lightning-1.7. The minimum PyTorch version is upgraded to 1.9. To use previous version of PyTorch, please install NanoDet <= v1.0.0-alpha-1
[2021.12.25] NanoDet-Plus release! Adding AGM(Assign Guidance Module) & DSLA(Dynamic Soft Label Assigner) to improve 7 mAP with only a little cost.
Find more update notes in Update notes.
Demo
Android demo

Android demo project is in demoandroidncnn folder. Please refer to Android demo guide.
Here is a better implementation ncnn-android-nanodet
NCNN C++ demo
C++ demo based on ncnn is in demo_ncnn folder. Please refer to Cpp demo guide.
MNN demo
Inference using Alibaba's MNN framework is in demo_mnn folder. Please refer to MNN demo guide.
OpenVINO demo
Inference using OpenVINO is in demo_openvino folder. Please refer to OpenVINO demo guide.
Web browser demo
https://nihui.github.io/ncnn-webassembly-nanodet/
Pytorch demo
First, install requirements and setup NanoDet following installation guide. Then download COCO pretrain weight from here
The pre-trained weight was trained by the config config/nanodet-plus-m_416.yml.
- Inference images
bash
python demo/demo.py image --config CONFIG_PATH --model MODEL_PATH --path IMAGE_PATH
- Inference video
bash
python demo/demo.py video --config CONFIG_PATH --model MODEL_PATH --path VIDEO_PATH
- Inference webcam
bash
python demo/demo.py webcam --config CONFIG_PATH --model MODEL_PATH --camid YOUR_CAMERA_ID
Besides, We provide a notebook here to demonstrate how to make it work with PyTorch.
Install
Requirements
- Linux or MacOS
- CUDA >= 10.2
- Python >= 3.7
- Pytorch >= 1.10.0, <2.0.0
Step
- Create a conda virtual environment and then activate it.
shell script
conda create -n nanodet python=3.8 -y
conda activate nanodet
- Install pytorch
shell script
conda install pytorch torchvision cudatoolkit=11.1 -c pytorch -c conda-forge
- Clone this repository
shell script
git clone https://github.com/RangiLyu/nanodet.git
cd nanodet
- Install requirements
shell script
pip install -r requirements.txt
- Setup NanoDet
shell script python setup.py develop
Model Zoo
NanoDet supports variety of backbones. Go to the config folder to see the sample training config files.
Model | Backbone |Resolution|COCO mAP| FLOPS |Params | Pre-train weight | :--------------------:|:------------------:|:--------:|:------:|:-----:|:-----:|:-----:| NanoDet-m | ShuffleNetV2 1.0x | 320320 | 20.6 | 0.72G | 0.95M | Download | NanoDet-Plus-m-320 (NEW) | ShuffleNetV2 1.0x | 320320 | 27.0 | 0.9G | 1.17M | Weight | Checkpoint NanoDet-Plus-m-416 (NEW) | ShuffleNetV2 1.0x | 416416 | 30.4 | 1.52G | 1.17M | Weight | Checkpoint NanoDet-Plus-m-1.5x-320 (NEW)| ShuffleNetV2 1.5x | 320320 | 29.9 | 1.75G | 2.44M | Weight | Checkpoint NanoDet-Plus-m-1.5x-416 (NEW)| ShuffleNetV2 1.5x | 416*416 | 34.1 | 2.97G | 2.44M | Weight | Checkpoint
Notice: The difference between Weight and Checkpoint is the weight only provide params in inference time, but the checkpoint contains training time params.
Legacy Model Zoo
Model | Backbone |Resolution|COCO mAP| FLOPS |Params | Pre-train weight | :--------------------:|:------------------:|:--------:|:------:|:-----:|:-----:|:-----:| NanoDet-m-416 | ShuffleNetV2 1.0x | 416416 | 23.5 | 1.2G | 0.95M | Download| NanoDet-m-1.5x | ShuffleNetV2 1.5x | 320320 | 23.5 | 1.44G | 2.08M | Download | NanoDet-m-1.5x-416 | ShuffleNetV2 1.5x | 416416 | 26.8 | 2.42G | 2.08M | Download| NanoDet-m-0.5x | ShuffleNetV2 0.5x | 320320 | 13.5 | 0.3G | 0.28M | Download | NanoDet-t | ShuffleNetV2 1.0x | 320320 | 21.7 | 0.96G | 1.36M | Download | NanoDet-g | Custom CSP Net | 416416 | 22.9 | 4.2G | 3.81M | Download| NanoDet-EfficientLite | EfficientNet-Lite0 | 320320 | 24.7 | 1.72G | 3.11M | Download| NanoDet-EfficientLite | EfficientNet-Lite1 | 416416 | 30.3 | 4.06G | 4.01M | Download | NanoDet-EfficientLite | EfficientNet-Lite2 | 512512 | 32.6 | 7.12G | 4.71M | Download | NanoDet-RepVGG | RepVGG-A0 | 416416 | 27.8 | 11.3G | 6.75M | Download |
How to Train
Prepare dataset
If your dataset annotations are pascal voc xml format, refer to config/nanodetcustomxml_dataset.yml
Otherwise, if your dataset annotations are YOLO format (Darknet TXT), refer to config/nanodet-plus-m_416-yolo.yml
Or convert your dataset annotations to MS COCO format(COCO annotation format details).
Prepare config file
Copy and modify an example yml config file in config/ folder.
Change save_dir to where you want to save model.
Change num_classes in model->arch->head.
Change image path and annotation path in both data->train and data->val.
Set gpu ids, num workers and batch size in device to fit your device.
Set total_epochs, lr and lr_schedule according to your dataset and batchsize.
If you want to modify network, data augmentation or other things, please refer to Config File Detail
Start training
NanoDet is now using pytorch lightning for training.
For both single-GPU or multiple-GPUs, run:
shell script
python tools/train.py CONFIG_FILE_PATH
Visualize Logs
TensorBoard logs are saved in
save_dirwhich you set in config file.To visualize tensorboard logs, run:
shell script cd <YOUR_SAVE_DIR> tensorboard --logdir ./
How to Deploy
NanoDet provide multi-backend C++ demo including ncnn, OpenVINO and MNN. There is also an Android demo based on ncnn library.
Export model to ONNX
To convert NanoDet pytorch model to ncnn, you can choose this way: pytorch->onnx->ncnn
To export onnx model, run tools/export_onnx.py.
shell script
python tools/export_onnx.py --cfg_path ${CONFIG_PATH} --model_path ${PYTORCH_MODEL_PATH}
Run NanoDet in C++ with inference libraries
ncnn
Please refer to demo_ncnn.
OpenVINO
Please refer to demo_openvino.
MNN
Please refer to demo_mnn.
Run NanoDet on Android
Please refer to android_demo.
Citation
If you find this project useful in your research, please consider cite:
BibTeX
@misc{=nanodet,
title={NanoDet-Plus: Super fast and high accuracy lightweight anchor-free object detection model.},
author={RangiLyu},
howpublished = {\url{https://github.com/RangiLyu/nanodet}},
year={2021}
}
Thanks
https://github.com/Tencent/ncnn
https://github.com/open-mmlab/mmdetection
https://github.com/implus/GFocal
https://github.com/cmdbug/YOLOv5_NCNN
https://github.com/rbgirshick/yacs
Owner
- Name: leonming
- Login: 1BTU
- Kind: user
- Repositories: 1
- Profile: https://github.com/1BTU
GitHub Events
Total
Last Year
Dependencies
- actions/checkout v2 composite
- actions/setup-python v2 composite
- codecov/codecov-action v2 composite
- androidx.appcompat:appcompat 1.1.0 implementation
- androidx.camera:camera-camera2 ${camerax_version} implementation
- androidx.camera:camera-core ${camerax_version} implementation
- androidx.constraintlayout:constraintlayout 1.1.3 implementation
- com.android.support:multidex 1.0.3 implementation
- com.bm.photoview:library 1.4.1 implementation
- com.github.chrisbanes:PhotoView 2.3.0 implementation
- com.zxy.android:recovery 1.0.0 implementation
- junit:junit 4.12 testImplementation