cfpt
Cross-Layer Feature Pyramid Transformer for Small Object Detection in Aerial Images
Science Score: 54.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
✓Academic publication links
Links to: arxiv.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (9.8%) to scientific vocabulary
Repository
Cross-Layer Feature Pyramid Transformer for Small Object Detection in Aerial Images
Basic Info
Statistics
- Stars: 59
- Watchers: 5
- Forks: 1
- Open Issues: 3
- Releases: 0
Metadata Files
README.md
Cross-Layer Feature Pyramid Transformer for Small Object Detection in Aerial Images
This repository provides the official PyTorch implementation of CFPT.
In this paper, we propose the cross-layer feature pyramid transformer designed for small object detection in aerial images.
Below is the performance comparison with other feature pyramid networks based on RetinaNet on the VisDrone-2019 DET dataset.
The architecture of CFPT is as described below.
Weights
Due to the accidental deletion of the model weights prepared for this paper, we retrained the entire network, resulting in slight differences in performance metrics compared to the original study. Model | AP | Log | Link1 | Link2 | --- |:---:|:---:|:---:|:---: retinanetr18cfpt | 20.0 | Log | BaiduNetDisk | GoogleDrive retinanetr50cfpt | 22.4 | Log | BaiduNetDisk | GoogleDrive retinanetr101cfpt | 22.6 | Log | BaiduNetDisk | GoogleDrive
Installation
Our experiments are based on torch 1.10+cu113, mmdet 2.24.1 and mmcv-full 1.6.0.
Please see get_started.md for the basic usage of MMDetection.
Install PyTorch.
Install mmcv-full and MMDetection toolbox.
bash pip install openmim mim install mmcv-full==1.6.0Install albumentations and other packages.
bash pip install einops pip install timm pip install yapf==0.40.1 pip install albumentations==1.1.0Clone and install this repository.
bash git clone https://github.com/duzw9311/CFPT.git cd ./CFPT pip install -e .
Usage
Data Preparation
Download the VisDrone2019-DET dataset converted to COCO annotation format. You can download it from this link.
Training
bash
python tools/train.py configs/CFPT/retinanet_r18_cfpt_1x_visdrone.py
Testing
bash
python tools/test.py configs/CFPT/retinanet_r18_cfpt_1x_visdrone.py work_dirs/retinanet_r18_cfpt_1x_visdrone/latest.pth --eval bbox
Acknowledgement
This repository is built upon the MMDetection library. Thanks to the authors of CEASC and other researchers in the field of object detection for their open-source code.
Citation
If you find this paper helpful for your project, we'd appreciate it if you could cite it.
@article{du2024cross,
title={Cross-Layer Feature Pyramid Transformer for Small Object Detection in Aerial Images},
author={Du, Zewen and Hu, Zhenjiang and Zhao, Guiyu and Jin, Ying and Ma, Hongbin},
journal={arXiv preprint arXiv:2407.19696},
year={2024}
}
Owner
- Name: duzw1
- Login: duzw9311
- Kind: user
- Repositories: 1
- Profile: https://github.com/duzw9311
Citation (CITATION.cff)
cff-version: 1.2.0 message: "If you use this software, please cite it as below." authors: - name: "MMDetection Contributors" title: "OpenMMLab Detection Toolbox and Benchmark" date-released: 2018-08-22 url: "https://github.com/open-mmlab/mmdetection" license: Apache-2.0
GitHub Events
Total
- Issues event: 3
- Watch event: 23
- Issue comment event: 3
Last Year
- Issues event: 3
- Watch event: 23
- Issue comment event: 3