urbancardetector

Comparison of Deep Learning Models for Urban Car Detection

https://github.com/dangeospatial/urbancardetector

Science Score: 44.0%

This score indicates how likely this project is to be science-related based on various indicators:

  • CITATION.cff file
    Found CITATION.cff file
  • codemeta.json file
    Found codemeta.json file
  • .zenodo.json file
    Found .zenodo.json file
  • DOI references
  • Academic publication links
  • Academic email domains
  • Institutional organization owner
  • JOSS paper metadata
  • Scientific vocabulary similarity
    Low similarity (9.3%) to scientific vocabulary

Keywords

arcgis-enterprise arcpy deep-learning jupyter-notebook object-detection
Last synced: 6 months ago · JSON representation ·

Repository

Comparison of Deep Learning Models for Urban Car Detection

Basic Info
  • Host: GitHub
  • Owner: DanGeospatial
  • License: gpl-3.0
  • Language: Jupyter Notebook
  • Default Branch: main
  • Homepage:
  • Size: 5.25 MB
Statistics
  • Stars: 1
  • Watchers: 1
  • Forks: 0
  • Open Issues: 0
  • Releases: 1
Topics
arcgis-enterprise arcpy deep-learning jupyter-notebook object-detection
Created about 2 years ago · Last pushed almost 2 years ago
Metadata Files
Readme License Citation

README.md

UrbanCarDetector

Comparison of Deep Learning Models for Urban Car Detection


The goal of the project was to compare the performance of six object detection models for detecting cars within a complex urban environment. 1,593 image tiles were created from RGB drone imagery at a spatial resolution of 3.65 cm. Some vehicles are clustered together while others are not, or partially covered by trees and shadows. The models tested for comparison all use a standard set of hyperparameters and backbone models. Except for YOLOv3 which uses darknet53 and MMDetection which uses cascade_rcnn. The ArcGIS.Learn module from the ArcGIS Developer library was using for coding with the pytorch and fastai backends. Training object detection models is available within ArcGIS Pro but it was necessary to use the python libraries to ensure that the same training/testing split was used for evaluating each model. In addition to the model comparison the resnet50 backbone model was selected based on a grid search of the available resnet family of models (18-152) using FasterRCNN.

image


Tile sizes of [256,416,512], strides of [half,quarter,0] and a minimum polygon overlap ratio of 0.5 were compared when optimizing the tiles. Using a tile size of 416 provided approximately 98% the average precision of 512 but at only about 60% of the training time. For this reason a tile size of 416 and stride of 224 were chosen after comparing options using FasterRCNN. Larger tile sizes can provide more context of surrounding objects but requires more GPU memory. The minimum polygon overlap ratio was used to try to eliminate tiles that only contained very small or no portions of vehicles.

Overall, the DETReg model had the highest average precision score (0.955) out of any model compared with YOLOv3 (0.915), MMDetection (0.932) and FasterRCNN (0.9348) producing very similar results. RetinaNet and SingleShotDetector produced substantially worse results than the other models.

image


Next, when we look at the total run time to train each model we get a very different picture. While DETReg had the highest average precision score it also took the longest to train. SingleShotDetector did very poorly overall after taking more than 80 minutes to run while providing a low average precision score. YOLOv3 and FasterRCNN both took less than 50 minutes to train on this dataset while producing among the highest precision.

image


DETReg produced the highest average precision score and required the largest number of epochs whereas FasterRCNN provided a balance between accuracy and processing time. After resnet50 adding more layers did not improve the model.

Issues:

Shadows/ghosting from moving cars were not patched out and could be influencing the precision. This is minor because most cars are stationary in this setting.

Owner

  • Login: DanGeospatial
  • Kind: user
  • Location: Canada

Citation (CITATION.cff)

cff-version: 1.2.0
message: "If you use this software, please cite it as below."
authors:
  - family-names: "Nelson"
    given-names: "Daniel"
    orcid: "https://orcid.org/0009-0005-5200-6652"
title: "UrbanCarDetector (v. 1.0.0)"
version: 1.0.0
date-released: 2024-04-02
url: "https://github.com/DanGeospatial/UrbanCarDetector"

GitHub Events

Total
Last Year