AddaxAI
AddaxAI: A no-code platform to train and deploy custom YOLOv5 object detection models - Published in JOSS (2023)
OfflineMOT
OfflineMOT: A Python package for multiple objects detection and tracking from bird view stationary drone videos - Published in JOSS (2022)
Logodetect
Logodetect: One-shot detection of logos in image and video data - Published in JOSS (2022)
sahi
Framework agnostic sliced/tiled inference + interactive ui + error analysis plots
OmniTrax
OmniTrax: A deep learning-driven multi-animal tracking and pose-estimation add-on for Blender - Published in JOSS (2024)
albumentations
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
rf-detr
RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO and designed for fine-tuning.
inference
Turn any computer or edge device into a command center for your computer vision projects.
catalyst
Accelerated deep learning R&D
rastervision
An open source library and framework for deep learning on satellite and aerial imagery.
torchdistill
A coding-free framework built on PyTorch for reproducible deep learning studies. PyTorch Ecosystem. 🏆26 knowledge distillation methods presented at CVPR, ICLR, ECCV, NeurIPS, ICCV, etc are implemented so far. 🎁 Trained models, training logs and configurations are available for ensuring the reproducibiliy and benchmark.
https://github.com/cheind/py-motmetrics
:bar_chart: Benchmark multiple object trackers (MOT) in Python
mxnet.sharp
.NET Standard bindings for Apache MxNet with Imperative, Symbolic and Gluon Interface for developing, training and deploying Machine Learning models in C#. https://mxnet.tech-quantum.com/
grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
geo-trax
🚀 Geo-trax is a comprehensive pipeline for extracting and analyzing high-accuracy georeferenced vehicle trajectories from quasi-stationary, bird’s-eye view drone footage. Using advanced computer vision and deep learning, it enables detailed urban traffic analysis and supports scalable, precise studies of vehicle dynamics.
sc2-benchmark
[TMLR] "SC2 Benchmark: Supervised Compression for Split Computing"
pylocron
PyTorch implementations of recent Computer Vision tricks (ReXNet, RepVGG, Unet3p, YOLOv4, CIoU loss, AdaBelief, PolyLoss, MobileOne). Other additions: AdEMAMix
autodistill
Images to inference with no labeling (use foundation models to train supervised models).
cvat
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
x-anylabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.
vitdet
Unofficial implementation for [ECCV'22] "Exploring Plain Vision Transformer Backbones for Object Detection"
rf100-vl
Code from the paper "Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models"
https://github.com/aim-uofa/adelaidet
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
rasberry_perception
General purpose ROS package for using deep learning/object detection frameworks on robots
skellytracker
A pose estimation and object detection/tracking backend for freemocap.
https://github.com/albumentations-team/albumentationsx
Next-generation Albumentations: dual-licensed for open-source and commercial use
hugsvision
HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision
targetran
Python library for data augmentation in object detection or image classification model training
cppe5
Code for our paper CPPE - 5 (Medical Personal Protective Equipment), a new challenging object detection dataset
seglight
Super fast and real-time semantic segmentation (cpu only) can be use for 1 core cpu
lrv-instruction
[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
kso
Notebooks to upload/download marine footage, connect to a citizen science project, train machine learning models and publish marine biological observations.
mish
Official Repository for "Mish: A Self Regularized Non-Monotonic Neural Activation Function" [BMVC 2020]
orca
ORCA: Oceanic Recognition & Classification Application for sea-life analysis systems.
peoplesanspeople
Unity's privacy-preserving human-centric synthetic data generator
promptdet
PromptDet: Towards Open-vocabulary Detection using Uncurated Images, ECCV2022
detect-waste
AI to Combat Environmental Pollution - detecting plastic waste in the environment to combat environmental pollution and promote circular economy (Deep Learning, PyTorch)
catnet
🛰️ Learning to Aggregate Multi-Scale Context for Instance Segmentation in Remote Sensing Images (TNNLS 2024)
quickai
QuickAI is a Python library that makes it extremely easy to experiment with state-of-the-art Machine Learning models.
https://github.com/rybakov-ks/particleanalyzer
A Computer Vision-based tool for automatic segmentation and size analysis of particles in Scanning Electron Microscope (SEM) images.
https://github.com/brainglobe/cellfinder-napari
Efficient cell detection in large images using cellfinder in napari
https://github.com/amazon-science/bigdetection
BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
https://github.com/bencardoen/specht.jl
SPECHT is a Julia implementation of a contrastive weakly supervised object detection and identification method for fluorescence microscopy.
https://github.com/alan-turing-institute/grace
Graph Representation Analysis for Connected Embeddings
https://github.com/braun-steven/dafne
Code for our paper "DAFNe: A One-Stage Anchor-Free Deep Model for Oriented Object Detection".
https://github.com/ammarlodhi255/crack-informed-yolov9-for-multi-locale-bone-fracture-detection
This repository contains the implementation code of the paper "Small Data, Big Impact: A Multi-Locale Bone Fracture Detection on an Extremely Limited Dataset Via Crack-Informed YOLOv9 Variants"
https://github.com/broadinstitute/keras-rcnn
Keras package for region-based convolutional neural networks (RCNNs)
https://github.com/autodistill/autodistill-owlv2
OWLv2 base model for use with Autodistill.
pyro-vision
Computer vision library for wildfire detection 🌲 Deep learning models in PyTorch & ONNX for inference on edge devices (e.g. Raspberry Pi)
https://github.com/autodistill/autodistill-transformers
Use object detection models in Hugging Face Transformers to automatically label data to train a fine-tuned model.
Photovoltaic_Fault_Detector
Model Photovoltaic Fault Detector based in model detector YOLOv.3, this repository contains four detector model with their weights and the explanation of how to use these models.
https://github.com/bencevans/megadetector-contained
Docker image for running the MegaDetector v4 camera-trap object detection model.
https://github.com/nsembleai/nsvision
nsvision is the image data pre and post processing and data augmentation library. It provides utilities for working with image data.
udmdet_tii
[TII 2024] An official implementation of ''A Novel Underwater Detection Method for Ambiguous Object Finding via Distraction Mining''
rl_drone_object_search
UAV-based path planning for efficient localization of non-uniformly distributed weeds using prior knowledge: A reinforcement-learning approach
astrobeecd
[IAC 2023, AA 2024] This repository contains the code used in our paper, "AstrobeeCD: Change Detection in Microgravity with Free-Flying Robots." This method is useful for detecting 3D scene changes given a 3D model, a sequence of images, and a sequence of camera poses.
https://github.com/ai-forever/aggme
Aggregation framework for annotating datasets in computer vision tasks (detection, segmentation, video captioning etc.)
strip-r-cnn
Offical implementation of "Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection"
detrex
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
https://github.com/bryceag11/quan
Quaternion Approximate Networks for Enhanced Image Classification and Object Detection
paperclipinspection
Analyzing Paper Clips Using Deep Learning and Computer Vision Techniques 📎
orientedformer
(TGRS 2024) OrientedFormer: An End-to-End Transformer-Based Oriented Object Detector in Remote Sensing Images
https://github.com/autodistill/autodistill-kosmos-2
Kosmos-2 base model for use with Autodistill.
https://github.com/autodistill/autodistill-vlpart
VLPart model for use with Autodistill.
https://github.com/autodistill/autodistill-gpt-4v
GPT-4V(ision) module for use with Autodistill.
efficient-yolo-rs-airplane-detection
Repository supports the article "Exploring YOLOv8 and YOLOv9 for Efficient Airplane Detection in VHR Remote Sensing Imagery" providing weights and evaluation metrics for YOLO architectures applied to high-resolution satellite imagery using the HRPlanes and CORS-ADD datasets.
small-object-detection-benchmark
icip2022 paper: sahi benchmark on visdrone and xview datasets using fcos, vfnet and tood detectors
Lighter
Lighter: Configuration-Driven Deep Learning - Published in JOSS (2025)
gpvit
[ICLR 2023 Spotlight] GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation
https://github.com/acai66/yolov5_rotation
rotated bbox detection. inspired by https://github.com/hukaixuan19970627/YOLOv5_DOTA_OBB, thanks hukaixuan19970627.
motherboard-dataset
[Kaggle Dataset] Motherboard production defect dataset for object detection. Currently available for YOLOv5, YOLOv7, YOLOv8 & CLIP. Also available on Kaggle.
air
A deep learning object detector framework written in Python for supporting Land Search and Rescue Missions.
yolov4_wrong_side_driving_detection
A Computer Vision (YOLOv4) based project to autonomously detect and penalize vehicles driving on the wrong side of the road.
https://github.com/ammarlodhi255/yolov10-fracture-detection
This repository contains code the official code for the paper "Pediatric Wrist Fracture Detection in X-rays via YOLOv10 Algorithm and Dual Label Assignment System"
https://github.com/arbit3rr/jetsonyolo
Simple process for camera installation, software and hardware setup, and object detection using Yolov5 and openCV on NVIDIA Jetson Nano.