AddaxAI
AddaxAI: A no-code platform to train and deploy custom YOLOv5 object detection models - Published in JOSS (2023)
OfflineMOT
OfflineMOT: A Python package for multiple objects detection and tracking from bird view stationary drone videos - Published in JOSS (2022)
Logodetect
Logodetect: One-shot detection of logos in image and video data - Published in JOSS (2022)
sahi
Framework agnostic sliced/tiled inference + interactive ui + error analysis plots
OmniTrax
OmniTrax: A deep learning-driven multi-animal tracking and pose-estimation add-on for Blender - Published in JOSS (2024)
albumentations
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
rf-detr
RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO and designed for fine-tuning.
inference
Turn any computer or edge device into a command center for your computer vision projects.
catalyst
Accelerated deep learning R&D
rastervision
An open source library and framework for deep learning on satellite and aerial imagery.
torchdistill
A coding-free framework built on PyTorch for reproducible deep learning studies. PyTorch Ecosystem. 🏆26 knowledge distillation methods presented at CVPR, ICLR, ECCV, NeurIPS, ICCV, etc are implemented so far. 🎁 Trained models, training logs and configurations are available for ensuring the reproducibiliy and benchmark.
https://github.com/cheind/py-motmetrics
:bar_chart: Benchmark multiple object trackers (MOT) in Python
mxnet.sharp
.NET Standard bindings for Apache MxNet with Imperative, Symbolic and Gluon Interface for developing, training and deploying Machine Learning models in C#. https://mxnet.tech-quantum.com/
grad-cam
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
geo-trax
🚀 Geo-trax is a comprehensive pipeline for extracting and analyzing high-accuracy georeferenced vehicle trajectories from quasi-stationary, bird’s-eye view drone footage. Using advanced computer vision and deep learning, it enables detailed urban traffic analysis and supports scalable, precise studies of vehicle dynamics.
sc2-benchmark
[TMLR] "SC2 Benchmark: Supervised Compression for Split Computing"
pylocron
PyTorch implementations of recent Computer Vision tricks (ReXNet, RepVGG, Unet3p, YOLOv4, CIoU loss, AdaBelief, PolyLoss, MobileOne). Other additions: AdEMAMix
autodistill
Images to inference with no labeling (use foundation models to train supervised models).
cvat
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
x-anylabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.
vitdet
Unofficial implementation for [ECCV'22] "Exploring Plain Vision Transformer Backbones for Object Detection"
rf100-vl
Code from the paper "Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models"
https://github.com/aim-uofa/adelaidet
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
rasberry_perception
General purpose ROS package for using deep learning/object detection frameworks on robots
skellytracker
A pose estimation and object detection/tracking backend for freemocap.
https://github.com/albumentations-team/albumentationsx
Next-generation Albumentations: dual-licensed for open-source and commercial use
hugsvision
HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision
targetran
Python library for data augmentation in object detection or image classification model training
cppe5
Code for our paper CPPE - 5 (Medical Personal Protective Equipment), a new challenging object detection dataset
seglight
Super fast and real-time semantic segmentation (cpu only) can be use for 1 core cpu
lrv-instruction
[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
kso
Notebooks to upload/download marine footage, connect to a citizen science project, train machine learning models and publish marine biological observations.
mish
Official Repository for "Mish: A Self Regularized Non-Monotonic Neural Activation Function" [BMVC 2020]
orca
ORCA: Oceanic Recognition & Classification Application for sea-life analysis systems.
peoplesanspeople
Unity's privacy-preserving human-centric synthetic data generator
promptdet
PromptDet: Towards Open-vocabulary Detection using Uncurated Images, ECCV2022
detect-waste
AI to Combat Environmental Pollution - detecting plastic waste in the environment to combat environmental pollution and promote circular economy (Deep Learning, PyTorch)
catnet
🛰️ Learning to Aggregate Multi-Scale Context for Instance Segmentation in Remote Sensing Images (TNNLS 2024)
quickai
QuickAI is a Python library that makes it extremely easy to experiment with state-of-the-art Machine Learning models.
https://github.com/rybakov-ks/particleanalyzer
A Computer Vision-based tool for automatic segmentation and size analysis of particles in Scanning Electron Microscope (SEM) images.
https://github.com/brainglobe/cellfinder-napari
Efficient cell detection in large images using cellfinder in napari
https://github.com/amazon-science/bigdetection
BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
https://github.com/bencardoen/specht.jl
SPECHT is a Julia implementation of a contrastive weakly supervised object detection and identification method for fluorescence microscopy.
https://github.com/alan-turing-institute/grace
Graph Representation Analysis for Connected Embeddings
https://github.com/braun-steven/dafne
Code for our paper "DAFNe: A One-Stage Anchor-Free Deep Model for Oriented Object Detection".
https://github.com/ammarlodhi255/crack-informed-yolov9-for-multi-locale-bone-fracture-detection
This repository contains the implementation code of the paper "Small Data, Big Impact: A Multi-Locale Bone Fracture Detection on an Extremely Limited Dataset Via Crack-Informed YOLOv9 Variants"
https://github.com/broadinstitute/keras-rcnn
Keras package for region-based convolutional neural networks (RCNNs)
https://github.com/autodistill/autodistill-owlv2
OWLv2 base model for use with Autodistill.
pyro-vision
Computer vision library for wildfire detection 🌲 Deep learning models in PyTorch & ONNX for inference on edge devices (e.g. Raspberry Pi)
https://github.com/autodistill/autodistill-transformers
Use object detection models in Hugging Face Transformers to automatically label data to train a fine-tuned model.
Photovoltaic_Fault_Detector
Model Photovoltaic Fault Detector based in model detector YOLOv.3, this repository contains four detector model with their weights and the explanation of how to use these models.
https://github.com/bencevans/megadetector-contained
Docker image for running the MegaDetector v4 camera-trap object detection model.
https://github.com/nsembleai/nsvision
nsvision is the image data pre and post processing and data augmentation library. It provides utilities for working with image data.
tardal
CVPR 2022 | Target-aware Dual Adversarial Learning and a Multi-scenario Multi-Modality Benchmark to Fuse Infrared and Visible for Object Detection.
gaze-fixation-and-object-saliency
This repository is related to estimating the driver's attention to the outside scene view as a point of gaze (PoG) w.r.t the gaze angles extracted from the driver facing view. We also explore analysis of gaze saliecny in form of heatmaps and yarbus plots.
ultralytics_ros
ROS/ROS 2 package for Ultralytics YOLOv8 real-time object detection and segmentation. https://github.com/ultralytics/ultralytics
igarss-eq-object-detection-yolo-kahramanmaras
The study uses YOLO models on satellite imagery to detect collapsed buildings in Antakya after the 2023 Türkiye earthquake, achieving good results with YOLOv7 but highlighting challenges.
https://github.com/autodistill/autodistill-kosmos-2
Kosmos-2 base model for use with Autodistill.
https://github.com/autodistill/autodistill-vlpart
VLPart model for use with Autodistill.
mfprn_tgrs
[TGRS 2023] An official implementation of Multitype Feature Perception and Refined Network for Spaceborne Infrared Ship Detection
yolov8-sheep-detection-counting
YOLOv8 Aerial Sheep Detection and Counting. Simulated on Gazebo.
strip-r-cnn
Offical implementation of "Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection"
yolov5-parts
Training of VOC2007(or T-LESS) dataset using YOLOv5 (including improved version)
cfinet
The official implementation for ICCV'23 paper "Small Object Detection via Coarse-to-fine Proposal Generation and Imitation Learning"
https://github.com/astrazeneca/perfcam
PoC Code for PerfCam: Digital Twinning for Production Lines Using 3D Gaussian Splatting and Vision Models
gpvit
[ICLR 2023 Spotlight] GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation
https://github.com/acai66/yolov5_rotation
rotated bbox detection. inspired by https://github.com/hukaixuan19970627/YOLOv5_DOTA_OBB, thanks hukaixuan19970627.
pediatric-wrist-abnormality-detection-system
A repository for an automated system using object detection and machine learning techniques to diagnose wrist abnormalities in children, adolescents, and young adults, providing high accuracy and precision in diagnosis through a user-friendly mobile application.
air
A deep learning object detector framework written in Python for supporting Land Search and Rescue Missions.
yolov4_wrong_side_driving_detection
A Computer Vision (YOLOv4) based project to autonomously detect and penalize vehicles driving on the wrong side of the road.
https://github.com/alphonsg/epic-bbox-cell-tracking
Harness deep learning and bounding boxes to perform object detection, segmentation, tracking and more.
https://github.com/amirzenoozi/yolo-tf2-object-detection
Object Detection on Image and Video Based on YOLO Model
https://github.com/bryceag11/quan
Quaternion Approximate Networks for Enhanced Image Classification and Object Detection
https://github.com/arbit3rr/jetsonyolo
Simple process for camera installation, software and hardware setup, and object detection using Yolov5 and openCV on NVIDIA Jetson Nano.
detrex
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
astrobeecd
[IAC 2023, AA 2024] This repository contains the code used in our paper, "AstrobeeCD: Change Detection in Microgravity with Free-Flying Robots." This method is useful for detecting 3D scene changes given a 3D model, a sequence of images, and a sequence of camera poses.
paperclipinspection
Analyzing Paper Clips Using Deep Learning and Computer Vision Techniques 📎