Updated 6 months ago
optimum
🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools
Updated 6 months ago
https://github.com/ami-iit/onnx-cpp-benchmark
Simple tool to profile onnx inference with C++ APIs.
Updated 6 months ago
https://github.com/altunenes/calcarine
Desktop VLM: Real-time FastVLM analysis of video & textures with live compute shaders
Updated 6 months ago
https://github.com/adalkiran/distributed-inference
A project to demonstrate an approach to designing cross-language and distributed pipeline in deep learning/machine learning domain, using WebRTC and Redis Streams.
Updated 6 months ago
https://github.com/danielsarmiento04/yolov10cpp
Implementation of yolo v10 in c++ std 17 over opencv and onnxruntime
Updated 6 months ago
anira
an architecture for neural network inference in real-time audio applications