Projects | Open Source Science

Updated 10 months ago

boxmot • Rank 22.4 • Science 77%

BoxMOT: Pluggable SOTA multi-object tracking modules modules for segmentation, object detection and pose estimation models

boosttrack botsort bytetrack clip deep-learning deepocsort improvedassociation machine-learning mot mots multi-object-tracking multi-object-tracking-segmentation ocsort oriented-bounding-box-tracking osnet segmentation strongsort tensorrt tracking-by-detection yolo

Artificial Intelligence and Machine Learning (36%)

Updated 10 months ago

mmpretrain • Rank 23.1 • Science 64%

OpenMMLab Pre-training Toolbox and Benchmark

beit clip constrastive-learning convnext deep-learning image-classification mae masked-image-modeling mobilenet moco multimodal pretrained-models pytorch resnet self-supervised-learning swin-transformer vision-transformer

Engineering (40%)

Updated 10 months ago

clipseq • Rank 6.4 • Science 77%

CLIP sequencing analysis pipeline for QC, pre-mapping, genome mapping, UMI deduplication, and multiple peak-calling options.

clip clip-seq nextflow nf-core peak-calling pipeline rna-rbp-interactions workflow

Updated 10 months ago

uform • Rank 15.6 • Science 64%

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

bert clip clustering contrastive-learning cross-attention huggingface-transformers image-search language-vision llava multi-lingual multimodal neural-network openai openclip pretrained-models pytorch representation-learning semantic-search transformer vector-search

Updated 9 months ago

biotrove • Rank 5.8 • Science 59%

NeurIPS 2024 Track on Datasets and Benchmarks (Spotlight)

animals clip image-classification multimodal rare-species species taxonomy zero-shot-classification

Updated 10 months ago

panoptic • Rank 10.3 • Science 54%

Explore and analyze large datasets of images

clip image-classification images python research-tool ui

Updated 10 months ago

cambrian • Rank 9.0 • Science 54%

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

chatbot clip computer-vision dino instruction-tuning large-language-models llms mllm multimodal-large-language-models representation-learning

Updated 10 months ago

x-anylabeling • Rank 16.5 • Science 44%

Effortless data labeling with AI support from Segment Anything and other awesome models.

annotation-tool classification clip deep-learning deeplearning depth-estimation grounding-dino image-segmentation labeling-tool llm matting object-detection onnx paddle pose-estimation pytorch resnet sam vlm yolo

Updated 10 months ago

ppdiffusers • Rank 19.3 • Science 36%

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.

aigc clip controlnet deepseek-vl dit eva-clip got-ocr20 image-to-text internvl2 llava minicpm-v multimodal ppdiffusers qwen2-vl sd-xl sora stable-diffusion stablevideodiffusion text-to-image text-to-video

Updated 10 months ago

marqo-fashionclip • Rank 6.4 • Science 44%

State-of-the-art CLIP/SigLIP embedding models finetuned for the fashion domain. +57% increase in evaluation metrics vs FashionCLIP 2.0.

clip embeddings fashion-classifier fashionclip informationretrieval multimodal recomendations search transformers vectorsearch vision-transformer

Updated 10 months ago

@stdlib/math-base-special-clamp • Rank 3.3 • Science 44%

Restrict a double-precision floating-point number to a specified range.

clamp clip dbl double double-precision interval javascript math mathematics node node-js nodejs range restrict stdlib

Updated 10 months ago

@stdlib/math-base-special-clampf • Rank 3.3 • Science 44%

Restrict a single-precision floating-point number to a specified range.

clampf clip float interval javascript math mathematics node node-js nodejs range restrict single single-precision stdlib

Updated 10 months ago

promptdet • Rank 7.3 • Science 36%

PromptDet: Towards Open-vocabulary Detection using Uncurated Images, ECCV2022

clip computer-vision eccv2022 novel-categories object-detection prompt-learning pseudo-labeling regional-prompt self-training vocabulary web-image zero-shot-learning

Updated 9 months ago

https://github.com/ai-forever/ru-clip • Rank 7.5 • Science 33%

CLIP implementation for Russian language

clip computer-vision nlp

Updated 9 months ago

https://github.com/bentoml/clip-api-service • Rank 9.2 • Science 13%

CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search

ai-applications clip cloud-native mlops model-inference model-inference-service model-serving openai-clip

Updated 9 months ago

https://github.com/autodistill/autodistill-clip • Rank 7.4 • Science 13%

CLIP module for use with Autodistill.

autodistill clip image-classification

Updated 9 months ago

https://github.com/capjamesg/sam-clip • Rank 3.4 • Science 13%

Use Grounding DINO, Segment Anything, and CLIP to label objects in images.

clip computer-vision segment-anything zero-shot-object-detection

Updated 9 months ago

https://github.com/cuc-zihang-liu/audioevaluation • Science 26%

多种音频评估方法复现

audio clap clip evaluation lpips rewas

Updated 10 months ago

b-cosification • Science 54%

[NeurIPS 2024] Code for the paper: B-cosification: Transforming Deep Neural Networks to be Inherently Interpretable.

b-cos-networks clip explainable-artificial-intelligence inherent-interpretability neurips-2024

Updated 10 months ago

geospatial-rag • Science 26%

AI Framework for Remote Sensing Image Analysis using RAG - 88%+ accuracy, multi-modal queries, ChatGPT-like interface

academic-research clip computer-vision earth-observation embeddings geospatial langchain machine-learning multimodal-ai pytorch rag remote-sensing

Updated 10 months ago

detect-clip-backdoor-samples • Science 54%

[ICLR2025] Detecting Backdoor Samples in Contrastive Language Image Pretraining

backdoor backdoor-attack backdoor-attacks backdoors clip contrastive-language-image-pretraining contrastive-learning deep deep-learning detection iclr iclr2025 outlier-detection poisoning-attack poisoning-defenses pytorch safety

Updated 10 months ago

bayesvlm • Science 54%

Code for Post-hoc Probabilistic Vision-Language Models

active-learning bayesian-deep-learning clip siglip vision-language-models zero-shot-learning

Updated 9 months ago

https://github.com/924973292/mambapro • Science 23%

【AAAI2025】MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt

adapter clip finetuning fusion mamba multi-modal prompt reid

Updated 10 months ago

spn4cir • Science 54%

[ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives

acmmm2024 blip blip2 clip composed-image-retrieval cross-modal-retrieval data-generation image-retrieval llama llava memory-bank multi-modal-retrieval multimodal-learning transformer

Updated 10 months ago

motherboard-dataset • Science 31%

[Kaggle Dataset] Motherboard production defect dataset for object detection. Currently available for YOLOv5, YOLOv7, YOLOv8 & CLIP. Also available on Kaggle.