chatAI4R: Interactive Artificial Intelligence toolkit for Data Science in R
chatAI4R: Interactive Artificial Intelligence toolkit for Data Science in R - Published in JOSS (2026)
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
mmagic
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
https://github.com/khoj-ai/khoj
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.
hugsvision
HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision
evolutionary-diffusion
Applying Evolutionary Computing to Embeddings of Diffusion Models
dilightnet
Official Code Release for [SIGGRAPH 2024] DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation
matlab-gan
MATLAB implementations of Generative Adversarial Networks -- from GAN to Pixel2Pixel, CycleGAN
https://github.com/ai-forever/kandinsky-2
Kandinsky 2 — multilingual text2image latent diffusion model
https://github.com/cptanalatriste/celebrity-generator
A deep convolutional generative adversarial network (DCGAN) for generating faces.
modular-diffusion
Python library for designing and training your own Diffusion Models with PyTorch
https://github.com/bytedance/uno
[ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning
https://github.com/christophreich1996/semantic_pyramid_for_image_generation
PyTorch reimplementation of the paper: "Semantic Pyramid for Image Generation" [CVPR 2020].
https://github.com/bytedance/infiniteyou
🔥 [ICCV 2025 Highlight] InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity
https://github.com/bytedance/comfyui_infiniteyou
🔥 [ICCV 2025 Highlight] Official ComfyUI native node supporting InfiniteYou with FLUX
https://github.com/bytedance/id-patch
Official implementation of CVPR 2025 paper "ID-Patch: Robust ID Association for Group Photo Personalization". This work proposed propose ID-Patch, a fast and robust method that links identity features to 2D positions via visual patches and embeddings.
text2earth
[IEEE GRSM 2025 🔥] "Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model"
kg-mm-survey
Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey
https://github.com/anapnoe/stable-diffusion-webui-ux
Stable Diffusion web UI UX
https://github.com/atharvapathak/computer_vision_linear_regression_projects
This repository consists of some basic Computer Vision & Linear Regression Projects.
mergevq
[CVPR] MergeVQ: A Unified Framework for Visual Generation and Representation with Token Merging and Quantization