controlnet
Self-implemented Control-Net for final proj of DATA303 2025 spring
Science Score: 54.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
✓Academic publication links
Links to: arxiv.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (11.1%) to scientific vocabulary
Repository
Self-implemented Control-Net for final proj of DATA303 2025 spring
Basic Info
- Host: GitHub
- Owner: kyne0127
- License: apache-2.0
- Language: Python
- Default Branch: main
- Size: 56.7 MB
Statistics
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
ControlNet Customed with 🤗 Diffusers
This repository provides examples and instructions for fine-tuning ControlNet on the circle-filling dataset using 🤗 Diffusers. You can train with PyTorch or Flax/JAX, leverage Accelerate for distributed/multi-GPU training, and apply various memory optimizations for GPUs of different sizes.
📦 Installation
- Clone and install Diffusers
bash
pip install -e .
2. Install example requirements
bash
cd examples/controlnet
pip install -r requirements.txt
3. Initialize Accelerate
Interactive setup:
bash accelerate configDefault configuration:
bash accelerate config defaultNon-interactive (e.g., notebook):
python from accelerate.utils import write_basic_config write_basic_config()
📂 Dataset
We use the circle-filling dataset (re-hosted for 🤗 Datasets compatibility) under fusing/fill50k. Dataloading is handled inside the training script.
🖼️ Download Conditioning Images
bash
wget https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/controlnet_training/conditioning_image_1.png
wget https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/controlnet_training/conditioning_image_2.png
🚀 Training (PyTorch)
```bash export MODELDIR="stable-diffusion-v1-5/stable-diffusion-v1-5" export OUTPUTDIR="output"
accelerate launch mytraincontrolnet.py \ --pretrainedmodelnameorpath=$MODELDIR \ --outputdir=$OUTPUTDIR \ --datasetname=fusing/fill50k \ --resolution=512 \ --learningrate=1e-5 \ --validationimage "./conditioningimage1.png" "./conditioningimage2.png" \ --validationprompt "red circle with blue background" "cyan circle with brown floral background" \ --trainbatchsize=4 --reportto=wandb ```
📖 References
Owner
- Login: kyne0127
- Kind: user
- Repositories: 1
- Profile: https://github.com/kyne0127
Citation (CITATION.cff)
cff-version: 1.2.0
title: 'Diffusers: State-of-the-art diffusion models'
message: >-
If you use this software, please cite it using the
metadata from this file.
type: software
authors:
- given-names: Patrick
family-names: von Platen
- given-names: Suraj
family-names: Patil
- given-names: Anton
family-names: Lozhkov
- given-names: Pedro
family-names: Cuenca
- given-names: Nathan
family-names: Lambert
- given-names: Kashif
family-names: Rasul
- given-names: Mishig
family-names: Davaadorj
- given-names: Dhruv
family-names: Nair
- given-names: Sayak
family-names: Paul
- given-names: Steven
family-names: Liu
- given-names: William
family-names: Berman
- given-names: Yiyi
family-names: Xu
- given-names: Thomas
family-names: Wolf
repository-code: 'https://github.com/huggingface/diffusers'
abstract: >-
Diffusers provides pretrained diffusion models across
multiple modalities, such as vision and audio, and serves
as a modular toolbox for inference and training of
diffusion models.
keywords:
- deep-learning
- pytorch
- image-generation
- hacktoberfest
- diffusion
- text2image
- image2image
- score-based-generative-modeling
- stable-diffusion
- stable-diffusion-diffusers
license: Apache-2.0
version: 0.12.1
GitHub Events
Total
- Member event: 2
- Push event: 10
- Create event: 1
Last Year
- Member event: 2
- Push event: 10
- Create event: 1
Dependencies
- actions/cache v2 composite
- actions/checkout v3 composite
- actions/upload-artifact v4 composite
- actions/checkout v3 composite
- docker/build-push-action v3 composite
- docker/login-action v2 composite
- docker/setup-buildx-action v1 composite
- huggingface/hf-workflows/.github/actions/post-slack main composite
- jitterbit/get-changed-files v1 composite
- actions/checkout v3 composite
- actions/setup-python v4 composite
- actions/checkout v3 composite
- actions/download-artifact v4 composite
- actions/upload-artifact v4 composite
- actions/checkout v3 composite
- actions/setup-python v4 composite
- actions/checkout v3 composite
- actions/setup-python v4 composite
- actions/checkout v3 composite
- actions/setup-python v4 composite
- actions/checkout v3 composite
- actions/upload-artifact v3 composite
- actions/upload-artifact v4 composite
- actions/checkout v3 composite
- actions/setup-python v4 composite
- actions/upload-artifact v4 composite
- actions/checkout v3 composite
- actions/setup-python v4 composite
- actions/upload-artifact v4 composite
- actions/checkout v3 composite
- actions/setup-python v4 composite
- actions/checkout v3 composite
- actions/upload-artifact v4 composite
- actions/checkout v3 composite
- actions/upload-artifact v4 composite
- ./.github/actions/setup-miniconda * composite
- actions/checkout v3 composite
- actions/upload-artifact v4 composite
- actions/checkout v3 composite
- actions/setup-python v4 composite
- actions/checkout v3 composite
- actions/upload-artifact v4 composite
- actions/checkout v4 composite
- actions/checkout v3 composite
- huggingface/tailscale-action main composite
- actions/checkout v3 composite
- huggingface/tailscale-action main composite
- actions/checkout v2 composite
- actions/setup-python v1 composite
- actions/checkout v4 composite
- trufflesecurity/trufflehog main composite
- actions/checkout v3 composite
- crate-ci/typos v1.12.4 composite
- actions/checkout v3 composite
- ubuntu 20.04 build
- ubuntu 20.04 build
- ubuntu 20.04 build
- ubuntu 20.04 build
- nvidia/cuda 12.1.0-runtime-ubuntu20.04 build
- ubuntu 20.04 build
- nvidia/cuda 12.1.0-runtime-ubuntu20.04 build
- nvidia/cuda 12.1.0-runtime-ubuntu20.04 build
- nvidia/cuda 12.1.0-runtime-ubuntu20.04 build
- Jinja2 *
- accelerate >=0.31.0
- ftfy *
- peft >=0.11.1
- sentencepiece *
- tensorboard *
- torchvision *
- transformers >=4.41.2
- Jinja2 *
- accelerate >=0.31.0
- ftfy *
- peft >=0.11.1
- sentencepiece *
- tensorboard *
- torchvision *
- transformers >=4.41.2
- Jinja2 *
- accelerate >=0.31.0
- decord >=0.6.0
- ftfy *
- imageio-ffmpeg *
- peft >=0.11.1
- sentencepiece *
- tensorboard *
- torchvision *
- transformers >=4.41.2
- accelerate ==1.2.0
- peft >=0.14.0
- torch *
- torchvision *
- transformers ==4.47.0
- wandb *
- Jinja2 *
- accelerate >=0.16.0
- ftfy *
- tensorboard *
- torchvision *
- transformers >=4.25.1
- webdataset *
- accelerate >=0.16.0
- datasets *
- ftfy *
- tensorboard *
- torchvision *
- transformers >=4.25.1
- Jinja2 *
- datasets *
- flax *
- ftfy *
- optax *
- tensorboard *
- torch *
- torchvision *
- transformers >=4.25.1
- Jinja2 *
- SentencePiece *
- accelerate >=0.16.0
- datasets *
- ftfy *
- tensorboard *
- torchvision *
- transformers >=4.25.1
- wandb *
- Jinja2 *
- accelerate >=0.16.0
- datasets *
- ftfy *
- tensorboard *
- torchvision *
- transformers >=4.25.1
- wandb *
- Jinja2 *
- accelerate >=0.16.0
- datasets *
- ftfy *
- tensorboard *
- torchvision *
- transformers >=4.25.1
- wandb *
- Jinja2 *
- accelerate *
- ftfy *
- tensorboard *
- torchvision *
- transformers >=4.25.1
- Jinja2 *
- accelerate >=0.16.0
- ftfy *
- peft ==0.7.0
- tensorboard *
- torchvision *
- transformers >=4.25.1
- Jinja2 *
- flax *
- ftfy *
- optax *
- tensorboard *
- torch *
- torchvision *
- transformers >=4.25.1
- Jinja2 *
- accelerate >=0.31.0
- ftfy *
- peft >=0.11.1
- sentencepiece *
- tensorboard *
- torchvision *
- transformers >=4.41.2
- Jinja2 *
- accelerate >=1.4.0
- ftfy *
- peft >=0.14.0
- sentencepiece *
- tensorboard *
- torchvision *
- transformers >=4.50.0
- Jinja2 *
- accelerate >=1.0.0
- ftfy *
- peft >=0.14.0
- sentencepiece *
- tensorboard *
- torchvision *
- transformers >=4.47.0
- Jinja2 *
- accelerate >=0.31.0
- ftfy *
- peft ==0.11.1
- sentencepiece *
- tensorboard *
- torchvision *
- transformers >=4.41.2
- Jinja2 *
- accelerate >=0.16.0
- ftfy *
- peft ==0.7.0
- tensorboard *
- torchvision *
- transformers >=4.25.1
- accelerate ==1.2.0
- peft >=0.14.0
- torch *
- torchvision *
- transformers ==4.47.0
- wandb *
- accelerate >=0.16.0
- datasets *
- ftfy *
- tensorboard *
- torchvision *
- transformers >=4.25.1
- Jinja2 *
- accelerate >=0.16.0
- datasets *
- ftfy *
- tensorboard *
- torchvision *
- transformers >=4.25.1
- huggingface-hub >=0.26.2
- Pillow *
- accelerate >=0.16.0
- bitsandbytes *
- datasets *
- huggingface_hub *
- lpips *
- numpy *
- packaging *
- taming_transformers *
- torch *
- torchvision *
- tqdm *
- transformers *
- wandb *
- xformers *
- Jinja2 *
- diffusers *
- ftfy *
- tensorboard *
- torch *
- torchvision *
- transformers *
- Jinja2 *
- accelerate >=0.16.0
- ftfy *
- tensorboard *
- torchvision *
- transformers >=4.25.1
- Jinja2 *
- accelerate >=0.16.0
- ftfy *
- peft *
- tensorboard *
- torchvision *
- transformers >=4.25.1
- wandb *
- accelerate *
- datasets *
- peft *
- torchvision *
- transformers *
- wandb *
- webdataset *
- Jinja2 *
- accelerate >=0.16.0
- diffusers ==0.9.0
- ftfy *
- tensorboard *
- torchvision *
- transformers >=4.21.0
- Jinja2 *
- accelerate >=0.16.0
- diffusers *
- fairscale *
- ftfy *
- scipy *
- tensorboard *
- timm *
- torchvision *
- transformers >=4.25.1
- wandb *
- Jinja2 *
- accelerate >=0.16.0
- ftfy *
- intel_extension_for_pytorch >=1.13
- tensorboard *
- torchvision *
- transformers >=4.21.0
- accelerate *
- ftfy *
- modelcards *
- neural-compressor *
- tensorboard *
- torchvision *
- transformers >=4.25.0
- accelerate *
- ip_adapter *
- torchvision *
- transformers >=4.25.1
- Jinja2 *
- accelerate >=0.16.0
- datasets *
- ftfy *
- tensorboard *
- torchvision *
- transformers >=4.25.1
- Jinja2 *
- accelerate >=0.16.0
- ftfy *
- tensorboard *
- torchvision *
- transformers >=4.25.1
- Jinja2 *
- accelerate >=0.16.0
- datasets >=2.16.0
- ftfy *
- tensorboard *
- torchvision *
- transformers >=4.25.1
- wandb >=0.16.1
- Jinja2 *
- accelerate >=0.16.0
- ftfy *
- tensorboard *
- torchvision *
- transformers >=4.25.1
- Jinja2 *
- flax *
- ftfy *
- optax *
- tensorboard *
- torch *
- torchvision *
- transformers >=4.25.1
- accelerate >=0.16.0
- datasets *
- ftfy *
- modelcards *
- tensorboard *
- torchvision *
- transformers >=4.25.1
- accelerate >=0.16.0
- ftfy *
- modelcards *
- tensorboard *
- torchvision *
- transformers >=4.25.1
- accelerate >=0.16.0
- datasets *
- tensorboard *
- torchvision *
- SentencePiece *
- controlnet-aux *
- datasets *
- torchvision *
- transformers *
- Jinja2 *
- accelerate >=0.16.0
- datasets >=2.19.1
- ftfy *
- peft ==0.7.0
- tensorboard *
- torchvision *
- transformers >=4.25.1
- Jinja2 ==3.1.6
- accelerate ==0.23.0
- diffusers ==0.20.1
- ftfy ==6.1.1
- peft ==0.5.0
- tensorboard ==2.14.0
- torch ==2.2.0
- torchvision >=0.16
- transformers ==4.38.0
- accelerate >=0.16.0
- bitsandbytes *
- deepspeed *
- peft >=0.6.0
- torchvision *
- transformers >=4.25.1
- wandb *
- aiohttp *
- fastapi *
- prometheus-fastapi-instrumentator >=7.0.0
- prometheus_client >=0.18.0
- py-consul *
- sentencepiece *
- torch *
- transformers ==4.46.1
- uvicorn *
- aiohappyeyeballs ==2.4.3
- aiohttp ==3.10.10
- aiosignal ==1.3.1
- annotated-types ==0.7.0
- anyio ==4.6.2.post1
- attrs ==24.2.0
- certifi ==2024.8.30
- charset-normalizer ==3.4.0
- click ==8.1.7
- fastapi ==0.115.3
- filelock ==3.16.1
- frozenlist ==1.5.0
- fsspec ==2024.10.0
- h11 ==0.14.0
- huggingface-hub ==0.26.1
- idna ==3.10
- jinja2 ==3.1.4
- markupsafe ==3.0.2
- mpmath ==1.3.0
- multidict ==6.1.0
- networkx ==3.4.2
- numpy ==2.1.2
- packaging ==24.1
- prometheus-client ==0.21.0
- prometheus-fastapi-instrumentator ==7.0.0
- propcache ==0.2.0
- py-consul ==1.5.3
- pydantic ==2.9.2
- pydantic-core ==2.23.4
- pyyaml ==6.0.2
- regex ==2024.9.11
- requests ==2.32.3
- safetensors ==0.4.5
- sentencepiece ==0.2.0
- sniffio ==1.3.1
- starlette ==0.41.0
- sympy ==1.13.3
- tokenizers ==0.20.1
- torch ==2.4.1
- tqdm ==4.66.5
- transformers ==4.46.1
- typing-extensions ==4.12.2
- urllib3 ==2.2.3
- uvicorn ==0.32.0
- yarl ==1.16.0
- accelerate >=0.16.0
- datasets *
- ftfy *
- safetensors *
- tensorboard *
- torchvision *
- transformers >=4.25.1
- wandb *
- Jinja2 *
- accelerate >=0.16.0
- datasets >=2.19.1
- ftfy *
- peft ==0.7.0
- tensorboard *
- torchvision *
- transformers >=4.25.1
- Jinja2 *
- datasets *
- flax *
- ftfy *
- optax *
- tensorboard *
- torch *
- torchvision *
- transformers >=4.25.1
- Jinja2 *
- accelerate >=0.22.0
- datasets *
- ftfy *
- peft ==0.7.0
- tensorboard *
- torchvision *
- transformers >=4.25.1
- Jinja2 *
- accelerate >=0.16.0
- ftfy *
- tensorboard *
- torchvision *
- transformers >=4.25.1
- Jinja2 *
- flax *
- ftfy *
- optax *
- tensorboard *
- torch *
- torchvision *
- transformers >=4.25.1
- accelerate >=0.16.0
- datasets *
- torchvision *
- accelerate >=0.16.0
- datasets *
- numpy *
- tensorboard *
- timm *
- torchvision *
- tqdm *
- transformers >=4.25.1
- deps *