https://github.com/bitstrawber/ddpm

Science Score: 49.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
✓
DOI references
Found 3 DOI reference(s) in README
✓
Academic publication links
Links to: arxiv.org
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (9.8%) to scientific vocabulary

Last synced: 10 months ago · JSON representation

Repository

Basic Info

Host: GitHub
Owner: BitStrawber
License: mit
Language: Python
Default Branch: main
Size: 2.49 MB

Statistics

Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Releases: 0

Created about 1 year ago · Last pushed about 1 year ago

Metadata Files

Readme License

Denoising Diffusion Probabilistic Model, in Pytorch

Implementation of Denoising Diffusion Probabilistic Model in Pytorch. It is a new approach to generative modeling that may have the potential to rival GANs. It uses denoising score matching to estimate the gradient of the data distribution, followed by Langevin sampling to sample from the true distribution.

This implementation was inspired by the official Tensorflow version here

Youtube AI Educators - Yannic Kilcher | AI Coffeebreak with Letitia | Outlier

Flax implementation from YiYi Xu

Annotated code by Research Scientists / Engineers from 🤗 Huggingface

Update: Turns out none of the technicalities really matters at all | "Cold Diffusion" paper | Muse

Install

bash $ pip install denoising_diffusion_pytorch

Usage

```python import torch from denoisingdiffusionpytorch import Unet, GaussianDiffusion

model = Unet( dim = 64, dimmults = (1, 2, 4, 8), flashattn = True )

diffusion = GaussianDiffusion( model, image_size = 128, timesteps = 1000 # number of steps )

trainingimages = torch.rand(8, 3, 128, 128) # images are normalized from 0 to 1 loss = diffusion(trainingimages) loss.backward()

after a lot of training

sampledimages = diffusion.sample(batchsize = 4) sampled_images.shape # (4, 3, 128, 128) ```

Or, if you simply want to pass in a folder name and the desired image dimensions, you can use the Trainer class to easily train a model.

```python from denoisingdiffusionpytorch import Unet, GaussianDiffusion, Trainer

model = Unet( dim = 64, dimmults = (1, 2, 4, 8), flashattn = True )

diffusion = GaussianDiffusion( model, imagesize = 128, timesteps = 1000, # number of steps samplingtimesteps = 250 # number of sampling timesteps (using ddim for faster inference [see citation for ddim paper]) )

trainer = Trainer( diffusion, 'path/to/your/images', trainbatchsize = 32, trainlr = 8e-5, trainnumsteps = 700000, # total training steps gradientaccumulateevery = 2, # gradient accumulation steps emadecay = 0.995, # exponential moving average decay amp = True, # turn on mixed precision calculate_fid = True # whether to calculate fid during training )

trainer.train() ```

Samples and model checkpoints will be logged to ./results periodically

Multi-GPU Training

The Trainer class is now equipped with 🤗 Accelerator. You can easily do multi-gpu training in two steps using their accelerate CLI

At the project root directory, where the training script is, run

python $ accelerate config

Then, in the same directory

python $ accelerate launch train.py

Miscellaneous

1D Sequence

By popular request, a 1D Unet + Gaussian Diffusion implementation.

```python import torch from denoisingdiffusionpytorch import Unet1D, GaussianDiffusion1D, Trainer1D, Dataset1D

model = Unet1D( dim = 64, dim_mults = (1, 2, 4, 8), channels = 32 )

diffusion = GaussianDiffusion1D( model, seqlength = 128, timesteps = 1000, objective = 'predv' )

training_seq = torch.rand(64, 32, 128) # features are normalized from 0 to 1

loss = diffusion(training_seq) loss.backward()

Or using trainer

dataset = Dataset1D(training_seq) # this is just an example, but you can formulate your own Dataset and pass it into the Trainer1D below

trainer = Trainer1D( diffusion, dataset = dataset, trainbatchsize = 32, trainlr = 8e-5, trainnumsteps = 700000, # total training steps gradientaccumulateevery = 2, # gradient accumulation steps emadecay = 0.995, # exponential moving average decay amp = True, # turn on mixed precision ) trainer.train()

after a lot of training

sampledseq = diffusion.sample(batchsize = 4) sampled_seq.shape # (4, 32, 128)

```

Trainer1D does not evaluate the generated samples in any way since the type of data is not known.

You could consider adding a suitable metric to the training loop yourself after doing an editable install of this package pip install -e ..

Citations

bibtex @inproceedings{NEURIPS2020_4c5bcfec, author = {Ho, Jonathan and Jain, Ajay and Abbeel, Pieter}, booktitle = {Advances in Neural Information Processing Systems}, editor = {H. Larochelle and M. Ranzato and R. Hadsell and M.F. Balcan and H. Lin}, pages = {6840--6851}, publisher = {Curran Associates, Inc.}, title = {Denoising Diffusion Probabilistic Models}, url = {https://proceedings.neurips.cc/paper/2020/file/4c5bcfec8584af0d967f1ab10179ca4b-Paper.pdf}, volume = {33}, year = {2020} }

bibtex @InProceedings{pmlr-v139-nichol21a, title = {Improved Denoising Diffusion Probabilistic Models}, author = {Nichol, Alexander Quinn and Dhariwal, Prafulla}, booktitle = {Proceedings of the 38th International Conference on Machine Learning}, pages = {8162--8171}, year = {2021}, editor = {Meila, Marina and Zhang, Tong}, volume = {139}, series = {Proceedings of Machine Learning Research}, month = {18--24 Jul}, publisher = {PMLR}, pdf = {http://proceedings.mlr.press/v139/nichol21a/nichol21a.pdf}, url = {https://proceedings.mlr.press/v139/nichol21a.html}, }

bibtex @inproceedings{kingma2021on, title = {On Density Estimation with Diffusion Models}, author = {Diederik P Kingma and Tim Salimans and Ben Poole and Jonathan Ho}, booktitle = {Advances in Neural Information Processing Systems}, editor = {A. Beygelzimer and Y. Dauphin and P. Liang and J. Wortman Vaughan}, year = {2021}, url = {https://openreview.net/forum?id=2LdBqxc1Yv} }

bibtex @article{Karras2022ElucidatingTD, title = {Elucidating the Design Space of Diffusion-Based Generative Models}, author = {Tero Karras and Miika Aittala and Timo Aila and Samuli Laine}, journal = {ArXiv}, year = {2022}, volume = {abs/2206.00364} }

bibtex @article{Song2021DenoisingDI, title = {Denoising Diffusion Implicit Models}, author = {Jiaming Song and Chenlin Meng and Stefano Ermon}, journal = {ArXiv}, year = {2021}, volume = {abs/2010.02502} }

bibtex @misc{chen2022analog, title = {Analog Bits: Generating Discrete Data using Diffusion Models with Self-Conditioning}, author = {Ting Chen and Ruixiang Zhang and Geoffrey Hinton}, year = {2022}, eprint = {2208.04202}, archivePrefix = {arXiv}, primaryClass = {cs.CV} }

bibtex @article{Salimans2022ProgressiveDF, title = {Progressive Distillation for Fast Sampling of Diffusion Models}, author = {Tim Salimans and Jonathan Ho}, journal = {ArXiv}, year = {2022}, volume = {abs/2202.00512} }

bibtex @article{Ho2022ClassifierFreeDG, title = {Classifier-Free Diffusion Guidance}, author = {Jonathan Ho}, journal = {ArXiv}, year = {2022}, volume = {abs/2207.12598} }

bibtex @article{Sunkara2022NoMS, title = {No More Strided Convolutions or Pooling: A New CNN Building Block for Low-Resolution Images and Small Objects}, author = {Raja Sunkara and Tie Luo}, journal = {ArXiv}, year = {2022}, volume = {abs/2208.03641} }

bibtex @inproceedings{Jabri2022ScalableAC, title = {Scalable Adaptive Computation for Iterative Generation}, author = {A. Jabri and David J. Fleet and Ting Chen}, year = {2022} }

bibtex @article{Cheng2022DPMSolverPlusPlus, title = {DPM-Solver++: Fast Solver for Guided Sampling of Diffusion Probabilistic Models}, author = {Cheng Lu and Yuhao Zhou and Fan Bao and Jianfei Chen and Chongxuan Li and Jun Zhu}, journal = {NeuRips 2022 Oral}, year = {2022}, volume = {abs/2211.01095} }

bibtex @inproceedings{Hoogeboom2023simpleDE, title = {simple diffusion: End-to-end diffusion for high resolution images}, author = {Emiel Hoogeboom and Jonathan Heek and Tim Salimans}, year = {2023} }

bibtex @misc{https://doi.org/10.48550/arxiv.2302.01327, doi = {10.48550/ARXIV.2302.01327}, url = {https://arxiv.org/abs/2302.01327}, author = {Kumar, Manoj and Dehghani, Mostafa and Houlsby, Neil}, title = {Dual PatchNorm}, publisher = {arXiv}, year = {2023}, copyright = {Creative Commons Attribution 4.0 International} }

bibtex @inproceedings{Hang2023EfficientDT, title = {Efficient Diffusion Training via Min-SNR Weighting Strategy}, author = {Tiankai Hang and Shuyang Gu and Chen Li and Jianmin Bao and Dong Chen and Han Hu and Xin Geng and Baining Guo}, year = {2023} }

bibtex @misc{Guttenberg2023, author = {Nicholas Guttenberg}, url = {https://www.crosslabs.org/blog/diffusion-with-offset-noise} }

bibtex @inproceedings{Lin2023CommonDN, title = {Common Diffusion Noise Schedules and Sample Steps are Flawed}, author = {Shanchuan Lin and Bingchen Liu and Jiashi Li and Xiao Yang}, year = {2023} }

bibtex @inproceedings{dao2022flashattention, title = {Flash{A}ttention: Fast and Memory-Efficient Exact Attention with {IO}-Awareness}, author = {Dao, Tri and Fu, Daniel Y. and Ermon, Stefano and Rudra, Atri and R{\'e}, Christopher}, booktitle = {Advances in Neural Information Processing Systems}, year = {2022} }

bibtex @article{Bondarenko2023QuantizableTR, title = {Quantizable Transformers: Removing Outliers by Helping Attention Heads Do Nothing}, author = {Yelysei Bondarenko and Markus Nagel and Tijmen Blankevoort}, journal = {ArXiv}, year = {2023}, volume = {abs/2306.12929}, url = {https://api.semanticscholar.org/CorpusID:259224568} }

bibtex @article{Karras2023AnalyzingAI, title = {Analyzing and Improving the Training Dynamics of Diffusion Models}, author = {Tero Karras and Miika Aittala and Jaakko Lehtinen and Janne Hellsten and Timo Aila and Samuli Laine}, journal = {ArXiv}, year = {2023}, volume = {abs/2312.02696}, url = {https://api.semanticscholar.org/CorpusID:265659032} }

bibtex @article{Li2024ImmiscibleDA, title = {Immiscible Diffusion: Accelerating Diffusion Training with Noise Assignment}, author = {Yiheng Li and Heyang Jiang and Akio Kodaira and Masayoshi Tomizuka and Kurt Keutzer and Chenfeng Xu}, journal = {ArXiv}, year = {2024}, volume = {abs/2406.12303}, url = {https://api.semanticscholar.org/CorpusID:270562607} }

bibtex @article{Chung2024CFGMC, title = {CFG++: Manifold-constrained Classifier Free Guidance for Diffusion Models}, author = {Hyungjin Chung and Jeongsol Kim and Geon Yeong Park and Hyelin Nam and Jong Chul Ye}, journal = {ArXiv}, year = {2024}, volume = {abs/2406.08070}, url = {https://api.semanticscholar.org/CorpusID:270391454} }

bibtex @inproceedings{Sadat2024EliminatingOA, title = {Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models}, author = {Seyedmorteza Sadat and Otmar Hilliges and Romann M. Weber}, year = {2024}, url = {https://api.semanticscholar.org/CorpusID:273098845} }

Owner

Login: BitStrawber
Kind: user

Repositories: 1
Profile: https://github.com/BitStrawber

GitHub Events

Total

Push event: 1
Create event: 2

Last Year

Push event: 1
Create event: 2

Dependencies

.github/workflows/python-publish.yml actions

actions/checkout v2 composite
actions/setup-python v2 composite
pypa/gh-action-pypi-publish 27b31702a0e7fc50959f5ad993c78deac1bdfc29 composite

setup.py pypi

accelerate *
einops *
ema-pytorch >=0.4.2
numpy *
pillow *
pytorch-fid *
scipy *
torch >=2.0
torchvision *
tqdm *

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science

https://github.com/bitstrawber/ddpm

Science Score: 49.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

Denoising Diffusion Probabilistic Model, in Pytorch

Install

Usage

after a lot of training

Multi-GPU Training

Miscellaneous

1D Sequence

Or using trainer

after a lot of training

Citations

Owner

GitHub Events

Total

Last Year

Dependencies