diffsketcher

[NIPS 2023] Official implementation for "DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models" https://arxiv.org/abs/2306.14685

https://github.com/ximinng/diffsketcher

Keywords

sketch sketch-generation text-to-sketch text-to-svg

Last synced: 9 months ago · JSON representation ·

Repository

[NIPS 2023] Official implementation for "DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models" https://arxiv.org/abs/2306.14685

Basic Info

Host: GitHub
Owner: ximinng
License: mit
Language: Python
Default Branch: main
Homepage: https://ximinng.github.io/DiffSketcher-project/
Size: 9.19 MB

Statistics

Stars: 202
Watchers: 7
Forks: 19
Open Issues: 2
Releases: 0

Topics

sketch sketch-generation text-to-sketch text-to-svg

Created almost 3 years ago · Last pushed about 2 years ago

Metadata Files

Readme License Citation

DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models

This repository contains our official implementation of the NeurIPS 2023 paper: DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models, which can generate high-quality vector sketches based on text prompts.

teaser1 teaser2

DiffSketcher Rendering Process:

| | | | |-------------------------------------------------------------------------|-----------------------------------------------------------------------------|--------------------------------------------------------------| | Prompt: Macaw full color, ultra detailed, realistic, insanely beautiful | Prompt: Very detailed masterpiece painting of baby yoda hoding a lightsaber | Prompt: Sailboat sailing in the sea on a clear day |

:new: Update

[01/2024] 🔥 We released the SVGDreamer. SVGDreamer is a novel text-guided vector graphics synthesis method. This method considers both the editing of vector graphics and the quality of the synthesis.
[12/2023] 🔥 We released the PyTorch-SVGRender. Pytorch-SVGRender is the go-to library for state-of-the-art differentiable rendering methods for image vectorization.
[11/2023] We thank @camenduru for implementing the DiffSketcher-colab.
[10/2023] We released the DiffSketcher code.
[10/2023] We released the VectorFusion code.

📌 Installation Guide

To quickly get started with DiffSketcher, follow the steps below.
These instructions will help you run quick inference locally.

🚀 Option 1: Standard Installation

Run the following command in the top-level directory:

shell chmod +x script/install.sh bash script/install.sh

🐳 Option 2: Using Docker

shell chmod +x script/run_docker.sh sudo bash script/run_docker.sh

🔥 Quickstart

Case: Sydney Opera House

Preview:

| Attention Map | Control Points Init | Strokes Initialization | 100 step | 500 step | |:------------------------------------------------------------------------------------:|:---------------------------------------------------------------------------------------:|:-------------------------------------------------------------------------------------:|:---------------------------------------------------------------------------------------:|:-------------------------------------------------------------------------------------------:| | | | | | |

From the abstract to the concrete:

| 16 Paths | 36 Paths | 48 Paths | 96 Paths | 128 Paths | |:------------------------------------------------------:|:------------------------------------------------------:|:------------------------------------------------------:|:------------------------------------------------------:|:-------------------------------------------------------:| | | | | | |

Script:

shell python run_painterly_render.py \ -c diffsketcher.yaml \ -eval_step 10 -save_step 10 \ -update "token_ind=4 num_paths=96 num_iter=800" \ -pt "a photo of Sydney opera house" \ -respath ./workdir/sydney_opera_house \ -d 8019 \ --download

-c a.k.a --config: configuration file, saving in DiffSketcher/config/.
-eval_step: the step size used to eval the method (too frequent calls will result in longer times).
-save_step: the step size used to save the result (too frequent calls will result in longer times).
-update: a tool for editing the hyper-params of the configuration file, so you don't need to create a new yaml.
-pt a.k.a --prompt: text prompt.
-respath a.k.a --results_path: the folder to save results.
-d a.k.a --seed: random seed.
--download: download models from huggingface automatically when you first run them.

crucial:

-update "token_ind=4" indicates the index of cross-attn maps to init strokes.
-update "num_paths=96" indicates the number of strokes.

optional:

-npt, a.k.a --negative_prompt: negative text prompt.
-mv, a.k.a --make_video: make a video of the rendering process (it will take much longer).
-frame_freq, a.k.a --video_frame_freq: the interval of the number of steps to save the image.
-framerate, a.k.a --video_frame_rate: control the playback speed of the output video.
Note: Download U2Net model and place in checkpoint/ dir if xdog_intersec=True
add enable_xformers=True in -update to enable xformers for speeding up.
add gradient_checkpoint=True in -update to use gradient checkpoint for low VRAM.

Case: Sydney Opera House in ink painting style

Preview:

| | | | | |------------------------------------------------------|--------------------------------------------------------|--------------------------------------------------------|--------------------------------------------------------| | Strokes Initialization | 100 step | 200 step | 990 step |

Script:

shell python run_painterly_render.py \ -c diffsketcher-width.yaml \ -eval_step 10 -save_step 10 \ -update "token_ind=4 num_paths=48 num_iter=800" \ -pt "a photo of Sydney opera house" \ -respath ./workdir/sydney_opera_house_ink \ -d 8019 \ --download

Oil Painting

Preview:

| | | | |----------------------------------------------------|------------------------------------------------------|------------------------------------------------------| | Strokes Initialization | 100 step | 570 step |

Script:

shell python run_painterly_render.py \ -c diffsketcher-color.yaml \ -eval_step 10 -save_step 10 \ -update "token_ind=5 num_paths=1000 num_iter=1000 guidance_scale=7.5" \ -pt "portrait of latin woman having a spiritual awaking, eyes closed, slight smile, illuminating lights, oil painting, by Van Gogh" \ -npt "text, signature, title, heading, watermark, ugly, duplicate, morbid, mutilated, out of frame, extra fingers, mutated hands, poorly drawn hands, poorly drawn face, mutation, deformed, blurry, bad anatomy, bad proportions, extra limbs, cloned face, disfigured, out of frame, ugly, extra limbs, gross proportions, malformed limbs, missing arms, missing legs, extra arms, extra legs, mutated hands, fused fingers, too many fingers, long neck" \ -respath ./workdir/latin_woman_portrait -d 58548

Preview:

| | | | |------------------------------------------------|--------------------------------------------------|--------------------------------------------------| | Strokes Initialization | 100 step | 570 step |

Script:

shell python run_painterly_render.py \ -c diffsketcher-color.yaml \ -eval_step 10 -save_step 10 \ -update "token_ind=5 num_paths=1000 num_iter=1000 guidance_scale=7.5" \ -pt "a painting of a woman with a crown on her head, art station front page, dynamic portrait style, many colors in the background, olpntng style, oil painting, forbidden beauty" \ -npt "2 heads, 2 faces, cropped image, out of frame, draft, deformed hands, twisted fingers, double image, malformed hands, multiple heads, extra limb, ugly, poorly drawn hands, missing limb, disfigured, cut-off, ugly, grain, low-res, Deformed, blurry, bad anatomy, disfigured, poorly drawn face, mutation, mutated, floating limbs, disconnected limbs, disgusting, poorly drawn, mutilated, mangled, extra fingers, duplicate artifacts, morbid, gross proportions, missing arms, mutated hands, mutilated hands, cloned face, malformed, blur haze" \ -respath ./workdir/woman_with_crown -d 178351

Preview:

| | | | |-----------------------------------------------------------|-------------------------------------------------------------|-------------------------------------------------------------| | Strokes Initialization | 100 step | 420 step |

Script:

shell python run_painterly_render.py \ -c diffsketcher-color.yaml \ -eval_step 10 -save_step 10 \ -update "token_ind=5 num_paths=1000 num_iter=1000 guidance_scale=7.5" \ -pt "a painting of a woman with a crown on her head, art station front page, dynamic portrait style, many colors in the background, olpntng style, oil painting, forbidden beauty" \ -npt "2 heads, 2 faces, cropped image, out of frame, draft, deformed hands, twisted fingers, double image, malformed hands, multiple heads, extra limb, ugly, poorly drawn hands, missing limb, disfigured, cut-off, ugly, grain, low-res, Deformed, blurry, bad anatomy, disfigured, poorly drawn face, mutation, mutated, floating limbs, disconnected limbs, disgusting, poorly drawn, mutilated, mangled, extra fingers, duplicate artifacts, morbid, gross proportions, missing arms, mutated hands, mutilated hands, cloned face, malformed, blur haze" \ -respath ./workdir/woman_with_crown -d 178351

Colorful Results

Preview:

| | | | |---------------------------------------------|-----------------------------------------------|-----------------------------------------------| | Strokes Initialization | 100 step | 340 step |

Script:

shell python run_painterly_render.py \ -c diffsketcher-color.yaml \ -eval_step 10 -save_step 10 \ -update "token_ind=5 num_paths=1000 num_iter=800 guidance_scale=7" \ -pt "a beautiful snow-covered castle, a stunning masterpiece, trees, rays of the sun, Leonid Afremov" \ -npt "poorly drawn hands, poorly drawn feet, poorly drawn face, out of frame, extra limbs, disfigured, deformed, body out of frame, bad anatomy, watermark, signature, cut off, low contrast, underexposed, overexposed, bad art, beginner, amateur, distorted face" \ -respath ./workdir/castle -d 370880

Preview:

| | | | |-----------------------------------------------|-------------------------------------------------|-------------------------------------------------| | Strokes Initialization | 100 step | 850 step |

Script:

shell python run_painterly_render.py \ -c diffsketcher-color.yaml \ -eval_step 10 -save_step 10 \ -update "token_ind=5 num_paths=1000 num_iter=800 guidance_scale=7" \ -pt "a beautiful snow-covered castle, a stunning masterpiece, trees, rays of the sun, Leonid Afremov" \ -npt "poorly drawn hands, poorly drawn feet, poorly drawn face, out of frame, extra limbs, disfigured, deformed, body out of frame, bad anatomy, watermark, signature, cut off, low contrast, underexposed, overexposed, bad art, beginner, amateur, distorted face" \ -respath ./workdir/castle -d 478376

DiffSketcher + Style Transfer

Preview:

| | | | |------------------------------------------------------------------------------------------|-------------------------------------------------------------------|------------------------------------------------------------------------------------------| | Generated sample | Style Image | Result |

Script:

shell python run_painterly_render.py \ -tk style-diffsketcher -c diffsketcher-style.yaml \ -eval_step 10 -save_step 10 \ -update "token_ind=4 num_paths=2000 style_warmup=0 style_strength=1 softmax_temp=0.4 sds.grad_scale=0 lr_scheduler=True num_iter=2000" \ -pt "The French Revolution, highly detailed, 8k, ornate, intricate, cinematic, dehazed, atmospheric, oil painting, by Van Gogh" \ -style ./img/starry.jpg \ -respath ./workdir/style_transfer \ -d 876809

-style: the path of style img place.
style_warmup: add style loss after style_warmup step.
style_strength: How strong the style should be. 100 (max) is a lot. 0 (min) is no style.

More Sketch Results

check the Examples.md for more cases.

TODO

[x] Add a webUI demo.
[x] Add support for colorful results and oil painting.

:books: Acknowledgement

The project is built based on the following repository:

We gratefully thank the authors for their wonderful works.

:paperclip: Citation

If you use this code for your research, please cite the following work:

@inproceedings{xing2023diffsketcher, title={DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models}, author={XiMing Xing and Chuang Wang and Haitao Zhou and Jing Zhang and Qian Yu and Dong Xu}, booktitle={Thirty-seventh Conference on Neural Information Processing Systems}, year={2023}, url={https://openreview.net/forum?id=CY1xatvEQj} }

:copyright: Licence

This work is licensed under a MIT License.

Owner

Name: XiMing Xing
Login: ximinng
Kind: user
Location: Beijing, China

Website: https://ximinng.github.io/
Repositories: 25
Profile: https://github.com/ximinng

Ph.D. student at BUAA. Interested in the Deep Generation and Vector Art.

Citation (CITATION.bib)

@inproceedings{xing2023diffsketcher,
    title={DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models},
    author={XiMing Xing and Chuang Wang and Haitao Zhou and Jing Zhang and Qian Yu and Dong Xu},
    booktitle={Thirty-seventh Conference on Neural Information Processing Systems},
    year={2023},
    url={https://openreview.net/forum?id=CY1xatvEQj}
}

GitHub Events

Total

Issues event: 3
Watch event: 50
Issue comment event: 4
Push event: 3
Fork event: 6

Last Year

Issues event: 3
Watch event: 50
Issue comment event: 4
Push event: 3
Fork event: 6

Issues and Pull Requests

Last synced: 10 months ago

All Time

Total issues: 13
Total pull requests: 1
Average time to close issues: 3 months
Average time to close pull requests: N/A
Total issue authors: 10
Total pull request authors: 1
Average comments per issue: 2.31
Average comments per pull request: 3.0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 0
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 0
Pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

View more stats

Top Authors

Issue Authors

hoshizhang (3)
mattdesl (2)
Yhc-777 (1)
LordLiang (1)
ysm2000 (1)
Dingmeihai (1)
BZBY (1)
darkzbaron (1)
Gitterman69 (1)
lll-zy (1)

diffsketcher

Science Score: 41.0%

Keywords

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models

:new: Update

📌 Installation Guide

🚀 Option 1: Standard Installation

🐳 Option 2: Using Docker

🔥 Quickstart

Case: Sydney Opera House

Case: Sydney Opera House in ink painting style

Oil Painting

Colorful Results

DiffSketcher + Style Transfer

More Sketch Results

TODO

:books: Acknowledgement

:paperclip: Citation

:copyright: Licence

Owner

Citation (CITATION.bib)

GitHub Events

Total

Last Year

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels

Dependencies