310-interactdiffusion-interaction-control-in-text-to-image-diffusion-models
Science Score: 41.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
✓CITATION.cff file
Found CITATION.cff file -
✓codemeta.json file
Found codemeta.json file -
○.zenodo.json file
-
○DOI references
-
✓Academic publication links
Links to: arxiv.org, scholar.google -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (6.3%) to scientific vocabulary
Last synced: 10 months ago
·
JSON representation
·
Repository
Basic Info
- Host: GitHub
- Owner: SZU-AdvTech-2024
- Default Branch: main
- Size: 0 Bytes
Statistics
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
- Releases: 0
Created over 1 year ago
· Last pushed over 1 year ago
Metadata Files
Citation
https://github.com/SZU-AdvTech-2024/310-InteractDiffusion-Interaction-Control-in-Text-to-Image-Diffusion-Models/blob/main/
# InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model [Jiun Tian Hoe](https://jiuntian.com/), [Xudong Jiang](https://personal.ntu.edu.sg/exdjiang/), [Chee Seng Chan](http://cs-chan.com), [Yap Peng Tan](https://personal.ntu.edu.sg/eyptan/), [Weipeng Hu](https://scholar.google.com/citations?user=zo6ni_gAAAAJ) [Project Page](https://jiuntian.github.io/interactdiffusion) | [paper](https://openaccess.thecvf.com/content/CVPR2024/html/Hoe_InteractDiffusion_Interaction_Control_in_Text-to-Image_Diffusion_Models_CVPR_2024_paper.html) | [arXiv](https://arxiv.org/abs/2312.05849) | [WebUI](https://github.com/jiuntian/sd-webui-interactdiffusion) | [Demo](https://huggingface.co/spaces/interactdiffusion/interactdiffusion) | [Video](https://www.youtube.com/watch?v=Uunzufq8m6Y) | [Diffuser](https://huggingface.co/interactdiffusion/diffusers-v1-2) | [Colab](https://colab.research.google.com/drive/1Bh9PjfTylxI2rbME5mQJtFqNTGvaghJq?usp=sharing) [](https://arxiv.org/abs/2312.05849) [](https://badges.toozhao.com/stats/01HH1JE53YX5TDDDDCG6PXY8WQ "Get your own page views count badge on badges.toozhao.com") [](https://huggingface.co/spaces/interactdiffusion/interactdiffusion) [](https://colab.research.google.com/drive/1Bh9PjfTylxI2rbME5mQJtFqNTGvaghJq?usp=sharing)  - Existing methods lack ability to control the interactions between objects in the generated content. - We propose a pluggable interaction control model, called InteractDiffusion that extends existing pre-trained T2I diffusion models to enable them being better conditioned on interactions. ## News - **[2024.3.13]** Diffusers code is available at [here](https://huggingface.co/interactdiffusion/diffusers-v1-2). - **[2024.3.8]** Demo is available at [Huggingface Spaces](https://huggingface.co/spaces/interactdiffusion/interactdiffusion). - **[2024.3.6]** Code is released. - **[2024.2.27]** InteractionDiffusion paper is accepted at CVPR 2024. - **[2023.12.12]** InteractionDiffusion paper is released. WebUI of InteractDiffusion is available as *alpha* version. ## Results
| Model | Interaction Controllability | FID | KID | |
|---|---|---|---|---|
| Tiny | Large | |||
| v1.0 | 29.53 | 31.56 | 18.69 | 0.00676 |
| v1.1 | 30.20 | 31.96 | 17.90 | 0.00635 |
| v1.2 | 30.73 | 33.10 | 17.32 | 0.00585 |
| XLv1.0 | -- | 35.60 | 16.65 | 0.00560 |
Owner
- Name: SZU-AdvTech-2024
- Login: SZU-AdvTech-2024
- Kind: organization
- Repositories: 1
- Profile: https://github.com/SZU-AdvTech-2024
Citation (citation.txt)
@inproceedings{REPO310,
author = "Hoe, Jiun Tian and Jiang, Xudong and Chan, Chee Seng and Tan, Yap-Peng and Hu, Weipeng",
booktitle = "Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)",
month = "June",
pages = "6180-6189",
title = "{InteractDiffusion: Interaction Control in Text-to-Image Diffusion Models}",
year = "2024"
}
GitHub Events
Total
- Push event: 2
- Create event: 3
Last Year
- Push event: 2
- Create event: 3