simpleimagegenerator

https://github.com/riyakanani/simpleimagegenerator

Science Score: 26.0%

This score indicates how likely this project is to be science-related based on various indicators:

○
CITATION.cff file
✓
codemeta.json file
Found codemeta.json file
✓
.zenodo.json file
Found .zenodo.json file
○
DOI references
○
Academic publication links
○
Academic email domains
○
Institutional organization owner
○
JOSS paper metadata
○
Scientific vocabulary similarity
Low similarity (12.8%) to scientific vocabulary

Last synced: 9 months ago · JSON representation

Repository

Basic Info

Host: GitHub
Owner: riyakanani
License: agpl-3.0
Language: Python
Default Branch: main
Size: 119 MB

Statistics

Stars: 0
Watchers: 2
Forks: 0
Open Issues: 0
Releases: 0

Created about 2 years ago · Last pushed about 2 years ago

Metadata Files

Readme Changelog License Citation Codeowners

Final Report

Project description:

The project we chose to implement was the  Stable Diffusion text-to-image generator. Our project includes an interactive UI in which the user can type into a text box and the image loads onto the side. The UI includes options to adjust the image generation process, such as dimension modifications, negative prompt options, batch size modifications, and more.
We encountered some challenges while setting up this project, mainly due to the original repository requiring the availability of NVIDIA GPUs. This prompted us to search for a tutorial that would guide us on how to bypass this requirement. Because of this, our current project does not make use of the NVIDIA GPUs, and instead works only on MacOS systems.

Deliverables:

Presentation Slides Interactive UI GitHub Repository (includes model and code for UI)

How we divided work:

Although for the most part, we worked pretty collaboratively, there were certain parts that we split into groups for. We collectively did research on which tutorials and models would work best on all of our different laptops and OS. Daniyah and Divya and Riya were able to find out how to create an app that worked on all of our laptops. Tanvi and Shraddha looked through different AI models to find the one that would work best, then chose the hugging face stable-diffusion 2.13 GB model as it was open source and used significantly less processing power compared to other text-to-image models. Riya worked on combining the components of the model and application to work together and created the GitHub Repository to push all of the progress.

Models used and Results:

We used the Mo-di-Diffusion model to generate the images for this project. We decided to use this model as it was readily available in the tutorial and required less storage space than a typical Stable diffusion checkpoint file. This model was also the most compatible one that we found with macOS.
In regards to the results of the project, we were able to successfully create an image generator with a simple UI. The UI we created has a text box for the user to input their prompt for the image they want to create. There is then a separate area where the image is generated. The tutorial and model we used enables the creation of images based on the users input. For example, if a user enters a prompt of a beach during sunset, our generator will easily develop a clear image of this. The user can even input complex prompts and the generator will be able to achieve the goal.

Owner

Login: riyakanani
Kind: user

Repositories: 1
Profile: https://github.com/riyakanani

GitHub Events

Total

Last Year

Dependencies

package.json npm

eslint ^8.40.0 development

pyproject.toml pypi

requirements-test.txt pypi

pytest * test
pytest-base-url * test
pytest-cov * test

requirements.txt pypi

GitPython *
Pillow *
accelerate *
blendmodes *
clean-fid *
einops *
facexlib *
fastapi >=0.90.1
gradio ==3.41.2
inflection *
jsonmerge *
kornia *
lark *
numpy *
omegaconf *
open-clip-torch *
piexif *
psutil *
pytorch_lightning *
requests *
resize-right *
safetensors *
scikit-image >=0.19
tomesd *
torch *
torchdiffeq *
torchsde *
transformers ==4.30.2

requirements_npu.txt pypi

cloudpickle *
decorator *
synr ==0.5.0
tornado *

requirements_versions.txt pypi

GitPython ==3.1.32
Pillow ==9.5.0
accelerate ==0.21.0
blendmodes ==2022
clean-fid ==0.1.35
einops ==0.4.1
facexlib ==0.3.0
fastapi ==0.94.0
gradio ==3.41.2
httpcore ==0.15
httpx ==0.24.1
inflection ==0.5.1
jsonmerge ==1.8.0
kornia ==0.6.7
lark ==1.1.2
numpy ==1.26.2
omegaconf ==2.2.3
open-clip-torch ==2.20.0
piexif ==1.1.3
psutil ==5.9.5
pytorch_lightning ==1.9.4
resize-right ==0.0.2
safetensors ==0.4.2
scikit-image ==0.21.0
spandrel ==0.1.6
tomesd ==0.1.3
torch *
torchdiffeq ==0.2.3
torchsde ==0.2.6
transformers ==4.30.2

venv/lib/python3.10/site-packages/clip-1.0-py3.10.egg-info/requires.txt pypi

ftfy *
pytest *
regex *
torch *
torchvision *
tqdm *

venv/lib/python3.10/site-packages/jsonmerge-1.8.0-py3.10.egg-info/requires.txt pypi

jsonschema *

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Open Source Science