simpleimagegenerator
Science Score: 26.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
✓codemeta.json file
Found codemeta.json file -
✓.zenodo.json file
Found .zenodo.json file -
○DOI references
-
○Academic publication links
-
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (12.8%) to scientific vocabulary
Repository
Basic Info
- Host: GitHub
- Owner: riyakanani
- License: agpl-3.0
- Language: Python
- Default Branch: main
- Size: 119 MB
Statistics
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
- Releases: 0
Metadata Files
README.md
Final Report
Project description:
The project we chose to implement was the Stable Diffusion text-to-image generator. Our project includes an interactive UI in which the user can type into a text box and the image loads onto the side. The UI includes options to adjust the image generation process, such as dimension modifications, negative prompt options, batch size modifications, and more.
We encountered some challenges while setting up this project, mainly due to the original repository requiring the availability of NVIDIA GPUs. This prompted us to search for a tutorial that would guide us on how to bypass this requirement. Because of this, our current project does not make use of the NVIDIA GPUs, and instead works only on MacOS systems.
Deliverables:
Presentation Slides Interactive UI GitHub Repository (includes model and code for UI)
How we divided work:
Although for the most part, we worked pretty collaboratively, there were certain parts that we split into groups for. We collectively did research on which tutorials and models would work best on all of our different laptops and OS. Daniyah and Divya and Riya were able to find out how to create an app that worked on all of our laptops. Tanvi and Shraddha looked through different AI models to find the one that would work best, then chose the hugging face stable-diffusion 2.13 GB model as it was open source and used significantly less processing power compared to other text-to-image models. Riya worked on combining the components of the model and application to work together and created the GitHub Repository to push all of the progress.
Models used and Results:
We used the Mo-di-Diffusion model to generate the images for this project. We decided to use this model as it was readily available in the tutorial and required less storage space than a typical Stable diffusion checkpoint file. This model was also the most compatible one that we found with macOS.
In regards to the results of the project, we were able to successfully create an image generator with a simple UI. The UI we created has a text box for the user to input their prompt for the image they want to create. There is then a separate area where the image is generated. The tutorial and model we used enables the creation of images based on the users input. For example, if a user enters a prompt of a beach during sunset, our generator will easily develop a clear image of this. The user can even input complex prompts and the generator will be able to achieve the goal.
Owner
- Login: riyakanani
- Kind: user
- Repositories: 1
- Profile: https://github.com/riyakanani
GitHub Events
Total
Last Year
Dependencies
- eslint ^8.40.0 development
- pytest * test
- pytest-base-url * test
- pytest-cov * test
- GitPython *
- Pillow *
- accelerate *
- blendmodes *
- clean-fid *
- einops *
- facexlib *
- fastapi >=0.90.1
- gradio ==3.41.2
- inflection *
- jsonmerge *
- kornia *
- lark *
- numpy *
- omegaconf *
- open-clip-torch *
- piexif *
- psutil *
- pytorch_lightning *
- requests *
- resize-right *
- safetensors *
- scikit-image >=0.19
- tomesd *
- torch *
- torchdiffeq *
- torchsde *
- transformers ==4.30.2
- cloudpickle *
- decorator *
- synr ==0.5.0
- tornado *
- GitPython ==3.1.32
- Pillow ==9.5.0
- accelerate ==0.21.0
- blendmodes ==2022
- clean-fid ==0.1.35
- einops ==0.4.1
- facexlib ==0.3.0
- fastapi ==0.94.0
- gradio ==3.41.2
- httpcore ==0.15
- httpx ==0.24.1
- inflection ==0.5.1
- jsonmerge ==1.8.0
- kornia ==0.6.7
- lark ==1.1.2
- numpy ==1.26.2
- omegaconf ==2.2.3
- open-clip-torch ==2.20.0
- piexif ==1.1.3
- psutil ==5.9.5
- pytorch_lightning ==1.9.4
- resize-right ==0.0.2
- safetensors ==0.4.2
- scikit-image ==0.21.0
- spandrel ==0.1.6
- tomesd ==0.1.3
- torch *
- torchdiffeq ==0.2.3
- torchsde ==0.2.6
- transformers ==4.30.2
- ftfy *
- pytest *
- regex *
- torch *
- torchvision *
- tqdm *
- jsonschema *