This is a learning project meant to generate images with the help of different AI models belonging to Stable Diffusion and OpenAI 'families'. Current features are limited and there is much planned.
- Convert text prompts into high-quality images.
- Support for various styles and resolutions.
- Easy-to-use interface for generating images.
-
Clone the repository:
-
Navigate to the project directory
-
Create a virtual environment and activate it
-
Install dependencies
-
Add .env file with api keys to OpenAI and Hugging Face
-
Run the application by executing
app.py
: -
Open your web browser and navigate to
http://localhost:8000
to access the application interface. -
Enter a text prompt in the input field provided.
-
Select the desired model for the generated image.
-
Click the "Generate" button to create an image based on your text prompt.
-
Once the image is generated you can view it under the images folder.
-
Refer to the
logs/
directory for detailed logs in case of errors or debugging purposes.
- Python 3.8 or higher
- Required libraries listed in
requirements.txt
- v0.5.0: Initial release with basic text-to-image generation functionality.
- v0.8.0: Add Gradio UI, add Inpainting skeleton
- Finalize inpainting to work
- Add Flux model(s) and Janus for generation
- Further optimize pipelines
- Introduce advanced style customization options.
- Add better error checking, type infering