Toy Story Creator Stable Diffusion

The main notebook in this project is CLIP_img2img_instruct_workflow.ipynb. This notebook is to be run with a GPU runtime in Google Colab. It reads input images from the images directory (which must be read into Colab) and uses the Hugging Face Diffusers Pipelines to perform image to image transformation using Stable Diffusion. There is also support for using CLIP interrogators to generate image captions before applying the image to image pipeline.

Also played with instruct pix2pix with mixed results.

The goal is to generate animated versions for the input images of toys.

Next Steps

Explore Background transformation - for some images, the transformation is poor. If we could generate images via text to image and use them as backgrounds to the transformed toys, it would give a stronger impression of the toys being cartoonised. One issue to address is that the background must feature naturally in the image aspect ratio, switching a background without consideration for where the character model will be gives unrealistic output.
Explore inpainting and instruct pix2pix for background editing. Playground AI has a good example, illustrated here
Build API: Either with Flask or FASTAPI. Need to create a service that takes an input image and returns a range of transformed images via stable diffusion. Need to think about how to design this, do we send an image as a POST request? If so we need to implement task queues and streaming body responses. Or do we upload input images to a COS bucket with a namespace and save the model outputs to a different directory in the COS bucket? All APIs will require GPU support. Perhaps can explore Hugging Face's Inference endpoints.
Gradio app on Hugging face spaces as an alternative to an API?
Finetune our own stable diffusion model with Disney Pixar style images.
Try other Stable Diffusion models on Hugging Face - new models are coming out daily and the models tried in this notebook could be outdated.
Explore other Neural Style Transfer methods to Stable Diffusion - should ideally be as easy as using HuggingFace Pipelines.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
fastapi_app		fastapi_app
flask_app		flask_app
images		images
.gitignore		.gitignore
.python-version		.python-version
CLIP_img2img_instruct_workflow.ipynb		CLIP_img2img_instruct_workflow.ipynb
README.md		README.md
huggingface_diffusers_template.txt		huggingface_diffusers_template.txt
requirements.txt		requirements.txt
resize_images.py		resize_images.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Toy Story Creator Stable Diffusion

Next Steps

About

Releases

Packages

Languages

JunaidMB/toy_story_sd

Folders and files

Latest commit

History

Repository files navigation

Toy Story Creator Stable Diffusion

Next Steps

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages