Skip to content

v0.426

Compare
Choose a tag to compare
@soten355 soten355 released this 10 May 22:27
· 64 commits to master since this release
8a15188

v0.426 - The Alien Update!

After a few months of debugging, puzzle solving, and continuous testing, MetalDiffusion now fully supports ControlNet!

ControlNet

This is the biggest update to MetalDiffusion. ControlNet required an entire reworking of the sampler and UNET model. The program supports all ControlNet's that are in .safetensors and .pth format.

Pre-Processing

The program is limited currently on the built in pre-processing functions. I wanted to keep the frontend of the program agnostic of a specific module (like PyTorch or TensorFlow), so the pre-processing options are limited to:

  • Canny Edge Detection
  • Soft Edge Detection (also known as HED)

That being said, you can always choose to bypass the pre-processing by simply unchecking Pre-Process Image? in the ControlNet tab.

ControlNet Cache

If the input image for the ControlNet didn't change, then why make ControlNet process it? ControlNet Cache is an option you can choose to safe the results from the ControlNet section temporarily and speed up the image creation process. Of course, the cache will update itself and run ControlNet again if any of these change:

  • Image Size
  • ControlNet Weight
  • Input Image (both for Stable Diffusion and ControlNet)
  • Steps

Video

ControlNet is fully integrated into the Video tab, but is still limited in terms of what it can do frame to frame. The video tab, in general, will need the ability to read frames from a folder as the input image(s) instead of its own frames. BUT, this means that ControlNet will eventually read those frames as well because it's fully integrated. Needless to say, ControlNet will be ready.

Quality of Life

There are a whole host of quality of life enhancements too, including:

  • Quicker Weight loading
  • Better UI
  • Small Increase in Speed
  • Better Folder Structure
  • Audio Notification (Beep) when image generation finishes
  • Utilizing the KerasCV Model Structure, an improved version from Divum Gupta's