Skip to content

Conversation

vbuterin
Copy link

@vbuterin vbuterin commented Jun 14, 2024

I needed to do this to be able to run the model on a 4070 (with 8 GB VRAM).

I needed to do this to be able to run the model on a 4070 (with 8 GM RAM).
@ksasso1028
Copy link

@vbuterin a better option is to run the the model in half precision (16). using the --model-half parameter I can run the model on a 3070ti ( 8GB), but model takes less than 4 GB looking at nvidia-smi and is faster than CPU decoding.

@ghost
Copy link

ghost commented Jun 25, 2024

@ksasso1028 --model-half seems to break inpainting though last I checked.

@ksasso1028
Copy link

@cherrerajobs whats happening is the input audio condition (for inpainting or audio init) is not being converted to half precision before running the VAE.

@ksasso1028
Copy link

I can put in a PR for this, will refer back in this comment

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants