Avoiding graph break by changing the way we infer dtype in vae.decoder #12512

ppadjinTT · 2025-10-20T12:00:32Z

…sors

What does this PR do?

This PR addresses the problem disscused in #12501, where the usage of upscale_dtype = next(iter(self.up_blocks.parameters())).dtype to infer the dtype in the forward pass of the vae.decoder causes the graph break when compiling the model with torch.compile.

The issue is that the usage of next(iter(...)) forces the lazy tensors in the initial compiled model pass to materialize, resulting in graph break, which decreases performance.

This PR proposes a simple fix by infering the dtype as:

upscale_dtype = self.conv_out.weight.dtype

Fixes #12501

Who can review?

@sayakpaul

…sors

sayakpaul · 2025-10-20T16:29:16Z

@DN6 WDYT?

HuggingFaceDocBuilderDev · 2025-10-20T16:37:35Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ppadjinTT · 2025-10-23T07:21:08Z

I made sure all autoencoder tests are passing locally, I would be very thankful if you can take a look @DN6

DN6 · 2025-10-23T08:28:45Z

src/diffusers/models/autoencoders/vae.py

        sample = self.conv_in(sample)

-        upscale_dtype = next(iter(self.up_blocks.parameters())).dtype
+        upscale_dtype = self.up_blocks[0].resnets[0].norm1.weight.dtype


Think current failing tests in the CI are due to the fact that not every decoder block has a norm1 with a weight. Hence the use of the generator here to avoid such cases.

@ppadjinTT I noticed you initially used self.conv_out.weight here? What was the issue you ran into with that?

okay, I will change that too, tnx! I intially changed the self.conv_out.weight because there are some tests that check what happens when conv_out and upscale_blocks have different dtypes

Could you point me to those tests? Seems like setting to conv_out is more robust.

Yup, these are the tests pytest -svvv tests/models/autoencoders/test_models_autoencoder_kl.py

This is one of the tests from this test set that fails tests/models/autoencoders/test_models_autoencoder_kl.py::AutoencoderKLTests::test_layerwise_casting_inference

I added better logic for inferring dtype, to capture the case where it doesn't work

Hmm I think we can remove upscale_type entirely here. I think all tests should still pass without it.

okay let's try that, i'm pushing the change

Do you think it's okay now? @DN6

Any chance you can take a look? Thanks for your effort @DN6

DN6 · 2025-10-30T03:09:57Z

Thanks @ppadjinTT 👍🏽

Changing the way we infer dtype to avoid force evaluation of lazy ten…

c813353

…sors

ppadjinTT mentioned this pull request Oct 20, 2025

VAE Decoder next(iter(..)) causes graph break #12501

Closed

sayakpaul requested a review from DN6 October 20, 2025 16:29

changing way to infer dtype to ensure type consistency

e84a341

DN6 reviewed Oct 23, 2025

View reviewed changes

ppadjinTT added 4 commits October 23, 2025 11:36

more robust infering of dtype

3a860ae

removing the upscale dtype entirely

0c04973

Merge branch 'main' into fix-vae-lazy-tensor-evaluation

6cc4ae8

Merge branch 'main' into fix-vae-lazy-tensor-evaluation

dea0229

DN6 approved these changes Oct 30, 2025

View reviewed changes

DN6 merged commit 9f3c0fd into huggingface:main Oct 30, 2025
9 of 11 checks passed

DN6 mentioned this pull request Nov 10, 2025

Deprecate upcast_vae in SDXL based pipelines #12619

Merged

6 tasks

sayakpaul mentioned this pull request Nov 10, 2025

[modular] add tests for qwen modular #12585

Merged

jiqing-feng mentioned this pull request Nov 11, 2025

stable_diffusion_xl_img2img failed due to dtype missmatch #12632

Open

Avoiding graph break by changing the way we infer dtype in vae.decoder #12512

Avoiding graph break by changing the way we infer dtype in vae.decoder #12512

Uh oh!

Conversation

ppadjinTT commented Oct 20, 2025

What does this PR do?

Who can review?

Uh oh!

sayakpaul commented Oct 20, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Oct 20, 2025

Uh oh!

ppadjinTT commented Oct 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

DN6 commented Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ppadjinTT commented Oct 23, 2025 •

edited

Loading