[WAN] Adds VACE conditioning to WAN 2.1 #304

martinarroyo · 2026-01-07T17:32:25Z

This brings in the VACE model taken from diffusers, trying to comply as much as possible with the conventions upstream.

github-actions · 2026-01-07T17:32:34Z

e2e testgrid: https://8bcf50593faf4ea38060e236169827e5-dot-us-central1.composer.googleusercontent.com/dags/maxdiffusion_tpu_e2e/grid

martinarroyo · 2026-01-07T17:33:41Z

src/maxdiffusion/pipelines/wan/wan_vace_pipeline.py

+        img_height, img_width = image.shape[-2:]
+        scale = min(image_size[0] / img_height, image_size[1] / img_width)
+        new_height, new_width = int(img_height * scale), int(img_width * scale)
+        # TODO: should we use jax/TF-based resizing here?


Let me know what you think about this.

I don't think it is necessary right now. Wouldn't it require casting to numpy for running the torch function below?

It worked fine so far, video_processor.preprocess returns a Torch tensor already, but I will keep an eye just in case

Co-authored-by: ninatu <[email protected]>

entrpn · 2026-01-08T22:53:38Z

src/maxdiffusion/models/wan/transformers/transformer_wan_vace.py

+        blocks.append(block)
+      self.blocks = blocks
+
+    if scan_layers:


I haven't looked too deeply at the vace architecture, but why is it that scan cannot be used?

I am not sure how to do it, because the nnx.vmap decorator does not differentiate between each separate layer. In fact, it simply creates a tensor with an extra axis, so passing parameters like apply_input_projection=vace_block_id == 0 is to my knowledge not feasible. I think the nnx.scan function later can probably be used in this context if we keep some new variable that acts as counter to identify the current iteration, but I was not able to work around the limitation in the initialization (and this parameter cannot be passed later because it conditions how the layer is initialized). I would like to support this though, in case you have any ideas I would appreciate it!

I can also try to have the Wan layers vmap-initialized and skip it for the Vace ones.

ok sounds good we can add it later.

entrpn · 2026-01-08T22:56:15Z

src/maxdiffusion/pipelines/wan/wan_vace_pipeline.py

+        img_height, img_width = image.shape[-2:]
+        scale = min(image_size[0] / img_height, image_size[1] / img_width)
+        new_height, new_width = int(img_height * scale), int(img_width * scale)
+        # TODO: should we use jax/TF-based resizing here?


I don't think it is necessary right now. Wouldn't it require casting to numpy for running the torch function below?

Adds a VACE transformer block

1ae2616

martinarroyo commented Jan 7, 2026

View reviewed changes

martinarroyo added 2 commits January 8, 2026 09:17

Adds the VACE logic to WAN

3cdef23

Adds a pipeline to run WAN-VACE models

db2a559

martinarroyo force-pushed the martinarroyo-wan2.1-vace branch from e9086b5 to db2a559 Compare January 8, 2026 08:19

Adds the logic to condition on videos and masks

73259e3

Co-authored-by: ninatu <[email protected]>

martinarroyo changed the title ~~[WIP, WAN] Adds VACE conditioning to WAN 2.1~~ [WAN] Adds VACE conditioning to WAN 2.1 Jan 8, 2026

martinarroyo marked this pull request as ready for review January 8, 2026 14:04

entrpn reviewed Jan 8, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WAN] Adds VACE conditioning to WAN 2.1 #304

[WAN] Adds VACE conditioning to WAN 2.1 #304

Uh oh!

martinarroyo commented Jan 7, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Jan 7, 2026

Uh oh!

martinarroyo Jan 7, 2026

Uh oh!

entrpn Jan 8, 2026

Uh oh!

martinarroyo Jan 9, 2026 •

edited

Loading

Uh oh!

entrpn Jan 8, 2026

Uh oh!

martinarroyo Jan 9, 2026 •

edited

Loading

Uh oh!

entrpn Jan 10, 2026

Uh oh!

entrpn Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[WAN] Adds VACE conditioning to WAN 2.1 #304

Are you sure you want to change the base?

[WAN] Adds VACE conditioning to WAN 2.1 #304

Uh oh!

Conversation

martinarroyo commented Jan 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jan 7, 2026

Uh oh!

martinarroyo Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

entrpn Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

martinarroyo Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

entrpn Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

martinarroyo Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

entrpn Jan 10, 2026

Choose a reason for hiding this comment

Uh oh!

entrpn Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

martinarroyo commented Jan 7, 2026 •

edited

Loading

martinarroyo Jan 9, 2026 •

edited

Loading

martinarroyo Jan 9, 2026 •

edited

Loading