[core] LTX Video 0.9.1 #10330

a-r-r-o-w · 2024-12-21T04:25:44Z

To run conversion:

 python3 scripts/convert_ltx_to_diffusers.py --transformer_ckpt_path /raid/aryan/ltx-new/ltx-video-2b-v0.9.1.safetensors --vae_ckpt_path /raid/aryan/ltx-new/ltx-video-2b-v0.9.1.safetensors --output_path /raid/aryan/ltx-diffusers --dtype bf16 --version 0.9.1 --text_encoder_cache_dir /raid/.cache/huggingface/ --save_pipeline

(I've verified that the conversion for v0.9.0 still works after the current modifications to the script)

Inference after conversion:

import torch
from diffusers import LTXPipeline
from diffusers.utils import export_to_video

pipe = LTXPipeline.from_pretrained("/raid/aryan/ltx-diffusers", torch_dtype=torch.bfloat16)
pipe.to("cuda")

prompt = "A woman with long brown hair and light skin smiles at another woman with long blonde hair. The woman with brown hair wears a black jacket and has a small, barely noticeable mole on her right cheek. The camera angle is a close-up, focused on the woman with brown hair's face. The lighting is warm and natural, likely from the setting sun, casting a soft glow on the scene. The scene appears to be real-life footage"
negative_prompt = "worst quality, inconsistent motion, blurry, jittery, distorted"

video = pipe(
    prompt=prompt,
    negative_prompt=negative_prompt,
    width=704,
    height=480,
    num_frames=161,
    num_inference_steps=50,
    decode_timestep=0.05,
    generator=torch.Generator(device="cuda").manual_seed(0),
).frames[0]
export_to_video(video, "output.mp4", fps=24)

Output on prompts from the model page:

ltxv-091-output-downscaled.mp4

Will open weights PR to the official repository soon.

cc @yoavhacohen @SapirW

HuggingFaceDocBuilderDev · 2024-12-21T04:32:21Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

yiyixuxu

that's fast! thanks!

src/diffusers/models/autoencoders/autoencoder_kl_ltx.py

src/diffusers/pipelines/ltx/pipeline_ltx.py

tin2tin · 2024-12-22T06:10:09Z

Are the 0.9.1 diffusers weights up on HuggingFace?

a-r-r-o-w · 2024-12-22T11:09:31Z

@tin2tin We're still working with their team on how to host the weights. It might take some time :(

Until then, the model is available here as well: https://huggingface.co/a-r-r-o-w/LTX-Video-0.9.1-diffusers, mostly because I need this weight format for finetuning. Once we have it hosted officially, those can be used instead

nitinmukesh · 2024-12-22T11:25:57Z

Until then, the model is available here as well: https://huggingface.co/a-r-r-o-w/LTX-Video-0.9.1-diffusers, mostly because I need this weight format for finetuning. Once we have it hosted officially, those can be used instead

Thank you for sharing.

a-r-r-o-w · 2024-12-22T12:14:31Z

cc @DN6 for single-file related support

tin2tin · 2024-12-23T00:45:26Z

Wow, this 0.9.1 (with prompt input) delivers very good quality video, and very fast!
First try:

-932937092_A_woman_with_long_brown_hair_and_light_.mp4

I guess img2vid is not implemented yet - I'm getting this error - sorry, if this is just premature to test this:

Python311\site-packages\diffusers\models\autoencoders\autoencoder_kl_ltx.py", line 431, in forward
    timestep=temb.flatten(),
             ^^^^^^^^^^^^
AttributeError: 'NoneType' object has no attribute 'flatten'

a-r-r-o-w · 2024-12-23T01:28:16Z

Oh, taking a look... This PR should worked for I2V as well

a-r-r-o-w · 2024-12-23T01:38:34Z

@tin2tin Could you try again? Thanks for catching!

tin2tin · 2024-12-23T07:56:57Z

@a-r-r-o-w That was quick! Yes, it is fixed now! Thank you!

1017385988_Photo_of_Photo_of_A_woman_with_long_bro.mp4

scarbain · 2024-12-23T19:59:39Z

@tin2tin We're still working with their team on how to host the weights. It might take some time :(

Until then, the model is available here as well: https://huggingface.co/a-r-r-o-w/LTX-Video-0.9.1-diffusers, mostly because I need this weight format for finetuning. Once we have it hosted officially, those can be used instead

Hi @a-r-r-o-w, thanks for this PR merged ! Any script for finetuning or LoRA i2v available somewhere ? :)

a-r-r-o-w · 2024-12-23T23:06:57Z

@scarbain There's one for T2V here: https://github.com/a-r-r-o-w/finetrainers. I2V will be supported soon!

a-r-r-o-w added 4 commits December 21, 2024 02:00

update

0871dc6

make style

58a51aa

update

5316f4b

update

a6d990c

a-r-r-o-w requested a review from yiyixuxu December 21, 2024 04:25

Merge branch 'main' into ltxv-0.9.1-integration

e847d85

a-r-r-o-w mentioned this pull request Dec 21, 2024

Error no file named pytorch_model.bin, model.safetensors found in directory Lightricks/LTX-Video. #10321

Closed

a-r-r-o-w added 2 commits December 21, 2024 12:51

update

9d776e7

make style

734fb71

a-r-r-o-w added the roadmap Add to current release roadmap label Dec 21, 2024

yiyixuxu reviewed Dec 21, 2024

View reviewed changes

src/diffusers/models/autoencoders/autoencoder_kl_ltx.py Show resolved Hide resolved

src/diffusers/pipelines/ltx/pipeline_ltx.py Show resolved Hide resolved

nitinmukesh mentioned this pull request Dec 21, 2024

Update LTX09_gradio.py newgenai79/newgenai#1

Closed

a-r-r-o-w added 2 commits December 22, 2024 13:13

single file related changes

8fc5cfc

Merge branch 'main' into ltxv-0.9.1-integration

cbf643e

a-r-r-o-w requested a review from DN6 December 22, 2024 12:14

yiyixuxu approved these changes Dec 22, 2024

View reviewed changes

update

65cc82d

a-r-r-o-w added 2 commits December 23, 2024 02:35

fix

167df2c

Merge branch 'main' into ltxv-0.9.1-integration

409b586

DN6 approved these changes Dec 23, 2024

View reviewed changes

update single file urls and docs

6061189

a-r-r-o-w added 4 commits December 23, 2024 13:28

Merge branch 'main' into ltxv-0.9.1-integration

7b412c5

Merge branch 'main' into ltxv-0.9.1-integration

4325449

update

178c22d

fix

a5e6c13

a-r-r-o-w requested a review from DN6 December 23, 2024 13:30

a-r-r-o-w added 2 commits December 23, 2024 19:00

Merge branch 'main' into ltxv-0.9.1-integration

5727643

Merge branch 'main' into ltxv-0.9.1-integration

e89b458

a-r-r-o-w merged commit 4b55713 into main Dec 23, 2024
15 checks passed

a-r-r-o-w deleted the ltxv-0.9.1-integration branch December 23, 2024 14:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[core] LTX Video 0.9.1 #10330

[core] LTX Video 0.9.1 #10330

Uh oh!

a-r-r-o-w commented Dec 21, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Dec 21, 2024

Uh oh!

yiyixuxu left a comment

Uh oh!

Uh oh!

Uh oh!

tin2tin commented Dec 22, 2024

Uh oh!

a-r-r-o-w commented Dec 22, 2024

Uh oh!

nitinmukesh commented Dec 22, 2024

Uh oh!

a-r-r-o-w commented Dec 22, 2024

Uh oh!

tin2tin commented Dec 23, 2024 •

edited

Loading

Uh oh!

a-r-r-o-w commented Dec 23, 2024

Uh oh!

a-r-r-o-w commented Dec 23, 2024

Uh oh!

tin2tin commented Dec 23, 2024

Uh oh!

Uh oh!

scarbain commented Dec 23, 2024

Uh oh!

a-r-r-o-w commented Dec 23, 2024

Uh oh!

Uh oh!

[core] LTX Video 0.9.1 #10330

[core] LTX Video 0.9.1 #10330

Uh oh!

Conversation

a-r-r-o-w commented Dec 21, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Dec 21, 2024

Uh oh!

yiyixuxu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

tin2tin commented Dec 22, 2024

Uh oh!

a-r-r-o-w commented Dec 22, 2024

Uh oh!

nitinmukesh commented Dec 22, 2024

Uh oh!

a-r-r-o-w commented Dec 22, 2024

Uh oh!

tin2tin commented Dec 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

a-r-r-o-w commented Dec 23, 2024

Uh oh!

a-r-r-o-w commented Dec 23, 2024

Uh oh!

tin2tin commented Dec 23, 2024

Uh oh!

Uh oh!

scarbain commented Dec 23, 2024

Uh oh!

a-r-r-o-w commented Dec 23, 2024

Uh oh!

Uh oh!

tin2tin commented Dec 23, 2024 •

edited

Loading