Initial DiT and VAE changes for LTX inference #783

aartilalwani · 2025-09-05T02:20:05Z

DiT and VAE changes for LTX inference pipeline, will add more optimizations in upcoming PRs including full pipeline

SolitaryThinker · 2025-11-14T20:12:14Z

Could you run pre-commit on this PR?

pre-commit install --hook-type pre-commit --hook-type commit-msg

# You can manually run pre-commit with
pre-commit run --all-files

SolitaryThinker · 2025-11-14T20:13:18Z

examples/inference/basic/basic_ltx.py

+        )
+
+    generator = VideoGenerator.from_pretrained(
+        model_path="data/Lightricks/LTX-Video",


VideoGenerator.from_pretrained() should download the model for you, meaning directly passing Lightricks/LTX-Video as the model_path should work. Could you test and simply this example accordingly? thanks

SolitaryThinker · 2025-11-14T20:14:16Z

fastvideo/configs/pipelines/ltx.py

+    # TODO: fix all of the configs so it's exact match
+
+    # # Text encoder configuration
+    # text_encoder_configs: tuple[EncoderConfig, ...] = field(
+    #     # todo: set max length later
+    #     #def ltx_t5_config():
+    #     #     config = T5Config()
+    #     #     config.tokenizer_kwargs["max_length"] = 128
+    #     #     return config
+
+    #     # @dataclass
+    #     # class LTXConfig(PipelineConfig):
+    #     #     text_encoder_configs: tuple[EncoderConfig, ...] = field(
+    #     #         default_factory=lambda: (ltx_t5_config(), ))
+    #     default_factory=lambda: (T5Config(), ))


Are these not needed? if so please remove

SolitaryThinker · 2025-11-14T20:14:30Z

fastvideo/configs/pipelines/ltx.py

+        # TODO: load differently for each config
+        # Text-to-Video: Only needs the decoder (to decode latents to video)
+        # Image-to-Video: Needs both encoder (to encode input image) and decoder
+        # @dataclass
+        # class LTXT2VConfig(LTXConfig):
+        #     def __post_init__(self):
+        #         super().__post_init__()
+        #         self.vae_config.load_encoder = False
+        #         self.vae_config.load_decoder = True
+
+        # @dataclass
+        # class LTXI2VConfig(LTXConfig):
+        #     def __post_init__(self):
+        #         super().__post_init__()
+        #         self.vae_config.load_encoder = True
+        #         self.vae_config.load_decoder = True
+


Also here, please clean up comments

SolitaryThinker · 2025-11-14T20:14:57Z

fastvideo/configs/sample/ltx.py

+class LTXSamplingParam(SamplingParam):
+    # Video parameters
+    height: int = 512
+    width: int = 704
+
+    # Most defaults set in pipeline config
+    num_inference_steps: int = 50


What's the default number of frames that the official repo generates? Could you add it here as well?

SolitaryThinker · 2025-11-14T20:15:18Z

fastvideo/models/dits/ltxvideo.py

+from diffusers.utils.torch_utils import maybe_allow_in_graph
+# from ..attention import FeedForward
+from fastvideo.attention import DistributedAttention, LocalAttention
+#from diffusers.attention_processor import Attention


clean up comments please

SolitaryThinker · 2025-11-14T20:16:47Z

fastvideo/pipelines/basic/ltxvideo/ltx_pipeline.py

+        # Add ImageVAEEncodingStage for I2V (conditional based on input)
+        # Before LatentPreparation for I2V
+        # if fastvideo_args.pipeline_config.ltx_i2v_mode:
+        #     self.add_stage(
+        #         stage_name="image_vae_encoding_stage",
+        #         stage=LTXImageVAEEncodingStage(vae=self.get_module("vae")))


remove if not needed

initial changes for dit and vae without full optimizations

bc0c1bd

SolitaryThinker added the go Trigger Buildkite CI label Sep 5, 2025

aartilalwani and others added 6 commits September 8, 2025 15:59

remove unnecessary func to fix build error

f659ed9

fix build

34cef9e

pre commit formatting

bae8bf2

add the vae code

b045560

Merge branch 'main' into aartil/addltxinference

05f3188

Merge branch 'hao-ai-lab:main' into aartil/addltxinference

7909ddd

aartilalwani marked this pull request as ready for review September 30, 2025 02:56

aartilalwani marked this pull request as draft September 30, 2025 02:58

aartilalwani added 2 commits September 30, 2025 14:53

initial pipeline logic

d02725f

updates for pr and some of the minimal pipeline changes

e31c7ad

aartilalwani marked this pull request as ready for review October 31, 2025 02:55

Merge branch 'main' into aartil/addltxinference

61323c8

SolitaryThinker self-requested a review November 14, 2025 20:12

SolitaryThinker reviewed Nov 14, 2025

View reviewed changes

SolitaryThinker mentioned this pull request Nov 24, 2025

[Feature] Development Roadmap 2025 Q4/2026 Q1 #899

Open

17 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Initial DiT and VAE changes for LTX inference #783

Initial DiT and VAE changes for LTX inference #783

Uh oh!

aartilalwani commented Sep 5, 2025 •

edited

Loading

Uh oh!

SolitaryThinker commented Nov 14, 2025

Uh oh!

SolitaryThinker Nov 14, 2025

Uh oh!

SolitaryThinker Nov 14, 2025

Uh oh!

SolitaryThinker Nov 14, 2025

Uh oh!

SolitaryThinker Nov 14, 2025

Uh oh!

SolitaryThinker Nov 14, 2025

Uh oh!

SolitaryThinker Nov 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Initial DiT and VAE changes for LTX inference #783

Are you sure you want to change the base?

Initial DiT and VAE changes for LTX inference #783

Uh oh!

Conversation

aartilalwani commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SolitaryThinker commented Nov 14, 2025

Uh oh!

SolitaryThinker Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

SolitaryThinker Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

SolitaryThinker Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

SolitaryThinker Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

SolitaryThinker Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

SolitaryThinker Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

aartilalwani commented Sep 5, 2025 •

edited

Loading