Merge changes #211

Skquark · 2025-05-12T12:27:48Z

No description provided.

…for resizing (#11421) * Set LANCZOS as default interpolation mode for resizing * [train_dreambooth_lora.py] Set LANCZOS as default interpolation mode for resizing

…s in pipelines during torch.compile() (#11085) * test for better torch.compile stuff. * fixes * recompilation and graph break. * clear compilation cache. * change to modeling level test. * allow running compilation tests during nightlies.

* enable group_offload cases and quanto cases on XPU Signed-off-by: YAO Matrix <[email protected]> * use backend APIs Signed-off-by: Yao Matrix <[email protected]> * fix style Signed-off-by: Yao Matrix <[email protected]> --------- Signed-off-by: YAO Matrix <[email protected]> Signed-off-by: Yao Matrix <[email protected]>

* enable test_layerwise_casting_memory cases on XPU Signed-off-by: Yao Matrix <[email protected]> * fix style Signed-off-by: Yao Matrix <[email protected]> --------- Signed-off-by: Yao Matrix <[email protected]>

fix import.

…follow up (#11426) * Update train_text_to_image.py * update

…ipts follow up (#11427) * Update train_text_to_image_lora.py * update_train_text_to_image_lora

* enable gguf test cases on XPU Signed-off-by: YAO Matrix <[email protected]> * make SD35LargeGGUFSingleFileTests::test_pipeline_inference pas Signed-off-by: root <[email protected]> * make FluxControlLoRAGGUFTests::test_lora_loading pass Signed-off-by: Yao Matrix <[email protected]> * polish code Signed-off-by: Yao Matrix <[email protected]> * Apply style fixes --------- Signed-off-by: YAO Matrix <[email protected]> Signed-off-by: root <[email protected]> Signed-off-by: Yao Matrix <[email protected]> Co-authored-by: root <[email protected]> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

remove unnecessary pipeline moving to cpu in validation Co-authored-by: Sayak Paul <[email protected]>

* Fixing missing provider options argument * Adding if else for provider options * Apply suggestions from code review Co-authored-by: YiYi Xu <[email protected]> * Apply style fixes * Update src/diffusers/pipelines/onnx_utils.py Co-authored-by: YiYi Xu <[email protected]> * Update src/diffusers/pipelines/onnx_utils.py Co-authored-by: YiYi Xu <[email protected]> --------- Co-authored-by: Uros Petkovic <[email protected]> Co-authored-by: Sayak Paul <[email protected]> Co-authored-by: YiYi Xu <[email protected]> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

…lNet training (#11449) Set LANCZOS as the default interpolation for image resizing

) raise warning instead of error

Signed-off-by: Yao Matrix <[email protected]>

udpate

Signed-off-by: Yao Matrix <[email protected]>

* enable unidiffuser cases on XPU Signed-off-by: Yao Matrix <[email protected]> * fix a typo Signed-off-by: Yao Matrix <[email protected]> * fix style Signed-off-by: Yao Matrix <[email protected]> --------- Signed-off-by: Yao Matrix <[email protected]>

* Add generic support for Intel Gaudi accelerator (hpu device) Signed-off-by: Daniel Socek <[email protected]> Co-authored-by: Libin Tang <[email protected]> * Add loggers for generic HPU support Signed-off-by: Daniel Socek <[email protected]> * Refactor hpu support with is_hpu_available() logic Signed-off-by: Daniel Socek <[email protected]> * Fix style for hpu support update Signed-off-by: Daniel Socek <[email protected]> * Decouple soft HPU check from hard device validation to support HPU migration Signed-off-by: Daniel Socek <[email protected]> --------- Signed-off-by: Daniel Socek <[email protected]> Co-authored-by: Libin Tang <[email protected]> Co-authored-by: Sayak Paul <[email protected]>

* upload StableDiffusion3InstructPix2PixPipeline * Move to community * Add readme * Fix images * remove images * Change image url * fix * Apply style fixes

* make safe diffusion test cases pass on XPU and A100 Signed-off-by: Yao Matrix <[email protected]> * calibrate A100 expected values Signed-off-by: YAO Matrix <[email protected]> --------- Signed-off-by: Yao Matrix <[email protected]> Signed-off-by: YAO Matrix <[email protected]>

…for impactful models (#11431) * Update test_models_transformer_hunyuan_video.py * update --------- Co-authored-by: Sayak Paul <[email protected]>

* Add LANCZOS as default interplotation mode. * LANCZOS as default interplotation * LANCZOS as default interplotation mode * Added LANCZOS as default interplotation mode

…pass on xpu (#11461) * make autoencoders. controlnet_flux and wan_transformer3d_single_file pass on XPU Signed-off-by: Yao Matrix <[email protected]> * Apply style fixes --------- Signed-off-by: Yao Matrix <[email protected]> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Aryan <[email protected]>

* [tests] Add torch.compile() test for WanTransformer3DModel * fix wan recompilation issues. * style --------- Co-authored-by: tongyu0924 <[email protected]>

* Fix typos in docs and comments * Apply style fixes --------- Co-authored-by: Sayak Paul <[email protected]> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

xfail recent pipeline tests for specific methods.

* cache packages_distributions * remove unused exception reference * make style Signed-off-by: Vladimir Mandic <[email protected]> * change name to _package_map --------- Signed-off-by: Vladimir Mandic <[email protected]> Co-authored-by: DN6 <[email protected]>

* reformat * initial * fin * review * inference * feedback * feedback * feedback

* refactor adapter docs * ip-adapter * ip adapter * fix toctree * fix toctree * lora * images * controlnet * feedback * controlnet * t2i * fix typo * feedback --------- Co-authored-by: Sayak Paul <[email protected]>

…rpolation mode for image resizing (#11471)

…rpolation mode for image resizing (#11472) * [train_controlnet_sdxl] Add LANCZOS as the default interpolation mode for image resizing * [train_dreambooth_lora_flux_advanced] Add LANCZOS as the default interpolation mode for image resizing

…11492) * Set LANCZOS as the default interpolation method for image resizing. * style: run make style and quality checks

…terpolation. (#11496) * Update training script for txt to img sdxl with lora supp with new interpolation. * ran make style and make quality.

update

* update dep table. * fix

* use removeprefix to preserve sanity. * f-string.

* add transformer * add pipeline * fixes * make fix-copies * update * add flux mu shift * update example snippet * debug * cleanup * batch_size=1 optimization * add pipeline test * fix for model cpu offloading' * add last_image support; credits: lllyasviel/FramePack#167 * update example with flf2v * update penguin url * fix test * address review comment: #11428 (comment) * address review comment: #11428 (comment) * Update src/diffusers/pipelines/hunyuan_video/pipeline_hunyuan_video_framepack.py --------- Co-authored-by: Linoy Tsaban <[email protected]>

* enable lora cases on XPU Signed-off-by: Yao Matrix <[email protected]> * remove hunyuanvideo xpu expectation Signed-off-by: Yao Matrix <[email protected]> --------- Signed-off-by: Yao Matrix <[email protected]>

…ORA conversion utility (#11441) (#11487) * [lora_conversion] Enhance key handling for OneTrainer components in LORA conversion utility (#11441) * Update src/diffusers/loaders/lora_conversion_utils.py Co-authored-by: Sayak Paul <[email protected]> --------- Co-authored-by: Sayak Paul <[email protected]>

* minor updates to bitsandbytes docs. * Apply suggestions from code review

* begin transformer conversion * refactor * refactor * refactor * refactor * refactor * refactor * update * add conversion script * add pipeline * make fix-copies * remove einops * update docs * gradient checkpointing * add transformer test * update * debug * remove prints * match sigmas * add vae pt. 1 * finish CV* vae * update * update * update * update * update * update * make fix-copies * update * make fix-copies * fix * update * update * make fix-copies * update * update tests * handle device and dtype for safety checker; required in latest diffusers * remove enable_gqa and use repeat_interleave instead * enforce safety checker; use dummy checker in fast tests * add review suggestion for ONNX export Co-Authored-By: Asfiya Baig <[email protected]> * fix safety_checker issues when not passed explicitly We could either do what's done in this commit, or update the Cosmos examples to explicitly pass the safety checker * use cosmos guardrail package * auto format docs * update conversion script to support 14B models * update name CosmosPipeline -> CosmosTextToWorldPipeline * update docs * fix docs * fix group offload test failing for vae --------- Co-authored-by: Asfiya Baig <[email protected]>

up

This reverts commit 87e508f.

* add lora_alpha and lora_dropout * Apply style fixes * add lora_alpha and lora_dropout * Apply style fixes * revert lora_alpha until #11324 is merged * Apply style fixes * empty commit --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

* test permission * Add cross attention type for Sana-Sprint. * Add Sana-Sprint training script in diffusers. * make style && make quality; * modify the attention processor with `set_attn_processor` and change `SanaAttnProcessor3_0` to `SanaVanillaAttnProcessor` * Add import for SanaVanillaAttnProcessor * Add README file. * Apply suggestions from code review * style * Update examples/research_projects/sana/README.md --------- Co-authored-by: lawrence-cj <[email protected]> Co-authored-by: Sayak Paul <[email protected]>

fix

fix audioldm2 for transformers main.

* feat: pipeline-level quant config. Co-authored-by: SunMarc <[email protected]> condition better. support mapping. improvements. [Quantization] Add Quanto backend (#10756) * update * updaet * update * update * update * update * update * update * update * update * update * update * Update docs/source/en/quantization/quanto.md Co-authored-by: Sayak Paul <[email protected]> * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * Update src/diffusers/quantizers/quanto/utils.py Co-authored-by: Sayak Paul <[email protected]> * update * update --------- Co-authored-by: Sayak Paul <[email protected]> [Single File] Add single file loading for SANA Transformer (#10947) * added support for from_single_file * added diffusers mapping script * added testcase * bug fix * updated tests * corrected code quality * corrected code quality --------- Co-authored-by: Dhruv Nair <[email protected]> [LoRA] Improve warning messages when LoRA loading becomes a no-op (#10187) * updates * updates * updates * updates * notebooks revert * fix-copies. * seeing * fix * revert * fixes * fixes * fixes * remove print * fix * conflicts ii. * updates * fixes * better filtering of prefix. --------- Co-authored-by: hlky <[email protected]> [LoRA] CogView4 (#10981) * update * make fix-copies * update [Tests] improve quantization tests by additionally measuring the inference memory savings (#11021) * memory usage tests * fixes * gguf [`Research Project`] Add AnyText: Multilingual Visual Text Generation And Editing (#8998) * Add initial template * Second template * feat: Add TextEmbeddingModule to AnyTextPipeline * feat: Add AuxiliaryLatentModule template to AnyTextPipeline * Add bert tokenizer from the anytext repo for now * feat: Update AnyTextPipeline's modify_prompt method This commit adds improvements to the modify_prompt method in the AnyTextPipeline class. The method now handles special characters and replaces selected string prompts with a placeholder. Additionally, it includes a check for Chinese text and translation using the trans_pipe. * Fill in the `forward` pass of `AuxiliaryLatentModule` * `make style && make quality` * `chore: Update bert_tokenizer.py with a TODO comment suggesting the use of the transformers library` * Update error handling to raise and logging * Add `create_glyph_lines` function into `TextEmbeddingModule` * make style * Up * Up * Up * Up * Remove several comments * refactor: Remove ControlNetConditioningEmbedding and update code accordingly * Up * Up * up * refactor: Update AnyTextPipeline to include new optional parameters * up * feat: Add OCR model and its components * chore: Update `TextEmbeddingModule` to include OCR model components and dependencies * chore: Update `AuxiliaryLatentModule` to include VAE model and its dependencies for masked image in the editing task * `make style` * refactor: Update `AnyTextPipeline`'s docstring * Update `AuxiliaryLatentModule` to include info dictionary so that text processing is done once * simplify * `make style` * Converting `TextEmbeddingModule` to ordinary `encode_prompt()` function * Simplify for now * `make style` * Up * feat: Add scripts to convert AnyText controlnet to diffusers * `make style` * Fix: Move glyph rendering to `TextEmbeddingModule` from `AuxiliaryLatentModule` * make style * Up * Simplify * Up * feat: Add safetensors module for loading model file * Fix device issues * Up * Up * refactor: Simplify * refactor: Simplify code for loading models and handling data types * `make style` * refactor: Update to() method in FrozenCLIPEmbedderT3 and TextEmbeddingModule * refactor: Update dtype in embedding_manager.py to match proj.weight * Up * Add attribution and adaptation information to pipeline_anytext.py * Update usage example * Will refactor `controlnet_cond_embedding` initialization * Add `AnyTextControlNetConditioningEmbedding` template * Refactor organization * style * style * Move custom blocks from `AuxiliaryLatentModule` to `AnyTextControlNetConditioningEmbedding` * Follow one-file policy * style * [Docs] Update README and pipeline_anytext.py to use AnyTextControlNetModel * [Docs] Update import statement for AnyTextControlNetModel in pipeline_anytext.py * [Fix] Update import path for ControlNetModel, ControlNetOutput in anytext_controlnet.py * Refactor AnyTextControlNet to use configurable conditioning embedding channels * Complete control net conditioning embedding in AnyTextControlNetModel * up * [FIX] Ensure embeddings use correct device in AnyTextControlNetModel * up * up * style * [UPDATE] Revise README and example code for AnyTextPipeline integration with DiffusionPipeline * [UPDATE] Update example code in anytext.py to use correct font file and improve clarity * down * [UPDATE] Refactor BasicTokenizer usage to a new Checker class for text processing * update pillow * [UPDATE] Remove commented-out code and unnecessary docstring in anytext.py and anytext_controlnet.py for improved clarity * [REMOVE] Delete frozen_clip_embedder_t3.py as it is in the anytext.py file * [UPDATE] Replace edict with dict for configuration in anytext.py and RecModel.py for consistency * 🆙 * style * [UPDATE] Revise README.md for clarity, remove unused imports in anytext.py, and add author credits in anytext_controlnet.py * style * Update examples/research_projects/anytext/README.md Co-authored-by: Aryan <[email protected]> * Remove commented-out image preparation code in AnyTextPipeline * Remove unnecessary blank line in README.md [Quantization] Allow loading TorchAO serialized Tensor objects with torch>=2.6 (#11018) * update * update * update * update * update * update * update * update * update fix: mixture tiling sdxl pipeline - adjust gerating time_ids & embeddings (#11012) small fix on generating time_ids & embeddings [LoRA] support wan i2v loras from the world. (#11025) * support wan i2v loras from the world. * remove copied from. * upates * add lora. Fix SD3 IPAdapter feature extractor (#11027) chore: fix help messages in advanced diffusion examples (#10923) Fix missing **kwargs in lora_pipeline.py (#11011) * Update lora_pipeline.py * Apply style fixes * fix-copies --------- Co-authored-by: hlky <[email protected]> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Fix for multi-GPU WAN inference (#10997) Ensure that hidden_state and shift/scale are on the same device when running with multiple GPUs Co-authored-by: Jimmy <39@🇺🇸.com> [Refactor] Clean up import utils boilerplate (#11026) * update * update * update Use `output_size` in `repeat_interleave` (#11030) [hybrid inference 🍯🐝] Add VAE encode (#11017) * [hybrid inference 🍯🐝] Add VAE encode * _toctree: add vae encode * Add endpoints, tests * vae_encode docs * vae encode benchmarks * api reference * changelog * Update docs/source/en/hybrid_inference/overview.md Co-authored-by: Sayak Paul <[email protected]> * update --------- Co-authored-by: Sayak Paul <[email protected]> Wan Pipeline scaling fix, type hint warning, multi generator fix (#11007) * Wan Pipeline scaling fix, type hint warning, multi generator fix * Apply suggestions from code review [LoRA] change to warning from info when notifying the users about a LoRA no-op (#11044) * move to warning. * test related changes. Rename Lumina(2)Text2ImgPipeline -> Lumina(2)Pipeline (#10827) * Rename Lumina(2)Text2ImgPipeline -> Lumina(2)Pipeline --------- Co-authored-by: YiYi Xu <[email protected]> making ```formatted_images``` initialization compact (#10801) compact writing Co-authored-by: Sayak Paul <[email protected]> Co-authored-by: YiYi Xu <[email protected]> Fix aclnnRepeatInterleaveIntWithDim error on NPU for get_1d_rotary_pos_embed (#10820) * get_1d_rotary_pos_embed support npu * Update src/diffusers/models/embeddings.py --------- Co-authored-by: Kai zheng <[email protected]> Co-authored-by: hlky <[email protected]> Co-authored-by: YiYi Xu <[email protected]> [Tests] restrict memory tests for quanto for certain schemes. (#11052) * restrict memory tests for quanto for certain schemes. * Apply suggestions from code review Co-authored-by: Dhruv Nair <[email protected]> * fixes * style --------- Co-authored-by: Dhruv Nair <[email protected]> [LoRA] feat: support non-diffusers wan t2v loras. (#11059) feat: support non-diffusers wan t2v loras. [examples/controlnet/train_controlnet_sd3.py] Fixes #11050 - Cast prompt_embeds and pooled_prompt_embeds to weight_dtype to prevent dtype mismatch (#11051) Fix: dtype mismatch of prompt embeddings in sd3 controlnet training Co-authored-by: Andreas Jörg <[email protected]> Co-authored-by: Sayak Paul <[email protected]> reverts accidental change that removes attn_mask in attn. Improves fl… (#11065) reverts accidental change that removes attn_mask in attn. Improves flux ptxla by using flash block sizes. Moves encoding outside the for loop. Co-authored-by: Juan Acevedo <[email protected]> Fix deterministic issue when getting pipeline dtype and device (#10696) Co-authored-by: Dhruv Nair <[email protected]> [Tests] add requires peft decorator. (#11037) * add requires peft decorator. * install peft conditionally. * conditional deps. Co-authored-by: DN6 <[email protected]> --------- Co-authored-by: DN6 <[email protected]> CogView4 Control Block (#10809) * cogview4 control training --------- Co-authored-by: OleehyO <[email protected]> Co-authored-by: yiyixuxu <[email protected]> [CI] pin transformers version for benchmarking. (#11067) pin transformers version for benchmarking. updates Fix Wan I2V Quality (#11087) * fix_wan_i2v_quality * Update src/diffusers/pipelines/wan/pipeline_wan_i2v.py Co-authored-by: YiYi Xu <[email protected]> * Update src/diffusers/pipelines/wan/pipeline_wan_i2v.py Co-authored-by: YiYi Xu <[email protected]> * Update src/diffusers/pipelines/wan/pipeline_wan_i2v.py Co-authored-by: YiYi Xu <[email protected]> * Update pipeline_wan_i2v.py --------- Co-authored-by: YiYi Xu <[email protected]> Co-authored-by: hlky <[email protected]> LTX 0.9.5 (#10968) * update --------- Co-authored-by: YiYi Xu <[email protected]> Co-authored-by: hlky <[email protected]> make PR GPU tests conditioned on styling. (#11099) Group offloading improvements (#11094) update Fix pipeline_flux_controlnet.py (#11095) * Fix pipeline_flux_controlnet.py * Fix style update readme instructions. (#11096) Co-authored-by: Juan Acevedo <[email protected]> Resolve stride mismatch in UNet's ResNet to support Torch DDP (#11098) Modify UNet's ResNet implementation to resolve stride mismatch in Torch's DDP Fix Group offloading behaviour when using streams (#11097) * update * update Quality options in `export_to_video` (#11090) * Quality options in `export_to_video` * make style improve more. add placeholders for docstrings. formatting. smol fix. solidify validation and annotation * Revert "feat: pipeline-level quant config." This reverts commit 316ff46. * feat: implement pipeline-level quantization config Co-authored-by: SunMarc <[email protected]> * update * fixes * fix validation. * add tests and other improvements. * add tests * import quality * remove prints. * add docs. * fixes to docs. * doc fixes. * doc fixes. * add validation to the input quantization_config. * clarify recommendations. * docs * add to ci. * todo. --------- Co-authored-by: SunMarc <[email protected]>

…otswapping (#11322) * refactor hotswap tester. * fix seeds.. * add to nightly ci. * move comment. * move to nightly

* support non-diffusers hidream loras * make fix-copies

* enable 7 cases on XPU Signed-off-by: Yao Matrix <[email protected]> * calibrate A100 expectations Signed-off-by: YAO Matrix <[email protected]> --------- Signed-off-by: Yao Matrix <[email protected]> Signed-off-by: YAO Matrix <[email protected]>

fix: update latents dtype to match vae

* enable dit integration test on XPU Signed-off-by: Yao Matrix <[email protected]> * fix style Signed-off-by: Yao Matrix <[email protected]> --------- Signed-off-by: Yao Matrix <[email protected]>

* detect xpu in print_env Signed-off-by: YAO Matrix <[email protected]> * enhance code, test passed on XPU Signed-off-by: Yao Matrix <[email protected]> --------- Signed-off-by: YAO Matrix <[email protected]> Signed-off-by: Yao Matrix <[email protected]>

update

* start. * add tests for framepack transformer model. * merge conflicts. * make to square. * fixes

* support framepack f1 * update docs * update toctree * remove typo

* enable kandinsky2_2 integration test cases on XPU Signed-off-by: Yao Matrix <[email protected]> * fix style Signed-off-by: Yao Matrix <[email protected]> * enable latent_diffusion, dance_diffusion, musicldm, shap_e integration uts on xpu Signed-off-by: Yao Matrix <[email protected]> * fix style Signed-off-by: Yao Matrix <[email protected]> --------- Signed-off-by: Yao Matrix <[email protected]> Co-authored-by: Aryan <[email protected]>

merterbak and others added 30 commits April 26, 2025 01:58

[train_dreambooth_lora.py] Set LANCZOS as default interpolation mode …

bd96a08

…for resizing (#11421) * Set LANCZOS as default interpolation mode for resizing * [train_dreambooth_lora.py] Set LANCZOS as default interpolation mode for resizing

enable test_layerwise_casting_memory cases on XPU (#11406)

a7e9f85

* enable test_layerwise_casting_memory cases on XPU Signed-off-by: Yao Matrix <[email protected]> * fix style Signed-off-by: Yao Matrix <[email protected]> --------- Signed-off-by: Yao Matrix <[email protected]>

[tests] fix import. (#11434)

0e3f271

fix import.

[train_text_to_image] Better image interpolation in training scripts …

b3b04fe

…follow up (#11426) * Update train_text_to_image.py * update

[train_text_to_image_lora] Better image interpolation in training scr…

3da98e7

…ipts follow up (#11427) * Update train_text_to_image_lora.py * update_train_text_to_image_lora

[Hi-Dream LoRA] fix bug in validation (#11439)

0ac1d5b

remove unnecessary pipeline moving to cpu in validation Co-authored-by: Sayak Paul <[email protected]>

Set LANCZOS as the default interpolation for image resizing in Contro…

58431f1

…lNet training (#11449) Set LANCZOS as the default interpolation for image resizing

Raise warning instead of error for block offloading with streams (#11425

8fe5a14

) raise warning instead of error

enable marigold_intrinsics cases on XPU (#11445)

60892c5

Signed-off-by: Yao Matrix <[email protected]>

torch.compile fullgraph compatibility for Hunyuan Video (#11457)

c865115

udpate

enable consistency test cases on XPU, all passed (#11446)

fbe2fe5

Signed-off-by: Yao Matrix <[email protected]>

Add StableDiffusion3InstructPix2PixPipeline (#11378)

8cd7426

* upload StableDiffusion3InstructPix2PixPipeline * Move to community * Add readme * Fix images * remove images * Change image url * fix * Apply style fixes

[test_models_transformer_hunyuan_video] help us test torch.compile() …

38ced7e

…for impactful models (#11431) * Update test_models_transformer_hunyuan_video.py * update --------- Co-authored-by: Sayak Paul <[email protected]>

Add LANCZOS as default interplotation mode. (#11463)

daf0a23

* Add LANCZOS as default interplotation mode. * LANCZOS as default interplotation * LANCZOS as default interplotation mode * Added LANCZOS as default interplotation mode

[WAN] fix recompilation issues (#11475)

d70f8ee

* [tests] Add torch.compile() test for WanTransformer3DModel * fix wan recompilation issues. * style --------- Co-authored-by: tongyu0924 <[email protected]>

Fix typos in docs and comments (#11416)

86294d3

* Fix typos in docs and comments * Apply style fixes --------- Co-authored-by: Sayak Paul <[email protected]> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

[tests] xfail recent pipeline tests for specific methods. (#11469)

5dcdf4a

xfail recent pipeline tests for specific methods.

[docs] Memory optims (#11385)

b848d47

* reformat * initial * fin * review * inference * feedback * feedback * feedback

[docs] Adapters (#11331)

e23705e

* refactor adapter docs * ip-adapter * ip adapter * fix toctree * fix toctree * lora * images * controlnet * feedback * controlnet * t2i * fix typo * feedback --------- Co-authored-by: Sayak Paul <[email protected]>

[train_dreambooth_lora_sdxl_advanced] Add LANCZOS as the default inte…

ed6cf52

…rpolation mode for image resizing (#11471)

yijun-lee and others added 29 commits May 5, 2025 12:18

Set LANCZOS as the default interpolation method for image resizing. (#…

9c29e93

…11492) * Set LANCZOS as the default interpolation method for image resizing. * style: run make style and quality checks

Update training script for txt to img sdxl with lora supp with new in…

ed4efbd

…terpolation. (#11496) * Update training script for txt to img sdxl with lora supp with new interpolation. * ran make style and make quality.

Fix torchao docs typo for fp8 granular quantization (#11473)

1fa5639

update

Update setup.py to pin min version of peft (#11502)

53f1043

update dep table. (#11504)

d88ae1f

* update dep table. * fix

[LoRA] use removeprefix to preserve sanity. (#11493)

10bee52

* use removeprefix to preserve sanity. * f-string.

enable lora cases on XPU (#11506)

8c661ea

* enable lora cases on XPU Signed-off-by: Yao Matrix <[email protected]> * remove hunyuanvideo xpu expectation Signed-off-by: Yao Matrix <[email protected]> --------- Signed-off-by: Yao Matrix <[email protected]>

[docs] minor updates to bitsandbytes docs. (#11509)

fb29132

* minor updates to bitsandbytes docs. * Apply suggestions from code review

clean up the __Init__ for stable_diffusion (#11500)

53bd367

up

fix audioldm

87e508f

Revert "fix audioldm"

c5c34a4

This reverts commit 87e508f.

Conditionally import torchvision in Cosmos transformer (#11524)

6674a51

fix

[tests] fix audioldm2 for transformers main. (#11522)

393aefc

fix audioldm2 for transformers main.

[Tests] Enable more general testing for torch.compile() with LoRA h…

7acf834

…otswapping (#11322) * refactor hotswap tester. * fix seeds.. * add to nightly ci. * move comment. * move to nightly

[LoRA] support non-diffusers hidream loras (#11532)

0c47c95

* support non-diffusers hidream loras * make fix-copies

enable 7 cases on XPU (#11503)

2d38089

* enable 7 cases on XPU Signed-off-by: Yao Matrix <[email protected]> * calibrate A100 expectations Signed-off-by: YAO Matrix <[email protected]> --------- Signed-off-by: Yao Matrix <[email protected]> Signed-off-by: YAO Matrix <[email protected]>

[LTXPipeline] Update latents dtype to match VAE dtype (#11533)

3c0a012

fix: update latents dtype to match vae

enable dit integration cases on xpu (#11523)

d6bf268

* enable dit integration test on XPU Signed-off-by: Yao Matrix <[email protected]> * fix style Signed-off-by: Yao Matrix <[email protected]> --------- Signed-off-by: Yao Matrix <[email protected]>

Change Framepack transformer layer initialization order (#11535)

92fe689

update

[tests] add tests for framepack transformer model. (#11520)

01abfc8

* start. * add tests for framepack transformer model. * merge conflicts. * make to square. * fixes

Hunyuan Video Framepack F1 (#11534)

e48f6ae

* support framepack f1 * update docs * update toctree * remove typo

Skquark merged commit 51b3ffd into Skquark:main May 12, 2025
4 of 6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge changes #211

Merge changes #211

Skquark commented May 12, 2025

Merge changes #211

Merge changes #211

Conversation

Skquark commented May 12, 2025