Master #5

stvoler · 2025-07-24T10:07:20Z

No description provided.

* mmdit-x * add support for sd3.5 medium * add skip layer guidance support (mmdit only) * ignore slg if slg_scale is zero (optimization) * init out_skip once * slg support for flux (expermiental) * warn if version doesn't support slg * refactor slg cli args * set default slg_scale to 0 (oops) * format code --------- Co-authored-by: leejet <[email protected]>

* Flux Lite (Freepik) support * format code --------- Co-authored-by: leejet <[email protected]>

* first attempt at updating to photomaker v2 * continue adding photomaker v2 modules * finishing the last few pieces for photomaker v2; id_embeds need to be done by a manual step and pass as an input file * added a name converter for Photomaker V2; build ok * more debugging underway * failing at cuda mat_mul * updated chunk_half to be more efficient; redo feedforward * fixed a bug: carefully using ggml_view_4d to get chunks of a tensor; strides need to be recalculated or set properly; still failing at soft_max cuda op * redo weight calculation and weight*v * fixed a bug now Photomaker V2 kinds of working * add python script for face detection (Photomaker V2 needs) * updated readme for photomaker * fixed a bug causing PMV1 crashing; both V1 and V2 work * fixed clean_input_ids for PMV2 * fixed a double counting bug in tokenize_with_trigger_token * updated photomaker readme * removed some commented code * improved reconstructing class word free prompt * changed reading id_embed to raw binary using existing load tensor function; this is more efficient than using model load and also makes it easier to work with sd server * minor clean up --------- Co-authored-by: bssrdf <[email protected]>

* repair flash attention in _ext this does not fix the currently broken fa behind the define, which is only used by VAE Co-authored-by: FSSRepo <[email protected]> * make flash attention in the diffusion model a runtime flag no support for sd3 or video * remove old flash attention option and switch vae over to attn_ext * update docs * format code --------- Co-authored-by: FSSRepo <[email protected]> Co-authored-by: leejet <[email protected]>

…#490) * Refactor: wtype per tensor * Fix default args * refactor: fix flux * Refactor photmaker v2 support * unet: refactor the refactoring * Refactor: fix controlnet and tae * refactor: upscaler * Refactor: fix runtime type override * upscaler: use fp16 again * Refactor: Flexible sd3 arch * Refactor: Flexible Flux arch * format code --------- Co-authored-by: leejet <[email protected]>

Signed-off-by: Xiaodong Ye <[email protected]>

With absolute paths, there's no way to change base url.

stduhpf and others added 30 commits November 23, 2024 11:15

fix: improve clip text_projection support (leejet#397)

9b1d90b

feat: add flux 1 lite 8B (freepik) support (leejet#474)

6ea8122

* Flux Lite (Freepik) support * format code --------- Co-authored-by: leejet <[email protected]>

docs: update readme (leejet#462)

0758544

feat: add support for loading F8_E5M2 weights (leejet#460)

8f94efa

fix: typo in clip-g encoder arg (leejet#472)

8c7719f

docs: update README.md (leejet#452)

b99cbfe

docs: update readme, add python bindings (leejet#423)

ea9b647

refactor: add some sd vesion helper functions

b5f4932

sync: update ggml

c3eeb66

fix: remove default variables in c headers (leejet#478)

53b415f

fix: use ggml_nn_attention in vae

4570715

feat: remove type restrictions (leejet#489)

9148b98

chore: remove rocm5.5 build temporarily

9578fdc

feat: support more LoRA models (leejet#520)

cc92a6a

feat: support Inpaint models (leejet#511)

8f4ab9a

fix: fix metal build (leejet#513)

0d9d665

feat: support Moore Threads GPU (leejet#529)

5cc74d1

Signed-off-by: Xiaodong Ye <[email protected]>

fix: fix typo for skip layers parameters (leejet#492)

b5cc142

feat: support 16 channel tae (taesd/taef1) (leejet#527)

d50473d

feat: use pretty-progress for tensor loading (leejet#516)

348a54e

chore: change SD_CUBLAS/SD_USE_CUBLAS to SD_CUDA/SD_USE_CUDA

dcf91f9

chore: fix CI windows release artifacts (leejet#532)

27edb76

chore: fix amd rocm build (leejet#571)

b70aaa6

chore: fix CUDA on GitHub Action (leejet#567)

4fe83d5

fix: avoid sd2((non inpaint) crash on v-pred check (leejet#537)

587a37b

feat: add sdxl v-pred suppport (leejet#536)

d9b5942

stduhpf and others added 8 commits July 17, 2025 17:30

Use progress_callback

a1e82ec

update progress bars (+fixes)

4a1fb94

frontend.cpp: use relative paths in fetch() (stduhpf#4) (cherry-picked)

226caf0

With absolute paths, there's no way to change base url.

rebaseon master and apply api changes

bcba77c

server: update api

febe7b5

server: use new naming convention

47e1744

server: change API

2ac5e31

server: redirect base path to ui

0934b51

stvoler force-pushed the master branch from f14e2d8 to 0934b51 Compare July 24, 2025 12:26

stvoler added 21 commits July 24, 2025 16:41

build-windows.yml

5794254

build-mac

1db8b0f

build-mac.yml

d53d94d

sd-mac-build-1

522653c

s1

c68b40a

metal

08cbb34

amd1

431715b

build-linux.yml

ddf0c97

build-linux-1

bd65a26

linux 2

ec0fb17

linux 3

aaf05c2

off

52b0059

build-linux.yml

b1617d7

build linux

9ad5bbb

linux

ed54dfe

build-linux.yml

57501e7

linux 1

e8667b8

linux 2

fa3cf22

cuda 1

a71deff

cuda 2

bc3bf49

cuda 3

ec44f4b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Master #5

Master #5

Uh oh!

stvoler commented Jul 24, 2025

Uh oh!

Uh oh!

Master #5

Are you sure you want to change the base?

Master #5

Uh oh!

Conversation

stvoler commented Jul 24, 2025

Uh oh!

Uh oh!