Releases · leejet/stable-diffusion.cpp

23 Nov 06:27

b5f4932

master-b5f4932 Latest

Latest

refactor: add some sd vesion helper functions

Assets 12

23 Nov 04:42

github-actions

master-9b1d90b

9b1d90b

master-9b1d90b

fix: improve clip text_projection support (#397)

Assets 12

23 Nov 05:07

github-actions

master-8f94efa

8f94efa

master-8f94efa

feat: add support for loading F8_E5M2 weights (#460)

Assets 12

23 Nov 05:07

github-actions

master-8c7719f

8c7719f

master-8c7719f

fix: typo in clip-g encoder arg (#472)

Assets 12

23 Nov 04:58

github-actions

master-6ea8122

6ea8122

master-6ea8122

feat: add flux 1 lite 8B (freepik) support (#474)

* Flux Lite (Freepik) support

* format code

---------

Co-authored-by: leejet <[email protected]>

Assets 12

23 Nov 04:42

github-actions

master-65fa646

65fa646

master-65fa646

feat: add sd3.5 medium and skip layer guidance support (#451)

* mmdit-x

* add support for sd3.5 medium

* add skip layer guidance support (mmdit only)

* ignore slg if slg_scale is zero (optimization)

* init out_skip once

* slg support for flux (expermiental)

* warn if version doesn't support slg

* refactor slg cli args

* set default slg_scale to 0 (oops)

* format code

---------

Co-authored-by: leejet <[email protected]>

Assets 12

23 Nov 05:15

github-actions

master-2b1bc06

2b1bc06

master-2b1bc06

feat: add PhotoMaker Version 2 support (#358)

* first attempt at updating to photomaker v2

* continue adding photomaker v2 modules

* finishing the last few pieces for photomaker v2; id_embeds need to be done by a manual step and pass as an input file

* added a name converter for Photomaker V2; build ok

* more debugging underway

* failing at cuda mat_mul

* updated chunk_half to be more efficient; redo feedforward

* fixed a bug: carefully using ggml_view_4d to get chunks of a tensor; strides need to be recalculated or set properly; still failing at soft_max cuda op

* redo weight calculation and weight*v

* fixed a bug now Photomaker V2 kinds of working

* add python script for face detection (Photomaker V2 needs)

* updated readme for photomaker

* fixed a bug causing PMV1 crashing; both V1 and V2 work

* fixed clean_input_ids for PMV2

* fixed a double counting bug in tokenize_with_trigger_token

* updated photomaker readme

* removed some commented code

* improved reconstructing class word free prompt

* changed reading id_embed to raw binary using existing load tensor function; this is more efficient than using model load and also makes it easier to work with sd server

* minor clean up

---------

Co-authored-by: bssrdf <[email protected]>

Assets 12

23 Nov 06:04

github-actions

master-1c168d9

1c168d9

master-1c168d9

fix: repair flash attention support (#386)

* repair flash attention in _ext
this does not fix the currently broken fa behind the define, which is only used by VAE

Co-authored-by: FSSRepo <[email protected]>

* make flash attention in the diffusion model a runtime flag
no support for sd3 or video

* remove old flash attention option and switch vae over to attn_ext

* update docs

* format code

---------

Co-authored-by: FSSRepo <[email protected]>
Co-authored-by: leejet <[email protected]>

Assets 12

24 Oct 15:21

github-actions

master-ac54e00

ac54e00

master-ac54e00

feat: add sd3.5 support (#445)

Assets 12

02 Sep 15:56

github-actions

master-e410aeb

e410aeb

master-e410aeb

sync: update ggml to fix large image generation with SYCL backend (#380)

* turn off fast-math on host in SYCL backend

Signed-off-by: zhentaoyu <[email protected]>

* update ggml for sync some sycl ops

Signed-off-by: zhentaoyu <[email protected]>

* update sycl readme and ggml

Signed-off-by: zhentaoyu <[email protected]>

---------

Signed-off-by: zhentaoyu <[email protected]>

Assets 12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: leejet/stable-diffusion.cpp

master-b5f4932

master-9b1d90b

master-8f94efa

master-8c7719f

master-6ea8122

master-65fa646

master-2b1bc06

master-1c168d9

master-ac54e00

master-e410aeb