ModelPatcher Overhaul and Hook Support #5583

Kosinkadink · 2024-11-11T18:16:14Z

This PR merges all changes from improved_memory branch, and expands ModelPatcher + transformer_options to allow for different weights and properties to be applied for selected conditioning. This is done by introducing a hook design pattern, where conditioning and CLIP can have hooks attached to change their behavior at sample time; before, this was hardcoded for specific things like controlnet and gligen.

I did not find any memory or performance regression in my testing, but more testing would be good; I will try to get some folks to test out this branch alongside the corresponding rework_modelpatcher branches in AnimateDiff-Evolved and Advanced-ControlNet that make use of the new functionality.

Related PRs in those repos that will be merged when this does:
Kosinkadink/ComfyUI-AnimateDiff-Evolved#498
Kosinkadink/ComfyUI-Advanced-ControlNet#198

Remaining TODO:

Fix VRAM usage exceeding expectations when applying weight hooks
- SOLVED: weights needed to be copied in place, and the weight backups should be copied to avoid being overriden
Figure out why flux LoRAs do not produce the same results when applied via Hook compared to with Load LoRA node (issue does not exist for other model loras as far as I can tell)
- SOLVED: I accidentally left an extra .to call on the calculate_weight results from way back before I added the stochastic_rounding call, which screws up results when fp8 weights are used. Removing the .to fixed it.
Make sure weights loaded in lowvram mode get modified appropriately by WeightHooks
- I haven't found any glaring issues through testing; any problems here can be resolved as they are reported. Main thing to be revisited here is RAM usage when the weights are backed up for big models (flux) with very little VRAM.

Breaking Changes:

ControlNet's get_control function now takes transformer_options as a required parameter; if a custom node wrote its own function to overwrite the built-in calc_cond_batch function, it will result in an error when executing. It will be an easy fix for any affected nodes; only one I can think of on the top of my head is TiledDiffusion.

Features:

Hooks
- In the long term, this will be a way for individual conds to have different weights, wrappers, callbacks, patches, attachments, etc. that are currently impossible for custom nodes to do in a way that doesn't require excessive hacks that break compatibility between nodes. Currently, two types of hooks are fully implemented, but I will be expanding this system in a near-future PR:
  - Weight Hooks: Different lora and model-as-lora weights can be attached and scheduled on CLIP/conditioning.
  - Wrapper Hooks: Different wrapper functions can be applied.
ModelPatcher additions
- Wrappers
  - Instead of requiring nodes to overwrite or hack into important functions to change their functionality, ModelPatcher and model_options support wrappers functions that will be automatically handle passing an executor into wrapper functions to facilitate wrapping in a predictable manner. Since there is no limitation on names of wrapper functions, some custom nodes could decide to expose extending their own functionality with other nodes through the wrapper system. Cost of wrapping is imperceptibly low, so more wrapper support can be added upon need/request.
- Callbacks
  - Similar to wrappers, but callbacks instead can be used to extend ModelPatcher functions to avoid the need for hacky ModelPatcher inheritance, for cases where wrapping wouldn't make sense. Same as wrappers, more callbacks can be added upon need/request.
- Additional Models
  - A dictionary stores models that should be loaded alongside the main model; they will be cloned when the main ModelPatcher is cloned.
- Attachments
  - A dictionary stores objects that will not be deep copied, unlike model_options. Only clones objects stored inside it that have a on_model_patcher_clone() callable,
- Injections
  - A dictionary storing a list of PatcherInjection objects, which allow for modifying ("injecting") anything on the ModelPatcher/model in a way that doesn't break the patching system. Biggest current user of this feature will be AnimateDiff, as it's implemented by injecting extra blocks into the base SD unets.
transformer_options as an execution context
- This PR begins to take transformer_options to its natural conclusion - a way to track the context of execution that can be modified by both native components and third-party extensions. At sampling time, it collects all patches, wrappers, callbacks, etc., and all exposed dicts and lists are copied each time it is passed down a layer where it is modified to make sure there is no accidental poisoning of the ModelPatcher's model_options. For callbacks and wrappers in particular, helper functions were added in comfy.patcher_extension to allow for easy modification and classification via CallbacksMP and WrappersMP. In a future PR, patches should be exposed in a similar way.
- ControlNets's get_control functions now take in transformer_options as an input, allowing them to add their own patches, wrappers, etc. as desired.

…xed fp8 support for model-as-lora feature

…d added call prepare current keyframe on hooks in calc_cond_batch

… for loras/model-as-loras, small renaming/refactoring

…odelPatcher at sampling time

…n work on better Create Hook Model As LoRA node

…implement ModelPatcher callbacks, attachments, and additional_models

…ed additional_models support in ModelPatcher, conds, and hooks

…apping

…ond to organize hooks by type

…odel

…er properties, improved AutoPatcherEjector usage in partially_load

…atch wrappers

…delPatcher for emb_patch and forward_timestep_embed_patch, added helper functions for removing callbacks/wrappers/additional_models by key, added custom_should_register prop to hooks

…s due to hooks should be offloaded in hooks_backup

…ormer_options as additional parameter, made the model_options stored in extra_args in inner_sample be a clone of the original model_options instead of same ref

…se __future__ so that I can use the better type annotations

asagi4 · 2024-11-19T20:24:24Z

I'm trying to implement a basic version of prompt control on top of this mechanism and it seems encode_from_tokens_scheduled throws an NPE if there are no hooks set. It would be convenient for it to gracefully fall back to regular encoding in that case.

I'm also wondering how this is going to interact with providing different types of weight interpretations (like advanced clip encode) but adding a better mechanism for those might belongs in another PR.

Kosinkadink · 2024-11-19T23:25:52Z

@asagi4 Thanks for the feedback, I've made encode_from_tokens_scheduled now work if no hooks are present.

Assuming custom weight interpretations are implemented around modifying the cond_state_model methods, everything should work. I'm not familiar with the inner working of any custom nodes that do this, so I can't give any good feedback on that front at this time. Chances are that if it is excessively difficult for those custom nodes to do their thing without hacky solution currently, there should be a better way to do so via a future PR.

…lace some current_hook = None with calls to self.patch_hooks(None) instead to avoid a potential edge case

…more smoothly

Kosinkadink · 2024-11-24T22:01:21Z

All of my manual testing is complete - the PR can be merged at any time if all looks fine with @comfyanonymous

…ous/ComfyUI#5583 The interface for this *will* change. See #63

asagi4 · 2024-11-26T21:45:01Z

@Kosinkadink If I create a workflow that sets a LoRA hook with no scheduling, it seems that it reloads the hook on every sampling run even if just the sampler seed changes. It causes pretty significant latency. I think it's because the patches get unloaded immediately after sampling even though there's no need to do so.

Is there a way to use the hook mechanism in a way that avoids this at the moment? My quick testing shows a 13.8s to 16.5s increase when only changing the seed after warmup (sometimes even up to 18s). --highvram reduces the latency quite a bit but still doesn't remove it

Kosinkadink · 2024-11-26T22:13:41Z

A lot of my current optimization was focused on the worst case scenarios of different hooks needing to be applied to different conditioning, so to prevent any memory issues from cached weights not being cleared, I currently have the model purge newly registered (AKA, added at sample time) hook patches and clear cached hooked weight calculations, always.

In cases where there is only a single hook group to apply, I could make it not revert the model to its unhooked state at the end of sampling, so that if nothing gets changed with the hooks/ModelPatcher, it would not need to redo hooked weight application. However, that introduces some extra complexity that could introduce bugs I don't want to deal with currently - I've been working on this for 3 months, and in its current state it hasn't even been released to be tested by a wide variety of peeps. Once it gets merged and it appears to be working fine in general, I'd be down to add an optimization for that edge case.

asagi4 · 2024-11-26T22:20:59Z

@Kosinkadink Fair. This PR is definitely big enough already.

nodes.py

…ns_scheduled call

Kosinkadink added 30 commits September 13, 2024 17:20

Added hook_patches to ModelPatcher for weights (model)

069ec7a

Initial changes to calc_cond_batch to eventually support hook_patches

3cbd40a

Added current_patcher property to BaseModel

9ae7581

Consolidated add_hook_patches_as_diffs into add_hook_patches func, fi…

1268d04

…xed fp8 support for model-as-lora feature

Added call to initialize_timesteps on hooks in process_conds func, an…

f160d46

…d added call prepare current keyframe on hooks in calc_cond_batch

Added default_conds support in calc_cond_batch func

5dadd97

Merge branch 'master' into patch_hooks

f5abdc6

Added initial set of hook-related nodes, added code to register hooks…

9ded65a

… for loras/model-as-loras, small renaming/refactoring

Made CLIP work with hook patches

a5034df

Added initial hook scheduling nodes, small renaming/refactoring

5a9aa58

Fixed MaxSpeed and default conds implementations

f5c899f

Added support for adding weight hooks that aren't registered on the M…

4b472ba

…odelPatcher at sampling time

Made Set Clip Hooks node work with hooks from Create Hook nodes, bega…

cfb1451

…n work on better Create Hook Model As LoRA node

Initial work on adding 'model_as_lora' lora type to calculate_weight

c29006e

Merge branch 'master' into patch_hooks

6b14fc8

Continued work on simpler Create Hook Model As LoRA node, started to …

787ef34

…implement ModelPatcher callbacks, attachments, and additional_models

Fix incorrect ref to create_hook_patches_clone after moving function

e80dc96

Added injections support to ModelPatcher + necessary bookkeeping, add…

5501429

…ed additional_models support in ModelPatcher, conds, and hooks

Added wrappers to ModelPatcher to facilitate standardized function wr…

59d72b4

…apping

Started scaffolding for other hook types, refactored get_hooks_from_c…

5f450d3

…ond to organize hooks by type

Fix skip_until_exit logic bug breaking injection after first run of m…

f28d892

…odel

Updated clone_has_same_weights function to account for new ModelPatch…

298397d

…er properties, improved AutoPatcherEjector usage in partially_load

Added WrapperExecutor for non-classbound functions, added calc_cond_b…

5052a78

…atch wrappers

Merge branch 'master' into patch_hooks

a154d0d

Refactored callbacks+wrappers to allow storing lists by id

7c86407

Added forward_timestep_embed_patch type, added helper functions on Mo…

da6c045

…delPatcher for emb_patch and forward_timestep_embed_patch, added helper functions for removing callbacks/wrappers/additional_models by key, added custom_should_register prop to hooks

Added get_attachment func on ModelPatcher

c422553

Implement basic MemoryCounter system for determing with cached weight…

d3229cb

…s due to hooks should be offloaded in hooks_backup

Modified ControlNet/T2IAdapter get_control function to receive transf…

fd2d572

…ormer_options as additional parameter, made the model_options stored in extra_args in inner_sample be a clone of the original model_options instead of same ref

Added create_model_options_clone func, modified type annotations to u…

09cbd69

…se __future__ so that I can use the better type annotations

Made encode_from_tokens_scheduled work when no hooks are set on patcher

59891b0

Small cleanup of comments

3501698

Kosinkadink marked this pull request as ready for review November 19, 2024 23:55

Kosinkadink requested a review from comfyanonymous as a code owner November 19, 2024 23:55

catboxanon mentioned this pull request Nov 20, 2024

[FR] Efficient Schedulable LoRAs asagi4/comfyui-prompt-control#63

Open

Kosinkadink added 11 commits November 20, 2024 18:57

Turn off hook patch caching when only 1 hook present in sampling, rep…

d38c535

…lace some current_hook = None with calls to self.patch_hooks(None) instead to avoid a potential edge case

On Cond/Cond Pair nodes, removed opt_ prefix from optional inputs

1c86976

Allow both FLOATS and FLOAT for floats_strength input

9a69ccf

Revert change, does not work

c044c3b

Merge branch 'master' into patch_hooks_improved_memory

76b9ed1

Merge branch 'master' into patch_hooks_improved_memory

0a432c1

Merge branch 'master' into patch_hooks_improved_memory

815c6f3

Made patch_hook_weight_to_device respect set_func and convert_func

8b2c324

Make discard_model_sampling True by default

602c12b

Add changes manually from 'master' so merge conflict resolution goes …

ac5a3bd

…more smoothly

Merge branch 'master' into patch_hooks_improved_memory

26ccd3b

Kosinkadink mentioned this pull request Nov 26, 2024

Regional Lora Hooks don't work on Flux/Quantized flux versions Kosinkadink/ComfyUI-AnimateDiff-Evolved#497

Open

toyxyz mentioned this pull request Nov 26, 2024

ComfyuI native support is coming soon! xyfJASON/ctrlora#16

Open

asagi4 added a commit to asagi4/comfyui-prompt-control that referenced this pull request Nov 26, 2024

WIP: Quick'n'dirty node using the new hook mechanism from comfyanonym…

2791f92

…ous/ComfyUI#5583 The interface for this *will* change. See #63

Kosinkadink added 2 commits November 26, 2024 21:34

Merge branch 'master' into patch_hooks_improved_memory

57f1ea8

Merge branch 'master' into patch_hooks_improved_memory

5994cd8

comfyanonymous reviewed Nov 27, 2024

View reviewed changes

nodes.py Outdated Show resolved Hide resolved

Kosinkadink added 2 commits November 27, 2024 19:42

Merge branch 'master' into patch_hooks_improved_memory

3911241

Cleaned up text encode nodes with just a single clip.encode_from_toke…

a54e734

…ns_scheduled call

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ModelPatcher Overhaul and Hook Support #5583

ModelPatcher Overhaul and Hook Support #5583

Kosinkadink commented Nov 11, 2024 •

edited

Loading

asagi4 commented Nov 19, 2024

Kosinkadink commented Nov 19, 2024

Kosinkadink commented Nov 24, 2024

asagi4 commented Nov 26, 2024

Kosinkadink commented Nov 26, 2024

asagi4 commented Nov 26, 2024

ModelPatcher Overhaul and Hook Support #5583

Are you sure you want to change the base?

ModelPatcher Overhaul and Hook Support #5583

Conversation

Kosinkadink commented Nov 11, 2024 • edited Loading

Remaining TODO:

Breaking Changes:

Features:

asagi4 commented Nov 19, 2024

Kosinkadink commented Nov 19, 2024

Kosinkadink commented Nov 24, 2024

asagi4 commented Nov 26, 2024

Kosinkadink commented Nov 26, 2024

asagi4 commented Nov 26, 2024

Kosinkadink commented Nov 11, 2024 •

edited

Loading