add GGUF model router support across all model types & refactor the code #223

qmeng222 · 2024-11-07T03:57:49Z

No description provided.

…ltimodal, and Audio) in Streamlit & refactor the code

zhiyuan8

Great job! Please fix one minor issue and ready to merge

zhiyuan8 · 2024-11-08T08:00:24Z

nexa/gguf/streamlit/streamlit_image_chat.py

+        # get model specific defaults:
+        default_params = get_default_params(model_to_check)
+
+        # adjust step range based on model type:


You should read in those parameters from
https://github.com/NexaAI/nexa-sdk/blob/main/nexa/constants.py#L311-L353

use
DEFAULT_IMG_GEN_PARAMS_LCM.num_inference_steps

@zhiyuan8 Let me clarify the get_default_params() approach:
Different computer vision models require their own default parameters. More specifically,

LCM models → DEFAULT_IMG_GEN_PARAMS_LCM (4 steps, 1.0 guidance)

SDXL-turbo → DEFAULT_IMG_GEN_PARAMS_TURBO (5 steps, 5.0 guidance)

Standard SD models → DEFAULT_IMG_GEN_PARAMS (20 steps, 7.5 guidance)

The get_default_params function at https://github.com/NexaAI/nexa-sdk/blob/qingying-model-router-streamlit/nexa/gguf/streamlit/streamlit_image_chat.py#L23-L30 picks the right parameter set based on model name(type). Happy to add clearer documentation, or let me know if I didn't get your point.

add GGUF model router support across all model types (NLP, Vision, Mu…

a9f4fab

…ltimodal, and Audio) in Streamlit & refactor the code

qmeng222 requested a review from zhiyuan8 November 7, 2024 04:11

zhiyuan8 requested changes Nov 8, 2024

View reviewed changes

resolve conflict in streamlit_image_chat.py

63cdd75

qmeng222 requested a review from zhiyuan8 November 8, 2024 23:12

zhiyuan8 approved these changes Nov 9, 2024

View reviewed changes

zhiyuan8 merged commit 568107f into main Nov 9, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add GGUF model router support across all model types & refactor the code #223

add GGUF model router support across all model types & refactor the code #223

qmeng222 commented Nov 7, 2024

zhiyuan8 left a comment

zhiyuan8 Nov 8, 2024 •

edited

Loading

qmeng222 Nov 8, 2024 •

edited

Loading

add GGUF model router support across all model types & refactor the code #223

add GGUF model router support across all model types & refactor the code #223

Conversation

qmeng222 commented Nov 7, 2024

zhiyuan8 left a comment

Choose a reason for hiding this comment

zhiyuan8 Nov 8, 2024 • edited Loading

Choose a reason for hiding this comment

qmeng222 Nov 8, 2024 • edited Loading

Choose a reason for hiding this comment

zhiyuan8 Nov 8, 2024 •

edited

Loading

qmeng222 Nov 8, 2024 •

edited

Loading