Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Quality comparison: SVDQuant vs GGUF Q4 ? #31

Open
marvin-0042 opened this issue Nov 21, 2024 · 0 comments
Open

Quality comparison: SVDQuant vs GGUF Q4 ? #31

marvin-0042 opened this issue Nov 21, 2024 · 0 comments

Comments

@marvin-0042
Copy link

SVDQuant quality is great! the paper compares the quality with NF4, however GGUF Q4 (Q4_0, Q4_K_S) is another popular quantization in community and broadly used in ComfyUI with Image Gen model like FLUX.1 and Video Generation Model like Mochi 1. GGUF Q4 FLUX.1-schenell is also ~6.5GB with good quality, and it does not need big inference engine change.

Can we have any quality comparison between SVDQuant vs GGUF Q4 too, on FLUX.1 (and future Mochi 1)?

Today new image/video gen models like FLUX.1 or Mochi 1, community will first try ComfyUI + GGUF Q4 DiT to enable it on consumer GPU, especially low GPU RAM. If we can see SVDQuant is better than GGUF Q4, on FLUX.1 and/or Mochi 1, it will give community much bigger motivation to adopt or prioritize this very novel SVDQuant.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant