-
-
Notifications
You must be signed in to change notification settings - Fork 5.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: Prefix caching doesn't work for LlavaOneVision #11371
Comments
Most multi-modal models don't support V1 yet. You can check the Supported Models page for more details (there is a V1 column for multi-modal models). |
Thank you! Is there a link for that? I didn't find documents about the V1 Engine. |
It is pinned in the list of issues . |
cc @ywang96 perhaps we should add a link to the V1 column header? |
I see this one #8779, but I didn't find any examples about its usage. It seems the V1 engine is not totally the same to use as the old one. |
It is still in development which is why we don't have user-facing docs about it yet. For now, you can enable it by setting environment variable |
@sleepwalker2017 V1 is only available for experimental use and not all multimodal models have yet been supported on V1. You can check our latest documentation here https://docs.vllm.ai/en/latest/models/supported_models.html#id3 (the V1 column) to see which models are supported. |
Thank you for the clear explanation! |
Hi, I have another question, what is needed if I want to add support for a multi-modal model in V1 engine? On the condition that it's already supported by the old engine. |
@sleepwalker2017 There are a few key changes you'll need to do:
Feel free to take a look at #10699 to see the changes needed. Also for now we only support image modality on V1. |
Thank you! I'll check that! |
Your current environment
The generated dummy input is video, but the preprocessor tries to get image from the dict, and then it crashes.
After I walk around this, the code still fails to run.
It complains this:
Model Input Dumps
The mmmu_bench.py comes from here:
#11187
🐛 Describe the bug
Before submitting a new issue...
The text was updated successfully, but these errors were encountered: