[V1][VLM] Add V1-rearch image inference support for Qwen2-VL #10988

ywang96 · 2024-12-08T10:56:59Z

Separated out from #10699

Placeholder ranges
Embedding generation refactoring
M-RoPE implementation
Fix image embedding as input test

Signed-off-by: Roger Wang <[email protected]>

github-actions · 2024-12-08T10:57:10Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

ywang96 · 2024-12-10T20:40:07Z

Hello @fyabc! We're currently going through re-architecture of vLLM and I'm wondering if it's possible to add M-RoPE into the model file instead of the model runner. Do you have any suggestion?

fyabc · 2024-12-23T09:53:36Z

Hi @ywang96 , sorry for late response, any update for this PR?

ywang96 · 2024-12-23T09:59:15Z

Hi @ywang96 , sorry for late response, any update for this PR?

@fyabc Hello! I closed this PR by accident, but I haven't really made too much progress on Qwen2 VL (mostly I'm not sure if we can put MRoPE inside the model file instead of the model runner) If it makes things easier feel free to join slack.vllm.ai so we can discuss more offline!

initial

b91c128

Signed-off-by: Roger Wang <[email protected]>

This was referenced Dec 8, 2024

[V1] Initial support of multimodal models for V1 re-arch #10699

Merged

[RFC]: Multi-modality Support on vLLM #4194

Open

ywang96 closed this by deleting the head repository Dec 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[V1][VLM] Add V1-rearch image inference support for Qwen2-VL #10988

[V1][VLM] Add V1-rearch image inference support for Qwen2-VL #10988

ywang96 commented Dec 8, 2024 •

edited by github-actions bot

Loading

github-actions bot commented Dec 8, 2024

ywang96 commented Dec 10, 2024 •

edited

Loading

fyabc commented Dec 23, 2024

ywang96 commented Dec 23, 2024

[V1][VLM] Add V1-rearch image inference support for Qwen2-VL #10988

[V1][VLM] Add V1-rearch image inference support for Qwen2-VL #10988

Conversation

ywang96 commented Dec 8, 2024 • edited by github-actions bot Loading

github-actions bot commented Dec 8, 2024

ywang96 commented Dec 10, 2024 • edited Loading

fyabc commented Dec 23, 2024

ywang96 commented Dec 23, 2024

ywang96 commented Dec 8, 2024 •

edited by github-actions bot

Loading

ywang96 commented Dec 10, 2024 •

edited

Loading