Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Model] Whisper model implementation #11280
[Model] Whisper model implementation #11280
Changes from 25 commits
cfbd164
ced0141
248bafb
6c9ee61
7329b2d
77ad7ed
755086b
b38f5b7
ff70bce
3fbd067
9032aa1
ce3a87c
04a0ef4
fd4ed14
26cfede
34c5830
bf111b2
a21470b
b457c01
d81d217
17712a4
b573fa9
6d6cbd9
94a867b
787708a
e943905
606642e
fe8e245
b59fddb
d66cd42
6ba1afc
26fd92a
4566b10
a21334c
1fe41fc
1c16ad2
7282280
3442852
e0cc63e
770534c
d73e004
9672af2
127f46e
edfec27
ba30886
dbd21a4
ab674fa
e920f2d
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
Check failure on line 86 in tests/models/encoder_decoder/audio/test_whisper.py
GitHub Actions / ruff (3.12)
Ruff (B020)
Check failure on line 1385 in vllm/core/scheduler.py
GitHub Actions / mypy (3.9)
Check failure on line 1385 in vllm/core/scheduler.py
GitHub Actions / mypy (3.10)
Check failure on line 1385 in vllm/core/scheduler.py
GitHub Actions / mypy (3.11)
Check failure on line 1385 in vllm/core/scheduler.py
GitHub Actions / mypy (3.12)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is there a way to determine this without model type information?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I am not sure about generalizing this from a single example. In the long term it may be better to allow the model definition to specify exactly the mapping between input fields and where they go (e.g. encoder/decoder)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Agreed with @aurickq , long-term it is probably best to either (1) have the model definition specify whether to map the input text prompt to the encoder, or (2) add a default behavior only for multi-modal models with cross-attention, wherein the text prompt is always routed to the decoder & the non-text modality is always mapped to the encoder.
(I worked on adding encoder/decoder cross-attention support to v0)