[Model] Automatic conversion of classification and reward models #11469

DarkLight1337 · 2024-12-24T16:03:30Z

This is the last PR to finish off #10674.

Signed-off-by: DarkLight1337 <[email protected]>

github-actions · 2024-12-24T16:03:45Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

DarkLight1337 · 2024-12-24T16:06:24Z

cc @maxdebayser

Signed-off-by: DarkLight1337 <[email protected]>

Isotr0py

LGTM!

umie0128 · 2024-12-25T03:22:45Z

@DarkLight1337 hi 我看到vllm0.6.4/0.6.5 更新说有速度上的提升但是我对比了0.6.2 跑qwen2.5-7B 感觉速度没有变化。请问是为啥呢

DarkLight1337 · 2024-12-25T05:22:19Z

@DarkLight1337 hi 我看到vllm0.6.4/0.6.5 更新说有速度上的提升但是我对比了0.6.2 跑qwen2.5-7B 感觉速度没有变化。请问是为啥呢

Hi, this PR is about pooling models, not text generation models. Can you open a new issue for this?

…m-project#11469) Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 added 2 commits December 24, 2024 16:01

Add pooling adapters for classification and reward models

9a67112

Signed-off-by: DarkLight1337 <[email protected]>

Update docs

6083426

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Dec 24, 2024

DarkLight1337 requested a review from Isotr0py December 24, 2024 16:03

DarkLight1337 requested a review from ywang96 as a code owner December 24, 2024 16:03

DarkLight1337 mentioned this pull request Dec 24, 2024

[RFC]: Make any vLLM model a pooling model #10674

Closed

7 tasks

mergify bot added the documentation Improvements or additions to documentation label Dec 24, 2024

DarkLight1337 mentioned this pull request Dec 24, 2024

[Usage]: How to infer the reward model of nvidia/Llama-3.1-Nemotron-70B-Reward-HF? #11459

Closed

1 task

DarkLight1337 added 3 commits December 24, 2024 16:08

Fix

24b465b

Signed-off-by: DarkLight1337 <[email protected]>

Clean up

8bac125

Signed-off-by: DarkLight1337 <[email protected]>

Update tests

d2954a2

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 force-pushed the pooling-adapters branch from 6256310 to d2954a2 Compare December 24, 2024 16:24

DarkLight1337 added 2 commits December 24, 2024 16:25

Remove redundant model

a111d5a

Signed-off-by: DarkLight1337 <[email protected]>

Actually this should be a reward model

6f77e21

Signed-off-by: DarkLight1337 <[email protected]>

Isotr0py approved these changes Dec 24, 2024

View reviewed changes

DarkLight1337 enabled auto-merge (squash) December 24, 2024 17:05

DarkLight1337 merged commit 3f3e92e into vllm-project:main Dec 24, 2024
56 checks passed

DarkLight1337 deleted the pooling-adapters branch December 25, 2024 02:17

BKitor pushed a commit to BKitor/vllm that referenced this pull request Dec 30, 2024

[Model] Automatic conversion of classification and reward models (vll…

57c795b

…m-project#11469) Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 mentioned this pull request Jan 9, 2025

Add LlamaForSequenceClassification model #8740

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model] Automatic conversion of classification and reward models #11469

[Model] Automatic conversion of classification and reward models #11469

DarkLight1337 commented Dec 24, 2024 •

edited by github-actions bot

Loading

github-actions bot commented Dec 24, 2024

DarkLight1337 commented Dec 24, 2024

Isotr0py left a comment

umie0128 commented Dec 25, 2024

DarkLight1337 commented Dec 25, 2024

[Model] Automatic conversion of classification and reward models #11469

[Model] Automatic conversion of classification and reward models #11469

Conversation

DarkLight1337 commented Dec 24, 2024 • edited by github-actions bot Loading

github-actions bot commented Dec 24, 2024

DarkLight1337 commented Dec 24, 2024

Isotr0py left a comment

Choose a reason for hiding this comment

umie0128 commented Dec 25, 2024

DarkLight1337 commented Dec 25, 2024

DarkLight1337 commented Dec 24, 2024 •

edited by github-actions bot

Loading