[Misc] Enhance offline_inference to support user-configurable paramet… #10392

wchen61 · 2024-11-16T15:43:01Z

github-actions · 2024-11-16T15:43:15Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

Isotr0py

Thanks for this enhancement! However, since this is an example script used to introduce the usage of LLM entrypoint for offline inference, we should keep it clean and simple as much as possible.

Therefore, we should expose arguments that are in common use or highlights in vLLM (like tensor parallel, chunked prefill and prefix caching etc).

examples/offline_inference.py

…ers (vllm-project#10391) Signed-off-by: wchen61 <[email protected]>

Signed-off-by: wchen61 <[email protected]>

Isotr0py

Overall LGTM, just leave some nits.

examples/offline_inference.py

Signed-off-by: wchen61 <[email protected]>

Isotr0py

LGTM!

vllm-project#10392) Signed-off-by: wchen61 <[email protected]> Signed-off-by: Linkun Chen <[email protected]>

vllm-project#10392) Signed-off-by: wchen61 <[email protected]>

vllm-project#10392) Signed-off-by: wchen61 <[email protected]> Signed-off-by: Maxime Fournioux <[email protected]>

vllm-project#10392) Signed-off-by: wchen61 <[email protected]> Signed-off-by: rickyx <[email protected]>

vllm-project#10392) Signed-off-by: wchen61 <[email protected]> Signed-off-by: Tyler Michael Smith <[email protected]>

vllm-project#10392) Signed-off-by: wchen61 <[email protected]>

wchen61 force-pushed the enhance_offline_inference branch from 603f180 to 3c27e97 Compare November 16, 2024 16:01

Isotr0py reviewed Nov 16, 2024

View reviewed changes

examples/offline_inference.py Outdated Show resolved Hide resolved

examples/offline_inference.py Outdated Show resolved Hide resolved

examples/offline_inference.py Outdated Show resolved Hide resolved

examples/offline_inference.py Show resolved Hide resolved

Isotr0py reviewed Nov 16, 2024

View reviewed changes

examples/offline_inference.py Outdated Show resolved Hide resolved

wchen61 added 5 commits November 17, 2024 10:36

[Misc] Enhance offline_inference to support user-configurable paramet…

55088c1

…ers (vllm-project#10391) Signed-off-by: wchen61 <[email protected]>

Run yapf and ruff

d3d1875

Signed-off-by: wchen61 <[email protected]>

Fix length typo

ce9a8fc

Signed-off-by: wchen61 <[email protected]>

Fix isort problem

5305af9

Signed-off-by: wchen61 <[email protected]>

Delete input-len to keep example simple

28d6d60

Signed-off-by: wchen61 <[email protected]>

wchen61 force-pushed the enhance_offline_inference branch from eab5c7b to 28d6d60 Compare November 17, 2024 02:36

Isotr0py reviewed Nov 17, 2024

View reviewed changes

examples/offline_inference.py Outdated Show resolved Hide resolved

examples/offline_inference.py Outdated Show resolved Hide resolved

Group sampling parameters

fd2e6fe

Signed-off-by: wchen61 <[email protected]>

Isotr0py approved these changes Nov 17, 2024

View reviewed changes

Isotr0py enabled auto-merge (squash) November 17, 2024 10:45

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 17, 2024

Isotr0py merged commit d1557e6 into vllm-project:main Nov 17, 2024
41 of 43 checks passed

lk-chen pushed a commit to lk-chen/vllm that referenced this pull request Nov 18, 2024

[Misc] Enhance offline_inference to support user-configurable paramet… (

242bb53

vllm-project#10392) Signed-off-by: wchen61 <[email protected]> Signed-off-by: Linkun Chen <[email protected]>

coolkp pushed a commit to coolkp/vllm that referenced this pull request Nov 20, 2024

[Misc] Enhance offline_inference to support user-configurable paramet… (

d4dc0fb

vllm-project#10392) Signed-off-by: wchen61 <[email protected]>

KuntaiDu pushed a commit to KuntaiDu/vllm that referenced this pull request Nov 20, 2024

[Misc] Enhance offline_inference to support user-configurable paramet… (

39f9cf5

vllm-project#10392) Signed-off-by: wchen61 <[email protected]>

mfournioux pushed a commit to mfournioux/vllm that referenced this pull request Nov 20, 2024

[Misc] Enhance offline_inference to support user-configurable paramet… (

dc8d45f

vllm-project#10392) Signed-off-by: wchen61 <[email protected]> Signed-off-by: Maxime Fournioux <[email protected]>

rickyyx pushed a commit to rickyyx/vllm that referenced this pull request Nov 20, 2024

[Misc] Enhance offline_inference to support user-configurable paramet… (

48443fb

vllm-project#10392) Signed-off-by: wchen61 <[email protected]> Signed-off-by: rickyx <[email protected]>

WoosukKwon mentioned this pull request Nov 21, 2024

[Minor] Revert change in offline inference example #10545

Merged

prashantgupta24 pushed a commit to opendatahub-io/vllm that referenced this pull request Dec 3, 2024

[Misc] Enhance offline_inference to support user-configurable paramet… (

d97d269

vllm-project#10392) Signed-off-by: wchen61 <[email protected]>

sleepwalker2017 pushed a commit to sleepwalker2017/vllm that referenced this pull request Dec 13, 2024

[Misc] Enhance offline_inference to support user-configurable paramet… (

fb62568

vllm-project#10392) Signed-off-by: wchen61 <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Misc] Enhance offline_inference to support user-configurable paramet… #10392

[Misc] Enhance offline_inference to support user-configurable paramet… #10392

wchen61 commented Nov 16, 2024 •

edited by github-actions bot

Loading

github-actions bot commented Nov 16, 2024

Isotr0py left a comment •

edited

Loading

Isotr0py left a comment

Isotr0py left a comment

[Misc] Enhance offline_inference to support user-configurable paramet… #10392

[Misc] Enhance offline_inference to support user-configurable paramet… #10392

Conversation

wchen61 commented Nov 16, 2024 • edited by github-actions bot Loading

github-actions bot commented Nov 16, 2024

Isotr0py left a comment • edited Loading

Choose a reason for hiding this comment

Isotr0py left a comment

Choose a reason for hiding this comment

Isotr0py left a comment

Choose a reason for hiding this comment

wchen61 commented Nov 16, 2024 •

edited by github-actions bot

Loading

Isotr0py left a comment •

edited

Loading