[torch.compile] remove compilation_context and simplify code #10838

youkaichao · 2024-12-02T21:06:08Z

remove the confusing compilation context, figure out cudagraph batchsizes during initialization of the config.

Signed-off-by: youkaichao <[email protected]>

github-actions · 2024-12-02T21:06:19Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

Signed-off-by: youkaichao <[email protected]>

vllm/worker/model_runner.py

Signed-off-by: youkaichao <[email protected]>

vllm/config.py

Signed-off-by: youkaichao <[email protected]>

WoosukKwon

LGTM.

…oject#10838) Signed-off-by: youkaichao <[email protected]>

commit

b67c1f6

Signed-off-by: youkaichao <[email protected]>

youkaichao requested review from WoosukKwon, robertgshaw2-neuralmagic, njhill, ywang96, comaniac and alexm-neuralmagic as code owners December 2, 2024 21:06

youkaichao added 3 commits December 2, 2024 13:07

commit

08e5e51

Signed-off-by: youkaichao <[email protected]>

move func to vllm config

3ff3eb4

Signed-off-by: youkaichao <[email protected]>

enc dec

4dc2e7a

Signed-off-by: youkaichao <[email protected]>

youkaichao commented Dec 2, 2024

View reviewed changes

vllm/worker/model_runner.py Show resolved Hide resolved

youkaichao added 2 commits December 2, 2024 13:16

draft

a5b0296

Signed-off-by: youkaichao <[email protected]>

fix tests

8b145d3

Signed-off-by: youkaichao <[email protected]>

youkaichao requested a review from DarkLight1337 as a code owner December 2, 2024 22:35

WoosukKwon reviewed Dec 3, 2024

View reviewed changes

vllm/config.py Show resolved Hide resolved

youkaichao commented Dec 3, 2024

View reviewed changes

vllm/config.py Show resolved Hide resolved

update

af2e5f0

Signed-off-by: youkaichao <[email protected]>

WoosukKwon approved these changes Dec 3, 2024

View reviewed changes

youkaichao enabled auto-merge (squash) December 3, 2024 03:37

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Dec 3, 2024

youkaichao merged commit dc5ce86 into vllm-project:main Dec 3, 2024
64 of 66 checks passed

youkaichao deleted the cudagraph_size_during_init branch December 3, 2024 07:46

sleepwalker2017 pushed a commit to sleepwalker2017/vllm that referenced this pull request Dec 13, 2024

[torch.compile] remove compilation_context and simplify code (vllm-pr…

3257258

…oject#10838) Signed-off-by: youkaichao <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[torch.compile] remove compilation_context and simplify code #10838

[torch.compile] remove compilation_context and simplify code #10838

youkaichao commented Dec 2, 2024

github-actions bot commented Dec 2, 2024

WoosukKwon left a comment

[torch.compile] remove compilation_context and simplify code #10838

[torch.compile] remove compilation_context and simplify code #10838

Conversation

youkaichao commented Dec 2, 2024

github-actions bot commented Dec 2, 2024

WoosukKwon left a comment

Choose a reason for hiding this comment