vllm service profiler #9867

wyhhyw123 · 2024-10-31T03:44:58Z

wyhhyw123
Oct 31, 2024

There are currently three schemes in vllm for system monitoring and performance analysis: OTel, prometheus, and torch profiler.

How about to provide a service profiler which just reflect the detail events inner vllm framework(not include torch).

For example, the service profiler should record each request arrival time and finish time(not the life cycle of vllm service, user can control the start time and end time of service profiler), some important events during its life cycle, such as queue switching, prefill forward, decode forward, token sampling, block swap in/out and relevant request id and so on.

Just like torch profiler, service profiler provide the system performance data with trace event format, perfetto/chrome trace will make the inference system no longer a black boxed to user and more conducive to analyzing the performance issues of the vllm framework.

Service profiler should be a light offline tools for performance analysis. An example like:

Wesley-Jzy · 2025-01-13T11:25:10Z

Wesley-Jzy
Jan 13, 2025

I'm also finding this kind of profiler. Do you have update for this?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vllm service profiler #9867

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

Select a reply

vllm service profiler #9867

wyhhyw123 Oct 31, 2024

Replies: 1 comment

Wesley-Jzy Jan 13, 2025

wyhhyw123
Oct 31, 2024

Wesley-Jzy
Jan 13, 2025