Skip to content
View KuntaiDu's full-sized avatar

Block or report KuntaiDu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
KuntaiDu/README.md

Hi, I'm Kuntai Du ๐Ÿ‘‹

I'm a PhD student @ UChicago, graduating, working in Large Language Model Inference. Check my home page for more about me!

๐Ÿ”ง Experiences

  • ๐Ÿš€ Working on vLLM project as vLLM team member. My contributions:
    • Performance dashboard: perf.vllm.ai.
    • Performance comparison with other LLM inference engines: the end of the blog.
    • Features: Disaggregated prefilling and CPU offloading.
  • ๐Ÿ’พ Contributing to the LMCache project, exploring fun ideas in KV caches.

๐ŸŽฎ Hobbies and Interests

  • ๐ŸŽฎ Gaming: League of Legends, Stardew Valley, Go
  • ๐Ÿ’ƒ Street Dance: Locking main, but I also dance waacking.
  • ๐ŸŽค Singing: Loch Lomond and ไผ ๅฅ‡ Legend

๐Ÿ“ง Contact

Popular repositories Loading

  1. dds dds Public

    Server-driven Video Streaming for Deep Learning Inference

    Python 85 32

  2. AccMPEG AccMPEG Public

    Jupyter Notebook 28 6

  3. vllm vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 11 3

  4. Video-Aalytic-Overview Video-Aalytic-Overview Public

    10 3

  5. Awesome-LLM-Inference Awesome-LLM-Inference Public

    Forked from DefTruth/Awesome-LLM-Inference

    ๐Ÿ“–A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

    2

  6. Asteroids-fun-ver Asteroids-fun-ver Public

    Racket 1