haichuan1221

Follow

Haichuan haichuan1221

Follow

2 followers · 3 following

Beijing

Achievements

Achievements

Popular repositories Loading

vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
llama.cpp llama.cpp Public

Forked from ggerganov/llama.cpp

LLM inference in C/C++

C++
TensorRT-LLM TensorRT-LLM Public

Forked from NVIDIA/TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++
mem0 mem0 Public

Forked from mem0ai/mem0

The memory layer for Personalized AI

Python
sglang sglang Public

Forked from sgl-project/sglang

SGLang is yet another fast serving framework for large language models and vision language models.

Python
flashinfer flashinfer Public

Forked from flashinfer-ai/flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda