Replaced grouped_gemm
with vLLM's Fused MoE Kernel for Inference Optimization
#7
Loading
grouped_gemm
with vLLM's Fused MoE Kernel for Inference Optimization
#7