I like to fine-tune Deep Neural Nets on small datasets.
Stars
LLM inference
2 repositories
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
A high-throughput and memory-efficient inference and serving engine for LLMs