bloat

refactor(core): embed llama.cpp's server binary directly for LLM inference #362

Job	Run time
cargo_bloat	6m 17s
	6m 17s

Provide feedback