Support vLLM as backend #75

oplatek · 2024-09-10T12:19:09Z

vLLM is fast and popular.

It has has the same API as OpenAPI which we already support 1 2.
Maybe we can simply test the OpenAI client with a different connection URL to the vLLM server?

However, for large flexibility it may be ideal to support vLLM server directly.

oplatek · 2024-11-13T12:32:46Z

Solved in #152

oplatek added the enhancement New feature or request label Sep 10, 2024

kasnerz added the low priority Tasks which can be postponed label Oct 14, 2024

oplatek added this to the Release-1.0.0 milestone Nov 13, 2024

kasnerz mentioned this issue Nov 13, 2024

Release 1.0.1 #133

Merged

kasnerz closed this as completed in #133 Nov 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support vLLM as backend #75

Support vLLM as backend #75

oplatek commented Sep 10, 2024

oplatek commented Nov 13, 2024

Support vLLM as backend #75

Support vLLM as backend #75

Comments

oplatek commented Sep 10, 2024

oplatek commented Nov 13, 2024