Plan to support latest vLLM? #76

thanhnguyentung95 · 2024-11-18T09:45:12Z

Due to the following code in vLLM 0.6.2, we cannot serve LoRA adapters on a per-request basis, as it does not support LoRA and Multimodal simultaneously:

        if self.lora_config:
            assert supports_lora(self.model), "Model does not support LoRA"
            assert not supports_multimodal(
                self.model
            ), "To be tested: Multi-modal model with LoRA settings."

Do you have a plan to upgrade the vLLM version for Aria?

The text was updated successfully, but these errors were encountered:

xffxff · 2024-11-18T10:28:44Z

@thanhnguyentung95 Yes, we can upgrade vLLM. I'll take a look at this.

xffxff · 2024-11-18T10:43:40Z

Hi @thanhnguyentung95
You can try upgrading vLLM in your local environment and continue development first. There are several indirect dependencies (like PyTorch, transformers), so I'll need some extra time to test the upgrade thoroughly to ensure it doesn't break any existing Aria functionality.

xffxff · 2024-11-20T06:49:25Z

@thanhnguyentung95 I've upgraded vllm to the latest version in #77

xffxff mentioned this issue Nov 19, 2024

upgrade vllm to the latest version #77

Merged

3 tasks

xffxff closed this as completed in #77 Nov 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Plan to support latest vLLM? #76

Plan to support latest vLLM? #76

thanhnguyentung95 commented Nov 18, 2024

xffxff commented Nov 18, 2024

xffxff commented Nov 18, 2024

xffxff commented Nov 20, 2024

Plan to support latest vLLM? #76

Plan to support latest vLLM? #76

Comments

thanhnguyentung95 commented Nov 18, 2024

xffxff commented Nov 18, 2024

xffxff commented Nov 18, 2024

xffxff commented Nov 20, 2024