ENH: bypass the sampling parameter skip_special_tokens to vLLM backend #2655

zjuyzj · 2024-12-11T09:59:59Z

Update api/restful_api.py and model/llm/vllm/core.py to pass through sampling parameter skip_special_tokens.

Fixes #2656 .

…I to vLLM backend, in order to support visual grounding task for MLLM like Qwen2-VL.

xinference/api/restful_api.py

qinxuye

LGTM

ENH: bypass the sampling parameter skip_special_tokens via RESTful AP…

aef1612

…I to vLLM backend, in order to support visual grounding task for MLLM like Qwen2-VL.

XprobeBot added the enhancement New feature or request label Dec 11, 2024

XprobeBot added this to the v1.x milestone Dec 11, 2024

zjuyzj mentioned this pull request Dec 11, 2024

支持在对话补全请求中控制是否返回特殊标记 #2656

Closed

qinxuye reviewed Dec 11, 2024

View reviewed changes

xinference/api/restful_api.py Outdated Show resolved Hide resolved

Update restful_api.py

f0f645f

qinxuye approved these changes Dec 11, 2024

View reviewed changes

qinxuye merged commit 0fd2001 into xorbitsai:main Dec 11, 2024
10 of 13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: bypass the sampling parameter skip_special_tokens to vLLM backend #2655

ENH: bypass the sampling parameter skip_special_tokens to vLLM backend #2655

zjuyzj commented Dec 11, 2024 •

edited by qinxuye

Loading

qinxuye left a comment

ENH: bypass the sampling parameter skip_special_tokens to vLLM backend #2655

ENH: bypass the sampling parameter skip_special_tokens to vLLM backend #2655

Conversation

zjuyzj commented Dec 11, 2024 • edited by qinxuye Loading

qinxuye left a comment

Choose a reason for hiding this comment

zjuyzj commented Dec 11, 2024 •

edited by qinxuye

Loading