feat(model): Support siliconflow models #2157

fangyinc · 2024-11-26T16:25:30Z

Description

How Has This Been Tested?

Using siliconflow proxyllm in DB-GPT webserver

# .env
LLM_MODEL=silicon_flow_proxyllm
SILICON_FLOW_MODEL_VERSION=Qwen/Qwen2.5-Coder-32B-Instruct
SILICON_FLOW_API_BASE=https://api.siliconflow.cn/v1
SILICON_FLOW_API_KEY={your-siliconflow-api-key}

Using SiliconFlowLLMClient
Create python file test_proxyllm.py

import asyncio

from dbgpt.core import ModelRequest
from dbgpt.model.proxy import SiliconFlowLLMClient

client = SiliconFlowLLMClient(model_alias="Qwen/Qwen2.5-Coder-32B-Instruct")
print(
    asyncio.run(
        client.generate(
            ModelRequest._build("Qwen/Qwen2.5-Coder-32B-Instruct", "Hi!")
        )
    )
)

Run test_proxyllm.py

SILICON_FLOW_API_KEY={your-siliconflow-api-key} python test_proxyllm.py

The output like this:

ModelOutput(text='Hello! How can I assist you today?', error_code=0, incremental=False, model_context=None, finish_reason=None, usage={'completion_tokens': 9, 'prompt_tokens': 31, 'total_tokens': 40}, metrics=None)

How Has This Been Tested?

Please describe the tests that you ran to verify your changes. Provide instructions so we can reproduce. Please also list any relevant details for your test configuration

Snapshots:

Include snapshots for easier review.

Checklist:

My code follows the style guidelines of this project
I have already rebased the commits and make the commit message conform to the project standard.
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
Any dependent changes have been merged and published in downstream modules

fangyinc · 2024-11-26T23:27:40Z

Use siliconflow embedding model by DB-GPT SDK:

import os
from dbgpt.rag.embedding import OpenAPIEmbeddings

openai_embeddings = OpenAPIEmbeddings(
    api_url="https://api.siliconflow.cn/v1/embeddings",
    api_key=os.getenv("SILICON_FLOW_API_KEY"),
    model_name="BAAI/bge-large-zh-v1.5",
)

texts = ["Hello, world!", "How are you?"]
openai_embeddings.embed_documents(texts)

csunny

It's cool, and using the Jupyter Notebook makes it easy for code testing.

r+

Aries-ckt

r+

feat(model): Support siliconflow models

f49608e

github-actions bot added enhancement New feature or request model Module: model labels Nov 26, 2024

csunny approved these changes Nov 27, 2024

View reviewed changes

Aries-ckt approved these changes Nov 27, 2024

View reviewed changes

Aries-ckt merged commit e5ec471 into eosphoros-ai:main Nov 27, 2024
5 checks passed

fangyinc mentioned this pull request Dec 11, 2024

feat(model): Support siliconflow rerank models #2188

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(model): Support siliconflow models #2157

feat(model): Support siliconflow models #2157

fangyinc commented Nov 26, 2024

fangyinc commented Nov 26, 2024

csunny left a comment

Aries-ckt left a comment

feat(model): Support siliconflow models #2157

feat(model): Support siliconflow models #2157

Conversation

fangyinc commented Nov 26, 2024

Description

How Has This Been Tested?

How Has This Been Tested?

Snapshots:

Checklist:

fangyinc commented Nov 26, 2024

csunny left a comment

Choose a reason for hiding this comment

Aries-ckt left a comment

Choose a reason for hiding this comment