Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Embedding model is reloaded cyclic when run on CPU #2599

Open
1 of 3 tasks
liunux4odoo opened this issue Nov 28, 2024 · 0 comments
Open
1 of 3 tasks

Embedding model is reloaded cyclic when run on CPU #2599

liunux4odoo opened this issue Nov 28, 2024 · 0 comments
Labels
Milestone

Comments

@liunux4odoo
Copy link
Contributor

System Info / 系統信息

system: win10
python: 3.10
transformers: 4.46.2

Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece?

  • docker / docker
  • pip install / 通过 pip install 安装
  • installation from source / 从源码安装

Version info / 版本信息

0.16.3 & 1.0.0

The command used to start Xinference / 用以启动 xinference 的命令

xinference-local.exe

Reproduction / 复现过程

  1. start xinference-local.exe
  2. launch bge-large-zh-v1.5 model on CPU in webui
  3. do embedding request

The console logs model reload information repeated.
There is no problem if set device to GPU.

Expected behavior / 期待表现

serve embedding model normally on CPU device

@XprobeBot XprobeBot added the gpu label Nov 28, 2024
@XprobeBot XprobeBot added this to the v1.x milestone Nov 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants