May I ask, is the issue I mentioned a bug? How can it be resolved? #3656

liyanshuai2018 · 2024-04-21T03:51:26Z

liyanshuai2018
Apr 21, 2024

1、Why is the response from calling large models within Ollama through the command line very quick, while it is much slower through Dify, even by a factor of ten to twenty times slower? Additionally, calling Ollama through open-webui is also more than ten times faster than through Dify. What is the reason for this discrepancy?
2、When uploading an image and calling the LLaVA-13B model of Ollama via Dify, the following prompt appears: "Query or prefix prompt is too long, you can reduce the prefix prompt, or shrink the max token, or switch to an LLM with a larger token limit size." However, when calling Ollama through the local command line or open-webui, this issue does not occur. Is there a need to adjust any parameters?

crazywoola · 2024-04-22T03:15:00Z

crazywoola
Apr 22, 2024
Maintainer

Query or prefix prompt is too long, you can reduce the prefix prompt, or shrink the max token, or switch to an LLM with a larger token limit size.

In conversation orchestration page, you need to adjust the max token of chosen ollama model. This meant to be the length of output token. It's different with the whole input tokens.

Regarding the first question, please read this thread. ollama/ollama#2851

0 replies

liyanshuai2018 · 2024-04-24T12:13:57Z

liyanshuai2018
Apr 24, 2024
Author

Query or prefix prompt is too long, you can reduce the prefix prompt, or shrink the max token, or switch to an LLM with a larger token limit size.

In conversation orchestration page, you need to adjust the max token of chosen ollama model. This meant to be the length of output token. It's different with the whole input tokens.

Regarding the first question, please read this thread. ollama/ollama#2851
@crazywoola v0.57 works fine, but v0.64 can't add models and gives a 502 error

1 reply

liqi0706401043 Jan 8, 2025

Do you hava solved?

liqi0706401043 · 2025-01-08T09:29:10Z

liqi0706401043
Jan 8, 2025

Do you hava solved this problem? I'm also meet the same situation.And when I according to the document to adjust the model max token, it not work.God help me !

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

May I ask, is the issue I mentioned a bug? How can it be resolved? #3656

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 3 comments 1 reply

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

Select a reply

May I ask, is the issue I mentioned a bug? How can it be resolved? #3656

liyanshuai2018 Apr 21, 2024

Replies: 3 comments · 1 reply

crazywoola Apr 22, 2024 Maintainer

liyanshuai2018 Apr 24, 2024 Author

liqi0706401043 Jan 8, 2025

liqi0706401043 Jan 8, 2025

liyanshuai2018
Apr 21, 2024

Replies: 3 comments 1 reply

crazywoola
Apr 22, 2024
Maintainer

liyanshuai2018
Apr 24, 2024
Author

liqi0706401043
Jan 8, 2025