use local model #347

ValValu · 2023-08-25T22:54:49Z

ValValu
Aug 25, 2023

would be nice to use https://huggingface.co/Phind/Phind-CodeLlama-34B-v1
locally

Dec 1, 2023

I got this working locally by using the config optionapi_host_cmd = 'echo -n http://localhost:5000' while running https://github.com/oobabooga/text-generation-webui, but I'd imagine it working for any server supporting the OpenAI API.

View full answer

shnee · 2023-08-30T19:56:22Z

shnee
Aug 30, 2023

+1

I was looking to use something like https://github.com/getumbrel/llama-gpt#openai-compatible-api which is supposed to be compatible with the OpenAI API.

Maybe allowing the OpenAI URL to be configurable would be enough?

EDIT: Looks like the URL is allready configurable! rtfm

Custom OpenAI API host with the configuration option api_host_cmd or environment variable called $OPENAI_API_HOST. It's useful if you can't access OpenAI directly

0 replies

rolandtannous · 2023-09-08T22:21:51Z

rolandtannous
Sep 8, 2023

@shnee did this work out with a local model?

0 replies

shnee · 2023-09-13T13:26:42Z

shnee
Sep 13, 2023

I haven't given it try yet, but I plan to tinker with it soon. I promise to report back.

0 replies

shnee · 2023-09-16T01:23:49Z

shnee
Sep 16, 2023

I gave it a try today with limited success. I had to change the protocol in api.lua to http because I'm connecting to a selfhosted instance. I changed the model in that file as well.

I was able to get responses from the local model but they were all strange...

What is the capitol of the United States?
Response: The capital city of the United States is Washington D. nobody knows.

What is the capitol of France?
Response: The capital city of France is Paris. Hinweis: Das Hauptstadt der Frankreich ist Paris.

I copied the requests that this plugin is sending:

curl -X POST --silent --show-error --no-buffer \
    http://192.168.1.204:3001/v1/chat/completions \
    -H 'Content-Type: application/json' \
    -H 'Authorization: Bearer <snip> \
    -d '{"model":"llama-2-7b-chat.bin",
           "messages":[{"role":"user","content":"What is the capitol of Ohio?"}],
           "n":1,"top_p":1,"presence_penalty":0,"max_tokens":300,"temperature":0,"frequency_penalty":0,"stream":true}'

I was getting the same results. However, once I started to send a "prompt" message, the responses were more normal:

"messages":[{"role":"system","content":"You are a helpful assistant."},{"role":"user","content":"What is the capitol of France?"}]

then I started to get back normal results:

The capital city of France is Paris.

There also appears to be an issue with saving the chat responses. The plugin seems to be only saving the requests. I didn't have time to dig into that yet. If I find some more time I'll try to dig further.

0 replies

darkacorn · 2023-09-18T15:06:14Z

darkacorn
Sep 18, 2023

https://github.com/GenAiWizards/ChatGPT.nvim

so far so good .. http/https is now part of the OPENAI_API_HOST env var

or falls back to localhost:5001

promps need to be adjusted but i do as i go

0 replies

teto · 2023-11-05T21:05:29Z

teto
Nov 5, 2023

I wanted to try out a local via https://github.com/getumbrel/llama-gpt#openai-compatible-api , so I set api_host_cmd = "echo -n '0.0.0.0:3000'" but that triggers a curl: (3) URL rejected: Port number was not a decimal number between 0 and 65535. Not sure if that's a server or client error. Is there a way to increase debug log levels ? could be helpful to have a health.lua to run :checkhealth with .

0 replies

eitamal · 2023-11-09T04:03:18Z

eitamal
Nov 9, 2023

You might want to take a look into https://github.com/mudler/LocalAI while you're at it. I haven't tried it myself, but it's meant to be a drop-in replacement for OpenAI's API with support for many models, which I believe will even support the Phind-CodeLlama-34B-v2 via TheBloke's GGUF port.

0 replies

sg1fan · 2023-12-01T00:15:35Z

sg1fan
Dec 1, 2023

I got this working locally by using the config optionapi_host_cmd = 'echo -n http://localhost:5000' while running https://github.com/oobabooga/text-generation-webui, but I'd imagine it working for any server supporting the OpenAI API.

0 replies

BPplays · 2024-08-23T14:03:29Z

BPplays
Aug 23, 2024

~~@sg1fan this seems to just tell me my api key is wrong then says the api key this plugin is using is the same i put in https://github.com/oobabooga/text-generation-webui~~ i think it needs to be outside openai_params

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use local model #347

{{title}}

Replies: 9 comments

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply