Ollama, Mistral AI, etc. #7296

haraldschilly · 2024-02-22T13:03:05Z

Description

selecting a model in the dialog is now a dropdown, plus some small changes
- this dropdown is used in account settings/others
add dynamic frontend configuration for arbitrary ollama models
- with that, backend logic to deal with arbitrary ollama-[model] models
- my main test for this ended up to be google's gamma model
many enhancements, refactorings and cleanups for LLM overall. This does not change the actual naming schema of the models, yet. This PR is already too complex.
Slight changes to the system prompt:
- it now depends on the vendor/model
- in the future it might even depend on the file type (path)
Many changes and fixes for the AI Formula generator
Adding GPT-4 Turbo as yet another paid option
- Since the minimum commitment is >$5, I've also added an "8k" variant of it. That way, the limits are lower, less "dangerous" to use
adding Mistral models and the "Large" one is a paid model
- The big issue with mistral is their badly coded lib. I had to fork it: https://github.com/sagemathinc/mistralai-client-js
- and then figure out how to tell langchain to use it. The package.json config looks odd, I have the feeling there there is a bug related to using this in a workspace. In any case, once this is fixed upstream, this bandaid has to go away.
fixing disable some AI integration instructor course setting seems to disable everything now #7341

testing

chat with GPT4 turbo (deliberately bad language on my side!) and testing reply-continuation.

I can also select an LLM in the slate editor in a chat input

course

partially disabled: explain cell and fix-error does work, and also replies
partially disabled: AI formula does say it is disabled
fully disabled: no explain button, no fix-error button
base case: no AI tools disabled: yes, students can do all of this

Ollama

This is the main part before this PR escalated: there is now a config parameter, which contains a dict of {[model name]: {config params}, ... } which is described in the description of that admin field and also has some detailed checks. The main point is, you can chat with any model that is available via the ollama api, you can have several ollama instances, and for each of them several models.

configuration example

and with that, there is a @gemma chat, a displayed name, a description, and of course it streams answers like the others

status

ok, the latest with this PR is everything is tested and works, but there seems to be a problem with the mistral lib itself, and modifying what fetch does. this is a bit ugly.

This seems to be the root cause: mistralai/client-js#42

Checklist:

Testing instructions are provided, if not obvious
Release instructions are provided, if not obvious

…fixes?) etc.

…add ollama description, and fix bug with ai-formula (no project context)

… box around cell number, etc

…etimes it is interesting

… enable/disable in course projects

…r; throw proper error when querying client/llm without an enabled LLM for a specific tag

… course project specific limitations

…date langchain

…much money

haraldschilly · 2024-03-17T19:22:51Z

this is super long and I stumbled over many obstacles. however, I'm now at a point where I think it's fine to merge. I didn't do all the refactoring I wanted to do, but at least a lot of things are collected in one place now. Further details are in the first message.

haraldschilly added the PR-work in progress label Feb 22, 2024

haraldschilly force-pushed the support-ollama-api branch 2 times, most recently from 932140d to fda426d Compare February 22, 2024 15:07

server/llm: first steps towards supporting ollama

fd4a48e

haraldschilly force-pushed the support-ollama-api branch 2 times, most recently from e06b06e to 5b78197 Compare February 23, 2024 14:11

ollama: starting with configuration + frontend

b6f4c87

haraldschilly force-pushed the support-ollama-api branch from 5b78197 to b6f4c87 Compare February 23, 2024 14:20

util/llm: refactoring to keep my sanity (and fixing circular imports)

71140d3

haraldschilly force-pushed the support-ollama-api branch from dff1ce5 to 71140d3 Compare February 23, 2024 16:08

server/ollama: some progress ...

3413932

haraldschilly force-pushed the support-ollama-api branch from d3126cd to 3413932 Compare February 23, 2024 17:45

haraldschilly added 2 commits February 26, 2024 09:13

Merge remote-tracking branch 'origin/master' into support-ollama-api

7c22214

frontend/ollama: various fixes, logo, starting chat integration (+bug…

d6c763e

…fixes?) etc.

haraldschilly force-pushed the support-ollama-api branch from 57a72b4 to d6c763e Compare February 26, 2024 15:44

haraldschilly added 2 commits February 26, 2024 19:00

Merge remote-tracking branch 'origin/master' into support-ollama-api

ed19f8c

Merge remote-tracking branch 'origin/master' into support-ollama-api

e06be12

haraldschilly force-pushed the support-ollama-api branch 2 times, most recently from 34824c9 to fbd281f Compare February 27, 2024 18:38

llm: make model selection a dropdown, show more of the explanations, …

35abc06

…add ollama description, and fix bug with ai-formula (no project context)

haraldschilly force-pushed the support-ollama-api branch from fbd281f to 35abc06 Compare February 27, 2024 18:41

haraldschilly added 2 commits February 28, 2024 09:43

Merge remote-tracking branch 'origin/master' into support-ollama-api

d8ffc3c

frontend/jupyter: cleanup weird "chatgpt" object, fix codebar offset,…

a1ddbed

… box around cell number, etc

haraldschilly force-pushed the support-ollama-api branch 6 times, most recently from 37b0791 to bd8e6e1 Compare March 8, 2024 17:06

Merge remote-tracking branch 'origin/master' into support-ollama-api

daba4cc

haraldschilly added 5 commits March 15, 2024 12:35

frontend/latex/ai formula: add button to insert full reply, since som…

73b40f6

…etimes it is interesting

frontend/llm: fixing #7341 by being specific about which LLM tools to…

d9059ed

… enable/disable in course projects

util/llm: minor fixes and tweaks

39a378a

frontend/llm: respect AI student project limit in AI formula generato…

d3150c5

…r; throw proper error when querying client/llm without an enabled LLM for a specific tag

frontend/llm: further debug and explain what is going on with student…

5d3f293

… course project specific limitations

haraldschilly force-pushed the support-ollama-api branch 2 times, most recently from 69ca9a2 to 783ce97 Compare March 15, 2024 15:43

frontend/llm: collect LLM descriptions in the universal llm-util

cb66d65

haraldschilly force-pushed the support-ollama-api branch from 783ce97 to cb66d65 Compare March 15, 2024 16:08

haraldschilly added 2 commits March 15, 2024 18:47

server/llm: fix token counter for mistral and ollama

6b3ec8f

server/llm: drop Palm2 support

e4ed26c

haraldschilly force-pushed the support-ollama-api branch from 206a108 to e4ed26c Compare March 15, 2024 18:46

Merge remote-tracking branch 'origin/master' into support-ollama-api

00834d1

haraldschilly force-pushed the support-ollama-api branch 2 times, most recently from 45d995a to 8e45e47 Compare March 17, 2024 13:05

npm: fork mistralai client, override its dependency, document why, up…

1e9def5

…date langchain

haraldschilly force-pushed the support-ollama-api branch 7 times, most recently from dab80a5 to 055cbb9 Compare March 17, 2024 19:00

llm: add a 8k limited GPT4 Turbo variant, to avoid committing to too …

28b989a

…much money

haraldschilly force-pushed the support-ollama-api branch from 055cbb9 to 28b989a Compare March 17, 2024 19:08

haraldschilly added PR-needs review and removed PR-work in progress labels Mar 17, 2024

haraldschilly marked this pull request as ready for review March 17, 2024 19:21

williamstein merged commit 0b15762 into master Mar 17, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ollama, Mistral AI, etc. #7296

Ollama, Mistral AI, etc. #7296

haraldschilly commented Feb 22, 2024 •

edited

Loading

haraldschilly commented Mar 17, 2024

Ollama, Mistral AI, etc. #7296

Ollama, Mistral AI, etc. #7296

Conversation

haraldschilly commented Feb 22, 2024 • edited Loading

Description

testing

chat with GPT4 turbo (deliberately bad language on my side!) and testing reply-continuation.

course

Ollama

status

Checklist:

haraldschilly commented Mar 17, 2024

haraldschilly commented Feb 22, 2024 •

edited

Loading