fix: [AG-178] AI Gateway bugs, 3.9.0 rollup #13932

tysoekong · 2024-11-27T00:41:37Z

Summary

AG-178

Bug fix rollup from 3.9.0.RC-1

Checklist

The Pull Request has tests
Doesn't need changelog - unreleased features
Doesn't need new docs - bug fixes

Issue reference

Fixes everything in AG-178.

kong/llm/drivers/shared.lua

kong/llm/plugin/observability.lua

fffonion · 2024-11-27T03:46:53Z

kong/llm/plugin/shared-filters/parse-json-response.lua

-      ai_plugin_o11y.metrics_set("llm_completion_tokens_count", t.usage.completion_tokens)
-    end
-  end
-


this is still needed, when normalize-json-response is not enabled on a request (for example, using semantic-cache without ai proxy)

@oowl What do you recommend?

Just duplicate the code for now "QAD" with a if not (namespace-ai-proxy) then run_the_code end ?

Because actually, metadata extraction itself should be move to its own filter too...

it's fine to run this code twice, the later one will just update/overwrite the previous, same as how we do to
headers and body.

metadata extraction itself should be move to its own filter too...

Ideally yes, but we are not in the real filter pipeline but still in Kong's plugin iterator.
Right now, move this to llm/shared to a utility function and to be called by both filters will be fine. Let's do this after 3.9.

So ideally:
for ai-proxy
parse-json-response | get metadata | normalize-json-response | get metadata
for ai-semantic-cache etc, without ai-proxy
parse-json-response | get metadata

Even if we do this, in real world, the plugin iterator still execute all "filters" for one plugin before executing the other, so if you have ai-proxy + ai-semantic-cache it would actually be:

parse-json-response | get metadata | parse-json-response (skipped) | get metadata | normalize-json-response | get metadata

so I think it's fine to just have get metadata part of the parse and normalize so we would have
parse-json-response| parse-json-response (skipped) | normalize-json-response

I'll add it back and just double check the tests!

Oh you already did it! Okay am very confused now.

kong/llm/plugin/shared-filters/normalize-json-response.lua

tysoekong · 2024-11-28T11:23:33Z

I've properly fixed the flakey tests

…nai format response

…nt and missing metadata

tysoekong · 2024-11-28T14:13:16Z

@fffonion @oowl Finally... good to go.

pull-request-size bot added the size/XL label Nov 27, 2024

github-actions bot added cherry-pick kong-ee schedule this PR for cherry-picking to kong/kong-ee plugins/ai-proxy plugins/ai-request-transformer plugins/ai-response-transformer labels Nov 27, 2024

github-actions bot assigned tysoekong Nov 27, 2024

tysoekong changed the title ~~Fix/ag 178 ai bugs 3 9 0 rollup~~ fix: [AG-178] AI Gateway bugs, 3.9.0 rollup Nov 27, 2024

tysoekong requested review from fffonion and oowl November 27, 2024 01:49

fffonion reviewed Nov 27, 2024

View reviewed changes

fffonion added the skip-changelog label Nov 27, 2024

fffonion force-pushed the fix/AG-178_ai_bugs_3_9_0_rollup branch 2 times, most recently from cc4b6bb to 9bba90f Compare November 27, 2024 05:22

fffonion approved these changes Nov 27, 2024

View reviewed changes

fffonion added this to the 3.9.0 milestone Nov 27, 2024

fffonion force-pushed the fix/AG-178_ai_bugs_3_9_0_rollup branch from 9bba90f to 78319b9 Compare November 27, 2024 07:35

kikito added the backport release/3.9.x label Nov 27, 2024

oowl approved these changes Nov 27, 2024

View reviewed changes

fffonion force-pushed the fix/AG-178_ai_bugs_3_9_0_rollup branch from 3f89c40 to b07d2f9 Compare November 28, 2024 07:45

tysoekong force-pushed the fix/AG-178_ai_bugs_3_9_0_rollup branch 2 times, most recently from d7516b9 to 548a5e4 Compare November 28, 2024 11:23

tysoekong force-pushed the fix/AG-178_ai_bugs_3_9_0_rollup branch 2 times, most recently from caef22a to 477b569 Compare November 28, 2024 12:40

tysoekong added 6 commits November 28, 2024 13:21

fix(AG-178): set correct conf field for error logger

38d71fa

fix(AG-178): send correct object to ctx buffer

aea8ce7

fix(AG-178): make analytics collection run AFTER response transformation

5f0193c

fix(AG-178): divide by zero protection on analytics maths

f556e9c

fix(AG-178): properly support all streaming content types

426550f

fix(AG-178): (ai-transformers): add statistic logger test for non-ope…

e8ccb92

…nai format response

tysoekong added 3 commits November 28, 2024 13:21

fix(AG-178): add logging tests for non-openai format 'gemini'

1f39c07

fix(AG-178): add logging tests for streaming

cd144ae

fix(AG-178): fix gemini parsing multiple chained tool calls

b20a95a

tysoekong force-pushed the fix/AG-178_ai_bugs_3_9_0_rollup branch from 477b569 to 5aacc35 Compare November 28, 2024 13:21

fix(llm): fix streaming sse filter not ran twice and prompt token cou…

8b8f12c

…nt and missing metadata

tysoekong force-pushed the fix/AG-178_ai_bugs_3_9_0_rollup branch from 5aacc35 to 8b8f12c Compare November 28, 2024 13:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: [AG-178] AI Gateway bugs, 3.9.0 rollup #13932

fix: [AG-178] AI Gateway bugs, 3.9.0 rollup #13932

tysoekong commented Nov 27, 2024

fffonion Nov 27, 2024

tysoekong Nov 27, 2024

tysoekong Nov 27, 2024

fffonion Nov 28, 2024 •

edited

Loading

fffonion Nov 28, 2024

tysoekong Nov 28, 2024

tysoekong Nov 28, 2024

tysoekong commented Nov 28, 2024

tysoekong commented Nov 28, 2024

fix: [AG-178] AI Gateway bugs, 3.9.0 rollup #13932

Are you sure you want to change the base?

fix: [AG-178] AI Gateway bugs, 3.9.0 rollup #13932

Conversation

tysoekong commented Nov 27, 2024

Summary

Checklist

Issue reference

fffonion Nov 27, 2024

Choose a reason for hiding this comment

tysoekong Nov 27, 2024

Choose a reason for hiding this comment

tysoekong Nov 27, 2024

Choose a reason for hiding this comment

fffonion Nov 28, 2024 • edited Loading

Choose a reason for hiding this comment

fffonion Nov 28, 2024

Choose a reason for hiding this comment

tysoekong Nov 28, 2024

Choose a reason for hiding this comment

tysoekong Nov 28, 2024

Choose a reason for hiding this comment

tysoekong commented Nov 28, 2024

tysoekong commented Nov 28, 2024

fffonion Nov 28, 2024 •

edited

Loading