🎨Unlossy AI translation #479

AntoLC · 2024-12-05T22:47:34Z

Purpose

To translate with the AI we were using lossy functions (blocksToMarkdownLossy / tryParseMarkdownToBlocks), so we were loosing indentations, colors, background, lot of formatting. Some errors were popping up as well about tables loosing ids after the translation.

Proposal

We are now keeping the Blocknote json structure.
We associate an id to every node with the type text (see)
We extract the text with the id in a json
We send only this json to the AI:

{
    "tid-102": "Hello",
    "tid-103": "world",
    "tid-104": "Nice",
}

The AI send back the same json with the translation:

{
    "tid-102": "Bonjour",
    "tid-103": "le monde",
    "tid-104": "Super",
}

We replace in the json Blocknote the text with the ID associated
We replace in the editor the previous blocks with the new ones, block by block in checking if the block is still existing in the editor.

Demo

scrnli_ulUKPeGj2U7Hbw.webm

We change the text type of the ai-translate endpoint from `text` to `json` to be able to send a json object with the text associated with an id. The ai will return the translated text with the id. Thanks to that we can preserve totally the Blocknote json and replace only the text.

We improved the blocks translations. Only for translations: We are now preserving totally the Blocknote json structure like to avoid losing any information. We are not transforming the blocks in markdown anymore, we are adding "id" to the text type blocks, to identify them, we extract the text and send them to the ai service to translate them. The ai preserve the id, then we replace the text blocks with the translated ones.

YousefED

Nice solution! When we discussed on Matrix I missed that we were purely talking about translations (not other AI operations). So, basically you extract all text strings from the document and translate those via an LLM. That's quite nice and I think your approach should work!

YousefED · 2024-12-06T03:59:05Z

src/frontend/apps/impress/src/features/docs/doc-editor/components/AIButton.tsx

@@ -280,31 +310,17 @@ const AIMenuItem = ({
  const editor = useBlockNoteEditor();
  const handleAIError = useHandleAIError();

-  const handleAIAction = async () => {
+  const handleAIAction = () => {


just curious; was the async removed intentionally?

I removed all the await so async was not needed anymore, I have linters that warn we about these things.

Got it, was wondering as there's a then below there's not much of a change.

YousefED · 2024-12-06T04:04:23Z

src/frontend/apps/impress/src/features/docs/doc-editor/utilsAI.ts

@@ -0,0 +1,145 @@
+import { Block as CoreBlock } from '@blocknote/core';
+
+type Block = Omit<CoreBlock, 'type'> & {


why do we need this? I think you could use Block from BlockNote directly?

I get some overlapping warning because of the unions, changing the type to string help to pass over.
Not sure it is the best though.

got it. It's possible for sure but would require a bit more work on the typings. If it's a priority we can pair on this

YousefED · 2024-12-06T04:06:59Z

src/frontend/apps/impress/src/features/docs/doc-editor/components/AIButton.tsx

+      throw new Error('No response from AI');
+    }
+
+    const markdown = await editor.tryParseMarkdownToBlocks(responseAI.answer);


Of course, non-translate AI actions like these will still be "lossy". I'll think of a blocknote-based solution for this (as part of BlockNote AI features)

Yes, this part is still lossy, I tried but I didn't have good AI results when it is with actions, the AI has difficulty to do the request when the text is too split:

impress/src/frontend/apps/impress/src/features/docs/doc-editor/api/useDocAITransform.tsx

Lines 5 to 9 in f45e724

export type AITransformActions =

| 'correct'

| 'prompt'

| 'rephrase'

| 'summarize';

Rephrase and summarize can not really work with this system I think, the AI will have the remove some ids.

I think it is less likely that users will select tables or complicated structures for these actions (to monitor).

arnaud-robin · 2024-12-17T14:54:41Z

src/backend/core/services/ai_services.py

@@ -62,7 +66,7 @@ def call_ai_api(self, system_content, text):
            response_format={"type": "json_object"},
            messages=[
                {"role": "system", "content": system_content},
-                {"role": "user", "content": json.dumps({"markdown_input": text})},
+                {"role": "user", "content": json.dumps({"answer": text})},


Why do you keep using the answer keyword, although it is not the answer in this context but the input ?

Also it feels weird than when we already have structured json output in the translate case, we still pass it to the model within the answer json.
We could get rid of the anwser altogether such that we don't have another indentation layer in the json.

arnaud-robin · 2024-12-17T14:57:58Z

src/backend/core/api/serializers.py

@@ -426,11 +426,12 @@ class AITranslateSerializer(serializers.Serializer):
    language = serializers.ChoiceField(
        choices=tuple(enums.ALL_LANGUAGES.items()), required=True
    )
-    text = serializers.CharField(required=True)
+    text = serializers.JSONField(required=True)


Nit: wouldn't call this text in this context

arnaud-robin · 2024-12-17T15:28:00Z

Hello Antho, nice solution ! However I think there might be issues when the order of the words differ in the two languages.

For example I tried translating "Je te donne mon livre", which should translate to "I give you my book".
The payload is

{
    "tid-52": "Je ",
    "tid-53": "te",
    "tid-54": " donne mon livre."
}

However the answer from gpt-4o-mini is:

{
    "tid-52": "I ",
    "tid-53": "give ",
    "tid-54": "you my book."
}

Here the bold word has been modified to give instead of you.

However, If i ask llama3.2 the answer becomes:

{ 
"tid-52": "I ",
 "tid-53": "to", 
"tid-54": " give me my book." 
}

where the sentence doesn't mean anything anymore.

YousefED · 2024-12-19T11:32:13Z

@arnaud-robin you requested my review here, but I think you're right about:

there might be issues when the order of the words differ in the two languages

Good spot, this seems like a potential blocker and I missed that one earlier. I'm doing an experiment to keep markdown formatting across LLM responses, which might be a viable alternative and also makes it possible for operations other than "translate" (i.e.: rewrite, simplify, etc.). It's very much wip atm though

AntoLC · 2024-12-19T11:54:46Z

Yes, I can see it is not the best so, even with gpt-4o-mini...

I am still wondering if markdown is the good way to solve this problem.
It should be easier to keep all the formatting with HTML, no ?

Curious to see how you do it with markdown @YousefED

AntoLC added 2 commits December 5, 2024 23:43

AntoLC added enhancement New feature or request frontend backend labels Dec 5, 2024

AntoLC requested a review from YousefED December 5, 2024 22:47

AntoLC self-assigned this Dec 5, 2024

AntoLC linked an issue Dec 5, 2024 that may be closed by this pull request

⚗️AI return with complex data #403

Open

YousefED reviewed Dec 6, 2024

View reviewed changes

fixup! ♻️(frontend) fix AI requests

85626c6

AntoLC mentioned this pull request Dec 16, 2024

Experimentation with "internal" llm #505

Draft

arnaud-robin requested a review from YousefED December 17, 2024 13:42

arnaud-robin reviewed Dec 17, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🎨Unlossy AI translation #479

🎨Unlossy AI translation #479

AntoLC commented Dec 5, 2024 •

edited

Loading

YousefED left a comment

YousefED Dec 6, 2024

AntoLC Dec 6, 2024 •

edited

Loading

YousefED Dec 6, 2024

YousefED Dec 6, 2024

AntoLC Dec 6, 2024

YousefED Dec 6, 2024

YousefED Dec 6, 2024

AntoLC Dec 6, 2024 •

edited

Loading

arnaud-robin Dec 17, 2024

arnaud-robin Dec 17, 2024

arnaud-robin Dec 17, 2024

arnaud-robin commented Dec 17, 2024 •

edited

Loading

YousefED commented Dec 19, 2024

AntoLC commented Dec 19, 2024

		@@ -0,0 +1,145 @@
		import { Block as CoreBlock } from '@blocknote/core';

		type Block = Omit<CoreBlock, 'type'> & {

	export type AITransformActions =
	\| 'correct'
	\| 'prompt'
	\| 'rephrase'
	\| 'summarize';

🎨Unlossy AI translation #479

Are you sure you want to change the base?

🎨Unlossy AI translation #479

Conversation

AntoLC commented Dec 5, 2024 • edited Loading

Purpose

Proposal

Demo

YousefED left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

AntoLC Dec 6, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

AntoLC Dec 6, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

arnaud-robin commented Dec 17, 2024 • edited Loading

YousefED commented Dec 19, 2024

AntoLC commented Dec 19, 2024

AntoLC commented Dec 5, 2024 •

edited

Loading

AntoLC Dec 6, 2024 •

edited

Loading

AntoLC Dec 6, 2024 •

edited

Loading

arnaud-robin commented Dec 17, 2024 •

edited

Loading