[WIP] Add language selection for streaming with whisper + Improve tests #48

AudranBert · 2024-11-22T15:34:43Z

Add language selection for streaming with whisper, by default it will take the language found in the env settings. But you can pass a language in the config when starting streaming.

The PR is also improving tests to add tests about languages. Also removing some useless ones in order to reduce testing duration.

It also adds the possibility to pass a language in the config in case of offline decoding. It will enable having a same model instance used for multiple languages instead of launching another Docker.

Signed-off-by: AudranBert <[email protected]>

Jeronymous · 2024-11-27T17:17:30Z

whisper/stt/processing/streaming.py

@@ -43,17 +44,20 @@ async def wssDecode(ws: WebSocketServerProtocol, model_and_alignementmodel):
    try:
        config = json.loads(res)["config"]
        sample_rate = config["sample_rate"]
+        language = config.get("language", None)


Don't we need to update the doc, specifying the language can be specified in the configuration of the streaming request?

Also, why not to support language="*" (as in the env variable for the language in offline decoding)
that would need to replace "*" by None when reading the config

Don't we need to update the doc, specifying the language can be specified in the configuration of the streaming request?

Yes I need to update the doc

Signed-off-by: AudranBert <[email protected]>

damienlaine · 2024-11-28T17:52:13Z

Could you clarify the list of supported languages? For example, does it include "en," "fr," etc.? On the LinTO side, we consistently use BCP-47 codes for language representation.
Parsers (env, API directives...) shall at least support BCP-47 codes as inputs.

AudranBert added 3 commits November 22, 2024 14:38

add: language option for streaming (can be in env or in config)

bbe04b2

Signed-off-by: AudranBert <[email protected]>

add language as option in test_streaming

f3b4680

Signed-off-by: AudranBert <[email protected]>

reduce amount of tests

51c49ee

Signed-off-by: AudranBert <[email protected]>

Jeronymous reviewed Nov 27, 2024

View reviewed changes

Jeronymous and others added 2 commits November 27, 2024 18:23

Generalize the function, to format languages in general

c56ad3a

add language option in transcription config offline

eafa601

Signed-off-by: AudranBert <[email protected]>

AudranBert linked an issue Nov 28, 2024 that may be closed by this pull request

Add language selection for offline transcription with whisper models #53

Open

AudranBert added 4 commits November 28, 2024 11:50

update doc

ab501f8

Signed-off-by: AudranBert <[email protected]>

add language through config for celery

3f80d46

Signed-off-by: AudranBert <[email protected]>

fix doc whisper

d2a19cd

Signed-off-by: AudranBert <[email protected]>

refactor tests + add tests for languages

fb68f11

Signed-off-by: AudranBert <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Add language selection for streaming with whisper + Improve tests #48

[WIP] Add language selection for streaming with whisper + Improve tests #48

AudranBert commented Nov 22, 2024 •

edited

Loading

Jeronymous Nov 27, 2024 •

edited

Loading

Jeronymous Nov 27, 2024

AudranBert Nov 28, 2024

damienlaine commented Nov 28, 2024

[WIP] Add language selection for streaming with whisper + Improve tests #48

Are you sure you want to change the base?

[WIP] Add language selection for streaming with whisper + Improve tests #48

Conversation

AudranBert commented Nov 22, 2024 • edited Loading

Jeronymous Nov 27, 2024 • edited Loading

Choose a reason for hiding this comment

Jeronymous Nov 27, 2024

Choose a reason for hiding this comment

AudranBert Nov 28, 2024

Choose a reason for hiding this comment

damienlaine commented Nov 28, 2024

AudranBert commented Nov 22, 2024 •

edited

Loading

Jeronymous Nov 27, 2024 •

edited

Loading