feat: implement VAD on realtime transcription #129

jhen0409 · 2023-09-21T05:20:53Z

Add simple VAD as options to start transcription on recording, it will greatly reduce the waste of resources. It also help #89.

The current VAD implementation is take from whisper.cpp/examples/common.cpp, we could use another library instead like libfvad If we find it has more benefits.

TODO:

Option: vadThold, vadFreqThold
Android
Use VAD to check last transcription

feat(ios): initial work for simple VAD

cdc7e44

jhen0409 force-pushed the simple-vad branch from 4d1385e to cdc7e44 Compare September 21, 2023 05:23

jhen0409 added 5 commits September 22, 2023 09:38

feat(ios): skip vad if isTranscribing

47260a3

feat(ios): add vadMs / vadThold / vadFreqThold options

e38f7d6

feat(android): implement vad on realtime transcription

ef922c7

Merge branch 'main' into simple-vad

d5b6296

feat: use vad to check last transcription

655d9b3

jhen0409 marked this pull request as ready for review September 23, 2023 06:03

feat(example): do not use vad by default

7a9ff80

jhen0409 merged commit 965409d into main Sep 23, 2023

jhen0409 deleted the simple-vad branch September 23, 2023 06:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: implement VAD on realtime transcription #129

feat: implement VAD on realtime transcription #129

jhen0409 commented Sep 21, 2023 •

edited

Loading

feat: implement VAD on realtime transcription #129

feat: implement VAD on realtime transcription #129

Conversation

jhen0409 commented Sep 21, 2023 • edited Loading

jhen0409 commented Sep 21, 2023 •

edited

Loading