Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove the 30-second constraint from whisper. #471

Merged
merged 7 commits into from
Dec 7, 2023

Conversation

csukuangfj
Copy link
Collaborator

This PR removes the 30-second constraint for whisper models

In other words, you don't need to pad your audio to 30 seconds. Any duration T <= 30 can be used for whisper models.

You can save computation by removing paddings.

You can find the exported models at
https://github.com/k2-fsa/sherpa-onnx/releases/tag/asr-models

Screenshot 2023-12-07 at 14 40 42

@csukuangfj csukuangfj merged commit 3ae984f into k2-fsa:master Dec 7, 2023
15 of 16 checks passed
@csukuangfj csukuangfj deleted the whisper-audio-ctx branch December 7, 2023 09:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant