Whisper Fine-tuning Recipe on Aishell1 #1466

yuekaizhang · 2024-01-17T08:21:31Z

This PR supports fine-tuning whisper models using aishell1.

Fix code comments & lint errors

	test (before fine-tuning)	test (after fine-tuning)	comment
medium	7.23	3.27	--epoch 10 --avg 4, ddp
large-v2	6.56	2.47	--epoch 10 --avg 6, deepspeed zero stage1
large-v3	6.06	2.84	--epoch 5 --avg 3, deepspeed zero stage1

yuekaizhang · 2024-01-17T08:37:12Z

Nice work!

Would you mind sharing how much GPU memory and how long it takes to fine-tune the Whisper model?

On a 8xA100(80GB) machine, it takes about 15 mins per epoch for large-v2/3 models. Since I am always set the max_duration as large as possible, I think people could fine-tune it on 24GB cards with deepspeed and smaller batch size.

egs/aishell/ASR/whisper/train.py

egs/aishell/ASR/local/compute_fbank_aishell.py

JinZr

LGTM! Thank you!

egs/aishell/ASR/whisper/decode.py

egs/aishell/ASR/whisper/requirements.txt

egs/aishell/ASR/RESULTS.md

See also k2-fsa/icefall#1466

csukuangfj · 2024-01-30T09:04:20Z

    model = whisper.load_model(filename)
  File "/opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages/whisper/__init__.py", line 147, in load_model
    dims = ModelDimensions(**checkpoint["dims"])
KeyError: 'dims'

It throws the above error while using whisper.load_model() with the checkpoint

Could you make your checkpoint compatible with whisper? For instance, you need to save dims in
the resulting checkpoint.

@yuekaizhang

yuekaizhang · 2024-01-31T07:52:26Z

    model = whisper.load_model(filename)
  File "/opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages/whisper/__init__.py", line 147, in load_model
    dims = ModelDimensions(**checkpoint["dims"])
KeyError: 'dims'
It throws the above error while using whisper.load_model() with the checkpoint

Could you make your checkpoint compatible with whisper? For instance, you need to save dims in the resulting checkpoint.

@yuekaizhang

Sorry, I added this converion script https://huggingface.co/yuekai/icefall_asr_aishell_whisper/blob/main/convert.sh. I have uploaded a converted medium model. Would you mind trying it again? https://huggingface.co/yuekai/icefall_asr_aishell_whisper/blob/main/exp_medium/whisper-medium-aishell1-epoch-10-avg-4.pt

See also k2-fsa/icefall#1466

csukuangfj · 2024-01-31T09:24:09Z

@yuekaizhang

Thanks! It works perfectly. I have converted it to sherpa-onnx with k2-fsa/sherpa-onnx#565

You can find the converted model at
https://huggingface.co/csukuangfj/sherpa-onnx-whisper-medium-aishell/tree/main

You can also try it in the following huggingface space
https://huggingface.co/spaces/k2-fsa/automatic-speech-recognition-with-whisper

yuekaizhang added 28 commits January 15, 2024 19:49

add decode seamlessm4t

f99f4d7

update finetuning codes

363c3f1

add requirements

3a7ad27

add fairseq2 require

cbc3852

update fine-tuning lr

0d6d8f9

update decoding from checkpoint

e815457

load checkpoint to decode

5f399dc

add decoding with avg model

cc64324

fix typo

72e9a43

change vocab table

7e387dd

add token files

22ee287

add custom tokenizer

2a288fb

fix loading

d926585

rename train, train2, add support to fine-tune embedding table

bb1c446

support whisper ft

6c2cd5b

using audio with any length

5bf3a9c

update lhotse version

8b832f1

change scaleadam to adamw

07cefa8

remove padding to 30s, compute validation loss once

98d11ab

clean up codes

92895f7

support deepspeed to finetune large model

b6418ac

update deepspeed model loading

fa7ad4d

support large-v3

2ce0980

add model saving

ac53222

remove seamless for next PR

e883bb6

revert asr data module

eea4645

clean codes

557b35c

add whisper fine-tuning results

84e4af9

yuekaizhang changed the title ~~Whisper Fine-tuning Recipes on Aishell1~~ Whisper Fine-tuning Recipe on Aishell1 Jan 17, 2024

yuekaizhang changed the title ~~Whisper Fine-tuning Recipe on Aishell1~~ [WIP] Whisper Fine-tuning Recipe on Aishell1 Jan 17, 2024

xingchensong reviewed Jan 17, 2024

View reviewed changes

egs/aishell/ASR/whisper/train.py Show resolved Hide resolved

xingchensong reviewed Jan 17, 2024

View reviewed changes

egs/aishell/ASR/whisper/train.py Show resolved Hide resolved

egs/aishell/ASR/whisper/train.py Outdated Show resolved Hide resolved

using monkey patch to replace models

bda4829

xingchensong mentioned this pull request Jan 22, 2024

[examples] update whisper results on aishell-1 wenet-e2e/wenet#2313

Merged

yuekaizhang added 2 commits January 22, 2024 15:20

fix requirements

b623c3b

fix lint

8d9ab30

yuekaizhang changed the title ~~[WIP] Whisper Fine-tuning Recipe on Aishell1~~ Whisper Fine-tuning Recipe on Aishell1 Jan 22, 2024

yuekaizhang added 2 commits January 22, 2024 16:15

remove model file

ab08201

fix wrong order of token slice

46605ea

JinZr reviewed Jan 24, 2024

View reviewed changes

egs/aishell/ASR/local/compute_fbank_aishell.py Outdated Show resolved Hide resolved

add manifest dir option

fd4ebf3

JinZr approved these changes Jan 26, 2024

View reviewed changes

JinZr merged commit 1c30847 into k2-fsa:master Jan 26, 2024
54 checks passed

csukuangfj reviewed Jan 30, 2024

View reviewed changes

egs/aishell/ASR/whisper/decode.py Show resolved Hide resolved

csukuangfj reviewed Jan 30, 2024

View reviewed changes

egs/aishell/ASR/whisper/requirements.txt Show resolved Hide resolved

csukuangfj reviewed Jan 30, 2024

View reviewed changes

egs/aishell/ASR/RESULTS.md Show resolved Hide resolved

csukuangfj mentioned this pull request Jan 30, 2024

Could you please provide a pretrained.py for whisper? #1480

Closed

csukuangfj added a commit to csukuangfj/sherpa-onnx that referenced this pull request Jan 30, 2024

Add fine-tuned whisper on aishell.

2e6535a

See also k2-fsa/icefall#1466

csukuangfj added a commit to csukuangfj/sherpa-onnx that referenced this pull request Jan 30, 2024

Add fine-tuned whisper on aishell.

bb5c765

See also k2-fsa/icefall#1466

csukuangfj added a commit to csukuangfj/sherpa-onnx that referenced this pull request Jan 30, 2024

Add fine-tuned whisper on aishell.

e16ec50

See also k2-fsa/icefall#1466

csukuangfj added a commit to csukuangfj/sherpa-onnx that referenced this pull request Jan 30, 2024

Add fine-tuned whisper on aishell.

f710c3b

See also k2-fsa/icefall#1466

csukuangfj added a commit to csukuangfj/sherpa-onnx that referenced this pull request Jan 30, 2024

Add fine-tuned whisper on aishell.

db67e9e

See also k2-fsa/icefall#1466

csukuangfj mentioned this pull request Jan 31, 2024

Add fine-tuned whisper model on aishell k2-fsa/sherpa-onnx#565

Merged

csukuangfj added a commit to k2-fsa/sherpa-onnx that referenced this pull request Jan 31, 2024

Add fine-tuned whisper model on aishell (#565)

2e8b321

See also k2-fsa/icefall#1466

rezame mentioned this pull request Feb 4, 2024

Whisper external language model #1489

Closed

marcoyang1998 mentioned this pull request Mar 28, 2024

Finetune Whisper model on LibriSpeech #1571

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Whisper Fine-tuning Recipe on Aishell1 #1466

Whisper Fine-tuning Recipe on Aishell1 #1466

yuekaizhang commented Jan 17, 2024

yuekaizhang commented Jan 17, 2024 •

edited

Loading

JinZr left a comment

csukuangfj commented Jan 30, 2024

yuekaizhang commented Jan 31, 2024

csukuangfj commented Jan 31, 2024

Whisper Fine-tuning Recipe on Aishell1 #1466

Whisper Fine-tuning Recipe on Aishell1 #1466

Conversation

yuekaizhang commented Jan 17, 2024

yuekaizhang commented Jan 17, 2024 • edited Loading

JinZr left a comment

Choose a reason for hiding this comment

csukuangfj commented Jan 30, 2024

yuekaizhang commented Jan 31, 2024

csukuangfj commented Jan 31, 2024

yuekaizhang commented Jan 17, 2024 •

edited

Loading