Support CTC/AED option for Zipformer recipe #1389

yaozengwei · 2023-11-22T01:59:45Z

This PR supports CTC/AED system for zipformer recipe.

CTC/AED results on LibriSpeech, trained for 50 epochs (--ctc-loss-scale=0.1, --attention-decoder-loss-scale=0.9), decoding method: sample 100-best paths from CTC lattice and rescore with the attention decoder
- Zipformer-S, 46.3M, 2.46 / 6.04
- Zipformer-M, 90.0M, 2.22 / 4.97
- Zipformer-L, 174.3M, 2.09 / 4.59

xingchensong · 2024-07-07T02:39:26Z

Nice results ! Seems that (Zipformer-M-ctc/aed, 90.0M, 2.22 / 4.97) is comparable to (Zipformer-rnnt, 65.55 M, 2.21, 4.82), and (Zipformer-L-ctc/aed, 174.3M, 2.09 / 4.59) surpasses all prior benchmarks.

Because we know that in the RNNT model, the majority of the parameters are in the encoder, while in the CTC/AED model, the decoder parameters account for a significant portion. This leads to the appearance that the Zipformer-CTC/AED model has a much larger number of parameters compared to the Zipformer-RNNT, yet the number of encoder parameters in both models might actually be quite similar. Thus I'm particularly intrigued by the parameters utilized in Zipformer-L-ctc/aed, specifically those related to the encoder and decoder components. Could you provide more details on these?

yaozengwei · 2024-07-07T04:41:59Z

Nice results ! Seems that (Zipformer-M-ctc/aed, 90.0M, 2.22 / 4.97) is comparable to (Zipformer-rnnt, 65.55 M, 2.21, 4.82), and (Zipformer-L-ctc/aed, 174.3M, 2.09 / 4.59) surpasses all prior benchmarks.

Because we know that in the RNNT model, the majority of the parameters are in the encoder, while in the CTC/AED model, the decoder parameters account for a significant portion. This leads to the appearance that the Zipformer-CTC/AED model has a much larger number of parameters compared to the Zipformer-RNNT, yet the number of encoder parameters in both models might actually be quite similar. Thus I'm particularly intrigued by the parameters utilized in Zipformer-L-ctc/aed, specifically those related to the encoder and decoder components. Could you provide more details on these?

For these Zipformer CTC/AED models, we keep the attention-decoder model configurations almost same. (Different encoder output dimensions in Zipformer-S/M/L would cause slightly inconsistent number of parameters used in the attention-decoder.)
E.g.,

In Zipformer-L CTC/AED, Number of model parameters: 174319650, Number of model parameters in encoder: 146013641, Number of model parameters in attention_decoder: 27309556;
In Zipformer-M CTC/AED, Number of model parameters: 89987295, Number of model parameters in encoder: 63382150; Number of model parameters in attention_decoder: 25736692

* add attention-decoder loss option for zipformer recipe * add attention-decoder-rescoring * update export.py and pretrained_ctc.py * update RESULTS.md

yaozengwei added 5 commits November 13, 2023 21:45

add attention-decoder loss option for zipformer recipe

d17535f

add attention-decoder-rescoring

7886da9

fix memory-dim issues

4dc9728

minor change

1503351

refactor attention decoder

0be32f3

zw76859420 mentioned this pull request May 6, 2024

CTC/AED PROBLEM IN K2 #1618

Closed

yaozengwei added 4 commits May 25, 2024 17:48

minor fix

4c8defb

Merge remote-tracking branch 'k2-fsa/master' into zipformer-ctc-aed

84dfb57

update export.py and pretrained_ctc.py

acdc333

update RESULTS.md

4e7cdb5

csukuangfj approved these changes Jul 5, 2024

View reviewed changes

yaozengwei merged commit f76afff into k2-fsa:master Jul 5, 2024
7 of 8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support CTC/AED option for Zipformer recipe #1389

Support CTC/AED option for Zipformer recipe #1389

yaozengwei commented Nov 22, 2023 •

edited

Loading

xingchensong commented Jul 7, 2024

yaozengwei commented Jul 7, 2024 •

edited

Loading

Support CTC/AED option for Zipformer recipe #1389

Support CTC/AED option for Zipformer recipe #1389

Conversation

yaozengwei commented Nov 22, 2023 • edited Loading

xingchensong commented Jul 7, 2024

yaozengwei commented Jul 7, 2024 • edited Loading

yaozengwei commented Nov 22, 2023 •

edited

Loading

yaozengwei commented Jul 7, 2024 •

edited

Loading