FunASR/egs/librispeech_100h/rnnt
aky15 269c201429
Dev aky2 (#588)
* support resume model from pai

* add padding for streaming rnnt conv input

* fix large dataset training bug

* bug fix

* modify aishell rnnt egs to support wav input

* add libri_100 rnnt recipe

* bug fix

* add librispeech rnnt recipe

* add librispeech README

* update rnnt results

* bug fix

---------

Co-authored-by: aky15 <ankeyu.aky@11.17.44.249>
2023-06-06 09:31:20 +08:00
..
conf Dev aky2 (#559) 2023-05-30 16:39:22 +08:00
local Dev aky2 (#559) 2023-05-30 16:39:22 +08:00
path.sh Dev aky2 (#559) 2023-05-30 16:39:22 +08:00
README.md Dev aky2 (#588) 2023-06-06 09:31:20 +08:00
run.sh Dev aky2 (#559) 2023-05-30 16:39:22 +08:00
utils Dev aky2 (#559) 2023-05-30 16:39:22 +08:00

Conformer Transducer Result

Training Config

  • Feature info: using 80 dims fbank, global cmvn, speed perturb(0.9, 1.0, 1.1), specaugment
  • Train config: conf/train_conformer_rnnt.yaml
  • LM config: LM was not used
  • Model size: 30.54M

Results (CER)

  • Decode config: conf/decode_rnnt_conformer.yaml
testset WER(%)
test_clean 6.64
test_other 17.12