FunASR/egs/librispeech/rnnt
aky15 269c201429
Dev aky2 (#588)
* support resume model from pai

* add padding for streaming rnnt conv input

* fix large dataset training bug

* bug fix

* modify aishell rnnt egs to support wav input

* add libri_100 rnnt recipe

* bug fix

* add librispeech rnnt recipe

* add librispeech README

* update rnnt results

* bug fix

---------

Co-authored-by: aky15 <ankeyu.aky@11.17.44.249>
2023-06-06 09:31:20 +08:00
..
conf Dev aky2 (#588) 2023-06-06 09:31:20 +08:00
local Dev aky2 (#588) 2023-06-06 09:31:20 +08:00
path.sh Dev aky2 (#588) 2023-06-06 09:31:20 +08:00
README.md Dev aky2 (#588) 2023-06-06 09:31:20 +08:00
run.sh Dev aky2 (#588) 2023-06-06 09:31:20 +08:00
utils Dev aky2 (#588) 2023-06-06 09:31:20 +08:00

Streaming RNN-T Result

Training Config

  • 8 gpu(Tesla V100)
  • Feature info: using 80 dims fbank, global cmvn, speed perturb(0.9, 1.0, 1.1), specaugment
  • Train config: conf/train_conformer_rnnt_unified.yaml
  • chunk config: chunk size 16, 1 left chunk
  • LM config: LM was not used
  • Model size: 90M

Results (CER)

  • Decode config: conf/decode_rnnt_conformer_streaming.yaml
testset WER(%)
test_clean 3.58
test_other 9.27