Commit Graph

344 Commits

Author SHA1 Message Date
嘉渊
5feca0cc17 update repo 2023-06-14 23:39:48 +08:00
lingyunfly
3da40ad9fe
vad bugfix (#624)
Co-authored-by: 凌匀 <ailsa.zly@alibaba-inc.com>
2023-06-13 11:13:03 +08:00
lingyunfly
6bdae5723e
Optimize memory (#612)
Co-authored-by: 凌匀 <ailsa.zly@alibaba-inc.com>
2023-06-09 16:19:35 +08:00
hnluo
4ce2e2d76c
Merge pull request #571 from alibaba-damo-academy/dev_lhn
Dev lhn
2023-05-31 18:42:38 +08:00
haoneng.lhn
28cabff45f update 2023-05-31 15:42:43 +08:00
hnluo
51e80c1667
Update wav_frontend.py
update wav_frontend online
2023-05-31 13:56:53 +08:00
凌匀
7109589f0e support in_cache 2023-05-31 13:12:15 +08:00
haoneng.lhn
d9984c2062 update 2023-05-30 17:39:48 +08:00
lingyunfly
80f90dd076
support in_cache (#560) 2023-05-30 16:40:23 +08:00
aky15
556429d8a9
Dev aky2 (#559)
* support resume model from pai

* add padding for streaming rnnt conv input

* fix large dataset training bug

* bug fix

* modify aishell rnnt egs to support wav input

* add libri_100 rnnt recipe

---------

Co-authored-by: aky15 <ankeyu.aky@11.17.44.249>
2023-05-30 16:39:22 +08:00
haoneng.lhn
84b4a01979 add paraformer online infer and finetune 2023-05-26 11:43:27 +08:00
haoneng.lhn
f630892863 update 2023-05-24 20:23:27 +08:00
嘉渊
2efd780568 update repo 2023-05-24 20:06:58 +08:00
haoneng.lhn
a9e2c603c0 update 2023-05-22 16:44:31 +08:00
haoneng.lhn
a2a70f776a add paraforme infer code 2023-05-22 15:39:09 +08:00
zhifu gao
97a689d65d
Merge pull request #526 from alibaba-damo-academy/dev_infer
Dev infer
2023-05-18 19:35:08 +08:00
yhliang
1e650fac78 fix bug 2023-05-18 11:27:24 +08:00
嘉渊
c3c945675b Merge branch 'dev_infer' of https://github.com/alibaba/FunASR into dev_infer 2023-05-17 19:56:07 +08:00
嘉渊
bd4bf9928d update repo 2023-05-17 19:55:59 +08:00
wucong.lyb
f83f3e5985 del lm 2023-05-17 19:54:34 +08:00
嘉渊
701022837a update repo 2023-05-17 19:53:00 +08:00
aky15
429995f4d0 Merge branch 'dev_infer' of https://github.com/alibaba-damo-academy/FunASR into dev_infer 2023-05-17 17:38:49 +08:00
aky15
9d01231fa6 rnnt继承ASRTask 2023-05-17 17:34:21 +08:00
haoneng.lhn
a7814a7bc3 fix paraformer online last chunk decoding strategy 2023-05-17 17:13:32 +08:00
嘉渊
6f0309a46d update repo 2023-05-16 16:20:45 +08:00
嘉渊
90d8e42e9e update repo 2023-05-16 16:18:35 +08:00
嘉渊
8629d30d3d update repo 2023-05-16 15:06:25 +08:00
嘉渊
85e7f2225c update repo 2023-05-15 14:02:13 +08:00
zhifu gao
15d5ba7882
Merge pull request #479 from alibaba-damo-academy/dev_aky
rnnt bug fix
2023-05-09 13:56:58 +08:00
aky15
77045e7bb7 rnnt bug fix 2023-05-09 11:16:07 +08:00
jmwang66
8dab6d184a
Merge pull request #473 from alibaba-damo-academy/dev_smohan
Add speaker-attributed ASR task for alimeeting (baseline for m2met2.0).
2023-05-09 10:58:33 +08:00
嘉渊
e9cafb55ce update repo 2023-05-08 19:12:28 +08:00
smohan-speech
49f13908de add speaker-attributed ASR task for alimeeting 2023-05-07 02:27:58 +08:00
smohan-speech
d76aea23d9 add speaker-attributed ASR task for alimeeting 2023-05-07 02:21:58 +08:00
smohan-speech
a73123bcfc add speaker-attributed ASR task for alimeeting 2023-05-06 16:17:48 +08:00
shixian.shi
e1d535e697 update neat contextual paraformer 2023-05-05 11:53:39 +08:00
shixian.shi
a6889a3170 update 2023-05-04 19:33:50 +08:00
shixian.shi
c238436e07 update 2023-05-04 19:31:19 +08:00
维石
f9eefa34ff update 2023-05-04 17:09:34 +08:00
shixian.shi
c91430542e update 2023-05-04 16:15:26 +08:00
haoneng.lhn
15e73d8baa update 2023-04-28 16:26:14 +08:00
嘉渊
675951eab8 update 2023-04-28 15:37:36 +08:00
嘉渊
f701679677 update 2023-04-28 15:34:48 +08:00
嘉渊
7f74ab462e update 2023-04-28 15:30:47 +08:00
嘉渊
e1549946bc update 2023-04-28 15:28:01 +08:00
嘉渊
9611e07e39 update 2023-04-28 15:20:32 +08:00
嘉渊
f97e0eb9ee update 2023-04-28 15:17:38 +08:00
嘉渊
607073619c update 2023-04-27 19:27:49 +08:00
嘉渊
6997763bf6 update 2023-04-27 17:51:13 +08:00
嘉渊
9539dec5c7 update 2023-04-27 17:31:54 +08:00
嘉渊
10e37a721f update 2023-04-27 17:24:47 +08:00
嘉渊
6ed27c64c9 update 2023-04-27 17:19:39 +08:00
shixian.shi
3c0a9fb7c1 fix name 2023-04-27 12:10:41 +08:00
shixian.shi
aa910b9860 update adavanced clas, including model and dataset 2023-04-27 12:03:31 +08:00
hnluo
9ff5b683db
Update sanm_encoder.py 2023-04-27 01:49:01 +08:00
haoneng.lhn
7584bbd6f3 update paraformer streaming code 2023-04-27 00:21:20 +08:00
zhifu gao
97ed4fada4
Merge pull request #423 from alibaba-damo-academy/dev_aky
update error calculator for rnnt
2023-04-26 13:28:43 +08:00
aky15
bdb8a99da4 update error calculator for rnnt 2023-04-26 10:34:48 +08:00
嘉渊
e358063f03 update 2023-04-24 23:15:20 +08:00
嘉渊
ec383fba56 update 2023-04-24 23:11:02 +08:00
嘉渊
87da739304 update 2023-04-24 23:09:33 +08:00
嘉渊
94a1deb5fb update 2023-04-24 16:15:20 +08:00
嘉渊
bb2113434a update 2023-04-24 16:05:42 +08:00
嘉渊
0149983c23 update 2023-04-24 15:00:36 +08:00
zhifu gao
033ef80e4e
Merge pull request #405 from alibaba-damo-academy/dev_aky
Dev aky
2023-04-24 11:17:24 +08:00
aky15
490531ed51 rnnt bug fix 2023-04-23 19:27:17 +08:00
凌匀
7a7ead00bc vad bug fix 2023-04-21 21:40:11 +08:00
凌匀
49be65031b merge inference.py and memory optimization 2023-04-21 18:46:29 +08:00
zhifu gao
55d0d1fd63
Merge pull request #384 from alibaba-damo-academy/dev_aky
Create __init__.py
2023-04-19 17:14:19 +08:00
aky15
2360b191d6
Create __init__.py 2023-04-19 17:05:31 +08:00
aky15
931cac99c1 add init file 2023-04-19 16:45:49 +08:00
aky15
606141f5ba
Merge pull request #351 from alibaba-damo-academy/dev_aky
Dev aky
2023-04-18 14:04:43 +08:00
aky15
8672352ecd merge many functions 2023-04-17 16:09:23 +08:00
hnluo
24f73665e2
Merge pull request #367 from alibaba-damo-academy/dev_lhn2
Dev lhn2
2023-04-17 15:49:45 +08:00
haoneng.lhn
25590804ba update 2023-04-17 15:19:56 +08:00
aky15
b3b4c1bc5b rename some functions 2023-04-17 11:19:14 +08:00
shixian.shi
2d65e5e754 update bicifparaformer forward 2023-04-17 10:59:34 +08:00
zhifu gao
2cc6b50453
Merge pull request #352 from alibaba-damo-academy/dev_lhn2
support wav_file input
2023-04-14 15:52:43 +08:00
aky15
fa25b637b0 remove some functions 2023-04-14 15:44:50 +08:00
zhifu gao
511caf64ff
Merge pull request #355 from alibaba-damo-academy/dev_dzh
add authority
2023-04-14 15:43:22 +08:00
志浩
157e854015 add authority 2023-04-14 15:37:22 +08:00
haoneng.lhn
5589b4a617 support wav_file input 2023-04-14 11:47:28 +08:00
aky15
256035b6c1 rnnt reorg 2023-04-14 11:38:00 +08:00
游雁
f0fdc051fb Author 2023-04-14 10:24:13 +08:00
yufan
d268a4360f
Update e2e_asr_mfcca.py
add arxiv link
2023-04-14 09:58:08 +08:00
zhifu gao
4fffbae0c5
Merge pull request #349 from alibaba-damo-academy/dev_zly2
vad bug fix
2023-04-13 15:07:51 +08:00
lingyunfly
3f15e4268a
Update e2e_vad.py 2023-04-13 15:03:23 +08:00
zhifu gao
c54bc90d36
Merge pull request #347 from alibaba-damo-academy/dev_cmz
add orgnization;change class name
2023-04-13 14:22:55 +08:00
mengzhe.cmz
e21a6ed2d8 add 2023-04-13 13:38:22 +08:00
aky15
7d1efe158e rnnt reorg 2023-04-12 16:49:56 +08:00
haoneng.lhn
00d3f31915 update loading cmvn_file 2023-04-12 13:09:23 +08:00
speech_asr
dfa356a10c update 2023-04-11 00:27:54 +08:00
speech_asr
0a954637cb update 2023-04-11 00:26:03 +08:00
speech_asr
7161312271 update 2023-04-11 00:24:12 +08:00
speech_asr
23bc5dee4e update 2023-04-11 00:21:45 +08:00
speech_asr
df662541a8 update 2023-04-11 00:13:30 +08:00
speech_asr
d5a80d642a update 2023-04-11 00:09:29 +08:00
speech_asr
5756ed9165 update 2023-04-10 19:27:51 +08:00
zhifu gao
2e769fb36c
Merge branch 'main' into dev_cmz2 2023-04-07 15:54:09 +08:00
haoneng.lhn
6be782d9fd fix decoder cache 2023-04-03 19:57:26 +08:00
haoneng.lhn
0ca6876f58 update 2023-04-03 14:15:51 +08:00
游雁
5788a4ca17 export 2023-03-31 15:23:35 +08:00
游雁
d0cd484fdc export 2023-03-31 15:05:37 +08:00
shixian.shi
e3c094ed9d finetune entire bicif_paraformer 2023-03-29 17:20:27 +08:00
北念
775fcd1b14 fix ContextualBiasDecoder spell 2023-03-29 17:14:29 +08:00
北念
2b2653ae2b fix contextualparaformer bias_embed 2023-03-29 16:51:16 +08:00
游雁
1f8b46402c export 2023-03-29 15:57:22 +08:00
haoneng.lhn
bb5f0cfc3b update 2023-03-29 13:28:52 +08:00
haoneng.lhn
a65a408f28 update 2023-03-29 13:28:00 +08:00
haoneng.lhn
d0d8684b96 update 2023-03-29 11:49:06 +08:00
凌匀
946000a29a support max_end_sil 2023-03-24 11:11:41 +08:00
zhifu gao
e62e208a5c
Merge pull request #251 from alibaba-damo-academy/dev_lhn
Dev lhn
2023-03-16 19:50:52 +08:00
Zhihao Du
38de2af5bf
Merge branch 'main' into dev_dzh 2023-03-16 19:41:34 +08:00
志浩
2868fe3df4 Merge branch 'main' into dev_dzh 2023-03-16 19:24:21 +08:00
志浩
49ded3a686 modify diar pipeline 2023-03-16 19:18:03 +08:00
speech_asr
f33ebfd1c7 update 2023-03-15 16:11:44 +08:00
speech_asr
fbec0f003d update 2023-03-15 15:58:43 +08:00
speech_asr
f691014c8a update 2023-03-15 15:43:18 +08:00
speech_asr
e9f6703350 update 2023-03-15 15:29:31 +08:00
speech_asr
2f933cb101 update 2023-03-15 15:23:08 +08:00
仁迷
62f88ea941 fix decoder cache bug 2023-03-15 14:57:09 +08:00
speech_asr
26b81480a8 update 2023-03-15 11:35:18 +08:00
speech_asr
6fe0d840f7 update 2023-03-15 11:23:29 +08:00
speech_asr
6165c13918 update 2023-03-15 11:15:00 +08:00
speech_asr
4d2bf9fe3c update 2023-03-14 17:13:25 +08:00
Lizerui9926
a7ab8bd688
Merge pull request #230 from alibaba-damo-academy/dev_wjm
Dev wjm
2023-03-14 16:45:30 +08:00
speech_asr
ee1b0ec605 update 2023-03-14 16:37:36 +08:00
speech_asr
64b591eb6f update 2023-03-14 16:20:24 +08:00
speech_asr
141a4737f7 update 2023-03-14 15:54:28 +08:00
zhifu gao
e0bd877ac0
Merge pull request #226 from alibaba-damo-academy/dev_dzh
Dev dzh
2023-03-14 14:36:31 +08:00
志浩
f191f4c868 modify statistic pooling layer 2023-03-14 14:31:27 +08:00
仁迷
3762d21300 add streaming paraformer code 2023-03-13 22:02:54 +08:00
speech_asr
e27de5aa6b update ola 2023-03-13 18:45:27 +08:00
zhifu gao
0a729038cf
Merge pull request #218 from alibaba-damo-academy/dev_ts
update timestamp related codes and egs_modelscope
2023-03-13 17:47:56 +08:00
shixian.shi
f5aa97f7bf update params name 2023-03-13 17:39:18 +08:00
shixian.shi
a1fe3c635f update tp inference 2023-03-13 17:26:20 +08:00
凌匀
6da711a8d3 support vad_inference_online 2023-03-13 16:45:00 +08:00
speech_asr
229efa6250 update ola 2023-03-13 16:25:53 +08:00
speech_asr
b6126fd539 update ola 2023-03-13 16:16:57 +08:00
shixian.shi
76fd90d230 add class TimestampPredictor in e2e 2023-03-13 16:09:41 +08:00
speech_asr
3ff62dbb97 update ola 2023-03-13 16:04:27 +08:00
speech_asr
b7b65c844d update ola 2023-03-13 15:33:23 +08:00
speech_asr
8762d99735 update ola 2023-03-13 15:30:17 +08:00
zhifu gao
9be8a443d7
Merge pull request #207 from alibaba-damo-academy/dev_dzh
Dev dzh
2023-03-10 18:24:39 +08:00
志浩
773ab31780 modify unit test for speech_diarization_sond-en-us-callhome-8k-n16k4-pytorch 2023-03-10 14:33:21 +08:00
志浩
c1f5bc2e4f modify unit test for speech_diarization_sond-en-us-callhome-8k-n16k4-pytorch 2023-03-09 22:33:24 +08:00
志浩
bb40093a64 modify unit test for speech_diarization_sond-en-us-callhome-8k-n16k4-pytorch 2023-03-09 17:34:01 +08:00
志浩
b0abb2f4c0 modify unit test for speech_diarization_sond-en-us-callhome-8k-n16k4-pytorch 2023-03-09 17:28:10 +08:00
志浩
cf41c0aa3a modify unit test for speech_diarization_sond-en-us-callhome-8k-n16k4-pytorch 2023-03-09 17:24:41 +08:00
志浩
77cbfde968 modify unit test for speech_diarization_sond-en-us-callhome-8k-n16k4-pytorch 2023-03-09 17:17:28 +08:00
志浩
c3ca7d963e modify unit test for speech_diarization_sond-en-us-callhome-8k-n16k4-pytorch 2023-03-09 17:04:28 +08:00
志浩
6e5c6f33fd modify unit test for speech_diarization_sond-en-us-callhome-8k-n16k4-pytorch 2023-03-09 17:00:26 +08:00
志浩
faa8ad377a modify unit test for speech_diarization_sond-en-us-callhome-8k-n16k4-pytorch 2023-03-09 16:28:47 +08:00
志浩
777ae05adb add en sv model 2023-03-09 12:16:58 +08:00
shixian.shi
4fb44fb330 update timestamp pipeline 2023-03-08 16:29:33 +08:00
凌匀
3388361d3b update vad inference 2023-02-28 14:33:40 +08:00
凌匀
4c053ccc39 gpu bug fix 2023-02-27 19:10:15 +08:00
zhifu gao
8cc5bbf99a
Merge pull request #159 from alibaba-damo-academy/dev_dzh
Dev dzh
2023-02-27 17:01:48 +08:00
志浩
97f8201138 fixbug sond initial 2023-02-27 15:03:07 +08:00
志浩
88efde8799 fixbug sond initial 2023-02-27 15:00:41 +08:00
凌匀
31eed1834f in_cache & support soundfile read 2023-02-27 13:33:55 +08:00
志浩
0a6ff596c6 sond pipeline 2023-02-24 11:50:42 +08:00
志浩
04a7ce3205 sond pipeline 2023-02-23 17:53:04 +08:00
游雁
3f7f737587 fbank online 2023-02-22 20:11:31 +08:00
游雁
2d7bd18d0e fbank online 2023-02-22 20:10:20 +08:00
凌匀
91027ddab4 fix vad results bug 2023-02-16 22:11:18 +08:00
凌匀
d6fdd1c793 support vad streaming decoder 2023-02-16 14:56:32 +08:00
志浩
5da92c1fa9 add training related code for sond 2023-02-15 11:51:27 +08:00
yufan-aslp
a319bf6e5f dev_yufan egs_mfcca update 2023-02-14 14:54:04 +08:00
游雁
865ae89f0a export model 2023-02-13 17:43:01 +08:00
志浩
be3ade8748 add sond model 2023-02-10 19:07:15 +08:00
志浩
f6a1cdaf34 add sond model 2023-02-10 18:56:14 +08:00
北念
5231b54af8 add ContextualParaformer 2023-02-09 19:37:21 +08:00
北念
ad0039596c add BiCifParaformer 2023-02-09 19:11:16 +08:00
北念
16d4e00549 add BiCifParaformer 2023-02-09 17:53:04 +08:00
hnluo
579b998ded
Merge pull request #83 from alibaba-damo-academy/dev_lzr
remove useless vars and fix bug in predictor tail_process_fn
2023-02-09 15:26:57 +08:00
lzr265946
44fe2e811f fix bug in predictor tail_process_fn 2023-02-09 15:23:40 +08:00
lzr265946
983bb9382e fix bug in predictor tail_process_fn 2023-02-09 15:14:20 +08:00
仁迷
c5c5c55b90 fix uniasr training bug 2023-02-09 15:14:13 +08:00
zhifu gao
de0ecb446f
Merge pull request #79 from alibaba-damo-academy/dev_wjm
Dev wjm
2023-02-09 10:57:25 +08:00
zhifu gao
8871dcb93a
Merge pull request #73 from alibaba-damo-academy/main
fix bug, batch cif predictor tail
2023-02-08 13:21:12 +08:00
游雁
87bff7ae59 export model 2023-02-07 22:51:39 +08:00
lzr265946
a3fe16f871 fix bug in predictor tail_process_fn 2023-02-07 20:35:56 +08:00
jmwang66
7bea618623 update data2vec pretrain 2023-02-06 16:09:24 +08:00
游雁
2f3d36d689 Merge branch 'dev' of github.com:alibaba-damo-academy/FunASR into dev
add
2023-01-30 17:51:07 +08:00
游雁
cfb2fda87c fix bug, ys_pad_masked in sampler of paraformer 2023-01-30 17:50:36 +08:00
lingyunfly
60d3b060f1
Update e2e_vad.py
fix ComputeDecibel bug
2023-01-19 14:32:05 +08:00
jmwang66
12a7adfdf3 update version 0.1.6 2023-01-16 18:46:40 +08:00
游雁
0515095886 paraformer batch padding 2022-12-11 20:58:08 +08:00
jmwang66
0b8348376a update FunASR version==0.1.4 2022-12-09 22:16:23 +08:00
lzr265946
a9e857e452 update funasr 0.1.3 2022-12-03 16:39:38 +08:00
TeaPoly
1b9ac4f7a2 Fix some issue to make batch inference easy for predictor and decoder. 2022-12-02 12:00:22 +08:00
游雁
a7251d3ff3 update details 2022-11-29 12:57:21 +08:00
游雁
c087854f71 create 2022-11-26 21:56:51 +08:00