Commit Graph

141 Commits

Author SHA1 Message Date
speech_asr
831d00aec2 update 2023-04-17 16:26:40 +08:00
zhifu gao
33681507e1
Merge pull request #342 from alibaba-damo-academy/dev_cmz
fix task.py with no dest_sample_rate task; fix bug in train and infer
2023-04-13 10:03:42 +08:00
mengzhe.cmz
1ad439f96b fix task.py with no dest_sample_rate task; fix bug in train and infer 2023-04-12 19:25:29 +08:00
zhuzizyf
3bbe2824bc
Update dataset.py 2023-04-12 17:16:56 +08:00
zhifu gao
2e769fb36c
Merge branch 'main' into dev_cmz2 2023-04-07 15:54:09 +08:00
游雁
d0cd484fdc export 2023-03-31 15:05:37 +08:00
speech_asr
4e7a8283be update 2023-03-22 16:00:42 +08:00
speech_asr
8314c5f17e update 2023-03-21 16:28:22 +08:00
仁迷
60c8f036e0 update audio type check 2023-03-17 20:09:02 +08:00
speech_asr
7aee2a6a71 update 2023-03-17 15:14:18 +08:00
speech_asr
fab0da6ab7 update 2023-03-17 15:09:41 +08:00
志浩
2868fe3df4 Merge branch 'main' into dev_dzh 2023-03-16 19:24:21 +08:00
志浩
0ac06c029e fixbug path_name_type_list can [[any,str,str],[any,str,str]] 2023-03-16 19:24:15 +08:00
zhuzizyf
2cdb2d654f
Update dataset.py
fix dest_sample_rate bug
2023-03-11 14:33:14 +08:00
zhuyunfeng
4afdd97df4 Add finetune resampling function under small data type. 2023-03-11 13:23:34 +08:00
hnluo
1eacb8ae81
Update iterable_dataset.py 2023-03-10 20:48:41 +08:00
hnluo
fa1df90827
Update iterable_dataset.py 2023-03-10 20:09:47 +08:00
hnluo
2b9d6e819e
Update iterable_dataset.py 2023-03-10 18:33:13 +08:00
仁迷
62c592fac1 support mfcca infenence 2023-03-09 14:51:56 +08:00
仁迷
7984a37f8c update large dataset for sampling rate 2023-03-01 17:03:45 +08:00
九耳
ee06cb9c68 punctuation:add training code, support largedataset 2023-02-28 18:11:12 +08:00
hnluo
742f2e927d
Update iterable_dataset.py 2023-02-14 17:34:36 +08:00
zhifu gao
b3bfea34ad
Merge pull request #103 from alibaba-damo-academy/dev_lhn
fix persian text segment bug
2023-02-14 13:05:39 +08:00
仁迷
4aedebc3cd fix persian text segment bug 2023-02-14 11:30:42 +08:00
Zhihao Du
fc0fd54e94
Merge pull request #102 from alibaba-damo-academy/dev_lhn
update iterable dataset
2023-02-13 17:45:53 +08:00
hnluo
7656297365
Merge pull request #101 from alibaba-damo-academy/dev_lhn
Dev lhn
2023-02-13 17:28:06 +08:00
仁迷
0a38657206 update iterable dataset 2023-02-13 17:25:56 +08:00
仁迷
ff78a5ea80 update dataset audio load 2023-02-13 16:26:15 +08:00
仁迷
2c3836a882 update dataset audio load 2023-02-13 16:20:02 +08:00
wucong.lyb
9e8a52153d add language model infer pipeline 2023-02-10 10:54:27 +08:00
zhifu gao
de0ecb446f
Merge pull request #79 from alibaba-damo-academy/dev_wjm
Dev wjm
2023-02-09 10:57:25 +08:00
jmwang66
ded881802c update data2vec pretrain 2023-02-07 10:17:52 +08:00
hnluo
ade51b3a12
support pcm audio format 2023-02-06 17:03:38 +08:00
jmwang66
55b45487c7 update data2vec pretrain: dataset 2023-02-06 16:59:00 +08:00
jmwang66
9befa9e508 update data2vec pretrain: add clipping 2023-02-06 16:42:33 +08:00
hnluo
c14169f374
support audio uppersampling and downsampling 2023-02-05 12:12:03 +08:00
九耳
86d65112ab fix 2023-02-05 10:48:12 +08:00
仁迷
cdac560080 more audio formats support 2023-01-31 17:42:28 +08:00
jmwang66
12a7adfdf3 update version 0.1.6 2023-01-16 18:46:40 +08:00
lzr265946
a9e857e452 update funasr 0.1.3 2022-12-03 16:39:38 +08:00
游雁
c087854f71 create 2022-11-26 21:56:51 +08:00