Commit Graph

135 Commits

Author SHA1 Message Date
shixian.shi
71d466e745 update AverageShiftCalculator in utils 2023-03-13 19:47:42 +08:00
shixian.shi
f5aa97f7bf update params name 2023-03-13 17:39:18 +08:00
shixian.shi
5a7ee30783 update timestamp related codes and egs_modelscope 2023-03-13 15:21:13 +08:00
zhifu gao
0a09368a64
Merge pull request #180 from zhuzizyf/finetune_fix
Update wav_utils.py
2023-03-03 11:01:21 +08:00
zhuzizyf
1a39b6f981
Update wav_utils.py
Because there are no uppercase letters in the dictionary, when there are uppercase letters in the annotated text, the finetune result will be "unk", so uniformly converted to lowercase when read the annotated text.
2023-03-03 10:33:51 +08:00
shixian.shi
9dd4901aad rapid_paraformer.utils.timestamp_utils 2023-03-02 19:50:06 +08:00
zhifu gao
5d4b0c3994
Merge pull request #167 from alibaba-damo-academy/dev_lhn
fix text postprocess bug
2023-03-01 11:17:18 +08:00
仁迷
e9ea65679a fix text postprocess bug 2023-03-01 11:14:09 +08:00
shixian.shi
57f2a51f9a onnx supports tiny and bicif paraformer 2023-02-27 16:55:06 +08:00
仁迷
b6a1c6c1e6 fix data dir filter bug 2023-02-27 14:47:32 +08:00
dingbig
bea5d98423 Add sentence timestamp support
Added support for statement event timestamp, which is particularly useful for applications such as lyrics and subtitles.
2023-02-22 19:48:50 +08:00
shixian.shi
03250ae634 timestamp func bug fix 2023-02-21 10:10:17 +08:00
hnluo
4a796298cc
Update asr_utils.py 2023-02-20 15:01:50 +08:00
lzr265946
546262a0c6 remove useless code 2023-02-16 15:22:14 +08:00
hnluo
71766839fd
Merge pull request #106 from alibaba-damo-academy/dev
Dev
2023-02-14 15:15:15 +08:00
speech_asr
216dc0978c add wav/text mismatch process 2023-02-14 14:58:46 +08:00
speech_asr
66a8235fbf add wav/text mismatch process 2023-02-14 14:45:39 +08:00
speech_asr
e180abe4fd update docs 2023-02-14 14:39:45 +08:00
志浩
f6a1cdaf34 add sond model 2023-02-10 18:56:14 +08:00
lzr265946
7aa2e885f4 support for turning off timestamps 2023-02-10 13:46:01 +08:00
北念
ad0039596c add BiCifParaformer 2023-02-09 19:11:16 +08:00
北念
16d4e00549 add BiCifParaformer 2023-02-09 17:53:04 +08:00
hnluo
d7e43300fb
Merge pull request #81 from alibaba-damo-academy/dev
Create vad_inference_launch.py
2023-02-09 15:30:21 +08:00
lzr265946
03875965c8 remove global vars 2023-02-09 15:13:14 +08:00
speech_asr
cced441e5f add file flush 2023-02-09 15:12:14 +08:00
zhifu gao
08384ef9eb
Merge pull request #75 from alibaba-damo-academy/dev
update github.io page
2023-02-08 19:52:18 +08:00
jmwang66
6a41e13ba9 upload github.io 2023-02-08 17:30:19 +08:00
hnluo
02a3eefb35
fix audio_in type bug 2023-02-07 10:09:51 +08:00
hnluo
9c64377c98
support pcm audio format 2023-02-06 17:05:06 +08:00
仁迷
cdac560080 more audio formats support 2023-01-31 17:42:28 +08:00
jmwang66
12a7adfdf3 update version 0.1.6 2023-01-16 18:46:40 +08:00
仁迷
ad75a464d8 update text postprocess 2023-01-09 19:34:52 +08:00
jmwang66
d653df71cb update text postprocess 2022-12-23 10:40:23 +08:00
lzr265946
a9e857e452 update funasr 0.1.3 2022-12-03 16:39:38 +08:00
游雁
c087854f71 create 2022-11-26 21:56:51 +08:00