嘉渊
6e66a74ae6
update
2023-04-25 16:33:00 +08:00
嘉渊
7436acc5dd
update
2023-04-25 16:29:39 +08:00
嘉渊
70f9a8f890
update
2023-04-25 01:29:12 +08:00
嘉渊
e86b95e747
update
2023-04-24 22:57:04 +08:00
嘉渊
f2b9780b29
update
2023-04-24 22:47:00 +08:00
haoneng.lhn
a8e92e4fb4
update data filtering recipe
2023-04-23 15:03:56 +08:00
speech_asr
993fdd8ecf
update
2023-04-20 17:01:47 +08:00
speech_asr
eac9f111b5
update
2023-04-20 16:59:26 +08:00
speech_asr
3e77fd4430
update
2023-04-20 16:41:22 +08:00
speech_asr
d6cc6896e4
update
2023-04-20 16:33:30 +08:00
speech_asr
518465d089
update
2023-04-20 16:07:01 +08:00
speech_asr
a29166b9a0
update
2023-04-20 16:03:54 +08:00
speech_asr
200d1ede05
update
2023-04-20 15:56:25 +08:00
speech_asr
c452b2a3c7
update
2023-04-20 15:43:29 +08:00
speech_asr
68852c3072
update
2023-04-20 15:35:25 +08:00
speech_asr
43c30967b0
update
2023-04-20 11:48:19 +08:00
speech_asr
02f2a3c2ec
update
2023-04-20 11:38:20 +08:00
speech_asr
7522c59e74
update
2023-04-20 11:16:49 +08:00
speech_asr
680cdb55bb
update
2023-04-19 14:49:36 +08:00
speech_asr
58fb22cb2b
update
2023-04-19 10:09:51 +08:00
speech_asr
05d4176e88
update
2023-04-18 19:28:33 +08:00
speech_asr
831d00aec2
update
2023-04-17 16:26:40 +08:00
speech_asr
d9ad40bf6f
update
2023-04-17 11:45:41 +08:00
speech_asr
6659c37d81
update
2023-04-17 11:23:37 +08:00
speech_asr
bd7455ec7d
update
2023-04-12 10:43:01 +08:00
北念
cf843d144a
fix compute cer problems
2023-04-04 14:26:22 +08:00
hnluo
85e8e0ed0d
Update postprocess_utils.py
2023-03-27 17:15:10 +08:00
Xian Shi
92248eb07b
Merge pull request #274 from dingbig/fix-punc
...
fixed token_int is zero bug and add more puncs to sentence
2023-03-21 17:52:18 +08:00
Yuanhang Zhang
3b3aebb124
Fix timestamp prediction on empty ASR outputs
...
_, timestamp = ts_prediction_lfr6_standard(us_alphas[i],
ValueError: not enough values to unpack (expected 2, got 0)
If char_list is empty, we should still return both the text (which is empty string) and the timestamps to fit the function signature.
2023-03-21 17:23:26 +08:00
dingbig
d80d046365
fixed token_int is zero bug and add more puncs to sentence
...
1)If token_int is 0, the following process will crash. This BUG has been fixed. In fact, when token_int=0, there is no need to continue processing.
2)modify time_stamp_sentence to support more punc.
2023-03-21 17:11:49 +08:00
lzr265946
2ab8b7f473
modify abbr postprocess
2023-03-21 12:55:16 +08:00
仁迷
60c8f036e0
update audio type check
2023-03-17 20:09:02 +08:00
shixian.shi
f63a72c52e
update tools
2023-03-15 10:22:30 +08:00
shixian.shi
f59a72d24e
release timestasmp related tools
2023-03-15 10:21:32 +08:00
shixian.shi
0b06794fde
bug fix
2023-03-13 20:14:41 +08:00
shixian.shi
9c21bbb96b
bug fix
2023-03-13 19:55:47 +08:00
shixian.shi
4b16316d49
bug fic
2023-03-13 19:53:33 +08:00
shixian.shi
71d466e745
update AverageShiftCalculator in utils
2023-03-13 19:47:42 +08:00
shixian.shi
f5aa97f7bf
update params name
2023-03-13 17:39:18 +08:00
shixian.shi
5a7ee30783
update timestamp related codes and egs_modelscope
2023-03-13 15:21:13 +08:00
zhifu gao
0a09368a64
Merge pull request #180 from zhuzizyf/finetune_fix
...
Update wav_utils.py
2023-03-03 11:01:21 +08:00
zhuzizyf
1a39b6f981
Update wav_utils.py
...
Because there are no uppercase letters in the dictionary, when there are uppercase letters in the annotated text, the finetune result will be "unk", so uniformly converted to lowercase when read the annotated text.
2023-03-03 10:33:51 +08:00
shixian.shi
9dd4901aad
rapid_paraformer.utils.timestamp_utils
2023-03-02 19:50:06 +08:00
zhifu gao
5d4b0c3994
Merge pull request #167 from alibaba-damo-academy/dev_lhn
...
fix text postprocess bug
2023-03-01 11:17:18 +08:00
仁迷
e9ea65679a
fix text postprocess bug
2023-03-01 11:14:09 +08:00
shixian.shi
57f2a51f9a
onnx supports tiny and bicif paraformer
2023-02-27 16:55:06 +08:00
仁迷
b6a1c6c1e6
fix data dir filter bug
2023-02-27 14:47:32 +08:00
dingbig
bea5d98423
Add sentence timestamp support
...
Added support for statement event timestamp, which is particularly useful for applications such as lyrics and subtitles.
2023-02-22 19:48:50 +08:00
shixian.shi
03250ae634
timestamp func bug fix
2023-02-21 10:10:17 +08:00
hnluo
4a796298cc
Update asr_utils.py
2023-02-20 15:01:50 +08:00
lzr265946
546262a0c6
remove useless code
2023-02-16 15:22:14 +08:00
hnluo
71766839fd
Merge pull request #106 from alibaba-damo-academy/dev
...
Dev
2023-02-14 15:15:15 +08:00
speech_asr
216dc0978c
add wav/text mismatch process
2023-02-14 14:58:46 +08:00
speech_asr
66a8235fbf
add wav/text mismatch process
2023-02-14 14:45:39 +08:00
speech_asr
e180abe4fd
update docs
2023-02-14 14:39:45 +08:00
志浩
f6a1cdaf34
add sond model
2023-02-10 18:56:14 +08:00
lzr265946
7aa2e885f4
support for turning off timestamps
2023-02-10 13:46:01 +08:00
北念
ad0039596c
add BiCifParaformer
2023-02-09 19:11:16 +08:00
北念
16d4e00549
add BiCifParaformer
2023-02-09 17:53:04 +08:00
hnluo
d7e43300fb
Merge pull request #81 from alibaba-damo-academy/dev
...
Create vad_inference_launch.py
2023-02-09 15:30:21 +08:00
lzr265946
03875965c8
remove global vars
2023-02-09 15:13:14 +08:00
speech_asr
cced441e5f
add file flush
2023-02-09 15:12:14 +08:00
zhifu gao
08384ef9eb
Merge pull request #75 from alibaba-damo-academy/dev
...
update github.io page
2023-02-08 19:52:18 +08:00
jmwang66
6a41e13ba9
upload github.io
2023-02-08 17:30:19 +08:00
hnluo
02a3eefb35
fix audio_in type bug
2023-02-07 10:09:51 +08:00
hnluo
9c64377c98
support pcm audio format
2023-02-06 17:05:06 +08:00
仁迷
cdac560080
more audio formats support
2023-01-31 17:42:28 +08:00
jmwang66
12a7adfdf3
update version 0.1.6
2023-01-16 18:46:40 +08:00
仁迷
ad75a464d8
update text postprocess
2023-01-09 19:34:52 +08:00
jmwang66
d653df71cb
update text postprocess
2022-12-23 10:40:23 +08:00
lzr265946
a9e857e452
update funasr 0.1.3
2022-12-03 16:39:38 +08:00
游雁
c087854f71
create
2022-11-26 21:56:51 +08:00