雾聪
2aa7d91822
Merge branch 'main' of https://github.com/alibaba-damo-academy/FunASR into main
2024-10-29 16:49:18 +08:00
雾聪
c540f7c831
update readme
2024-10-29 16:48:59 +08:00
游雁
7699a35d2c
v1.1.12
2024-10-29 15:13:14 +08:00
游雁
7edad6fba3
Merge branch 'main' of github.com:alibaba-damo-academy/FunASR
...
merge
2024-10-29 15:11:54 +08:00
游雁
17ed90966a
minmo-s2t
2024-10-29 15:11:33 +08:00
雾聪
4f87e0b8f8
update readme
2024-10-29 15:06:40 +08:00
雾聪
3a10179542
Merge branch 'main' of https://github.com/alibaba-damo-academy/FunASR into main
2024-10-29 11:40:27 +08:00
雾聪
1819303f5e
support SenseVoiceSmall in 2pass mode
2024-10-29 11:40:18 +08:00
Vignesh Skanda
c3e667b217
Update run_evaluate.py ( #2175 )
2024-10-28 21:22:39 +08:00
Truco
1a45b647a8
perf(models/FsmnVADStreaming): optimize GetFrameState and PopDataToOutputBuf ( #2177 )
...
- In GetFrameState(), pass generator to sum() instead of generating a list, ~10% gain in a 21s sample
- In GetFrameState(), cast `sum_score` (a tensor) to float to reduce calling to tensor lib,
~13% gain in a 23s example
- In PopDataToOutputBuf(), remove unused `out_pos` and related calculation, ~10% gain in a 27s sample
2024-10-28 13:41:38 +08:00
StevenH
1254e8aee1
optimize ComputeDecibel in fsmn-vad model by using numpy ( #2174 )
2024-10-26 12:19:07 +08:00
Vignesh Skanda
2a296ab511
Create Contribution.md ( #2167 )
2024-10-24 11:10:09 +08:00
Djraemon
a76f15c785
Fix audio format ( #2159 )
...
* 添加了对音频文件扩展名是否为.mp3的补丁,是mp3格式则转化为wav格式
* 增加检测音频文件是否为mp3格式的补丁
* 完善对音频文件后缀名的检查,若文件后缀不是.wav,则转化为wav
* 增加音频文件后缀名检查;音频文件无效时抛出错误
2024-10-21 13:30:45 +08:00
Wu Can
757d20b3e8
Fix typo ( #2158 )
...
* doc: Correct html5 download path error
* docs: fix typo
---------
Co-authored-by: wucan <awesomecancanz@gmail.com>
Co-authored-by: WuCan <wucan@haocang.com>
2024-10-21 13:30:04 +08:00
zhifu gao
ed143ec57c
Update README.md
2024-10-18 11:21:01 +08:00
游雁
98e2c546a0
funasr tables
2024-10-16 15:22:05 +08:00
游雁
6e6475cd2a
funasr tables
2024-10-16 14:35:56 +08:00
游雁
7900433640
funasr tables
2024-10-16 14:31:31 +08:00
Vignesh Skanda
e6fe606577
Update register.py ( #2145 )
2024-10-16 13:45:22 +08:00
Vignesh Skanda
9a70dac239
Update README.md ( #2146 )
2024-10-16 12:49:10 +08:00
Kun Lu
db308e7535
feat: add campplus merge_thr ( #2135 )
2024-10-15 17:52:10 +08:00
pointerhacker
70645e4807
数据并行可能导致的模型训练报错 ( #2139 )
...
* fix: 修复数据并行训练中ä¼可能会出现的错误
* fix: 修复数据并行训练中ä¼可能会出现的错误
* fix: 修复数据并行ènot need tensor
---------
Co-authored-by: zhaochaojin <zhaochaojin@didiglobal.com>
2024-10-15 17:50:51 +08:00
游雁
5c28b4d612
whisper-large-v3-turbo
2024-10-14 00:21:24 +08:00
游雁
cd68458099
whisper-large-v3-turbo
2024-10-11 16:10:04 +08:00
游雁
7511595b94
Merge branch 'main' of github.com:alibaba-damo-academy/FunASR
...
merge
2024-10-11 14:39:34 +08:00
游雁
6d932da239
whisper-large-v3-turbo
2024-10-11 14:37:27 +08:00
雾聪
1480dcf5d5
add GetInputNames GetOutputNames
2024-10-10 17:45:45 +08:00
雾聪
bef2d3a391
Merge branch 'main' of https://github.com/alibaba-damo-academy/FunASR into main
2024-10-10 15:44:58 +08:00
雾聪
4d8e96f695
fix memmory leak of GetInputOutputInfo
2024-10-10 15:44:44 +08:00
游雁
2330e58f5f
bugfix v1.1.11
2024-10-10 15:32:08 +08:00
游雁
5fc1d918aa
v1.1.10
2024-10-09 10:58:52 +08:00
Nixon
5dbe6898ca
fix list index out of range error ( #2122 )
...
Co-authored-by: nixonjin <nixonjin@tencent.com>
2024-10-09 10:40:58 +08:00
游雁
0bf8edca37
find_unused_parameters
2024-09-30 15:43:13 +08:00
游雁
5d35b3c70b
find_unused_parameters
2024-09-30 12:28:16 +08:00
sugarcase
a8f0aad81d
fsmn_kws_mt finetune and inference adapt to right modelscope hub ( #2113 )
...
Co-authored-by: pengteng.spt <pengteng.spt@alibaba-inc.com>
2024-09-27 14:16:28 +08:00
Yabin Li
5b53b8fb7b
Update websocket_protocol_zh.md
2024-09-26 15:27:20 +08:00
雾聪
cd4c6485ef
update readme
2024-09-26 11:28:03 +08:00
雾聪
df24be2892
Revert "fix onnxruntime memoryleak when load model ( #2108 )"
...
This reverts commit c494d8fd18 .
2024-09-26 00:11:19 +08:00
雾聪
d62d237a76
add sensevoice in offline-stream
2024-09-25 23:46:47 +08:00
雾聪
aa72e0ca5f
add sensevoice-small
2024-09-25 23:44:42 +08:00
雾聪
3e44172c8b
update wbsocket for sensevoice & onnx models
2024-09-25 23:43:30 +08:00
lyblsgo
67239ea39b
update type_utils
2024-09-25 23:38:22 +08:00
locasxe
c494d8fd18
fix onnxruntime memoryleak when load model ( #2108 )
...
* onnxruntime memoryleak fix
* fix onnxruntime memoryleak
* fix onnxruntime memoryleak
2024-09-25 23:25:18 +08:00
sugarcase
8cd8e25f18
add demo.py for all kws models ( #2107 )
...
Co-authored-by: pengteng.spt <pengteng.spt@alibaba-inc.com>
2024-09-25 23:24:58 +08:00
lyblsgo
8745733b54
paraformer_bin bugfix
2024-09-25 20:24:42 +08:00
游雁
fc00476a25
Merge branch 'main' of github.com:alibaba-damo-academy/FunASR
...
merge
2024-09-25 16:34:07 +08:00
游雁
1a44f82ead
v1.1.8
2024-09-25 16:33:37 +08:00
游雁
c0e7b17f08
sensevoice bugfix
2024-09-25 16:32:58 +08:00
zhifu gao
4294d2166e
update README ( #2106 )
...
Co-authored-by: pengteng.spt <pengteng.spt@alibaba-inc.com>
2024-09-25 16:03:15 +08:00
游雁
8b0f876c0c
v1.1.7
2024-09-25 15:27:06 +08:00