Commit Graph

4687 Commits

Author SHA1 Message Date
zhifu gao
cc88b1b317
Update finetune.sh 2024-10-31 18:49:42 +08:00
游雁
0572a434e6 docs: 添加多语言切换链接至文档标题部分 2024-10-31 18:47:38 +08:00
游雁
d2fb3a8fad docs(tutorial): 更新表格配置文档 2024-10-31 18:44:58 +08:00
游雁
811c516932 docs: 更新教程文档链接 2024-10-31 18:39:34 +08:00
游雁
e6f58e7bc7 docs(tutorial): 添加新模型注册教程 2024-10-31 18:38:05 +08:00
游雁
949a95986c docs: 移除Paraformer模型示例代码 2024-10-31 16:29:40 +08:00
游雁
567bf98954 fix(model): 调整Codec子模型中的上采样逻辑并修正准确率计算偏移问题
此提交修复了Codec子模型中的上采样逻辑,并调整了准确率计算时的标签偏移问题。
2024-10-31 16:22:18 +08:00
雾聪
2aa7d91822 Merge branch 'main' of https://github.com/alibaba-damo-academy/FunASR into main 2024-10-29 16:49:18 +08:00
雾聪
c540f7c831 update readme 2024-10-29 16:48:59 +08:00
游雁
7699a35d2c v1.1.12 2024-10-29 15:13:14 +08:00
游雁
7edad6fba3 Merge branch 'main' of github.com:alibaba-damo-academy/FunASR
merge
2024-10-29 15:11:54 +08:00
游雁
17ed90966a minmo-s2t 2024-10-29 15:11:33 +08:00
雾聪
4f87e0b8f8 update readme 2024-10-29 15:06:40 +08:00
雾聪
3a10179542 Merge branch 'main' of https://github.com/alibaba-damo-academy/FunASR into main 2024-10-29 11:40:27 +08:00
雾聪
1819303f5e support SenseVoiceSmall in 2pass mode 2024-10-29 11:40:18 +08:00
Vignesh Skanda
c3e667b217
Update run_evaluate.py (#2175) 2024-10-28 21:22:39 +08:00
Truco
1a45b647a8
perf(models/FsmnVADStreaming): optimize GetFrameState and PopDataToOutputBuf (#2177)
- In GetFrameState(), pass generator to sum() instead of generating a list, ~10% gain in a 21s sample
- In GetFrameState(), cast `sum_score` (a tensor) to float to reduce calling to tensor lib,
  ~13% gain in a 23s example
- In PopDataToOutputBuf(), remove unused `out_pos` and related calculation, ~10% gain in a 27s sample
2024-10-28 13:41:38 +08:00
StevenH
1254e8aee1
optimize ComputeDecibel in fsmn-vad model by using numpy (#2174) 2024-10-26 12:19:07 +08:00
Vignesh Skanda
2a296ab511
Create Contribution.md (#2167) 2024-10-24 11:10:09 +08:00
Djraemon
a76f15c785
Fix audio format (#2159)
* 添加了对音频文件扩展名是否为.mp3的补丁,是mp3格式则转化为wav格式

* 增加检测音频文件是否为mp3格式的补丁

* 完善对音频文件后缀名的检查,若文件后缀不是.wav,则转化为wav

* 增加音频文件后缀名检查;音频文件无效时抛出错误
2024-10-21 13:30:45 +08:00
Wu Can
757d20b3e8
Fix typo (#2158)
* doc: Correct html5 download path error

* docs: fix typo

---------

Co-authored-by: wucan <awesomecancanz@gmail.com>
Co-authored-by: WuCan <wucan@haocang.com>
2024-10-21 13:30:04 +08:00
zhifu gao
ed143ec57c
Update README.md 2024-10-18 11:21:01 +08:00
游雁
98e2c546a0 funasr tables 2024-10-16 15:22:05 +08:00
游雁
6e6475cd2a funasr tables 2024-10-16 14:35:56 +08:00
游雁
7900433640 funasr tables 2024-10-16 14:31:31 +08:00
Vignesh Skanda
e6fe606577
Update register.py (#2145) 2024-10-16 13:45:22 +08:00
Vignesh Skanda
9a70dac239
Update README.md (#2146) 2024-10-16 12:49:10 +08:00
Kun Lu
db308e7535
feat: add campplus merge_thr (#2135) 2024-10-15 17:52:10 +08:00
pointerhacker
70645e4807
数据并行可能导致的模型训练报错 (#2139)
* fix: 修复数据并行训练中ä¼可能会出现的错误

* fix: 修复数据并行训练中ä¼可能会出现的错误

* fix: 修复数据并行ènot need tensor

---------

Co-authored-by: zhaochaojin <zhaochaojin@didiglobal.com>
2024-10-15 17:50:51 +08:00
游雁
5c28b4d612 whisper-large-v3-turbo 2024-10-14 00:21:24 +08:00
游雁
cd68458099 whisper-large-v3-turbo 2024-10-11 16:10:04 +08:00
游雁
7511595b94 Merge branch 'main' of github.com:alibaba-damo-academy/FunASR
merge
2024-10-11 14:39:34 +08:00
游雁
6d932da239 whisper-large-v3-turbo 2024-10-11 14:37:27 +08:00
雾聪
1480dcf5d5 add GetInputNames GetOutputNames 2024-10-10 17:45:45 +08:00
雾聪
bef2d3a391 Merge branch 'main' of https://github.com/alibaba-damo-academy/FunASR into main 2024-10-10 15:44:58 +08:00
雾聪
4d8e96f695 fix memmory leak of GetInputOutputInfo 2024-10-10 15:44:44 +08:00
游雁
2330e58f5f bugfix v1.1.11 2024-10-10 15:32:08 +08:00
游雁
5fc1d918aa v1.1.10 2024-10-09 10:58:52 +08:00
Nixon
5dbe6898ca
fix list index out of range error (#2122)
Co-authored-by: nixonjin <nixonjin@tencent.com>
2024-10-09 10:40:58 +08:00
游雁
0bf8edca37 find_unused_parameters 2024-09-30 15:43:13 +08:00
游雁
5d35b3c70b find_unused_parameters 2024-09-30 12:28:16 +08:00
sugarcase
a8f0aad81d
fsmn_kws_mt finetune and inference adapt to right modelscope hub (#2113)
Co-authored-by: pengteng.spt <pengteng.spt@alibaba-inc.com>
2024-09-27 14:16:28 +08:00
Yabin Li
5b53b8fb7b
Update websocket_protocol_zh.md 2024-09-26 15:27:20 +08:00
雾聪
cd4c6485ef update readme 2024-09-26 11:28:03 +08:00
雾聪
df24be2892 Revert "fix onnxruntime memoryleak when load model (#2108)"
This reverts commit c494d8fd18.
2024-09-26 00:11:19 +08:00
雾聪
d62d237a76 add sensevoice in offline-stream 2024-09-25 23:46:47 +08:00
雾聪
aa72e0ca5f add sensevoice-small 2024-09-25 23:44:42 +08:00
雾聪
3e44172c8b update wbsocket for sensevoice & onnx models 2024-09-25 23:43:30 +08:00
lyblsgo
67239ea39b update type_utils 2024-09-25 23:38:22 +08:00
locasxe
c494d8fd18
fix onnxruntime memoryleak when load model (#2108)
* onnxruntime memoryleak fix

* fix onnxruntime memoryleak

* fix onnxruntime memoryleak
2024-09-25 23:25:18 +08:00