游雁
58830eca40
License Agreement
2024-11-20 11:27:45 +08:00
游雁
6a19d111ed
License Agreement
2024-11-20 11:22:20 +08:00
Steve Li
8cd0d4aab7
Add bounds check for postprocess_utils.py abbr_dispose() ( #2209 )
...
"/Users/{USER}/.pyenv/versions/funasr_usage/lib/python3.12/site-packages/funasr/utils/postprocess_utils.py", line 127, in abbr_dispose
end = time_stamp[ts_nums[num]][1]
~~~~~~~~~~^^^^^^^^^^^^^^
IndexError: list index out of range
2024-11-15 11:03:31 +08:00
Yuekai Zhang
ef1d7b3f12
Merge pull request #2206 from yijinsheng/triton_gpu
...
paraformer_large_offline triton运行bug 修复
2024-11-12 17:23:51 +08:00
yijinsheng
2b747626c8
paraformer_large_offline triton运行bug 修复
2024-11-11 23:52:48 +08:00
游雁
5e100f1244
ds stage0
2024-11-08 13:51:50 +08:00
游雁
fc515cbe2e
refactor(deepspeed_conf): 移除旧配置文件
2024-11-08 13:30:48 +08:00
游雁
1a0de67a08
SenseVoice docs
2024-11-07 13:58:20 +08:00
zhifu gao
5f25e809c5
Update version.txt
2024-11-05 16:33:27 +08:00
Djraemon
7e9696f156
Fix audio format 2.0 ( #2186 )
...
* 添加了对音频文件扩展名是否为.mp3的补丁,是mp3格式则转化为wav格式
* 增加检测音频文件是否为mp3格式的补丁
* 完善对音频文件后缀名的检查,若文件后缀不是.wav,则转化为wav
* 增加音频文件后缀名检查;音频文件无效时抛出错误
* 在paraformer、vad两个模型中加入对音频文件后缀的检查,并将非wav格式转为wav格式
* 修改wav_path的数据类型,使demo能够顺利运行
2024-11-04 11:04:52 +08:00
游雁
6224003492
modelscope
2024-11-01 13:55:14 +08:00
游雁
53a06e3c1a
fix(register): 修改注册重复键值的处理方式
...
更新注册系统,在尝试注册已存在的键值时,从抛出异常改为打印提示信息并重新注册。
2024-11-01 09:41:16 +08:00
游雁
9118496192
docs: 更新注册模型教程文案
2024-10-31 18:51:55 +08:00
zhifu gao
cc88b1b317
Update finetune.sh
2024-10-31 18:49:42 +08:00
游雁
0572a434e6
docs: 添加多语言切换链接至文档标题部分
2024-10-31 18:47:38 +08:00
游雁
d2fb3a8fad
docs(tutorial): 更新表格配置文档
2024-10-31 18:44:58 +08:00
游雁
811c516932
docs: 更新教程文档链接
2024-10-31 18:39:34 +08:00
游雁
e6f58e7bc7
docs(tutorial): 添加新模型注册教程
2024-10-31 18:38:05 +08:00
游雁
949a95986c
docs: 移除Paraformer模型示例代码
2024-10-31 16:29:40 +08:00
游雁
567bf98954
fix(model): 调整Codec子模型中的上采样逻辑并修正准确率计算偏移问题
...
此提交修复了Codec子模型中的上采样逻辑,并调整了准确率计算时的标签偏移问题。
2024-10-31 16:22:18 +08:00
雾聪
2aa7d91822
Merge branch 'main' of https://github.com/alibaba-damo-academy/FunASR into main
2024-10-29 16:49:18 +08:00
雾聪
c540f7c831
update readme
2024-10-29 16:48:59 +08:00
游雁
7699a35d2c
v1.1.12
2024-10-29 15:13:14 +08:00
游雁
7edad6fba3
Merge branch 'main' of github.com:alibaba-damo-academy/FunASR
...
merge
2024-10-29 15:11:54 +08:00
游雁
17ed90966a
minmo-s2t
2024-10-29 15:11:33 +08:00
雾聪
4f87e0b8f8
update readme
2024-10-29 15:06:40 +08:00
雾聪
3a10179542
Merge branch 'main' of https://github.com/alibaba-damo-academy/FunASR into main
2024-10-29 11:40:27 +08:00
雾聪
1819303f5e
support SenseVoiceSmall in 2pass mode
2024-10-29 11:40:18 +08:00
Vignesh Skanda
c3e667b217
Update run_evaluate.py ( #2175 )
2024-10-28 21:22:39 +08:00
Truco
1a45b647a8
perf(models/FsmnVADStreaming): optimize GetFrameState and PopDataToOutputBuf ( #2177 )
...
- In GetFrameState(), pass generator to sum() instead of generating a list, ~10% gain in a 21s sample
- In GetFrameState(), cast `sum_score` (a tensor) to float to reduce calling to tensor lib,
~13% gain in a 23s example
- In PopDataToOutputBuf(), remove unused `out_pos` and related calculation, ~10% gain in a 27s sample
2024-10-28 13:41:38 +08:00
StevenH
1254e8aee1
optimize ComputeDecibel in fsmn-vad model by using numpy ( #2174 )
2024-10-26 12:19:07 +08:00
Vignesh Skanda
2a296ab511
Create Contribution.md ( #2167 )
2024-10-24 11:10:09 +08:00
Djraemon
a76f15c785
Fix audio format ( #2159 )
...
* 添加了对音频文件扩展名是否为.mp3的补丁,是mp3格式则转化为wav格式
* 增加检测音频文件是否为mp3格式的补丁
* 完善对音频文件后缀名的检查,若文件后缀不是.wav,则转化为wav
* 增加音频文件后缀名检查;音频文件无效时抛出错误
2024-10-21 13:30:45 +08:00
Wu Can
757d20b3e8
Fix typo ( #2158 )
...
* doc: Correct html5 download path error
* docs: fix typo
---------
Co-authored-by: wucan <awesomecancanz@gmail.com>
Co-authored-by: WuCan <wucan@haocang.com>
2024-10-21 13:30:04 +08:00
Vignesh Skanda
f99e5fc706
Update README.md
2024-10-19 22:41:02 +05:30
zhifu gao
ed143ec57c
Update README.md
2024-10-18 11:21:01 +08:00
游雁
98e2c546a0
funasr tables
2024-10-16 15:22:05 +08:00
游雁
6e6475cd2a
funasr tables
2024-10-16 14:35:56 +08:00
游雁
7900433640
funasr tables
2024-10-16 14:31:31 +08:00
Vignesh Skanda
e6fe606577
Update register.py ( #2145 )
2024-10-16 13:45:22 +08:00
Vignesh Skanda
9a70dac239
Update README.md ( #2146 )
2024-10-16 12:49:10 +08:00
Kun Lu
db308e7535
feat: add campplus merge_thr ( #2135 )
2024-10-15 17:52:10 +08:00
pointerhacker
70645e4807
数据并行可能导致的模型训练报错 ( #2139 )
...
* fix: 修复数据并行训练中ä¼可能会出现的错误
* fix: 修复数据并行训练中ä¼可能会出现的错误
* fix: 修复数据并行ènot need tensor
---------
Co-authored-by: zhaochaojin <zhaochaojin@didiglobal.com>
2024-10-15 17:50:51 +08:00
游雁
5c28b4d612
whisper-large-v3-turbo
2024-10-14 00:21:24 +08:00
游雁
cd68458099
whisper-large-v3-turbo
2024-10-11 16:10:04 +08:00
游雁
7511595b94
Merge branch 'main' of github.com:alibaba-damo-academy/FunASR
...
merge
2024-10-11 14:39:34 +08:00
游雁
6d932da239
whisper-large-v3-turbo
2024-10-11 14:37:27 +08:00
雾聪
1480dcf5d5
add GetInputNames GetOutputNames
2024-10-10 17:45:45 +08:00
雾聪
bef2d3a391
Merge branch 'main' of https://github.com/alibaba-damo-academy/FunASR into main
2024-10-10 15:44:58 +08:00
雾聪
4d8e96f695
fix memmory leak of GetInputOutputInfo
2024-10-10 15:44:44 +08:00