FunASR

mirror of https://github.com/modelscope/FunASR synced 2025-09-15 14:48:36 +08:00

Author	SHA1	Message	Date
游雁	58830eca40	License Agreement	2024-11-20 11:27:45 +08:00
游雁	6a19d111ed	License Agreement	2024-11-20 11:22:20 +08:00
Steve Li	8cd0d4aab7	Add bounds check for postprocess_utils.py abbr_dispose() (#2209 ) "/Users/{USER}/.pyenv/versions/funasr_usage/lib/python3.12/site-packages/funasr/utils/postprocess_utils.py", line 127, in abbr_dispose end = time_stamp[ts_nums[num]][1] ~~~~~~~~~~^^^^^^^^^^^^^^ IndexError: list index out of range	2024-11-15 11:03:31 +08:00
Yuekai Zhang	ef1d7b3f12	Merge pull request #2206 from yijinsheng/triton_gpu paraformer_large_offline triton运行bug 修复	2024-11-12 17:23:51 +08:00
yijinsheng	2b747626c8	paraformer_large_offline triton运行bug 修复	2024-11-11 23:52:48 +08:00
游雁	5e100f1244	ds stage0	2024-11-08 13:51:50 +08:00
游雁	fc515cbe2e	refactor(deepspeed_conf): 移除旧配置文件	2024-11-08 13:30:48 +08:00
游雁	1a0de67a08	SenseVoice docs	2024-11-07 13:58:20 +08:00
zhifu gao	5f25e809c5	Update version.txt	2024-11-05 16:33:27 +08:00
Djraemon	7e9696f156	Fix audio format 2.0 (#2186 ) * 添加了对音频文件扩展名是否为.mp3的补丁，是mp3格式则转化为wav格式 * 增加检测音频文件是否为mp3格式的补丁 * 完善对音频文件后缀名的检查，若文件后缀不是.wav，则转化为wav * 增加音频文件后缀名检查；音频文件无效时抛出错误 * 在paraformer、vad两个模型中加入对音频文件后缀的检查，并将非wav格式转为wav格式 * 修改wav_path的数据类型，使demo能够顺利运行	2024-11-04 11:04:52 +08:00
游雁	6224003492	modelscope	2024-11-01 13:55:14 +08:00
游雁	53a06e3c1a	fix(register): 修改注册重复键值的处理方式更新注册系统，在尝试注册已存在的键值时，从抛出异常改为打印提示信息并重新注册。	2024-11-01 09:41:16 +08:00
游雁	9118496192	docs: 更新注册模型教程文案	2024-10-31 18:51:55 +08:00
zhifu gao	cc88b1b317	Update finetune.sh	2024-10-31 18:49:42 +08:00
游雁	0572a434e6	docs: 添加多语言切换链接至文档标题部分	2024-10-31 18:47:38 +08:00
游雁	d2fb3a8fad	docs(tutorial): 更新表格配置文档	2024-10-31 18:44:58 +08:00
游雁	811c516932	docs: 更新教程文档链接	2024-10-31 18:39:34 +08:00
游雁	e6f58e7bc7	docs(tutorial): 添加新模型注册教程	2024-10-31 18:38:05 +08:00
游雁	949a95986c	docs: 移除Paraformer模型示例代码	2024-10-31 16:29:40 +08:00
游雁	567bf98954	fix(model): 调整Codec子模型中的上采样逻辑并修正准确率计算偏移问题此提交修复了Codec子模型中的上采样逻辑，并调整了准确率计算时的标签偏移问题。	2024-10-31 16:22:18 +08:00
雾聪	2aa7d91822	Merge branch 'main' of https://github.com/alibaba-damo-academy/FunASR into main	2024-10-29 16:49:18 +08:00
雾聪	c540f7c831	update readme	2024-10-29 16:48:59 +08:00
游雁	7699a35d2c	v1.1.12	2024-10-29 15:13:14 +08:00
游雁	7edad6fba3	Merge branch 'main' of github.com:alibaba-damo-academy/FunASR merge	2024-10-29 15:11:54 +08:00
游雁	17ed90966a	minmo-s2t	2024-10-29 15:11:33 +08:00
雾聪	4f87e0b8f8	update readme	2024-10-29 15:06:40 +08:00
雾聪	3a10179542	Merge branch 'main' of https://github.com/alibaba-damo-academy/FunASR into main	2024-10-29 11:40:27 +08:00
雾聪	1819303f5e	support SenseVoiceSmall in 2pass mode	2024-10-29 11:40:18 +08:00
Vignesh Skanda	c3e667b217	Update run_evaluate.py (#2175 )	2024-10-28 21:22:39 +08:00
Truco	1a45b647a8	perf(models/FsmnVADStreaming): optimize GetFrameState and PopDataToOutputBuf (#2177 ) - In GetFrameState(), pass generator to sum() instead of generating a list, ~10% gain in a 21s sample - In GetFrameState(), cast `sum_score` (a tensor) to float to reduce calling to tensor lib, ~13% gain in a 23s example - In PopDataToOutputBuf(), remove unused `out_pos` and related calculation, ~10% gain in a 27s sample	2024-10-28 13:41:38 +08:00
StevenH	1254e8aee1	optimize ComputeDecibel in fsmn-vad model by using numpy (#2174 )	2024-10-26 12:19:07 +08:00
Vignesh Skanda	2a296ab511	Create Contribution.md (#2167 )	2024-10-24 11:10:09 +08:00
Djraemon	a76f15c785	Fix audio format (#2159 ) * 添加了对音频文件扩展名是否为.mp3的补丁，是mp3格式则转化为wav格式 * 增加检测音频文件是否为mp3格式的补丁 * 完善对音频文件后缀名的检查，若文件后缀不是.wav，则转化为wav * 增加音频文件后缀名检查；音频文件无效时抛出错误	2024-10-21 13:30:45 +08:00
Wu Can	757d20b3e8	Fix typo (#2158 ) * doc: Correct html5 download path error * docs: fix typo --------- Co-authored-by: wucan <awesomecancanz@gmail.com> Co-authored-by: WuCan <wucan@haocang.com>	2024-10-21 13:30:04 +08:00
Vignesh Skanda	f99e5fc706	Update README.md	2024-10-19 22:41:02 +05:30
zhifu gao	ed143ec57c	Update README.md	2024-10-18 11:21:01 +08:00
游雁	98e2c546a0	funasr tables	2024-10-16 15:22:05 +08:00
游雁	6e6475cd2a	funasr tables	2024-10-16 14:35:56 +08:00
游雁	7900433640	funasr tables	2024-10-16 14:31:31 +08:00
Vignesh Skanda	e6fe606577	Update register.py (#2145 )	2024-10-16 13:45:22 +08:00
Vignesh Skanda	9a70dac239	Update README.md (#2146 )	2024-10-16 12:49:10 +08:00
Kun Lu	db308e7535	feat: add campplus merge_thr (#2135 )	2024-10-15 17:52:10 +08:00
pointerhacker	70645e4807	数据并行可能导致的模型训练报错 (#2139 ) * fix: 修复数据并行训练中ä¼可能会出现的错误 * fix: 修复数据并行训练中ä¼可能会出现的错误 * fix: 修复数据并行ènot need tensor --------- Co-authored-by: zhaochaojin <zhaochaojin@didiglobal.com>	2024-10-15 17:50:51 +08:00
游雁	5c28b4d612	whisper-large-v3-turbo	2024-10-14 00:21:24 +08:00
游雁	cd68458099	whisper-large-v3-turbo	2024-10-11 16:10:04 +08:00
游雁	7511595b94	Merge branch 'main' of github.com:alibaba-damo-academy/FunASR merge	2024-10-11 14:39:34 +08:00
游雁	6d932da239	whisper-large-v3-turbo	2024-10-11 14:37:27 +08:00
雾聪	1480dcf5d5	add GetInputNames GetOutputNames	2024-10-10 17:45:45 +08:00
雾聪	bef2d3a391	Merge branch 'main' of https://github.com/alibaba-damo-academy/FunASR into main	2024-10-10 15:44:58 +08:00
雾聪	4d8e96f695	fix memmory leak of GetInputOutputInfo	2024-10-10 15:44:44 +08:00

1 2 3 4 5 ...

4801 Commits