FunASR

mirror of https://github.com/modelscope/FunASR synced 2025-09-15 14:48:36 +08:00

Author	SHA1	Message	Date
zhifu gao	c0b2079fb1	Update README.md	2025-09-09 19:36:55 +08:00
游雁	4402e95b0f	v1.2.7	2025-08-15 15:22:18 +08:00
游雁	f5051c55cd	trust_remote_code	2025-08-15 15:10:37 +08:00
majic31	5115a066c9	fix #2587 : Resolve VAD multithreading issue (#2613 ) * Fix crash in ASR tasks when lm is set to none in #2237 * fix #2587: Resolve VAD multithreading issue * Update funasr/models/fsmn_vad_streaming/model.py ok Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> --------- Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2025-08-14 16:08:44 +08:00
Kyle He	82a07e2f6e	fix: funasr_onnx pass run_options to ort (#2632 )	2025-08-14 16:07:54 +08:00
topjgc	68dcef510a	Update SECURITY.md (#2639 )	2025-08-14 16:07:24 +08:00
topjgc	4675344c13	Create SECURITY.md (#2638 )	2025-08-14 15:56:12 +08:00
ming030890	b3fb4c0acd	Allow one to set a custom progress callback (#2609 ) * Allow one to set a custom progress callback so that they can show it own progrss bar * Uncomment an existing test * restore indentation --------- Co-authored-by: Tony Mak <tony@Tonys-MacBook-Air-1802.local>	2025-08-05 17:48:10 +08:00
Jeffrey Guan	8316fc4197	docs(readme): update onnx example import path and cache location (#2607 ) The example code in README files was updated to use the correct import path for Paraformer from the runtime package instead of the top-level module. The wav path was also modified to use Path.home() for cross-platform compatibility and to include the missing 'models' directory in the cache path. These changes ensure the examples work correctly with the current package structure.	2025-07-23 14:56:31 +08:00
ming030890	a750595594	Fix a few issues found during fine-tuning (#2582 ) * Fix wandb log * fix validation loss is not logged batch_idx got reset for each epoch. use the global step counter instead * LR should only be updated per step, not per step+ per epoch * add early stopping * Fix bf16 handling scaler is only needed for fp16 * more logs --------- Co-authored-by: Tony Mak <tony@Tonys-MacBook-Air-1800.local>	2025-07-04 14:25:54 +08:00
nianjiuhuiyi	05c8eba11c	Fix: 修复c++后端服务因为空数组的异常退出，以及c++的http服务在收到Ctrl+C信号后无法正常退出 (#2571 ) * Fix: 修复c++后端，在持续接收请求过程中运行不定时长后，服务会因为segmentation fault挂掉添加msg_stamp是否为空的判断，在某些特定请求中，msg_stamp可能为空，避免msg_stamp为空时，直接对其进行取值操作。 * Fix: 修复c++的http服务在收到Ctrl+C信号后无法正常退出的问题在程序主动退出时，添加工作守卫的移除，确保在捕获Ctrl+C信号后，程序能正退退出。	2025-06-27 09:57:45 +08:00
kmn1024	443bc09c11	Bugfix: Only allow rank==0 to clean up old checkpoints (#2558 ) Fixes bug: https://github.com/modelscope/FunASR/issues/2557	2025-06-25 16:34:30 +08:00
Thirteen	a3d6e48fe1	Update server.py (#2556 ) Fix the error that occurs when there are no text elements in the audio and an exception is thrown.	2025-06-21 22:45:24 +08:00
lyblsgo	3ee9e8e3ba	update runtime docs	2025-06-16 19:23:20 +08:00
yuGAN6	3445cd9652	sensevoice2jsonl.py punctuation matching fix (#2533 ) * fix sensevoice2jsonl.py punctuation check * fix sensevoice2jsonl.py punc check	2025-05-28 10:33:26 +08:00
Logan	ab2148ec18	更新go client 的原生实现 (#2532 )	2025-05-26 14:34:47 +08:00
chengligen	8b0fb74bde	feat: add 'words' key aligned with timestamps in sensevoice model output (#2531 )	2025-05-26 14:11:33 +08:00
王梦迪	9038340be7	修复Fsmn_vad_online多线程调用报错 (#2528 ) * 缓存cmvn file加载结果，避免多次实例化WavFrontend时重复加载 * 修复Fsmn_vad_online并发调用报错 --------- Co-authored-by: wangmengdi06 <wangmengdi06@58.com>	2025-05-26 14:11:02 +08:00
TouchZZZ	eccf746847	fix: msg_stamp.size() is unsigned,should convert to int (#2523 ) Co-authored-by: zjw <zhangjinwei@1586906817702817.onaliyun.com>	2025-05-22 17:47:28 +08:00
王梦迪	561bdbdfc0	通过缓存seg_dict，加快seaco_paraformer推理 (#2520 ) Co-authored-by: wangmengdi06 <wangmengdi06@58.com>	2025-05-22 11:27:01 +08:00
王梦迪	fe588bc508	Fsmn_vad支持多线程并发调用 (#2519 ) * 修复WavFrontend.fbank多线程调用下共享fbank_fn导致的bug * Fsmn_vad支持多线程并发调用 --------- Co-authored-by: wangmengdi06 <wangmengdi06@58.com>	2025-05-20 16:10:59 +08:00
zhaomingwork	ae013cf597	fix bug for core dump in http, use libboost as parse (#2509 )	2025-05-08 23:52:09 +08:00
zhifu gao	2c2fb5e1eb	Update README.md (#2487 )	2025-04-22 09:56:44 +08:00
zhifu gao	4bfcfd7f13	Update README_zh.md	2025-04-22 09:53:18 +08:00
xmx0632	e7237d8cb4	add mac m1 mps support (#2477 )	2025-04-14 13:40:12 +08:00
Akshay Shinde	8c336fed79	docs: add MIT license file (#2472 ) (#2476 ) Co-authored-by: Akshay Popat Shinde <akshaypopat.shinde@ad.infosys.com>	2025-04-12 00:20:17 +08:00
AldarisX	d43d0853dc	add intel xpu support (#2468 )	2025-04-07 21:20:31 +08:00
yijinsheng	9afa40520f	本地模型加载 (#2453 )	2025-04-07 00:29:17 +08:00
hanydd	5799831104	修复模型仓库链接 (#2464 )	2025-04-01 23:21:20 +08:00
Isuxiz Slidder	3df109adfc	Update model.py to fix "IndexError: index 1 is out of bounds for dimension 1 with size 0" (#2454 ) * Update model.py Avoid exception of "IndexError: index 1 is out of bounds for dimension 1 with size 0" * Update model.py Add return word in timestamps * Revert "Update model.py" This reverts commit `bc736df302`.	2025-03-31 17:51:52 +08:00
zhifu gao	d629b1084b	Update README_zh.md	2025-03-28 17:12:09 +08:00
天地	e24dbdc496	感觉应该从文件读取更合适，因为上面判断了文件存在，且可以读取，如果本身是文本的话，下面也会有逻辑进行处理 (#2452 ) Co-authored-by: tiandiweizun <qq1274949542@163.com>	2025-03-26 13:44:41 +08:00
passerbya	5ee2f382b3	FIX 'NoneType' object has no attribute 'isalpha' (#2440 ) Traceback (most recent call last): File "/root/miniconda3/envs/sensevoice/lib/python3.10/multiprocessing/process.py", line 314, in _bootstrap self.run() File "/root/miniconda3/envs/sensevoice/lib/python3.10/multiprocessing/process.py", line 108, in run self._target(self._args, self._kwargs) File "/usr/local/data2/workspace/egs_vocal_extractor/data/speech_det.py", line 156, in process_audio_task res = model.generate( File "/root/miniconda3/envs/sensevoice/lib/python3.10/site-packages/funasr/auto/auto_model.py", line 306, in generate return self.inference_with_vad(input, input_len=input_len, cfg) File "/root/miniconda3/envs/sensevoice/lib/python3.10/site-packages/funasr/auto/auto_model.py", line 464, in inference_with_vad results = self.inference( File "/root/miniconda3/envs/sensevoice/lib/python3.10/site-packages/funasr/auto/auto_model.py", line 345, in inference res = model.inference(batch, *kwargs) File "/root/miniconda3/envs/sensevoice/lib/python3.10/site-packages/funasr/models/sense_voice/model.py", line 950, in inference timestamp = self.post(timestamp) File "/root/miniconda3/envs/sensevoice/lib/python3.10/site-packages/funasr/models/sense_voice/model.py", line 973, in post elif prev_word.isalpha() and prev_word.isascii() and word.isalpha() and word.isascii(): AttributeError: 'NoneType' object has no attribute 'isalpha'	2025-03-20 23:01:05 +08:00
nianjiuhuiyi	0d6e6a2fb8	Fix: Spelling causes hotwords to not take effect (#2444 ) The hotword passed by the client cannot take effect because of a spelling error.	2025-03-20 22:59:42 +08:00
天地	6e69d784e4	1. bug fix:list(mean)和list(var)，由于mean和var是numpy，导致写入到文件的格式错误，参考上面的话，大概率是list(mean.tolist())，其实外层list没有必要 (#2437 ) 2. 删除不必要的代码list(numpy_array.tolist())-->numpy_array.tolist() 3. 性能优化：replace没有必要，性能慢，性能为O(nm)，n是源字符串长度，m是需要替换的字符串长度，虽然这里的m长度是1，且list转字符串的"[]",只有首尾有，直接拼接即可。 Co-authored-by: tiandiweizun <qq1274949542@163.com>	2025-03-19 23:10:13 +08:00
天地	3903d7c775	Update README_zh.md (#2436 ) 修正推理参数笔误	2025-03-18 14:36:27 +08:00
Han Zhang	3c349ac053	fix: use converted token_ids for alignment for sensevoice model with timestamp output (#2429 ) * fix: use converted token_ids for alignment BPE doesn't guarantee converted ids (subwords) are revertible. which means `tokens` converted back is not always the same as `token_int`. A easy fix is to directly use the converted ids for alignment. Since they are from the same text, it shouldn't matter. * fix: handle empty string to index an empty string is to raise an exception. 这里没有判空。	2025-03-18 11:45:37 +08:00
lyblsgo	77db489a8f	update dingding.png	2025-03-13 19:28:25 +08:00
游雁	93c701bab6	v1.2.6	2025-03-11 14:26:35 +08:00
Shi Xian	027ad5c2a1	Merge pull request #2418 from modelscope/revert-2382-dev_clean Revert "# 增强说话人分离功能的时间戳支持"	2025-03-11 13:54:42 +08:00
Shi Xian	700cb827f5	Revert "# 增强说话人分离功能的时间戳支持"	2025-03-11 13:54:23 +08:00
hohaiuhsx	6fe10a8dbf	修复当选用SenseVoice模型处理长音频（同时开启vad和output_timestamp）时的异常 (#2413 )	2025-03-10 23:16:22 +08:00
游雁	9c67d9b969	v1.2.5	2025-03-07 23:41:54 +08:00
zhifu gao	37fb08962d	Update README_zh.md	2025-03-05 14:16:42 +08:00
游雁	82e5ca37a8	Large-Scale Data Training	2025-02-25 14:28:34 +08:00
Shi Xian	57e023e5cf	Merge pull request #2382 from msgk239/dev_clean # 增强说话人分离功能的时间戳支持	2025-02-18 15:15:40 +08:00
msgk	a8591060d3	fix(spk): 修复 speaker embedding 集群后的重新排序问题 - 增加了对时间戳支持的检查 - 初始化 punc_res 变量以处理不同情况 - 根据模型情况设置 punc_res，包括内部标点模型、外部标点模型和仅时间戳情况 - 修复了缺少标点模型时的错误处理	2025-02-14 14:16:51 +08:00
游雁	53ac0cb401	v1.2.4	2025-02-13 14:16:05 +08:00
游雁	1eeb5067ab	Merge branch 'main' of github.com:alibaba-damo-academy/FunASR merge	2025-02-13 14:06:33 +08:00
游雁	604ae30fdb	oom fix	2025-02-13 14:06:03 +08:00

1 2 3 4 5 ...

4802 Commits